Senior/lead SRE
Apply NowCompany: RIT Solutions, Inc.
Location: Glendale, CA 91205
Description:
Title: Senior/lead SRE
Location: Glendale, CA - (hybrid 3 Days onsite)
Job Description:
Bachelor of Science degree in computer science or related field or equivalent experience in technical operations and software engineering.
There will be 3 rounds of interviews for this position Core must haves:
1) Linux system admin and Windows but willing to consider only Linux backgrounds.
2) At least 1 programming language
3) Cloud skills and Public Cloud hosting background Must haves:
expected to have expert level systems administration skills on both the Linux and Windows platforms, and must have experience with CI/CD platforms (GitHub Actions, GitLab CI)), systems automation (Chef/Ansible/Terraform), systems development (Go, Python, Ruby) and cloud automation tools (Boto, CloudFormation, Terraform), source control, cloud hosting, container computing, web technologies and the DevOps team culture.
This position will also bring expertise on systems, operational excellence and application stability, security, performance, and capacity management, as well as documentation. This position works closely with Imagineering Technology Studio teams to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support.
Misc. Info: Must give last four SSN, Birth mm/dd and email address
This position works closely with Imagineering Technology Studio teams to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support.
Design: Leading project/planning efforts, architectural design, engineering, attending meetings w/ various teams. - 30%
Build: Implementing, integrating and configuring solutions, tools, infrastructure and systems. - 30%
Run: Systems Administration and Application Support - Level 2 & Maintenance and Support- 30%
Basic Qualifications Technical Requirements
Understand how to install and configure operating systems, specifically with expertise in Linux and Windows Server.
(e.g. languages, computer programs)
Software Development Continuous Integration (CI) Pipeline knowledge (GitLabCI, Github Actions)
Experience with Source Control Management systems (Git)
Experience in public and private cloud hosting services (AWS, Google Cloud, Azure, OpenStack, CloudStack) as well as familiarity with container computing (eg. Docker, ECS, Kubernetes, Terraform).
Recognized as a subject matter expert on at least one OS and proficient in multiple operating systems, including OS performance monitoring, setup, configuration, tuning, and troubleshooting.
Proficient in web or web server technologies: Java, Node.js, Tomcat, IIS, Apache/nginx, MySQL, PostgreSQL, etc., including being able to perform basic setup, configuration, and troubleshooting.
Understand internet technologies and network protocols, including HTTP, basicload balancing configurations, security zones, VIPs, SNMP, REST and DNS.
Able to implement existing base standards for new systems and/or applications with mentoring for all of the following:
o Site monitoring and instrumentation
o Application monitoring and instrumentation
o System monitoring and instrumentation
o Resiliency and performance
Able to diagnose simple to complex system problems.
Has experience on one or more load balancer platforms (setting up pools, VIPs, layer 7 routing, debugging).
Able to author tools and scripts to be used by others to automate repeatable production tasks in standard languages like Bash, Ruby, Python, or Go.
Advanced skills in at least one programming language such as Python, PHP, Ruby, Java, Go, Swift or C++ and able to build unit test suites for all software being developed.
Experience supporting and/or developing backend tools or services
Able to perform and provide in depth analysis on load test runs against a
moderately complex system.
Demonstrates exceptional troubleshooting methodology, including the ability to
author and instruct new methodologies to the SRE team.
Independently resolve moderately to highly complex system and application incidents.
Able to identify and propose system and application fixes for performance bottlenecks.
Able to evaluate new application requirements for capacity and run-time best practices.
Able to evaluate new system and/or infrastructure solutions for technical feasibility against known requirements and standards.
Effective at dealing with change: Able to transition in role or handle a significant modification to workflow or technology with minimal ramp-up time and with very little guidance.
Communication and Leadership Requirements:
Preferred Qualifications experience with relevant internet technologies and with implementing, administering, and supporting production websites and backend support systems.
Required Education BS degree in computer science
Location: Glendale, CA - (hybrid 3 Days onsite)
Job Description:
Bachelor of Science degree in computer science or related field or equivalent experience in technical operations and software engineering.
There will be 3 rounds of interviews for this position Core must haves:
1) Linux system admin and Windows but willing to consider only Linux backgrounds.
2) At least 1 programming language
3) Cloud skills and Public Cloud hosting background Must haves:
expected to have expert level systems administration skills on both the Linux and Windows platforms, and must have experience with CI/CD platforms (GitHub Actions, GitLab CI)), systems automation (Chef/Ansible/Terraform), systems development (Go, Python, Ruby) and cloud automation tools (Boto, CloudFormation, Terraform), source control, cloud hosting, container computing, web technologies and the DevOps team culture.
This position will also bring expertise on systems, operational excellence and application stability, security, performance, and capacity management, as well as documentation. This position works closely with Imagineering Technology Studio teams to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support.
Misc. Info: Must give last four SSN, Birth mm/dd and email address
This position works closely with Imagineering Technology Studio teams to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support.
Design: Leading project/planning efforts, architectural design, engineering, attending meetings w/ various teams. - 30%
Build: Implementing, integrating and configuring solutions, tools, infrastructure and systems. - 30%
Run: Systems Administration and Application Support - Level 2 & Maintenance and Support- 30%
Basic Qualifications Technical Requirements
Understand how to install and configure operating systems, specifically with expertise in Linux and Windows Server.
(e.g. languages, computer programs)
Software Development Continuous Integration (CI) Pipeline knowledge (GitLabCI, Github Actions)
Experience with Source Control Management systems (Git)
Experience in public and private cloud hosting services (AWS, Google Cloud, Azure, OpenStack, CloudStack) as well as familiarity with container computing (eg. Docker, ECS, Kubernetes, Terraform).
Recognized as a subject matter expert on at least one OS and proficient in multiple operating systems, including OS performance monitoring, setup, configuration, tuning, and troubleshooting.
Proficient in web or web server technologies: Java, Node.js, Tomcat, IIS, Apache/nginx, MySQL, PostgreSQL, etc., including being able to perform basic setup, configuration, and troubleshooting.
Understand internet technologies and network protocols, including HTTP, basicload balancing configurations, security zones, VIPs, SNMP, REST and DNS.
Able to implement existing base standards for new systems and/or applications with mentoring for all of the following:
o Site monitoring and instrumentation
o Application monitoring and instrumentation
o System monitoring and instrumentation
o Resiliency and performance
Able to diagnose simple to complex system problems.
Has experience on one or more load balancer platforms (setting up pools, VIPs, layer 7 routing, debugging).
Able to author tools and scripts to be used by others to automate repeatable production tasks in standard languages like Bash, Ruby, Python, or Go.
Advanced skills in at least one programming language such as Python, PHP, Ruby, Java, Go, Swift or C++ and able to build unit test suites for all software being developed.
Experience supporting and/or developing backend tools or services
Able to perform and provide in depth analysis on load test runs against a
moderately complex system.
Demonstrates exceptional troubleshooting methodology, including the ability to
author and instruct new methodologies to the SRE team.
Independently resolve moderately to highly complex system and application incidents.
Able to identify and propose system and application fixes for performance bottlenecks.
Able to evaluate new application requirements for capacity and run-time best practices.
Able to evaluate new system and/or infrastructure solutions for technical feasibility against known requirements and standards.
Effective at dealing with change: Able to transition in role or handle a significant modification to workflow or technology with minimal ramp-up time and with very little guidance.
Communication and Leadership Requirements:
Preferred Qualifications experience with relevant internet technologies and with implementing, administering, and supporting production websites and backend support systems.
Required Education BS degree in computer science