Compute Site Reliability Engineer (SRE) - Kubernetes

Apply Now

Company: Apple

Location: Seattle, WA 98115

Description:

Summary
Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Join the Apple Services Engineering team as a site reliability engineer to help support and scale cloud services for thousands of development and operations engineers. This is a hands-on role to maintain and enhance SRE practices for a private cloud service to accelerate our ability to reliably and consistently deliver thousands of applications.

Description
As a Compute Site Reliability Engineer, you will be responsible for maintaining, monitoring, and improving the reliability, scalability, and performance of our Kubernetes-based infrastructure. You'll work closely with senior SREs, developers, and other engineers to ensure high availability and optimize our containerized applications. This is a fantastic opportunity for someone eager to grow their expertise in Kubernetes and cloud-native technologies.

AS AN SRE AT APPLE, YOU WILL:

- Operate, monitor, and triage all aspects of our production and non-production environments.

- Design, build and implement innovative solutions for previous, present and future issues.

- Prepare alert handling procedures, runbooks, and collaborate with other SRE teams.

- Participate in on-call rotations to troubleshoot and resolve production issues, minimizing downtime.

- Automate deployment and orchestration of services into the cloud environment as well as other routine processes.

- Actively participate in capacity planning, scale testing, and disaster recovery exercises.

Similar Jobs