Alibaba Cloud-SRE-Seattle

Apply Now

Company: Alibaba Cloud

Location: Seattle, WA 98115

Description:

Job Description

We are a product family team that owns ComputeNest, OOS (CloudOps Orchestration Service), and ROS (Resource Orchestration Service). Our team aims to offer a one - stop platform for partners to effectively manage and deploy their products. Moreover, we strive to provide a modern DevOps/CloudOps product or feature family, enabling easy and secure management and operation of cloud resources.
Design, write cloud product or ops system development code, and deploy.
Review code developed by other developers and provide feedback to ensure best practices (e.g., code sytle guidelines, design principles, security standards, and accuracy, testability, and efficiency).
Contribute to existing documentation or educational content and adapt content based on product or system updates and customer feedback.
Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on (virtual) server, network, or service operations and quality. Lead incident response, root cause analysis (RCA), and post-mortems to improve system resilience.
Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
Implement monitoring, logging, and alerting systems to proactively identify and resolve issues.
Optimize infrastructure performance and costs.
Partner with DevOps and development teams to integrate SRE principles into CI/CD pipelines.
Stay updated on Alibaba Cloud's evolving services and advocate for innovative solutions.
Refine deployment strategies, disaster recovery plans, and scalability benchmarks.

Position Requirement

Minimum qualification:
Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
3+ years in SRE, DevOps, or cloud infrastructure roles, with hands-on expertise in Alibaba Cloud or other cloud providers.
Strong scripting skills (Python, Bash, etc.) and experience with automation frameworks.
Infrastructure as Code (IaC), CI/CD pipelines, and configuration management tools (e.g., Terraform, Ansible).
Monitoring tools (e.g., Prometheus, Grafana) and logging systems (e.g., ELK Stack).
Networking and security fundamentals in cloud environments.
Excellent problem-solving, communication, and collaboration abilities.
Experience working in agile, fast-paced environments.

Preferred qualification:
Alibaba Cloud or other cloud Certified Expert (Preferred).
Experience with large-scale distributed systems and microservices architectures.
Contributions to open-source tools or communities related to cloud automation.
Mandarin in oral and written is a plus.

The pay range for this position at commencement of employment is expected to be between $104,400 and $171,000/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.

If hired, employee will be in an "at-will position" and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Alibaba U.S. based full time regular employees have access to medical, dental, and vision insurance, a 401(k) plan and basic life insurance, and wellbeing benefits like FSA, subject to the terms and conditions of the applicable plans then in effect. U.S. based employees are also eligible to receive up to 12 paid holidays, accrue up to 15 paid vacation days for this position, and receive up to 72 hours paid sick time (front-loaded) per calendar year.

Similar Jobs