Alibaba Cloud-Server Hardware Maintenance and Reliability Engineer-Sunnyvale,CA

Apply Now

Company: Alibaba Cloud

Location: Sunnyvale, CA 94087

Description:

Job Description

Alibaba Cloud is a leading cloud computing service brand under Alibaba Group, established in 2009. As a leading provider of cloud computing services , Alibaba Cloud is dedicated to offering reliable cloud computing products and technical support to enterprises and individual users.

We are part of the Alibaba Cloud Server R&D team, focusing on stability and operations. Our primary responsibilities include delivering and managing the operating systems for massive-scale servers, monitoring their performance, handling repairs, identifying and mitigating risks, ensuring the efficient allocation of resources and maintaining the reliability of the servers.

1. Be responsible for the full lifecycle maintenance for massive-scale servers , from delivery to decommissioning, including architecture design and platform construction for maintenance systems such as NPI (New Product Introduction), OS installation, maintenance/repair,hardware monitoring, hardware reconfiguration, server management to ensure delivery quality and delivery efficiency meet SLA.

2. Participate in the development of server reliability standards and platform construction,risk Emergency Handling, ensuring stability and enhancing business safety and sustainability.

3. Stay updated with the latest trends and cutting-edge technologies in the server maintenance field and incorporate relevant technologies into Alibaba Cloud's business scenarios to drive optimization and innovation.

Position Requirement

1. Education background in Communications, Computer Science, or Electronics Engineering, with a strong foundation in computer hardware fundamentals.

2. 3+ years of experience in SRE with a scale of 10k+ servers, rich experience in maintenance architecture design, DevOps, and proficiency in programming languages such as Linux/UNIX, SHELL, PYTHON, and SQL.

3. Strong communication skills and teamwork spirit, with excellent customer service orientation. Proficient in English and Chinese reading and writing.

Experience in hardware delivery maintenance, reliability improvements, data-driven maintenance, and large-scale integration of Aliyun products is a plus.

The pay range for this position at commencement of employment is expected to be between $133,200 and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.

If hired, employee will be in an "at-will position" and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

Similar Jobs