Alibaba Cloud-OS Reliability & Operations Specialist-Seattle
Apply NowCompany: Alibaba Cloud
Location: Seattle, WA 98115
Description:
Job Description
We are Operating System maintenance Team, providing technical solutions for internal (HostOS) and external (GuestOS) customers. We specialize in OS maintenance, fault resolution, and efficiency optimization to ensure system stability, business continuity, and an exceptional user experience.
We are seeking an experienced operation and support engineer to maintain our operating system and resolve OS related issues.
1. Responsible for ensuring the daily stability of the operating system, participating in emergency fault response and stability activities, quickly resolving OS-related issues, and providing professional consulting services.
2. Proactively conduct system maintenance tasks, identify potential risks in advance, and eliminate hidden dangers to ensure stable operation.
3. Regularly review problems, analyze the current stability status and improvement areas of customers, propose optimization suggestions, and continuously enhance user experience.
4. Independently or jointly participate in customer visits to deeply understand their needs, extract stability value, promote the integration of the operation platform with customers, and assist them in improving basic software operation capabilities and problem closure efficiency.
5. Collaborate with development teams to deepen cooperation in vertical fields and comprehensively enhance overall stability and operation experience.
6. Responsible for implementing and maintaining the operation platform, improve its functions based on reviews and customer feedback, enhance automation and intelligence levels, and increase user usability and satisfaction.
Position Requirement
Education
Bachelor's degree in Computer Science, Information Technology, or a related field.
Experience
1. 3+ years of experience in operating system maintainance or operations;
2. Experience in large-scale system operations is preferred;
3. Familiarity with distributed systems, big data processing, or AIOPS technologies is preferred.
Technical Skills
1. Familiar with Linux kernel and its working mechanisms;
2. Experience in performance tuning and troubleshooting;
3. Proficient in C, and at least one scripting language such as Shell or Python;
4. Knowledge of front-end technologies, including JavaScript and frameworks like React;
5. Knowledge of distributed software principles, understanding of the architecture of common open-source big data software (such as Hadoop, Flink, Spark, Kafka), and familiar with cloud-native technology stacks.
Soft Skills
1. Clear logical thinking, excellent communication skills, and the ability to collaborate effectively with users and teams;
2. Strong sense of product ownership, able to proactively drive project progress and solve problems.
The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.
If hired, employee will be in an "at-will position" and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.
Alibaba U.S. based full time regular employees have access to medical, dental, and vision insurance, a 401(k) plan and basic life insurance, and wellbeing benefits like FSA, subject to the terms and conditions of the applicable plans then in effect. U.S. based employees are also eligible to receive up to 12 paid holidays, accrue up to 15 paid vacation days for this position, and receive up to 72 hours paid sick time (front-loaded) per calendar year.
We are Operating System maintenance Team, providing technical solutions for internal (HostOS) and external (GuestOS) customers. We specialize in OS maintenance, fault resolution, and efficiency optimization to ensure system stability, business continuity, and an exceptional user experience.
We are seeking an experienced operation and support engineer to maintain our operating system and resolve OS related issues.
1. Responsible for ensuring the daily stability of the operating system, participating in emergency fault response and stability activities, quickly resolving OS-related issues, and providing professional consulting services.
2. Proactively conduct system maintenance tasks, identify potential risks in advance, and eliminate hidden dangers to ensure stable operation.
3. Regularly review problems, analyze the current stability status and improvement areas of customers, propose optimization suggestions, and continuously enhance user experience.
4. Independently or jointly participate in customer visits to deeply understand their needs, extract stability value, promote the integration of the operation platform with customers, and assist them in improving basic software operation capabilities and problem closure efficiency.
5. Collaborate with development teams to deepen cooperation in vertical fields and comprehensively enhance overall stability and operation experience.
6. Responsible for implementing and maintaining the operation platform, improve its functions based on reviews and customer feedback, enhance automation and intelligence levels, and increase user usability and satisfaction.
Position Requirement
Education
Bachelor's degree in Computer Science, Information Technology, or a related field.
Experience
1. 3+ years of experience in operating system maintainance or operations;
2. Experience in large-scale system operations is preferred;
3. Familiarity with distributed systems, big data processing, or AIOPS technologies is preferred.
Technical Skills
1. Familiar with Linux kernel and its working mechanisms;
2. Experience in performance tuning and troubleshooting;
3. Proficient in C, and at least one scripting language such as Shell or Python;
4. Knowledge of front-end technologies, including JavaScript and frameworks like React;
5. Knowledge of distributed software principles, understanding of the architecture of common open-source big data software (such as Hadoop, Flink, Spark, Kafka), and familiar with cloud-native technology stacks.
Soft Skills
1. Clear logical thinking, excellent communication skills, and the ability to collaborate effectively with users and teams;
2. Strong sense of product ownership, able to proactively drive project progress and solve problems.
The pay range for this position at commencement of employment is expected to be between $133,200/year and $219,600/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.
If hired, employee will be in an "at-will position" and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.
Alibaba U.S. based full time regular employees have access to medical, dental, and vision insurance, a 401(k) plan and basic life insurance, and wellbeing benefits like FSA, subject to the terms and conditions of the applicable plans then in effect. U.S. based employees are also eligible to receive up to 12 paid holidays, accrue up to 15 paid vacation days for this position, and receive up to 72 hours paid sick time (front-loaded) per calendar year.