Site Reliability Engineer
Apply NowCompany: Frontend Arts
Location: Scottsdale, AZ 85254
Description:
- On-call production support and managing 24/7 support environment
- Applicant should have full understanding of Various API's and Middlewares like Apigee,Vordel, Data power and Nodejs
- Applicant should expertise in configuring, supporting and manage Rancher, Kubernetes and Docker Containerization
- 3+ Years of Experience in Incident Management, Change Management and Problem Management.
- 1-2 years of Experience in Infrastructure Support, Configuration and Release Management.
- 2-3 years of hands on experience with Tools including Splunk, Grafana, Loki, APPDynamics or other APM solutions
- 2+ years of Experience with Application support built on-prem and native cloud environments
- Able to code - Java, SQL, PromSQL, Shell and Python.
Key Skills:
# SRE , # incident management , # change management , # problem management , # infra support , # Application Support , # Coding
- Minimum Experience : 10 Yrs
Roles & Responsibilities:
Root cause analysis, management communication and client relationship management in partnership with Infrastructure Support team members.
Ensuring all production changes are made in accordance with life-cycle methodology and risk guidelines.
Ability to work on-call production support and Managing in a 24/7 support environment
Understanding and working knowledge of infrastructure environments.
Excellent problem management skills and relentless drive for root cause and execute measures to reduce repeat occurrence.
Good communication (Verbal/written) and Interpersonal Skills
Education:
- A Bachelor's degree in engineering or computer science, or a Bachelor's degree with significant work experience in technology Industry