Cloud Site Reliability Engineer

Apply Now

Company: Delinea

Location: Santa Clara, CA 95051

Description:

Our software runs on public clouds with 99.9% or better uptime and is mission critical for our customers. Our cloud operations team is where the rubber meets the road and needs innovative Site Reliability Engineers. Join a professional team of smart and hard-working professionals building enterprise-class cloud-based services in the rapidly growing market of Identity and Access Management. As a Site Reliability Engineer, you'll be part of the Cloud DevOps team and report to the Cloud Operations manager. Together with the team you'll be responsible for maintaining, monitoring and alerting on our application uptime, and manage deployment and escalations.

Responsibilities:
Manage our cloud application using common DevOps and Agile practices to successfully keep uptime and delivery
About 50% of your time should be spent automate the site systems to self-manage and self-heal
As site reliability engineer you should spend about 50% of your time dealing with software deploy, incidents response, on-call duty and manual intervention

Skills & Experience:
Experience in public clouds (Preferably Azure and AWS)
Deep understanding and knowledge of modern monitoring and alerting tools such as ELK stack, Nagios, Prometheus, Qualys, Dome9, etc.
Working knowledge of scripting language such as Python, PowerShell, Bash, etc.
Experience with configuration management software like puppet, salt, chef, etc.
Experience with some of the DevOps standard tools such as docker, Cloudera, Hadoop, terraform, Jenkins, git, consul, Vault, etc.
Experience managing Windows and Linux servers
Strong problem-solving skills
Excellent communication and documentation skills
Skills & Requirements Qualifications

Similar Jobs