Site Realibility Engineer

Apply Now

Company: Omni Inclusive

Location: Providence, RI 02902

Description:

SREDesign and implement resilient IT infrastructure solutions, focusing on high availability and performance.Establish and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs).Incident and Problem Management:Proactively monitor and troubleshoot infrastructure issues to minimize Mean Time to Recovery (MTTR).Conduct root cause analysis (RCA) for P1/P2 incidents and implement preventive measures.Automation and Toil Reduction:Develop and implement Infrastructure-as-Code (IaC) solutions using tools like Terraform, Ansible, or similar.Automate repetitive tasks to improve operational efficiency and reduce human intervention.Observability and Monitoring:Set up and manage observability tools such as Grafana, Prometheus, ELK, or Azure Monitor.Ensure comprehensive logging, metrics, and alerting for all critical systems.DevOps Integration:Collaborate with DevOps teams to integrate CI/CD pipelines into infrastructure workflows.Support containerized environments (e.g., Docker, Kubernetes) and orchestration platforms.Required Skills and QualificationsBachelor s degree in Computer Science, IT, or a related field.Proven experience in managing and scaling IT infrastructure in on-premise, cloud, or hybrid environments.Proficiency in one or more cloud platforms (e.g., AWS, Azure, GCP).Strong scripting and programming skills (e.g., Python, Bash, PowerShell).Hands-on experience with automation tools such as Terraform, Ansible, or Chef.Familiarity with observability tools (e.g. Grafana, ELK).Solid understanding of networking, virtualization, and storage concepts. (1.) Depending on the work environment, the subject matter expert may lead or be an active participant of a work-group with the need for specialized knowledge. (2.) Meet all agreed-upon turnaround times for deliverables, deliverable reviews, or deliverable sign-off (3.) Understands, articulates and implements best practices related to his area of expertise. (4.) Provides guidance on how his area of capability can resolve an organizational need and actively participates in all phases of the solution life cycle. Design Solutions and best practices to meet clients objective. (5.) Work with clients to identify business challenges and contribute to client deliverables by refining, analyzing, and structuring relevant data

Similar Jobs