SRE (Reliability Engineering)

Apply Now

Company: SysMind Tech

Location: Hartford, CT 06106

Description:

Job Title

RE (Reliability Engineering) Consultant

Relevant Experience

(in Yrs)

10+ Yrs

Technical/Functional Skills

Site Reliability Engineering

Automation

AWS Application Architect

Experience Required

The Reliability Engineering (RE) and Automation team is seeking a highly energetic Staff Reliability Engineer to join the Automation Engineering Team. The ideal candidate should have a strong background in SRE and IT operations, as well as proficiency in various programming languages.

Position requires a strong technical understanding of complex IT environments, cloud, and evolving technologies.

Roles & Responsibilities
  • Reliability Engineering
    • Manage communications and share industry best practices to support the RE Community of Practice
    • Accountable for the identification, development, catalog, and maintenance of reusable assets
    • Deliver cost effective innovative strategies to support emerging business opportunities.
    • Execute the strategic roadmap to support Reliability Engineering
    • Appy strong problem-solving skills strategic mindset with a focus on scalable continuous delivery approach
  • AWS Cloud expertise in microservice architecture
    • Champion the migration of applications to open-source platforms, PaaS, containers, serverless, event-based designs, and other cloud technology standards for cloud-enablement and platform agility.
  • Automation strategy
    • Execute the delivery of automation use cases to minimize manual activities for cloud migrations.
    • Collaborate with team members regarding process improvement opportunities and end to end automation enhancements.
    • Deliver increased automation and self-healing capabilities.
    • Provide technical expertise to automate toil reduction.
    • Coach and mentor Automation Engineers and other resources as appropriate
  • ITSM Expertise
    • Drive the implementation of processes: Incident Management and response skills, blameless postmortems, Change Management and Problem Management
  • Software engineering
    • Deep software and systems engineering expertise.
    • Ability to design systems and implement new software architecture patterns.
  • Hands on experience with Observability tools such as Dynatrace, SPLUNK, CloudWatch, CloudTrail is a plus
  • Solid understanding of technologies that support the services offered for cloud applications
  • Up to date knowledge of industry trends, emerging technologies in DevOps, Cloud Engineering and AI/ML
  • Familiarity with enterprise software solutions such as GitHub, Jenkins, Nexus, Ansible etc.
  • Solid understanding of AWS, DevSecOps practices, SAFe Agile methodologies
  • Familiarity with programming languages (Python, Lambda, Go, Java or JavaScript/Node.js)
  • Knowledgeable of Amazon Web Services including but not limited to EC2, S3, ECS, RDS, CloudWatch, SNS, CloudTrail, SQS, Service Catalog.
  • Expertise with cloud platforms like AWS and microservices architecture
  • Familiarity with programming languages (Python, Lambda, Go )
  • Experience in Infrastructure as Code (IaC) using CloudFormation & Terraform templates, YAML files, build specifications


Generic Managerial Skills
  • Ability to interact with diverse technical and non-technical groups in a matrix organization
  • Must have exceptional communication skills (written, oral, presentation and facilitation)
  • Understanding of robotics and artificial intelligence to improve services
  • Experience in strategy development to achieve business objectives
  • Understanding of networks and experience troubleshooting issues
  • Ability to develop, manage and communicate frameworks: e.g., Cloud Security Alliance
  • Excellent analytical and problem-solving skills

Similar Jobs