Site Reliability Engineer

Apply Now

Company: SysMind Tech

Location: Minneapolis, MN 55407

Description:

Relevant Experience

(in Yrs)
  • 5+ years of experience as SRE and knowledge of Platform ( AWS/Kubernetes )
  • 3+ years of Telemetry experience, Obsessive elimination of Single points of failure, Application config standards.

Technical/Functional Skills
  • Deep knowledge of platform (AWS/ Kubernetes etc) as platform engineer
  • Bridge between Platform and app engineering/ partners with application SRE
  • Telemetry, Obsessive elimination of single points of failure, application config standards
  • Works with enterprise platform, network, storage, etc. and external vendor teams to ensure Upgrade planning, platform migrations, app config standards, and high HA
  • Skill set is high in monitoring tools such as Grafana, Data Dog, EAPM, Splunk

Experience Required

6-8 Years as Site Reliability Engineer

Roles & Responsibilities
  • Primary roles have been to build telemetry for business and to support engineering in the same.
  • One to one dotted line relationship with Eng Leader, And Service Director over a suite of applications/ services.
  • High attention to reliability engineering, single points of failure, infrastructure capacity and tuning, and related telemetry/ trend analytics.
  • Defines and measures SLA/SLO/SLI.

Similar Jobs