Site Reliability Engineer

Apply Now

Company: Futran Tech Solutions Pvt. Ltd.

Location: Phoenix, AZ 85032

Description:

Role: Site Reliability Engineer

Location: Phoenix, AZ

Signal Fx/Splunk Observability Admin.

Responsibilities:
  • Introduce enterprise capabilities, tools, and innovation improving availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, CI/CD integration(performance, smoke, regression, functional, chaos and environment propagation through automatic deployments)
  • Introduce continuous improvement, standardization/automation, capabilities to conduct destructive and resiliency testing
  • Consistent track record of troubleshooting and resolving issues in live production environments and implementing strategies to eliminate them
  • Driven approach to continually improving service levels
  • Build and manage systems, infrastructure, and applications through automation
  • Deploy, support, and monitor new and existing services, platforms, and application stacks
  • Engage in improving the whole lifecycle of services from inception through deployment, operations, and refinement
  • Provide hands-on technical expertise during service impacting events
  • Collaborate with other engineers on code reviews, internal infrastructure improvements and process enhancements
  • Use scalability testing to measure, tune and optimize system performance
  • Automate key SRE metrics and IT Service Operations processes including customer impact, % availability of critical business flows, SLO/SLI adherence, error budget, automate incident process for IT Service Operations through data integrating with unified communications, alerting/notification systems
  • Participate in periodic 24x7 on-call duties
  • Share support responsibilities for critical applications and customer journeys onboarded to SRE including remediation of issues through Agile, conduct blameless postmortems, root cause analysis and introduce continuous improvement solving problems once and for all with the goal of no repeats.

Required Qualifications
  • Experience with Observability/Monitoring technologies like Splunk, Signalfx, Splunk-OnCall, Rigor and Azure Monitoring
  • Experience with one or more Cloud Platforms (Azure, GCP, AWS)
  • Experience with Container technologies: Kubernetes, Docker, AKS
  • Experience setting up monitoring in infrastructure, applications and database
  • 3+ years of systems support analysis experience demonstrated through work or military experience
  • 2+ years of experience with one or more Agile tools used for tracking user stories or backlogs, such as JIRA
  • Excellent verbal, written, and interpersonal communication skills

Similar Jobs