Site Reliability Engineer
Apply NowCompany: Futran Tech Solutions Pvt. Ltd.
Location: Minneapolis, MN 55407
Description:
Site Reliability Engineer
Minneapolis, MN
Qualifications
Primary Responsibilities:
Required Qualifications:
Preferred Qualifications:
Minneapolis, MN
Qualifications
Primary Responsibilities:
- Lead the development and implementation of infrastructure and automation solutions to improve the reliability and scalability of our FinTech payments platform
- Collaborate with development teams to identify and resolve system issues, and participate in the design of new features and services
- Establish and maintain best practices for monitoring, logging, and alerting using tools like Datadog, Prometheus, and Grafana
- Configure and maintain services such as load balancers, relational & NoSQL databases, and messaging systems while ensuring high availability and performance
- Participate in on-call rotations and respond to incidents in a timely manner, ensuring quick resolution and effective communication with stakeholders
- Conduct regular system audits and capacity planning exercises to identify areas for improvement and ensure readiness for future growth
- Mentor and provide guidance to junior members of the team
Required Qualifications:
- 5+ years of experience as a Site Reliability Engineer, DevOps Engineer, Software Engineer or in IT Operations
- 3+ years of experience with automation and scripting tools such as Python, Bash, PowerShell, and Perl
- 2+ years of experience in at least one object-oriented programming language such as C# or JAVA
- Experience with configuration and maintenance of services such as load balancers, relational & NoSQL databases, and messaging systems
- Experience in monitoring and alerting tools such as Datadog, Prometheus, and Grafana
- Experience with incident response and post-mortem analysis
Preferred Qualifications:
- Bachelor's or Master's degree in Computer Science, Engineering, or technical field
- Excellent communication and interpersonal skills, with the ability to work collaboratively with development teams, stakeholders, and management
- Experience in problem-solving skills on complex technical issues and a proactive attitude towards identifying and addressing potential issues
- Experience with public cloud platforms, hybrid cloud environments, and migration strategies
- Experience with container technologies like Docker and Kubernetes
- Experience with HTTP API design, micro services, and event driven architecture
- Experience with configuration and deployment management tools such as Ansible, Terraform
- Demonstrated excellent communication and interpersonal skills, with the ability to work collaboratively with development teams, stakeholders, and management