SRE - Site Reliability Engineer
Apply NowCompany: CyberThink Inc.
Location: Southlake, TX 76092
Description:
Job Description:
As a Site Reliability Engineer, you will be responsible for supporting highly available backend applications and APIs in a production environment. You will participate in a 24x7 on-call rotation, triage issues reported by customers, and work closely with cross-functional teams to ensure the reliability, scalability, and performance of critical systems. Your role will include contributing to production readiness, monitoring, and troubleshooting, as well as participating in application design to embed reliability into every phase of development. This is a hands-on technical role suited for individuals with strong operational expertise and a proactive mindset.
Key Responsibilities:
The hourly range for roles of this nature are $40.00 to $80.00/hr. Rates are heavily dependent on skills, experience, location, and industry.
cyberThink is an Equal Opportunity Employer.
As a Site Reliability Engineer, you will be responsible for supporting highly available backend applications and APIs in a production environment. You will participate in a 24x7 on-call rotation, triage issues reported by customers, and work closely with cross-functional teams to ensure the reliability, scalability, and performance of critical systems. Your role will include contributing to production readiness, monitoring, and troubleshooting, as well as participating in application design to embed reliability into every phase of development. This is a hands-on technical role suited for individuals with strong operational expertise and a proactive mindset.
Key Responsibilities:
- Provide production support for backend applications/APIs and participate in on-call rotation.
- Troubleshoot, debug, and resolve issues across application, network, and infrastructure layers.
- Monitor application health using tools like Splunk and AppDynamics, and implement alerting strategies.
- Collaborate with developers and infrastructure teams during production readiness and deployments.
- Participate in application design to ensure availability, observability, and scalability requirements.
- Coordinate and execute system maintenance tasks such as OS upgrades and disaster recovery exercises.
- Work with multiple teams, including Change Management, Network, and IT Operations, for issue resolution.
- Mentor junior team members and support continuous process improvements across production services.
- Utilize messaging systems like Kafka and contribute to CI/CD process enhancements.
- Analyze incidents and provide root cause analysis and preventative recommendations.
- Bachelor's degree in Computer Science or related discipline.
- 5-7+ years of experience supporting production backend applications/APIs in a live environment.
- Strong troubleshooting and debugging skills across application, database, and network layers.
- Experience in on-call rotation and managing high-priority incidents under pressure.
- Solid understanding of application architecture and performance tuning.
- Hands-on experience with Splunk, AppDynamics, and building application alerts.
- Working knowledge of F5 (GTM/LTM), networking, routing, and load balancing.
- Familiarity with NoSQL databases and messaging technologies such as Kafka.
- Exposure to CI/CD processes and tools.
- Experience with technologies such as .NET or Java in a support/development context.
- Strong communication skills and ability to work in a fast-paced, distributed team environment.
The hourly range for roles of this nature are $40.00 to $80.00/hr. Rates are heavily dependent on skills, experience, location, and industry.
cyberThink is an Equal Opportunity Employer.