Senior Site Reliability Engineer
Apply NowCompany: Simple Solutions
Location: Jacksonville, FL 32210
Description:
Job Description
Position: Senior Site Reliability Engineer (SRE) - REMOTE
What is your mission?
This individual is part of a small close-knit team that is responsible for the day-to-day operations / monitoring a key operational platform. You'll work closely with development and the other SRE team members to streamline deployments, improve the infrastructure, and work on projects. As we continue to mature, you'll be developing and implementing tooling to automate our existing processes and tooling. Part of the job does require participating in a 24 / 7 On-Call rotation so that we can meet our 99.99% uptime SLA.
What you'll love about your mission:
This role will be great for someone who loves to be hands on and knee deep in all that is SRE to expand their knowledge of Terraform, Packer, Octopus Deploy, Linux, Web Application Firewalls, DataDog, and more.
There are lots of opportunities to get involved with architecting for the long-term a cloud-based solution. Plus, we're a global company so you'll be working with people across the globe!
Requirements
The superpowers we are looking for:
Must be a US Citizen or a US Green Card holder living in the US
Must live in the Eastern or Central time zone
8-10+ years of experience in design, implementation, monitoring, and setup of cloud infrastructure across multiple regions and clouds, especially AWS.
High proficiency with modern CI/CD processes, and how to implement monitoring and deployment at scale
Experience working across teams such as Development, IT, and Information Security
Familiarity with industry automation tools (like Terraform, Packer, Ansible, Octopus Deploy) to build and maintain cloud infrastructure
A passion to develop and maintain custom tools scripts as needed to reduce toil.
Understanding of what it takes to maintain high service levels
Good understanding of networking protocols/components such as: HTTP, DNS, TCP/IP, IP Subnetting, Load balancers, etc.
Understanding of common scripting languages (bash, PowerShell, Python, etc.). Experience working in at least one object-oriented language (C#, node.js, Java, etc.)
Knowledgeable on Kubernetes and containers
Experience working in Linux environments
Experience with Monitoring tools and Cloud monitoring tools
Database management and monitoring (backup / restores / migrations), but not necessarily DBA type skills
Strong organization and time-management; a can-do attitude, detail-oriented while seeing the big picture
Participated in SOC 2, GDPR, FedRAMP compliance initiatives including working with security questionnaires
Excellent technical, analytical, and problem-solving skills required
Must be able to work independently and efficiently in a fast paced, team-oriented environment
Education & Preferred Qualifications
Minimum of 8-10 years Site Reliability Engineer Role
Bachelor's degree in computer science or related field, or equivalent
professional experience
Position: Senior Site Reliability Engineer (SRE) - REMOTE
What is your mission?
This individual is part of a small close-knit team that is responsible for the day-to-day operations / monitoring a key operational platform. You'll work closely with development and the other SRE team members to streamline deployments, improve the infrastructure, and work on projects. As we continue to mature, you'll be developing and implementing tooling to automate our existing processes and tooling. Part of the job does require participating in a 24 / 7 On-Call rotation so that we can meet our 99.99% uptime SLA.
What you'll love about your mission:
This role will be great for someone who loves to be hands on and knee deep in all that is SRE to expand their knowledge of Terraform, Packer, Octopus Deploy, Linux, Web Application Firewalls, DataDog, and more.
There are lots of opportunities to get involved with architecting for the long-term a cloud-based solution. Plus, we're a global company so you'll be working with people across the globe!
Requirements
The superpowers we are looking for:
Must be a US Citizen or a US Green Card holder living in the US
Must live in the Eastern or Central time zone
8-10+ years of experience in design, implementation, monitoring, and setup of cloud infrastructure across multiple regions and clouds, especially AWS.
High proficiency with modern CI/CD processes, and how to implement monitoring and deployment at scale
Experience working across teams such as Development, IT, and Information Security
Familiarity with industry automation tools (like Terraform, Packer, Ansible, Octopus Deploy) to build and maintain cloud infrastructure
A passion to develop and maintain custom tools scripts as needed to reduce toil.
Understanding of what it takes to maintain high service levels
Good understanding of networking protocols/components such as: HTTP, DNS, TCP/IP, IP Subnetting, Load balancers, etc.
Understanding of common scripting languages (bash, PowerShell, Python, etc.). Experience working in at least one object-oriented language (C#, node.js, Java, etc.)
Knowledgeable on Kubernetes and containers
Experience working in Linux environments
Experience with Monitoring tools and Cloud monitoring tools
Database management and monitoring (backup / restores / migrations), but not necessarily DBA type skills
Strong organization and time-management; a can-do attitude, detail-oriented while seeing the big picture
Participated in SOC 2, GDPR, FedRAMP compliance initiatives including working with security questionnaires
Excellent technical, analytical, and problem-solving skills required
Must be able to work independently and efficiently in a fast paced, team-oriented environment
Education & Preferred Qualifications
Minimum of 8-10 years Site Reliability Engineer Role
Bachelor's degree in computer science or related field, or equivalent
professional experience