Site Reliability Engineer

Apply Now

Company: Air-tek

Location: Toronto, ON M4E 3Y1

Description:

About us

Air-tek is a Canadian-based software company with a powerful suite of unique products that have already achieved a significant share of a huge global market. The product market fit is excellent, and customers are lining up to buy. Although our global customers know us, we intentionally operate in stealth mode during this growth phase.

Our diverse team shares a collective passion for solving complex problems with a drive to innovate and a desire to create the passenger-centric travel industry

Based in Toronto, our inclusive culture is built on trust, collaboration, delivering a great product, and continuous personal development. We love what we do, and we support the team around us.

About the team

The SRE Team is dedicated to ensuring the reliability, scalability, and performance of Air-tek's critical systems and services. We bridge the gap between development and operations by applying software engineering principles to operational challenges, fostering a culture of reliability, automation, and continuous improvement.

As a member of the team, you will work a multitude of modern technologies, contribute to the vision of the SRE team, partner with other engineering teams to tackle new ideas and challenges, and have a direct impact on Air-Tek's ability to sustain our rapid growth.

In this role you will
    • Ensure the uptime and reliability of Air-Tek's platform in accordance with company SLOs.
    • Ensure the successful deployment of new code and services into our hosted platform.
    • Analyze and tune systems to operate at maximum efficiency.
    • Reduce the toil of manual work through automation and creating new tooling.
    • Collaborate with other engineering teams to integrate reliability into the software development lifecycle.
    • Be a member of the team's on-call rotation - responding to and resolving critical issues.


Skills and Experience
    • Bachelor's degree in computer science, software engineering, or equivalent.
    • [5+] years of relevant experience
    • Experience with production monitoring and logging tools such as AWS CloudWatch and DataDog.
    • Experience with some or all of the following tools we leverage:
    • System administration: Docker, Linux
    • Cloud: Amazon Web Services
    • Databases: Mongo Atlas, PostgreSQL, AWS Aurora
    • CI/CD: GitHub Actions, ArgoCD
    • Environment management: Pulumi, Terraform, Kubernetes
    • Data streaming platforms: Kafka, RabbitMQ
    • Experience with programming and scripting languages such as C# .NET, Node.js, PowerShell, and Bash.
    • Possess strong analytical and problem-solving skills and have the confidence to tackle difficult problems.
    • Good written and oral communication skills with the ability to explain technical concepts and designs clearly and succinctly.


Why join us?
    • Be part of a collaborative, inclusive team that values innovation and creativity.
    • Work with exciting, modern technologies and gain hands-on experience across a diverse range of projects.
    • Contribute to solutions that make a tangible impact.
    • Enjoy opportunities for professional growth and development in a supportive environment.

Similar Jobs