Director Cloud and Hosting Services

Apply Now

Company: Pearson Education

Location: Durham, NC 27713

Description:

Job Description

Director of Site Reliability Engineering (SRE)

Location: Durham, NC (Hybrid - 3 days onsite per week)

Job Type: Full-time

Reports To: VP Site Reliability

About Pearson

At Pearson, our mission is to provide world-class learning experiences that help people advance in their lives. As we continue to expand our digital learning platforms, we are committed to building a scalable, resilient, and high-performing technology ecosystem that supports our global audience.

About the Role

We are seeking a Director of Site Reliability Engineering (SRE) to lead the transformation of our reliability strategy, foster a high-performance engineering culture, and position Pearson's products for scale and growth. This role requires a blend of technical leadership, operational excellence, and people development to embed modern SRE practices across the organization.

As a senior leader, you will be responsible for shaping Pearson's SRE strategy, evolving our cloud infrastructure, and championing automation-first principles to improve the availability, performance, and security of our systems. You will lead the cultural shift towards reliability as a core engineering discipline while enabling teams to move faster with confidence.

This role requires onsite collaboration in our Durham, NC office three days per week, fostering strong cross-functional partnerships, mentorship, and hands-on leadership in a hybrid environment.

Key Responsibilities

Strategic & Cultural Leadership

Define and execute a modern SRE strategy that aligns with Pearson's business and product growth objectives.
Lead the cultural transformation towards automation, observability, and self-service reliability engineering.
Establish and refine service reliability metrics (SLIs, SLOs, and error budgets) to drive continuous improvement.
Partner with Engineering, Security, and Product teams to develop a scalable, high-performance platform.
Advocate for best-in-class reliability practices, ensuring teams adopt a proactive approach to system resiliency.

Operational Excellence & Reliability Engineering

Own the availability, performance, and scalability of Pearson's mission-critical cloud services.
Drive automation-first approaches to incident response, monitoring, and cloud infrastructure management.
Improve incident response processes, reducing mean time to detect (MTTD) and mean time to resolve (MTTR).
Implement modern observability and monitoring solutions, ensuring real-time visibility into system health.
Strengthen CI/CD practices, enabling seamless and safe deployments through progressive delivery strategies.

People Leadership & Team Development

Build, mentor, and scale a high-performing SRE team that drives innovation and operational excellence.
Cultivate an inclusive and collaborative engineering culture, fostering learning, career growth, and technical mastery.
Develop and implement talent strategies, ensuring a strong pipeline of SRE expertise.
Encourage cross-team knowledge sharing, ensuring best practices in reliability and automation are widely adopted.

Qualifications & Technical Expertise

10+ years of experience in site reliability engineering, cloud infrastructure, or software engineering, with 5+ years in leadership roles.
Proven expertise in AWS architecture, cloud operations, and best practices for high-scale, distributed systems.
Strong experience with Terraform (Infrastructure-as-Code) to manage cloud infrastructure at scale.
Hands-on experience with GitHub Actions for CI/CD and GitOps methodologies for managing deployments.
Deep understanding of SRE principles, observability, and incident management.
Strong ability to influence engineering and leadership teams, aligning technology with business needs.
Excellent communication and stakeholder management skills, with a track record of driving organization-wide adoption of reliability practices.

Preferred Qualifications

Experience with AIOps and automation-driven reliability solutions.
Knowledge of cloud cost optimization (FinOps) and strategies for efficient infrastructure management.
Background in regulated industries (e.g., finance, healthcare, education) with complex compliance and reliability needs.

Why Join Pearson?

Lead the transformation of our reliability engineering strategy, driving cultural and technical change.
Work with cutting-edge technologies, including AWS, Terraform, GitHub Actions, and GitOps.
Be part of a collaborative, high-impact engineering culture with a balance of onsite engagement and flexibility.
Competitive compensation, benefits, and opportunities for career growth at a global leader in education technology.

Director Cloud and Hosting Services

Similar Jobs