Negotiable
Undetermined
Hybrid
London Area, United Kingdom
Summary: The SRE Coach (Principal SRE – Enablement & Training) role focuses on designing and delivering comprehensive SRE training programs and bootcamps, coaching engineers on SRE principles, and fostering SRE adoption within the organization. The position requires collaboration with various teams to align training with business needs and the development of learning materials tailored for multi-cloud environments. The role is based in Manchester or London with a hybrid working arrangement.
Key Responsibilities:
- Design and deliver hands-on SRE training programs and technical bootcamps across beginner to advanced levels.
- Coach and mentor engineers and teams on SRE principles, including SLOs, SLIs, error budgets, incident management, and reliability practices.
- Lead workshops and embedded coaching sessions to drive SRE adoption, automation, and observability across the organization.
- Collaborate with engineering, operations, and product teams to align training with business and platform needs.
- Develop and maintain learning content, assessments, and certification pathways, continuously improving materials based on feedback and industry trends.
- Support multi-cloud environments and tailor content for AWS, Azure, GCP, and private cloud platforms.
Key Skills:
- SRE
- Reliability Engineering
- AWS
- Azure
- GCP
- Terraform
- Ansible
- CloudFormation
- CI/CD
- Monitoring
- Observability
- Incident Management
- SLO
- SLI
- Error Budgets
- DevOps
- Automation
Salary (Rate): undetermined
City: London
Country: United Kingdom
Working Arrangements: hybrid
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
I am hiring for SRE Coach (Principal SRE – Enablement & Training) Location: Manchester (Preferred) or London (Optional) – Hybrid (50–60% Remote)
Job Description
- Design and deliver hands-on SRE training programs and technical bootcamps across beginner to advanced levels.
- Coach and mentor engineers and teams on SRE principles , including SLOs, SLIs, error budgets, incident management , and reliability practices.
- Lead workshops and embedded coaching sessions to drive SRE adoption, automation, and observability across the organization.
- Collaborate with engineering, operations, and product teams to align training with business and platform needs .
- Develop and maintain learning content, assessments, and certification pathways , continuously improving materials based on feedback and industry trends.
- Support multi-cloud environments and tailor content for AWS, Azure, GCP, and private cloud platforms.
Key Skills: SRE, ReliabilityEngineering, AWS, Azure, GCP, Terraform, Ansible, CloudFormation, CI/CD, Monitoring, Observability, IncidentManagement, SLO, SLI, ErrorBudgets, DevOps, Automation