£500 Per day
Outside
Remote
London, UK
Summary: The role of DevOps/Site Reliability Engineer involves designing, building, and maintaining scalable cloud infrastructure while ensuring system reliability and performance. The position requires a strong background in automation and observability, with a focus on driving DevOps best practices within a mission-driven organization. The engineer will collaborate closely with AI and platform engineering teams to enhance system security and reliability. This is a unique opportunity to impact mental health at scale through technology.
Key Responsibilities:
- Design, build and maintain scalable infrastructure using Infrastructure as Code (IaC)
- Implement observability and monitoring systems using tools like Grafana
- Support and improve incident response and escalation processes
- Drive automation across the development and deployment pipelines
- Work closely with AI and platform engineering teams to ensure system reliability and security
- Develop and scale systems to support both B2C and B2B product lines
- Champion DevOps best practices and SRE principles
Key Skills:
- Extensive experience working in a DevOps or SRE role, preferably within product-led or security-focused tech companies
- Expert-level knowledge of AWS and managing cloud services at scale
- Strong experience with IaC tools such as Terraform or CloudFormation
- Proven track record setting up observability, monitoring and alerting systems - ideally with Grafana or similar
- Incident response and systems reliability experience in high-scale environments
- Proficiency in building automation tools and optimising CI/CD pipelines
- Security-focused mindset with knowledge of platform hardening and secure provisioning
- Excellent communication and collaboration skills across engineering functions
Salary (Rate): £500 daily
City: London
Country: UK
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: Mid-Level
Industry: IT
Location: UK Remote
Rate: £500 Outside IR35
Contract Length: 12 Months
Start Date: Immediate
We are looking for a skilled DevOps/Site Reliability Engineer with a strong background in cloud infrastructure and a passion for automation, observability, and high-performance systems. You'll be instrumental in provisioning and securing scalable systems, ensuring uptime and performance, and driving a culture of reliability engineering across the business.
Key responsibilities:
* Design, build and maintain scalable infrastructure using Infrastructure as Code (IaC)
* Implement observability and monitoring systems using tools like Grafana
* Support and improve incident response and escalation processes
* Drive automation across the development and deployment pipelines
* Work closely with AI and platform engineering teams to ensure system reliability and security
* Develop and scale systems to support both B2C and B2B product lines
* Champion DevOps best practices and SRE principles
Required skills and experience:
* Extensive experience working in a DevOps or SRE role, preferably within product-led or security-focused tech companies
* Expert-level knowledge of AWS and managing cloud services at scale
* Strong experience with IaC tools such as Terraform or CloudFormation
* Proven track record setting up observability, monitoring and alerting systems - ideally with Grafana or similar. Incident response and systems reliability experience in high-scale environments
* Proficiency in building automation tools and optimising CI/CD pipelines
* Security-focused mindset with knowledge of platform hardening and secure provisioning
* Excellent communication and collaboration skills across engineering functions This is a unique opportunity to shape DevOps practices within a mission-driven, AI-led organisation impacting mental health at scale.
Apply now to be considered.
