Site Reliability Engineer

Site Reliability Engineer

Posted Today by itecopeople

£85 Per hour
Outside
Hybrid
Cheltenham, England, United Kingdom

Summary: The role of eDV DevOps Engineer / Site Reliability Engineer (SRE) focuses on enhancing platform reliability and automation for secure technology platforms supporting UK government organizations. The position requires strong expertise in AWS, Kubernetes, Terraform, and CI/CD practices, with a commitment to improving system performance and observability. The engineer will collaborate with various teams to ensure the resilience of critical systems while adhering to security protocols. Active eDV clearance is mandatory for this contract role based in Cheltenham with a hybrid working arrangement.

Key Responsibilities:

  • Collaborate with software engineering teams to improve subsystem reliability and performance.
  • Work with system administrators to automate operational processes and reduce manual effort.
  • Enhance monitoring and observability capabilities to proactively detect and resolve issues.
  • Support development environments to improve delivery speed and quality.
  • Contribute to the evolution of infrastructure, DevOps practices, and CI/CD pipelines.
  • Research and evaluate new technologies and tools to support engineering decisions.
  • Develop expertise across multiple technical and business domains.

Key Skills:

  • Active eDV clearance is essential.
  • Configuration management tools such as Ansible, Chef, or similar.
  • Strong Terraform experience.
  • Docker containers and container orchestration platforms (Kubernetes, OpenShift, Docker Swarm).
  • Maintaining and using CI/CD tooling such as Jenkins.
  • Monitoring and observability experience with Prometheus, Grafana, or InfluxDB.
  • Event-driven integration and messaging systems such as RabbitMQ or other AMQP solutions.
  • Strong Linux command line, administration, and shell scripting experience.
  • Solid understanding of relational databases and SQL.
  • Network security protocols.
  • Working with cloud platforms, ideally AWS (EC2, RDS, S3, Lambda); Azure a plus.

Salary (Rate): £85.00/hr

City: Cheltenham

Country: United Kingdom

Working Arrangements: hybrid

IR35 Status: outside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

eDV DevOps Engineer / Site Reliability Engineer (SRE) – AWS, Kubernetes – Contract Outside IR35. . We are supporting a specialist engineering consultancy delivering secure technology platforms to high-profile UK government organisations. They are seeking an eDV Cleared DevOps Engineer / Site Reliability Engineer (SRE) with strong experience across AWS, Kubernetes, Terraform, CI/CD and Linux environments to support the continued growth of critical cross-domain systems. This contract role will focus on improving platform reliability, automation, infrastructure as code, observability and DevOps practices across both cloud and on-premise environments. You will work closely with software engineers, platform engineers and operations teams to ensure highly secure, scalable and resilient systems supporting sensitive government programmes.

Location: Cheltenham (Hybrid – 3 days onsite)

Rate: £500–£650 per day Outside IR35

Security Clearance: Active eDV Clearance required

Start Date ASAP

As a DevOps / Site Reliability Engineer, you will be responsible for ensuring the availability, performance, and reliability of services supporting sensitive government programmes. You will collaborate with multiple feature development teams and BAU/support teams to evolve both cloud and on-premise infrastructure, delivery pipelines, and observability tooling. The role will focus on improving system reliability, monitoring, automation, and performance, while proactively identifying and mitigating operational risks. This position may also involve participation in an on-call rota, which could include occasional 24/7 call-out support.

Key Responsibilities:

  • Collaborate with software engineering teams to improve subsystem reliability and performance.
  • Work with system administrators to automate operational processes and reduce manual effort.
  • Enhance monitoring and observability capabilities to proactively detect and resolve issues.
  • Support development environments to improve delivery speed and quality.
  • Contribute to the evolution of infrastructure, DevOps practices, and CI/CD pipelines.
  • Research and evaluate new technologies and tools to support engineering decisions.
  • Develop expertise across multiple technical and business domains.

Required Skills & Experience

  • Active eDV clearance is essential
  • configuration management tools such as Ansible, Chef, or similar
  • Strong Terraform
  • Docker containers and container orchestration platforms (Kubernetes, OpenShift, Docker Swarm)
  • maintaining and using CI/CD tooling such as Jenkins
  • Monitoring and observability experience with Prometheus, Grafana, or InfluxDB
  • event-driven integration and messaging systems such as RabbitMQ or other AMQP solutions
  • Strong Linux command line, administration, and shell scripting experience
  • Solid understanding of relational databases and SQL
  • network security protocols
  • Working with cloud platforms, ideally AWS (EC2, RDS, S3, Lambda) Azure a plus

Please send your CV to Laura at lramm@itecopeople.co.uk to progress matters.

Services Advertised are those of Employment Business.