Site Reliability Engineer

Site Reliability Engineer

Posted 1 week ago by 1753250713

£550 Per day
Outside
Undetermined
Edinburgh

Summary: The Site Reliability Engineer (SRE) role involves transitioning a public sector organization from on-premise systems to cloud-based services. The position requires collaboration within an agile team to enhance platform resilience, automation, and observability using a modern tech stack. Key responsibilities include ensuring service continuity and disaster recovery readiness. The role demands strong technical expertise in Unix/Linux, automation, containerization, and cloud services, particularly AWS.

Key Responsibilities:

  • Enhance platform resilience, automation, and observability.
  • Ensure service continuity and disaster recovery readiness.
  • Work with a modern tech stack including RHEL, Ansible, Oracle, AWS, and container platforms.
  • Collaborate within an agile team environment.

Key Skills:

  • Strong Unix/Linux expertise, particularly with RHEL 7/8/9 and Red Hat Satellite.
  • Automation skills, including Ansible, shell scripting (Bash/Perl), and infrastructure-as-code principles.
  • Containerisation experience with Docker and Kubernetes/OpenShift.
  • CI/CD knowledge, including pipeline configuration and Git-based workflows.
  • Monitoring and observability tools, such as Prometheus, Grafana, InfluxDB, and Nagios.
  • Cloud proficiency, especially with AWS services (EC2, S3, VPC, NLB) and automation tools like Terraform or CDK.
  • Desirable skills include experience with MongoDB, Python, CommVault, Oracle virtualisation (KVM/LVM), and AWS EKS.

Salary (Rate): £550 per day

City: Edinburgh

Country: United Kingdom

Working Arrangements: undetermined

IR35 Status: outside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Your new company and role

A leading public sector organisation is undergoing a major shift from on-premise systems to cloud-based services. As a Site Reliability Engineer (SRE), you’ll join a collaborative, agile team focused on enhancing platform resilience, automation, and observability.

You’ll work across a modern tech stack, including RHEL, Ansible, Oracle, AWS, and container platforms like OpenShift and Kubernetes, playing a key role in ensuring service continuity and disaster recovery readiness.

What you'll need to succeed

To thrive in this role, you’ll bring:

  • Strong Unix/Linux expertise, particularly with RHEL 7/8/9 and Red Hat Satellite.
  • Automation skills, including Ansible, shell scripting (Bash/Perl), and infrastructure-as-code principles.
  • Containerisation experience, with Docker and Kubernetes/OpenShift.
  • CI/CD knowledge, including pipeline configuration and Git-based workflows.
  • Monitoring and observability tools, such as Prometheus, Grafana, InfluxDB, and Nagios.
  • Cloud proficiency, especially with AWS services (EC2, S3, VPC, NLB) and automation tools like Terraform or CDK.
  • Desirable skills include experience with MongoDB, Python, CommVault, Oracle virtualisation (KVM/LVM), and AWS EKS.

What you need to do now

If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.

If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.