HPC Systems Engineer

HPC Systems Engineer

Posted Today by Totaljobs

£465 Per day
Outside
Hybrid
Swansea (SA1)

Summary: The HPC Systems Engineer role focuses on deploying, configuring, optimizing, and supporting high-performance computing environments, particularly large-scale Linux clusters and GPU-accelerated systems. The position is primarily remote with occasional site travel to Swansea. The engineer will also be responsible for automating provisioning and managing configurations at scale. This role requires strong expertise in Linux administration and HPC systems management.

Key Responsibilities:

  • Deploy, configure, optimise and support enterprise Linux across HPC clusters
  • Automate provisioning and manage configuration at scale (PXE, Kickstart, Ansible/Puppet)
  • Install, configure, and optimise HPC schedulers (e.g. Slurm) and MPI environments
  • Deploy and manage GPU (NVIDIA/CUDA) and high-performance storage solutions
  • Monitor, benchmark, and tune system performance across compute, network, and storage
  • Implement authentication, security controls, and system hardening for multi-user environments
  • Support HPC software stacks, toolchains, and container runtimes (Spack, EasyBuild, Apptainer)
  • Maintain documentation and support user access/workflows

Key Skills:

  • Strong Linux administration in HPC or large-scale environments
  • Experience with automation and cluster provisioning
  • Knowledge of Slurm, MPI, and parallel computing
  • Experience with GPU/CUDA environments
  • Understanding of system performance tuning
  • Familiarity with identity management and security best practices
  • Previous experience operating as an HPC Systems Engineer

Salary (Rate): £465 per day

City: Swansea

Country: United Kingdom

Working Arrangements: hybrid

IR35 Status: outside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Job Title: HPC Systems Engineer - Enterprise Linux & Cluster Infrastructure

Location: Hybrid to Remote - Swansea (mostly remote & expenses for all site travel)

Day Rate: £465 per day - payable to Limited Company / Outside IR35

Duration: 4 months initially

Pay Frequency: Weekly

Start Date: ASAP

Overview

We are seeking an experienced HPC Systems Engineer to deploy, configure, optimise and support high-performance computing (HPC) environments. This includes large-scale Linux clusters, GPU-accelerated systems, and associated storage, networking, and authentication infrastructure.

Although this role is hybrid, it's expected to be largely remote / WFH.

Responsibilities

  • Deploy, configure, optimise and support enterprise Linux across HPC clusters
  • Automate provisioning and manage configuration at scale (PXE, Kickstart, Ansible/Puppet)
  • Install, configure, and optimise HPC schedulers (e.g. Slurm) and MPI environments
  • Deploy and manage GPU (NVIDIA/CUDA) and high-performance storage solutions
  • Monitor, benchmark, and tune system performance across compute, network, and storage
  • Implement authentication, security controls, and system hardening for multi-user environments
  • Support HPC software stacks, toolchains, and container runtimes (Spack, EasyBuild, Apptainer)
  • Maintain documentation and support user access/workflows

Requirements

  • Strong Linux administration in HPC or large-scale environments
  • Experience with automation and cluster provisioning
  • Knowledge of Slurm, MPI, and parallel computing
  • Experience with GPU/CUDA environments
  • Understanding of system performance tuning
  • Familiarity with identity management and security best practices
  • Previous experience operating as an HPC Systems Engineer