Negotiable
Outside
Hybrid
Swansea, West Glamorgan, UK
Summary: We are looking for an experienced HPC Linux Systems Engineer to manage and optimize high-performance computing environments, primarily focusing on large-scale Linux clusters and GPU-accelerated systems. The role is hybrid, with a significant emphasis on remote work. The engineer will be responsible for deploying, configuring, and supporting various HPC infrastructures, ensuring optimal performance and security. This position requires strong Linux administration skills and experience in HPC environments.
Key Responsibilities:
- Deploy, configure, optimise and support enterprise Linux across HPC clusters
- Automate provisioning and manage configuration at scale (PXE, Kickstart, Ansible/Puppet)
- Install, configure, and optimise HPC schedulers (eg Slurm) and MPI environments
- Deploy and manage GPU (NVIDIA/CUDA) and high-performance storage solutions
- Monitor, benchmark, and tune system performance across compute, network, and storage
- Implement authentication, security controls, and system hardening for multi-user environments
- Support HPC software stacks, toolchains, and container runtimes (Spack, EasyBuild, Apptainer)
- Maintain documentation and support user access/workflows
Key Skills:
- Strong Linux administration in HPC or large-scale environments
- Experience with automation and cluster provisioning
- Knowledge of Slurm, MPI, and parallel computing
- Experience with GPU/CUDA environments
- Understanding of system performance tuning
- Familiarity with identity management and security best practices
- Previous experience operating as an HPC Linux Systems Engineer
Salary (Rate): £465 Daily
City: Swansea
Country: UK
Working Arrangements: hybrid
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
Job Title: HPC Linux Systems Engineer - Enterprise Linux & Cluster Infrastructure
Location: Hybrid to Remote - Swansea (mostly remote & expenses for all site travel)
Day Rate: £465 per day - payable to Limited Company/Outside IR35
Duration: 4 months initially
Pay Frequency: Weekly
Start Date: ASAP
Overview
We are seeking an experienced HPC Linux Systems Engineer to deploy, configure, optimise and support high-performance computing (HPC) environments. This includes large-scale Linux clusters, GPU-accelerated systems, and associated storage, networking, and authentication infrastructure.
Although this role is hybrid, it's expected to be largely remote/WFH.
Responsibilities
- Deploy, configure, optimise and support enterprise Linux across HPC clusters
- Automate provisioning and manage configuration at scale (PXE, Kickstart, Ansible/Puppet)
- Install, configure, and optimise HPC schedulers (eg Slurm) and MPI environments
- Deploy and manage GPU (NVIDIA/CUDA) and high-performance storage solutions
- Monitor, benchmark, and tune system performance across compute, network, and storage
- Implement authentication, security controls, and system hardening for multi-user environments
- Support HPC software stacks, toolchains, and container runtimes (Spack, EasyBuild, Apptainer)
- Maintain documentation and support user access/workflows
Requirements
- Strong Linux administration in HPC or large-scale environments
- Experience with automation and cluster provisioning
- Knowledge of Slurm, MPI, and parallel computing
- Experience with GPU/CUDA environments
- Understanding of system performance tuning
- Familiarity with identity management and security best practices
- Previous experience operating as an HPC Linux Systems Engineer