GCP Public Cloud Infrastructure Architect (HPC, GKE)

GCP Public Cloud Infrastructure Architect (HPC, GKE)

Posted 5 days ago by W3Global

Negotiable
Undetermined
Hybrid
London, England, United Kingdom

Summary: The GCP Public Cloud Infrastructure Architect will lead the design and deployment of cloud infrastructure for enterprise and public sector clients in the UK, focusing on scalable and secure solutions. This role involves managing High-Performance Computing (HPC) workloads and deploying applications using Google Kubernetes Engine (GKE). The ideal candidate will possess extensive experience in cloud infrastructure design and a strong proficiency in Google Cloud services. Certification as a Google Professional Cloud Architect is mandatory for this position.

Key Responsibilities:

  • Architect and implement infrastructure solutions on Google Cloud for compute-intensive and enterprise-grade environments.
  • Design and optimize HPC workloads leveraging GCP services like Compute Engine, Preemptible VMs, TPUs/GPUs, Filestore, and Parallel File Systems.
  • Architect and manage GKE clusters (standard and autopilot), ensuring high availability, performance, and security.
  • Develop and manage CI/CD pipelines using tools like Cloud Build, GitLab, ArgoCD, or Jenkins integrated with GKE.
  • Automate deployment workflows and platform management using Terraform, Helm, and Kustomize.
  • Implement Istio service mesh, ingress controllers, network policies, and GCP IAM for secure microservice communication.
  • Monitor and optimize workloads with Prometheus, Grafana, Cloud Operations Suite, and GKE Autopilot insights.
  • Collaborate with DevOps, security, and development teams to implement best practices in multi-tenant Kubernetes environments.
  • Troubleshoot issues with containerized applications, networking, and scaling strategies.
  • Collaborate with application, security, and platform teams to ensure optimized performance, cost control, and compliance.
  • Provide architectural guidance, detailed design documentation, and technical enablement for internal and customer engineering teams.

Key Skills:

  • Bachelor's degree in Computer Science, Engineering, or related field.
  • 10+ years of experience in cloud infrastructure design, DevOps, or system architecture.
  • 5+ years of experience in cloud infrastructure engineering, with a strong focus on Google Cloud Platform (GCP).
  • In-depth knowledge and hands-on experience with core GCP services (e.g., Compute Engine, GKE, VPC, Cloud Storage, Cloud IAM, Cloud DNS, Cloud Load Balancing).
  • Proficiency in Infrastructure as Code (IaC) tools, especially Terraform.
  • Strong scripting and automation skills (e.g., Python, Bash, Go).
  • Experience with CI/CD pipelines and DevOps methodologies.
  • Solid understanding of networking concepts (TCP/IP, DNS, VPNs, firewalls).
  • Experience with monitoring and logging tools (e.g., GCP Operations Suite, Prometheus, Grafana).
  • Strong problem-solving, debugging, and analytical skills.
  • Proven expertise in GCP infrastructure, GKE, and HPC workload architecture.
  • Experience in optimizing HPC environments including batch scheduling, job queuing (e.g., Slurm), and shared/distributed storage.
  • Strong understanding of Kubernetes internals, pod scheduling, autoscaling, and node management.
  • Proficient in Infrastructure as Code (Terraform, Deployment Manager).
  • Hands-on experience with Docker, Helm, Istio, and container security scanning tools (e.g., Trivy, Aqua).
  • Experience integrating observability and monitoring tools for GKE.
  • Strong proficiency in Terraform, Linux administration, and container orchestration tools.
  • Fluent in English (written and verbal).
  • Certification: Google Professional Cloud Architect (mandatory).

Salary (Rate): undetermined

City: London

Country: United Kingdom

Working Arrangements: hybrid

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

GCP Public Cloud Infrastructure Architect (HPC, GKE) | (UK) Location: United Kingdom (Remote/Hybrid Eligible) Language Requirement: Fluent in English Certification Required: Google Professional Cloud Architect (Mandatory)

About The Role

We are seeking a highly skilled GCP Public Cloud Infrastructure Architect to lead cloud infrastructure design and deployment for enterprise and public sector clients in the UK. You will focus on architecting scalable, secure infrastructure, managing HPC (High-Performance Computing) workloads, and deploying containerized applications using Google Kubernetes Engine (GKE).

Key Responsibilities

  • Architect and implement infrastructure solutions on Google Cloud for compute-intensive and enterprise-grade environments.
  • Design and optimize HPC workloads leveraging GCP services like Compute Engine, Preemptible VMs, TPUs/GPUs, Filestore, and Parallel File Systems.
  • Architect and manage GKE clusters (standard and autopilot), ensuring high availability, performance, and security.
  • Develop and manage CI/CD pipelines using tools like Cloud Build, GitLab, ArgoCD, or Jenkins integrated with GKE.
  • Automate deployment workflows and platform management using Terraform, Helm, and Kustomize.
  • Implement Istio service mesh, ingress controllers, network policies, and GCP IAM for secure microservice communication.
  • Monitor and optimize workloads with Prometheus, Grafana, Cloud Operations Suite, and GKE Autopilot insights.
  • Collaborate with DevOps, security, and development teams to implement best practices in multi-tenant Kubernetes environments.
  • Troubleshoot issues with containerized applications, networking, and scaling strategies.
  • Collaborate with application, security, and platform teams to ensure optimized performance, cost control, and compliance.
  • Provide architectural guidance, detailed design documentation, and technical enablement for internal and customer engineering teams.

Required Qualifications

  • Bachelor's degree in Computer Science, Engineering, or related field.
  • 10+ years of experience in cloud infrastructure design, DevOps, or system architecture.
  • Bachelor's degree in Computer Science, Engineering, Mathematics, a related quantitative field, or equivalent practical experience.
  • 5+ years of experience in cloud infrastructure engineering, with a strong focus on Google Cloud Platform (GCP).
  • In-depth knowledge and hands-on experience with core GCP services (e.g., Compute Engine, GKE, VPC, Cloud Storage, Cloud IAM, Cloud DNS, Cloud Load Balancing).
  • Proficiency in Infrastructure as Code (IaC) tools, especially Terraform.
  • Strong scripting and automation skills (e.g., Python, Bash, Go).
  • Experience with CI/CD pipelines and DevOps methodologies.
  • Solid understanding of networking concepts (TCP/IP, DNS, VPNs, firewalls).
  • Experience with monitoring and logging tools (e.g., GCP Operations Suite, Prometheus, Grafana).
  • Strong problem-solving, debugging, and analytical skills.
  • Proven expertise in GCP infrastructure, GKE, and HPC workload architecture.
  • Experience in optimizing HPC environments including batch scheduling, job queuing (e.g., Slurm), and shared/distributed storage.
  • Strong understanding of Kubernetes internals, pod scheduling, autoscaling, and node management.
  • Proficient in Infrastructure as Code (Terraform, Deployment Manager).
  • Hands-on experience with Docker, Helm, Istio, and container security scanning tools (e.g., Trivy, Aqua).
  • Experience integrating observability and monitoring tools for GKE.
  • Strong proficiency in Terraform, Linux administration, and container orchestration tools.
  • Fluent in English (written and verbal).
  • Certification: Google Professional Cloud Architect (mandatory).

Preferred Qualifications

  • Hands-on experience with GPU/TPU workloads, Slurm, or Intel MPI/OpenMPI in cloud HPC environments.
  • Experience deploying hybrid and multi-cloud solutions using Anthos or GCVE.
  • Familiarity with CI/CD, Cloud Build, Artifact Registry, and security scanning tools.
  • Familiarity with Anthos, GKE Autopilot, or hybrid/multi-cloud Kubernetes environments.
  • Experience with GitOps workflows (ArgoCD, Flux).
  • Exposure to workload identity, GCP Workload Identity Federation, and K8s RBAC.
  • Consulting or client-facing experience in industries like life sciences, financial services, or manufacturing with HPC needs.