Network Architect

Network Architect

Posted 7 days ago by 1761200683

Negotiable
Outside
Remote
USA

Summary: The Network Architect role is a hands-on position focused on developing and deploying ultra-high-speed, resilient, and scalable interconnects for GPU-accelerated data centers and compute clusters. The ideal candidate will possess strong problem-solving skills and a deep understanding of network security protocols, routing, switching, and automation. This position requires collaboration with various teams to ensure the delivery of highly available networks for extreme-performance workloads. A minimum of 6-8 years of experience in managing large-scale hybrid data center networks is essential.

Key Responsibilities:

  • Lead the architecture, design, and deployment of global-scale DCs inter-connects and fabric for HPC, AI, and GPU computing clusters.
  • Develop high-performance data center fabric using InfiniBand, Ultra Ethernet and related technologies.
  • Optimize carrier interconnects, intra and inter DC routing, and dark fiber deployments to ensure low latency and high reliability.
  • Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for extreme-performance workloads.
  • Implement network monitoring, telemetry, solving, and continuous performance improvement processes.
  • Drive technology selection, vendor engagement, and lifecycle management for Data Center hardware and software.

Key Skills:

  • Minimum 6-8 years of experience in building, managing and supporting large scale hybrid Data Center networks.
  • Developing automation pipelines with Python, Ruby, Go or other languages used in infrastructure automation.
  • SME in networking technologies: InfiniBand, TCP/UDP, IPv4/IPv6, BGP/MP-BGP, VPN, L2 switching, EVPN, VxLAN, Segment Routing, MPLS, IS-IS, DWDM.
  • Experience automating SDN/NFV/NFVI Infrastructure.
  • Experience using an automated configuration management system (Ansible, Salt, etc.).

Salary (Rate): undetermined

City: undetermined

Country: USA

Working Arrangements: remote

IR35 Status: outside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Top Skills:
Core Network Architecture & Design Skills
Data Center fabric/interconnect architecture (HPC, AI, GPU clusters)
InfiniBand, Ultra Ethernet, high-performance networking protocols
Hybrid Data Center network design & scalability
Dark fiber deployments, low-latency routing, global scale interconnects

Network Architect
This is a hands-on architecture position focused on the development and deployment of ultra-high-speed, resilient, and scalable interconnects for GPU-accelerated data centers and compute clusters. Outstanding problem-solving abilities and a comprehensive understanding of the network security protocols & standards, routing, switching, automation and deep understanding of fundamental network theory is also critical to success

Key Responsibilities
Lead the architecture, design, and deployment of global-scale DCs inter-connects and fabric for HPC, AI, and GPU computing clusters.
Develop high-performance data center fabric using InfiniBand, Ultra Ethernet and related technologies.
Optimize carrier interconnects, intra and inter DC routing, and dark fiber deployments to ensure low latency and high reliability.
Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for extreme-performance workloads.
Implement network monitoring, telemetry, solving, and continuous performance improvement processes.
Drive technology selection, vendor engagement, and lifecycle management for Data Center hardware and software.

What We re Looking For:
Minimum 6-8 years of experience in building, managing and supporting large scale hybrid Data Center networks. Developing automation pipelines with Python, Ruby, Go or other languages used in infrastructure automation.
SME in networking technologies: InfiniBand, TCP/UDP, IPv4/IPv6, BGP/MP-BGP, VPN, L2 switching, EVPN, VxLAN, Segment Routing, MPLS, IS-IS, DWDM. Etc
Experience automating SDN/NFV/NFVI Infrastructure
Experience using an automated configuration management system (Ansible, Salt, etc.)