Sr. Linux AI System Engineer

Sr. Linux AI System Engineer

Posted 1 day ago by F2ONSITE

Negotiable
Undetermined
Remote
Remote

Summary: The Sr. Linux AI System Engineer role involves addressing instability in an AI environment built with less than optimal practices. The position requires expertise in Linux systems engineering, GPU/HPC, and AI/ML platform support, with a focus on debugging and resolving issues. This is a remote position, preferably for candidates in Texas or Central Standard Time zones, with an immediate start for a one-month assignment. Candidates will also need to undergo a CJIS background check due to the nature of the data handled.

Key Responsibilities:

  • Linux Systems Engineering
  • GPU / HPC expertise
  • AI/ML platform support
  • Cloud + DevOps + MLOps
  • Full AI Build Experience
  • Configuration management, monitoring, diagnostics, and issue resolution of servers, NVIDIA Cumulus Ethernet SN5000 Series, Ubuntu Linux, Kubernetes, Portworx SDS, and GPU Passthrough OS to k8 Pod
  • Familiarity with Centific
  • Strong debugging and diagnostic skills
  • Ability to map out and document an environment for future administrators
  • Strong written and verbal communication skills
  • Proven experience working in a collaborative multi-vendor environment
  • Full-time weekdays and daytime hours with potential overtime, non-standard work hours, nights, and weekends during active issue resolution
  • Working with CJIS data requiring a fingerprint-based background check

Key Skills:

  • Linux Systems Engineering
  • GPU / HPC expertise
  • AI/ML platform support
  • Cloud + DevOps + MLOps
  • Configuration management
  • Strong debugging and diagnostic skills
  • Documentation skills
  • Strong written and verbal communication skills
  • Collaborative experience in multi-vendor environments
  • Familiarity with Centific
  • Ability to work flexible hours
  • Compliance with CJIS background check requirements

Salary (Rate): £56.25 hourly

City: undetermined

Country: undetermined

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Sr. Linux AI System Engineer

  • Remote - TX and or CST preferred
  • Start ASAP - one month assignment

The AI environment that was built following less than best practices. Experiencing instability, and do not have the resources to debug and resolve these issues.
Technical Requirements/Scope

  • Linux Systems Engineering
  • GPU / HPC expertise
  • AI/ML platform support
  • Cloud + DevOps + MLOps
  • Full AI Build Experience
  • Configuration management, Monitoring, diagnostics and issue resolution of:
    • Servers Lenovo and Lenovo GPU nodes
    • NVIDIA Cumulus Ethernet SN5000 Series
    • Ubuntu Linux
    • Kubernetes's
    • Portworx SDS
    • GPU Passthrough OS to k8 Pod
  • Familiarity with Centific would be a huge plus
  • Strong Debugging and Diagnostic Skills
  • Ability to map out and documents an environment for future administrators.
  • Strong written and verbal communication skills
  • Proven experience working in a collaborative multi-vendor environment
  • Full time weekdays and daytime hours
    • Potential overtime non-standard work hours, nights, and weekends, extended days during active issue resolution.
  • Will be working with CJIS data. CJIS background check is a fingerprint-based, national criminal history search mandated by the FBI's Criminal Justice Information Services (CJIS) Division. It checks individuals accessing sensitive law enforcement data for compliance.

Additional Information

  • All candidates are encouraged to apply, but many positions require a strict drug and background check by our customers.
  • F2OnSite supports and adheres to all state laws regarding background checks.