Negotiable
Undetermined
Remote
Remote
Summary: The Sr. Linux AI System Engineer role involves addressing instability in an AI environment built with less than optimal practices. The position requires expertise in Linux systems engineering, GPU/HPC, and AI/ML platform support, with a focus on debugging and resolving issues. This is a remote position, preferably for candidates in Texas or Central Standard Time zones, with an immediate start for a one-month assignment. Candidates will also need to undergo a CJIS background check due to the nature of the data handled.
Key Responsibilities:
- Linux Systems Engineering
- GPU / HPC expertise
- AI/ML platform support
- Cloud + DevOps + MLOps
- Full AI Build Experience
- Configuration management, monitoring, diagnostics, and issue resolution of servers, NVIDIA Cumulus Ethernet SN5000 Series, Ubuntu Linux, Kubernetes, Portworx SDS, and GPU Passthrough OS to k8 Pod
- Familiarity with Centific
- Strong debugging and diagnostic skills
- Ability to map out and document an environment for future administrators
- Strong written and verbal communication skills
- Proven experience working in a collaborative multi-vendor environment
- Full-time weekdays and daytime hours with potential overtime, non-standard work hours, nights, and weekends during active issue resolution
- Working with CJIS data requiring a fingerprint-based background check
Key Skills:
- Linux Systems Engineering
- GPU / HPC expertise
- AI/ML platform support
- Cloud + DevOps + MLOps
- Configuration management
- Strong debugging and diagnostic skills
- Documentation skills
- Strong written and verbal communication skills
- Collaborative experience in multi-vendor environments
- Familiarity with Centific
- Ability to work flexible hours
- Compliance with CJIS background check requirements
Salary (Rate): £56.25 hourly
City: undetermined
Country: undetermined
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Detailed Description From Employer:
Sr. Linux AI System Engineer
- Remote - TX and or CST preferred
- Start ASAP - one month assignment
The AI environment that was built following less than best practices. Experiencing instability, and do not have the resources to debug and resolve these issues.
Technical Requirements/Scope
- Linux Systems Engineering
- GPU / HPC expertise
- AI/ML platform support
- Cloud + DevOps + MLOps
- Full AI Build Experience
- Configuration management, Monitoring, diagnostics and issue resolution of:
- Servers Lenovo and Lenovo GPU nodes
- NVIDIA Cumulus Ethernet SN5000 Series
- Ubuntu Linux
- Kubernetes's
- Portworx SDS
- GPU Passthrough OS to k8 Pod
- Familiarity with Centific would be a huge plus
- Strong Debugging and Diagnostic Skills
- Ability to map out and documents an environment for future administrators.
- Strong written and verbal communication skills
- Proven experience working in a collaborative multi-vendor environment
- Full time weekdays and daytime hours
- Potential overtime non-standard work hours, nights, and weekends, extended days during active issue resolution.
- Will be working with CJIS data. CJIS background check is a fingerprint-based, national criminal history search mandated by the FBI's Criminal Justice Information Services (CJIS) Division. It checks individuals accessing sensitive law enforcement data for compliance.
Additional Information
- All candidates are encouraged to apply, but many positions require a strict drug and background check by our customers.
- F2OnSite supports and adheres to all state laws regarding background checks.