Negotiable
Outside
Remote
USA
Summary: The AI/ML Engineer role focuses on deploying and optimizing AI models on specific platforms while implementing advanced pipelines and conducting performance testing. The position requires expertise in GPU utilization and cost analysis to ensure resource efficiency. This is a long-term remote position based in the USA. The role is classified as outside IR35.
Key Responsibilities:
- Deploy and optimize AI models on both Systalyze and Baseten platforms
- Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
- Conduct comprehensive performance testing and optimization
- GPU utilization analysis and CUDA optimization
- Cost analysis and resource efficiency evaluation
- Model inference latency and throughput benchmarking
Key Skills:
- Experience with AI model deployment and optimization
- Knowledge of RAG pipelines
- Performance testing and optimization skills
- Proficiency in GPU utilization and CUDA
- Cost analysis and resource efficiency evaluation
- Benchmarking skills for model inference
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
Role: AI/ML Engineer
Location: Remote
Duration: Long Term
Primary Responsibilities
- Deploy and optimize AI models on both Systalyze and Baseten platforms
- Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
- Conduct comprehensive performance testing and optimization
- GPU utilization analysis and CUDA optimization
- Cost analysis and resource efficiency evaluation
- Model inference latency and throughput benchmarking