Negotiable
Outside
Remote
USA
Summary: The AI/ML Engineer role focuses on deploying and optimizing AI models across major cloud platforms, implementing advanced pipelines, and conducting performance testing. The position requires expertise in various programming languages and machine learning frameworks, as well as experience with containerization and orchestration technologies. The role is fully remote and classified as outside IR35.
Key Responsibilities:
- Deploy and optimize AI models on AWS, Azure and Google Cloud Platform
- Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
- Conduct comprehensive performance testing and optimization
- Cost analysis and resource efficiency evaluation
- Model inference latency and throughput benchmarking
Key Skills:
- Programming Languages: Python (advanced), C++ (intermediate)
- ML Frameworks: PyTorch, TensorFlow, Hugging Face Transformers, LangChain
- Containerization using Docker
- Orchestration: Kubernetes
- Cloud Platforms: AWS (EC2 P/G instances), Azure (NC/ND series), Google Cloud Platform (A2/N1 instances)
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
AI/ML Engineer
Primary Responsibilities
- Deploy and optimize AI models on AWS, Azure and Google Cloud Platform
- Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
- Conduct comprehensive performance testing and optimization
- Cost analysis and resource efficiency evaluation
- Model inference latency and throughput benchmarking
Required Technical Skills
Core AI/ML Expertise:
- Programming Languages: Python (advanced), C++ (intermediate)
- ML Frameworks: PyTorch, TensorFlow, Hugging Face Transformers, LangChain
Platform & Infrastructure:
- Containerization using Docker
- Orchestration: Kubernetes
- Cloud Platforms: AWS (EC2 P/G instances), Azure (NC/ND series), Google Cloud Platform (A2/N1 instances)