AI/ML Engineer

AI/ML Engineer

Posted 1 day ago by 1756797990

Negotiable
Outside
Remote
USA

Summary: The AI/ML Engineer role focuses on deploying and optimizing AI models across major cloud platforms, implementing advanced pipelines, and conducting performance testing. The position requires expertise in various programming languages and machine learning frameworks, as well as experience with containerization and orchestration technologies. The role is fully remote and classified as outside IR35.

Key Responsibilities:

  • Deploy and optimize AI models on AWS, Azure and Google Cloud Platform
  • Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
  • Conduct comprehensive performance testing and optimization
  • Cost analysis and resource efficiency evaluation
  • Model inference latency and throughput benchmarking

Key Skills:

  • Programming Languages: Python (advanced), C++ (intermediate)
  • ML Frameworks: PyTorch, TensorFlow, Hugging Face Transformers, LangChain
  • Containerization using Docker
  • Orchestration: Kubernetes
  • Cloud Platforms: AWS (EC2 P/G instances), Azure (NC/ND series), Google Cloud Platform (A2/N1 instances)

Salary (Rate): undetermined

City: undetermined

Country: USA

Working Arrangements: remote

IR35 Status: outside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

AI/ML Engineer

Primary Responsibilities

  • Deploy and optimize AI models on AWS, Azure and Google Cloud Platform
  • Implement and benchmark RAG (Retrieval-Augmented Generation) pipelines
  • Conduct comprehensive performance testing and optimization
  • Cost analysis and resource efficiency evaluation
  • Model inference latency and throughput benchmarking

Required Technical Skills

Core AI/ML Expertise:

  • Programming Languages: Python (advanced), C++ (intermediate)
  • ML Frameworks: PyTorch, TensorFlow, Hugging Face Transformers, LangChain

Platform & Infrastructure:

  • Containerization using Docker
  • Orchestration: Kubernetes
  • Cloud Platforms: AWS (EC2 P/G instances), Azure (NC/ND series), Google Cloud Platform (A2/N1 instances)