ML Engineer (Remote) (Southampton)

Posted Today by Outlier AI

Apply

Negotiable

Undetermined

Remote

Bordon, England, United Kingdom

Apply

Application Programming Interface (API) Artificial Intelligence Automation Data Acquisition Generic Programming Input/Output Java (Programming Language) JavaScript (Programming Language) Machine Learning Nice (Unix Utility) Outliers Parsing Python (Programming Language) Software Development SQL (Programming Language) Systems Integration Web SQL Databases Workflows

Summary: The ML Engineer role at Outlier involves enhancing AI agents through human feedback, focusing on training Large Language Models for complex architectural workflows. Candidates should have a strong background in backend engineering and experience with multi-turn system interactions. The position offers remote working options and seeks individuals passionate about AI and software development. Ideal candidates will have a proven track record in building production-grade software and integrating various tools and APIs.

Key Responsibilities:

Enhance AI agents by providing human feedback.
Train Large Language Models to function as proactive, multi-step agents.
Design, coordinate, and optimize complex architectural workflows.
Build and maintain production-grade software with modular separation.
Provide clear technical feedback on complex system behaviors.
Integrate agents with live tools and APIs to solve real-world problems.
Implement persistent state and session discovery to track agent progress.
Identify subtle failures such as privacy leaks and authority escalation.

Key Skills:

2+ years of experience in backend engineering, AI automation, or complex systems integration.
Proven ability to build production-grade software with modular separation.
Strong command of at least two major programming languages (e.g., Python, JavaScript, Go, Java).
Experience with SQL databases.
Practical experience in live environments and multi-turn system interactions.
Outstanding attention to detail and technical feedback skills.
Expertise in building multi-stage coordination tasks.
Hands-on experience integrating agents with live tools and APIs.
Experience identifying subtle failures in systems.

Salary (Rate): undetermined

City: Bordon

Country: United Kingdom

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

About The Project Find out more about this role by reading the information below, then apply to be considered. Outlier helps the world’s most innovative companies improve their AI agents by providing human feedback. Do you want to shape the future of autonomous agents like OpenClaw? We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows. Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.

Ideal Qualifications 2+ years of experience in backend engineering, AI automation, or complex systems integration. Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting). Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases. Practical experience building for live, non-mocked environments and handling multi-turn system interactions. Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors. Nice to have Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output. Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems. High level of comfort implementing persistent state and session discovery using to track agent progress. Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

xlqdzyr Remote working/work at home options are available for this role.

Apply

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)

National Insurance

Holiday Pay

Expenses

Pensions

Maternity Pay

Sick Pay

What Is A Limited Company?

Limited Company vs Sole Trader

Incorporation

Taxes

Filing Responsibilities

Bookkeeping

Insurance

Expenses

Buying a Car or Van

Capital Allowances

Benefits In Kind

Pensions

Employing A Spouse

Managing Excess Money

Dormant Companies

Closing Your Company

Withdrawing Money

Business Asset Disposal Relief

How To Become A Contractor

Inside IR35 Checklist

Outside IR35 Checklist

Self-Assessment Tax Returns

Mortgages

Pensions

Working Multiple Contracts

What is the £100k Abatement?

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)