Negotiable
Inside
Remote
United Kingdom
Summary: The role of AI Model Tester at Mercor involves conducting model testing and grading by assessing outputs from AI models. This fully remote contract position requires a detail-oriented generalist to support benchmarking and quality assurance processes. The commitment ranges from 10 to 40 hours per week, with an immediate start date. Compensation is set at $40 per hour, paid weekly via Stripe Connect.
Key Responsibilities:
- Conduct model testing and grading by running prompts through models and assessing preliminary outputs.
- Support benchmarking and quality assurance by collaborating in QA review processes to ensure prompt tasks and rubrics meet rigor.
- Maintain consistency and reliability before integration into official benchmarks.
- Work independently and asynchronously to meet deadlines while contributing to frontier AI work.
Key Skills:
- Detail-oriented generalist skills.
Salary (Rate): £40.00/hr
City: undetermined
Country: United Kingdom
Working Arrangements: remote
IR35 Status: inside IR35
Seniority Level: undetermined
Industry: IT
About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: Generalist
Type: Contract
Compensation: $40/hour
Location: Remote
Duration: 10/17/25–10/19/25
Commitment: 10–40 hours/week
Role Responsibilities
- Conduct model testing and grading by running prompts through models and assessing preliminary outputs.
- Support benchmarking and quality assurance by collaborating in QA review processes to ensure prompt tasks and rubrics meet rigor.
- Maintain consistency and reliability before integration into official benchmarks.
- Work independently and asynchronously to meet deadlines while contributing to frontier AI work.
Qualifications
Must-Have Detail-oriented generalist skills.
Start Date Immediately; applications reviewed on a rolling basis.
Interview Process 20 mins initial screening.
Compensation & Legal Hourly contractor classification. Paid weekly via Stripe Connect.
Application Process (Takes 20–30 mins to complete)
- Upload resume
- AI interview based on your resume
- Submit form
Resources & Support For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
For any help or support, reach out to: support@mercor.com
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.