AI Model Tester - Fully Remote

AI Model Tester - Fully Remote

Posted Today by Mercor

Negotiable
Inside
Remote
United Kingdom

Summary: The role of AI Model Tester at Mercor involves conducting model testing and grading by assessing outputs from AI models. This fully remote contract position requires a detail-oriented generalist to support benchmarking and quality assurance processes. The commitment ranges from 10 to 40 hours per week, with an immediate start date. Compensation is set at $40 per hour, paid weekly via Stripe Connect.

Key Responsibilities:

  • Conduct model testing and grading by running prompts through models and assessing preliminary outputs.
  • Support benchmarking and quality assurance by collaborating in QA review processes to ensure prompt tasks and rubrics meet rigor.
  • Maintain consistency and reliability before integration into official benchmarks.
  • Work independently and asynchronously to meet deadlines while contributing to frontier AI work.

Key Skills:

  • Detail-oriented generalist skills.

Salary (Rate): £40.00/hr

City: undetermined

Country: United Kingdom

Working Arrangements: remote

IR35 Status: inside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: Generalist

Type: Contract

Compensation: $40/hour

Location: Remote

Duration: 10/17/25–10/19/25

Commitment: 10–40 hours/week

Role Responsibilities

  • Conduct model testing and grading by running prompts through models and assessing preliminary outputs.
  • Support benchmarking and quality assurance by collaborating in QA review processes to ensure prompt tasks and rubrics meet rigor.
  • Maintain consistency and reliability before integration into official benchmarks.
  • Work independently and asynchronously to meet deadlines while contributing to frontier AI work.

Qualifications

Must-Have Detail-oriented generalist skills.

Start Date Immediately; applications reviewed on a rolling basis.

Interview Process 20 mins initial screening.

Compensation & Legal Hourly contractor classification. Paid weekly via Stripe Connect.

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.