Model Testing Expert - Fully Remote

Model Testing Expert - Fully Remote

Posted Today by Mercor

Negotiable
Inside
Remote
United Kingdom

Summary: The role of Model Testing Expert at Mercor involves conducting model testing and grading, supporting benchmarking and quality assurance, and maintaining consistency in AI outputs. This is a remote, hourly contractor position requiring a commitment of 10-20 hours per week. The ideal candidate should be detail-oriented and able to work independently in a fast-paced environment. The position is available immediately and offers a compensation of $40 per hour.

Key Responsibilities:

  • Conduct model testing and grading by running prompts through models and assessing preliminary outputs.
  • Support benchmarking and quality assurance by collaborating in QA review processes to ensure prompt tasks and rubrics meet rigor.
  • Maintain consistency and reliability before integration into official benchmarks.
  • Work independently and asynchronously to meet deadlines while contributing to frontier AI work.

Key Skills:

  • Detail-oriented approach to tasks.
  • Ability to thrive in fast-paced, high-precision environments.

Salary (Rate): £40.00/hr

City: undetermined

Country: United Kingdom

Working Arrangements: remote

IR35 Status: inside IR35

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: Generalist

Type: Hourly contractor

Compensation: $40/hour

Location: Remote

Duration: 10/17/25 – 10/19/25

Commitment: 10–20 hours/week

Role Responsibilities

  • Conduct model testing and grading by running prompts through models and assessing preliminary outputs.
  • Support benchmarking and quality assurance by collaborating in QA review processes to ensure prompt tasks and rubrics meet rigor.
  • Maintain consistency and reliability before integration into official benchmarks.
  • Work independently and asynchronously to meet deadlines while contributing to frontier AI work.

Qualifications

Must-Have

  • Detail-oriented approach to tasks.
  • Ability to thrive in fast-paced, high-precision environments.

Start Date Immediately

Compensation & Legal

Hourly contractor

Paid weekly via Stripe Connect

Application Process (Takes 20–30 mins to complete)

  • Upload resume
  • AI interview based on your resume
  • Submit form

Resources & Support

For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.