AI Quality Analyst (LLM) | $30/hr Remote

Posted Today by Crossing Hurdles

Apply

Negotiable

Undetermined

Remote

EMEA

Apply

Artificial Intelligence Benchmarking Input/Output Stakeholder Management User Experience UX/UI Research UX Research Workflows

Summary: The AI Model Evaluator role involves evaluating outputs from large language models and autonomous agent systems, providing structured feedback to enhance model performance. The position requires a commitment of 10 to 40 hours per week and is conducted remotely. Candidates must possess strong analytical skills and experience in AI evaluation. The role emphasizes attention to detail and effective communication of insights to stakeholders.

Key Responsibilities:

Evaluate outputs from large language models and autonomous agent systems using defined rubrics and quality standards.
Review multi-step agent workflows, including screenshots and reasoning traces, to assess accuracy and completeness.
Apply benchmarking criteria consistently while identifying edge cases and recurring failure patterns.
Provide structured, actionable feedback to support model refinement and product improvements.
Participate in calibration sessions to ensure consistent evaluation alignment across reviewers.
Adapt to evolving guidelines and ambiguous scenarios with sound judgment.
Document findings clearly and communicate insights to relevant stakeholders.

Key Skills:

Strong experience in LLM evaluation, AI output analysis, QA/testing, UX research, or similar analytical roles.
Proficiency in rubric-based scoring, benchmarking frameworks, and AI quality assessment.
Excellent attention to detail with strong decision-making skills in ambiguous cases.
Proficient English communication skills (written and verbal).
Ability to work independently in a remote environment.
Comfortable committing to structured evaluation workflows and evolving guidelines.

Salary (Rate): £30.00/hr

City: undetermined

Country: undetermined

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Position: AI Model Evaluator (LLM & Agent Systems)

Type: Hourly contract

Compensation: $20–$30/hour

Location: Remote

Commitment: 10–40 hours/week

Role Responsibilities

Evaluate outputs from large language models and autonomous agent systems using defined rubrics and quality standards.
Review multi-step agent workflows, including screenshots and reasoning traces, to assess accuracy and completeness.
Apply benchmarking criteria consistently while identifying edge cases and recurring failure patterns.
Provide structured, actionable feedback to support model refinement and product improvements.
Participate in calibration sessions to ensure consistent evaluation alignment across reviewers.
Adapt to evolving guidelines and ambiguous scenarios with sound judgment.
Document findings clearly and communicate insights to relevant stakeholders.

Requirements

Strong experience in LLM evaluation, AI output analysis, QA/testing, UX research, or similar analytical roles.
Proficiency in rubric-based scoring, benchmarking frameworks, and AI quality assessment.
Excellent attention to detail with strong decision-making skills in ambiguous cases.
Proficient English communication skills (written and verbal).
Ability to work independently in a remote environment.
Comfortable committing to structured evaluation workflows and evolving guidelines.

Application Process (Takes 20 Min)

Upload resume
Interview (15 min)
Submit form

Apply

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)

National Insurance

Holiday Pay

Expenses

Pensions

Maternity Pay

Sick Pay

What Is A Limited Company?

Limited Company vs Sole Trader

Incorporation

Taxes

Filing Responsibilities

Bookkeeping

Insurance

Expenses

Buying a Car or Van

Capital Allowances

Benefits In Kind

Pensions

Employing A Spouse

Managing Excess Money

Dormant Companies

Closing Your Company

Withdrawing Money

Business Asset Disposal Relief

How To Become A Contractor

Inside IR35 Checklist

Outside IR35 Checklist

Self-Assessment Tax Returns

Mortgages

Pensions

Working Multiple Contracts

What is the £100k Abatement?

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)