Negotiable
Undetermined
Remote
United Kingdom
Summary: The role of English Language Audio Model Trainer at Mercor involves recording clear and concise spoken descriptions of visual content for AI model training. The position is remote and offers flexible hours, requiring excellent verbal communication skills and attention to detail. Candidates will collaborate with AI researchers to ensure high-quality data for model development. Training support will be provided, making it accessible for those with varying levels of experience.
Key Responsibilities:
- Record clear, concise, and natural-sounding descriptions of visual content.
- View a series of images and generate spoken descriptions (typically 2–3 minutes each).
- Ensure high-quality recordings free from background noise or distortion.
- Follow linguistic, timing, or stylistic guidelines set by the research team.
- Collaborate with AI researchers and QA teams to review and iterate on data quality.
Key Skills:
- Excellent verbal communication and enunciation skills.
- Native or near-native fluency in English (additional language fluency is a plus).
- Strong attention to detail and ability to follow precise guidelines.
- Prior experience with voice recording or data annotation is a plus, but not required.
- Comfortable working independently and handling repetitive tasks consistently.
Salary (Rate): £20.00/hr
City: undetermined
Country: United Kingdom
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
At Crossing Hurdles, we work as a referral partner. We refer candidates to Mercor that collaborates with the world’s leading AI research labs to build and train cutting-edge AI models.
Organization: Mercor
Position : English Language Audio Model Trainer
Referral Partner: Crossing Hurdles
Type : Hourly Contract (Remote)
Compensation : $20/hr
Location : Remote
Duration :10–40 hrs/week, flexible and asynchronous
Requirements: (Training support will be provided)
- Excellent verbal communication and enunciation skills
- Native or near-native fluency in English (additional language fluency is a plus)
- Strong attention to detail and ability to follow precise guidelines
- Prior experience with voice recording or data annotation is a plus, but not required
- Comfortable working independently and handling repetitive tasks consistently
Role Responsibilities:
- Record clear, concise, and natural-sounding descriptions of visual content
- View a series of images and generate spoken descriptions (typically 2–3 minutes each)
- Ensure high-quality recordings free from background noise or distortion
- Follow linguistic, timing, or stylistic guidelines set by the research team
- Collaborate with AI researchers and QA teams to review and iterate on data quality
Application process: (Takes 20 min)
- Upload resume
- AI interview based on your resume (15 min)
- Submit form