Linguist

Linguist

Posted 1 day ago by SGS Consulting

Negotiable
Undetermined
Remote
Remote

Summary: The role of a Linguist requires a graduate degree in Linguistics and proficiency in a non-English language, with a focus on linguistic analysis of AI model outputs and large datasets. The position involves writing guidelines for AI projects, conducting research on language typology, and collaborating with native speakers. Strong communication skills and coding abilities in Python are essential, along with experience in Natural Language Processing techniques. The role is remote and targets candidates with 0-3 years of experience.

Key Responsibilities:

  • Perform linguistic analyses on large datasets.
  • Perform linguistic error analysis of AI model outputs, determining frequent and severe error categories.
  • Write and revise guidelines for human annotation and AI projects, including translation tasks.
  • Conduct typological and sociolinguistic research on various languages.
  • Perform linguistic analyses for Responsible AI in multilingual settings.
  • Conduct linguistic literature reviews on NLP-adjacent topics and summarize findings.
  • Compare quality of deliveries between vendors, identify error patterns, and provide feedback.
  • Provide information or guidance on linguistic knowledge aspects.
  • Collaborate with native speakers in various languages.
  • Communicate results of linguistic analyses to engineers and research scientists.

Key Skills:

  • Strong written and spoken communication skills, especially in business and research.
  • Native speaker of a non-English language (preferably Thai, Vietnamese, Dutch, Portuguese).
  • Working knowledge in other languages is a plus; proficiency in low-resource languages is valued.
  • Ability to code in Python and query databases using SQL; other coding languages for data analysis are a plus.
  • Ability to work independently, prioritize, plan, and track work.
  • Education or training in project management is a plus.
  • Self-motivation is a must.
  • Working knowledge of international language-classification standards is valued.

Salary (Rate): £36,000 yearly

City: undetermined

Country: undetermined

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: Entry Level

Industry: Other

Required YOE: 0-3 Years
REQUIRED: Must have a graduate degree in Linguistics. A graduate degree in Literature or English is not an appropriate substitution. A degree in Computer Science with a specialization in NLP is not an appropriate substitution.
REQUIRED: Must be native speaker of a non-English language (preferably Thai, Vietnamese, Dutch, Portuguese)

Main duties:
Perform linguistic analyses on large datasets.
Perform linguistic error analysis of AI model outputs, determining what the most frequent and severe error categories are.
Write and revise guidelines for human annotation and other AI projects, including but not limited to translation tasks.
Conduct typological and sociolinguistic research on a large number of languages, highlighting their similarities and differences.
Perform linguistic analyses for Responsible AI (toxic language, hate speech, gender bias and other cultural biases) in massively multilingual settings.
Conduct linguistic literature reviews on various NLP-adjacent topics, and summarize findings.
Compare the quality of deliveries between vendors, identify error patterns, and provide actionable feedback.
Provide information or guidance relative to any aspect of linguistic knowledge (typology, morpho-syntax, sociolinguistics, classification, phonetics/phonology, pragmatics, etc.).
Reach out to and collaborate with native speakers in various languages.
Communicate results of linguistic analyses to engineers and research scientists.

Skills:
Must have strong written and spoken communication skills, especially business and research communication.
Must be native speaker of a non-English language (preferably Thai, Vietnamese, Dutch, Portuguese)
Working knowledge in other languages is a plus. Proficiency in a low-resource language is valued.
Must be able to code in Python (must) and query databases using SQL, other coding languages used for data analysis are a plus.
Must be able to independently work through complex requests and perform under pressure.
Strong ability to work independently, prioritize, plan, and track work, as well as report progress
education or training in the basics of project management is a plus
self-motivation is a must
Working knowledge of international language-classification standards is valued.

Education:
Graduate degree in Linguistics or related field is a must; PhD is a plus a background or specialization in corpus linguistics is a plus experience with field work is a plus
A graduate degree in Literature or English is not an appropriate substitution
Degree in Computer Science with a specialization in NLP is not an appropriate substitution
Must have a very firm grasp of the following linguistic fields: language typology, syntax, morphology, sociolinguistics (especially dialectology and discourse analysis), corpus linguistics, writing systems, pragmatics, phonology.
Must have some experience with applying basic Natural Language Processing techniques.

Experience:
Experience working cross-functionally
Experience collaborating with machine learning, NLP, or software engineers, or data scientists
Experience contributing to research papers
Important: Preferably no known conflicts of interest in the fields of machine translation, ASR, TTS, or LLM research (as the candidates need to be contributing to research papers)