£90,000 Per year
Inside
Hybrid
Greater London, England, United Kingdom
Summary: An industry-leading AI research organization is looking for a Python Engineer to join their data engineering team on a long-term contract. The role involves building and maintaining data infrastructure for large language model research, collaborating with AI researchers, and developing data pipelines for various datasets. This position offers a competitive salary and a hybrid working model based in Central London.
Key Responsibilities:
- Build and maintain scalable Python-based data pipelines
- Preprocess, clean, and filter large unstructured datasets (text, images, video)
- Implement risk filtering and quality control processes
- Collaborate with ML researchers and engineers to support experimental workflows
- Contribute to the efficiency and reliability of data onboarding for AI model training
Key Skills:
- 3–6 years’ experience in software or data engineering
- Proficient in Python with hands-on experience using libraries like Pandas, NumPy, or FastAPI
- Experience working with data pipelines (ETL/ELT), especially with unstructured datasets
- Familiarity with ML workflows and datasets used in model development
- Experience with PyTorch or other ML frameworks (bonus)
- Prior exposure to video/image/text processing at scale (bonus)
- Background in research-focused or high-scale data environments (bonus)
Salary (Rate): £90,000 yearly
City: Central London
Country: United Kingdom
Working Arrangements: hybrid
IR35 Status: inside IR35
Seniority Level: Mid-Level
Industry: IT
Overview: An industry-leading AI research organisation is seeking a Python Engineer to join their data engineering team on a long-term contract. You’ll help build and maintain the data infrastructure powering large language model (LLM) research. This is a unique opportunity to work closely with cutting-edge AI researchers, developing pipelines that onboard and process large-scale text, image, and video datasets.
What's on offer:
- Annual salary up to £90,000 DOE
- Weekly PAYE payments
- Hybrid model: 3 days in Central London office (non-negotiable)
- Contract until December 2025, with strong potential for extension
- Work at the intersection of software engineering and AI research
- Contribute directly to practical ML tooling and infrastructure
Key responsibilities:
- Build and maintain scalable Python-based data pipelines
- Preprocess, clean, and filter large unstructured datasets (text, images, video)
- Implement risk filtering and quality control processes
- Collaborate with ML researchers and engineers to support experimental workflows
- Contribute to the efficiency and reliability of data onboarding for AI model training
Required skill and experience:
- 3–6 years’ experience in software or data engineering
- Proficient in Python with hands-on experience using libraries like Pandas, NumPy, or FastAPI
- Experience working with data pipelines (ETL/ELT), especially with unstructured datasets
- Familiarity with ML workflows and datasets used in model development
Bonus points:
- Experience with PyTorch or other ML frameworks
- Prior exposure to video/image/text processing at scale
- Background in research-focused or high-scale data environments
______________________________
Contract until December 2025
Up to £90,000 per annum (Inside IR35, PAYE)
Central London (Hybrid - 3 days onsite)