£48 Per hour
Undetermined
Undetermined
London Area, United Kingdom
Summary: The role of Data Engineer / Software Engineer focuses on supporting a pioneering project centered on LLM agents for research. The position involves onboarding and filtering various datasets while collaborating with researchers to enhance data management through automation. The successful candidate will play a crucial role in ensuring data quality and readiness for intelligent agents. This opportunity is entirely dedicated to an AI-driven project, emphasizing effective engineering practices.
Key Responsibilities:
- Transform raw datasets into formats suitable for filtering pipelines.
- Clean, enhance, and filter data using client-provided pipelines.
- Apply filtering results to original datasets, repackage them, and support re-ingestion workflows.
- Collaborate with cross-functional teams to ensure data readiness and quality.
- Contribute to system design and pipeline automation efforts.
- Build and maintain Python-based data pipelines.
- Engage in technical discussions with researchers and stakeholders.
- Execute data mitigation and filtering tasks with minimal overhead.
- Develop tools that enable LLM agents to process and manage data automatically.
Key Skills:
- Solid background in data engineering or software engineering.
- Proficient in Python and pipeline development.
- Experience with data cleaning, transformation, and validation.
- Understanding of modern data storage systems and formats.
- Strong communication and collaboration skills.
- Prior work with text, image, and video datasets.
- Familiarity with data risk mitigation practices.
- Experience using client-specific data pipelines or filtering frameworks (preferred).
- Knowledge of machine learning concepts (preferred).
- Exposure to front-end technologies like JavaScript is a plus (preferred).
- Background in research or AI/ML project environments is beneficial (preferred).
Salary (Rate): £48.00/hr
City: London
Country: United Kingdom
Working Arrangements: undetermined
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
We are seeking a Data Engineer / Software Engineer to support a pioneering project focused on LLM agents for research . You will play a key role in onboarding and filtering text, image, and video datasets used by our client’s research teams. The goal is to proactively mitigate risks associated with these datasets through effective engineering and automation. This position offers the opportunity to work 100% on an AI-driven project, collaborating with researchers to build tools that allow intelligent agents to manage and improve datasets autonomously.
Responsibilities:
- Data Preprocessing : Transform raw datasets into formats suitable for filtering pipelines.
- Filtering : Clean, enhance, and filter data using client-provided pipelines .
- Post-processing : Apply filtering results to original datasets, repackage them, and support re-ingestion workflows.
- Collaborate with cross-functional teams to ensure data readiness and quality.
- Contribute to system design and pipeline automation efforts.
- Build and maintain Python-based data pipelines.
- Engage in technical discussions with researchers and stakeholders.
- Execute data mitigation and filtering tasks with minimal overhead.
- Develop tools that enable LLM agents to process and manage data automatically.
Required Skills & Qualifications
- Solid background in data engineering or software engineering .
- Proficient in Python and pipeline development .
- Experience with data cleaning, transformation, and validation .
- Understanding of modern data storage systems and formats.
- Strong communication and collaboration skills.
- Prior work with text, image, and video datasets .
- Familiarity with data risk mitigation practices.
Preferred Skills
- Experience using client-specific data pipelines or filtering frameworks.
- Knowledge of machine learning concepts .
- Exposure to front-end technologies like JavaScript is a plus.
- Background in research or AI/ML project environments is beneficial.