Senior Data Engineer

Senior Data Engineer

Posted Today by Exalto Consulting

£625 Per day
Outside
Hybrid
England, United Kingdom

Summary: The Senior Python Data Engineer role requires candidates to possess active SC Clearance and UK residency, focusing on machine learning projects for a government department. The position involves remote/hybrid work with occasional travel and emphasizes skills in Python, machine learning frameworks, and data quality management. Candidates should be prepared to start in early January and have experience in building reproducible data pipelines and experiment tracking.

Key Responsibilities:

  • Develop and implement machine learning solutions using Python and relevant libraries.
  • Conduct data exploration and quality assessments, including visualisation and profiling.
  • Manage labelling workflows and ensure quality assurance in data annotation.
  • Build reproducible data processing and training pipelines.
  • Track experiments and manage model artifacts using MLflow or similar tools.
  • Utilize version control systems for code and dataset management.

Key Skills:

  • Proficiency in Python with applied machine learning experience.
  • Experience with ML frameworks such as PyTorch or TensorFlow.
  • Strong skills in data exploration and visualisation tools.
  • Familiarity with pipeline orchestration tools like Airflow or Prefect.
  • Knowledge of experiment tracking tools like MLflow or Weights & Biases.
  • Version control experience with Git and dataset management tools.
  • Domain knowledge in signal processing is a bonus.

Salary (Rate): £625pd

City: undetermined

Country: United Kingdom

Working Arrangements: hybrid

IR35 Status: outside IR35

Seniority Level: Senior

Industry: IT

Detailed Description From Employer:

Senior Python Data Engineer (Active SC Clearance) You must hold active and current SC Clearance, have UK Residency and hold a UK passport.

Location: Remote/hybrid with occasional travel to offices (North and South) and customer site (South).

Rate: £625pd, outside IR35.

Contract Length: 6 months initially.

This project is at the cutting edge of machine learning, working with a talented team delivering important advances for a large government department.

Essential Skills and Experience Needed:

  • Python (applied ML) : pandas, numpy, Jupyter, strong scripting and reusable code
  • ML frameworks : PyTorch and or TensorFlow (plus scikit-learn)
  • Data exploration & quality : EDA, visualisation (matplotlib/plotly), dataset profiling, class balance, outliers
  • Labelling workflows : hands-on annotation and ontology work, label QA (inter-annotator agreement), confidence scoring, audit trail mindset
  • Pipeline building : reproducible pre-processing and training pipelines (not necessarily Kubernetes), eg Python pipelines, Airflow/Prefect/Luigi-style orchestration, and good experiment hygiene
  • Experiment tracking : MLflow and or Weights & Biases (runs, metrics, artefacts, model registry)
  • Version control : Git, plus dataset/versioned artefacts (DVC or similar is a plus)
  • Deployment not required , but ability to package work for T&E style repeatable runs is important
  • Domain bonus : signal processing experience (spectrograms, time-series, acoustic features), because the data is sonar/acoustics

Please apply immediately if you hold active and current SC Clearance and have the skills required above, as this is an urgent requirement, you must be able to start the contract in the first or second week of January.