Senior Data Engineer (Distributed Data Processing)

Posted Today by Xcede

Apply

Negotiable

Undetermined

Remote

Remote work , UK

Apply

Amazon Web Services (AWS) Apache Spark Azure Databricks Business Logic Cloud Computing Cloud Infrastructure Cloud Technology Databricks Data Infrastructure Data Processing Data Science Forecasting Machine Learning Microsoft Azure Pyspark Python (Programming Language) Software as a Service (SaaS) Software Engineering Workflows

Summary: The Senior Data Engineer role focuses on developing and optimizing large-scale distributed data pipelines within a data-intensive SaaS platform in a regulated industry. The position requires strong engineering skills, particularly in Python and Spark, and involves mentoring other engineers while managing complex data workflows. This is a hands-on role that emphasizes autonomy and technical leadership. It is not related to machine learning or data science.

Key Responsibilities:

Design, build, and evolve large-scale distributed data pipelines using Spark/PySpark.
Develop production-grade Python data workflows that implement complex business logic.
Work with Databricks for job execution, orchestration, and optimisation.
Own and optimise cloud-based data infrastructure (AWS preferred, Azure also relevant).
Optimise data workloads for performance, reliability, and cost.
Collaborate with engineers, domain specialists, and delivery teams on client-facing projects.
Take ownership of technical initiatives and lead by example within the team.
Support and mentor other engineers.

Key Skills:

Proven experience as a Senior Data Engineer.
Strong Python software engineering foundation.
Hands-on Spark experience in production (PySpark essential).
Real-world experience using Databricks for data pipelines (Spark depth matters most).
Experience with large-scale or parallel data processing.
Ownership of cloud infrastructure (AWS and/or Azure).
Comfortable operating with senior-level autonomy and responsibility.
Experience mentoring or supporting other engineers.

Salary (Rate): undetermined

City: undetermined

Country: UK

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Senior Data Engineer (Distributed Data Processing)
UK (O/IR35), Belgium, Netherlands or Germany (B2B)
Fully Remote

We're looking for a Senior Data Engineer to join a data-intensive SaaS platform operating in a complex, regulated industry.
This is a hands-on senior IC role focused on distributed data processing, Spark-based pipelines, and Python-heavy engineering. You'll be working on large-scale batch data workflows that power pricing, forecasting, and operational decision-making systems.
The role requires strong engineering judgement, the ability to operate autonomously, and the confidence to mentor others while delivering under tight timelines. This is not an ML, Data Science, or GenAI role.

What You'll Be Doing
Design, build, and evolve large-scale distributed data pipelines using Spark/PySpark.
Develop production-grade Python data workflows that implement complex business logic.
Work with Databricks for job execution, orchestration, and optimisation.
Own and optimise cloud-based data infrastructure (AWS preferred, Azure also relevant).
Optimise data workloads for performance, reliability, and cost.
Collaborate with engineers, domain specialists, and delivery teams on client-facing projects.
Take ownership of technical initiatives and lead by example within the team.
Support and mentor other engineers.

Must-Have Experience
Proven experience as a Senior Data Engineer.
Strong Python software engineering foundation.
Hands-on Spark experience in production (PySpark essential).
Real-world experience using Databricks for data pipelines (Spark depth matters most).
Experience with large-scale or parallel data processing.
Ownership of cloud infrastructure (AWS and/or Azure).
Comfortable operating with senior-level autonomy and responsibility.
Experience mentoring or supporting other engineers.

Nice-to-Have Experience
Experience working with time-series data.
Background in utilities, energy, or other data-heavy regulated industries.
Exposure to streaming technologies (Kafka, event-driven systems), though the role is primarily batch-focused.

Apply

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)

National Insurance

Holiday Pay

Expenses

Pensions

Maternity Pay

Sick Pay

What Is A Limited Company?

Limited Company vs Sole Trader

Incorporation

Taxes

Filing Responsibilities

Bookkeeping

Insurance

Expenses

Buying a Car or Van

Capital Allowances

Benefits In Kind

Pensions

Employing A Spouse

Managing Excess Money

Dormant Companies

Closing Your Company

Withdrawing Money

Business Asset Disposal Relief

How To Become A Contractor

Inside IR35 Checklist

Outside IR35 Checklist

Self-Assessment Tax Returns

Mortgages

Pensions

Working Multiple Contracts

What is the £100k Abatement?

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)