Data Integration Engineer with Gen AI

Data Integration Engineer with Gen AI

Posted 2 weeks ago by Recruitment.ai

Negotiable
Undetermined
Remote
Remote

Summary: The Data Integration Engineer role focuses on developing AI infrastructure for healthcare by transforming complex data into reliable pipelines. The position requires expertise in ETL processes and collaboration with a Data Integration Manager to meet partner requirements. Candidates should have a strong background in data engineering, particularly within healthcare data systems. This is a contract position that allows for remote work.

Key Responsibilities:

  • Design and build ETL pipelines (PySpark, SQL, Azure) to process data from Epic, LIMS, PACS, and other healthcare systems
  • Develop data quality validation frameworks and troubleshoot schema, transformation, and performance issues
  • Build reusable integration components that speed up future implementations
  • Collaborate with a Data Integration Manager to turn partner requirements into technical solutions

Key Skills:

  • 5+ years in data engineering or analytics with strong ETL experience
  • Proficiency in PySpark and advanced SQL
  • Hands-on experience with healthcare data (EHR, claims, clinical, lab)
  • Familiarity with Azure Databricks, Data Factory, or similar cloud platforms
  • Knowledge of healthcare standards (FHIR, HL7v2, LOINC, ICD-10) is a plus
  • Epic Clarity experience is a strong plus

Salary (Rate): undetermined

City: undetermined

Country: undetermined

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Position - Data Integration Engineer

Location - Remote Role

Type - Contract Only on W2


We''re building AI infrastructure for healthcare — and we need a data engineer who loves turning messy, complex healthcare data into reliable, production-grade pipelines.


What You''ll Do

  • Design and build ETL pipelines (PySpark, SQL, Azure) to process data from Epic, LIMS, PACS, and other healthcare systems
  • Develop data quality validation frameworks and troubleshoot schema, transformation, and performance issues
  • Build reusable integration components that speed up future implementations
  • Collaborate with a Data Integration Manager to turn partner requirements into technical solutions

What You Bring

  • 5+ years in data engineering or analytics with strong ETL experience
  • Proficiency in PySpark and advanced SQL
  • Hands-on experience with healthcare data (EHR, claims, clinical, lab)
  • Familiarity with Azure Databricks, Data Factory, or similar cloud platforms
  • Knowledge of healthcare standards (FHIR, HL7v2, LOINC, ICD-10) is a plus
  • Epic Clarity experience is a strong plus

Tech Stack Azure Databricks · PySpark · Data Factory · GitHub Actions · Terraform · FHIR · Python