Negotiable
Undetermined
Remote
Remote
Summary: The Data Engineer role involves designing, building, and maintaining data pipelines for large-scale distributed systems, as well as developing backend services for data-intensive applications. The position requires collaboration with engineering teams to deploy scalable services and optimize SQL queries. Candidates should have a strong background in infrastructure and software engineering, particularly in data engineering. The role is fully remote, allowing for independent work while maintaining communication with stakeholders.
Key Responsibilities:
- Design, build, and maintain robust data pipelines for large-scale, distributed systems
- Develop and support backend services for high-throughput, data-intensive applications
- Optimize and manage SQL queries across both traditional RDBMS and modern lakehouse architectures
- Collaborate with engineering teams to deploy scalable services using Docker and CI/CD pipelines
- Monitor, troubleshoot, and improve data workflows to ensure reliability and performance
- Contribute to infrastructure automation and deployment processes
- Work independently to solve technical challenges while maintaining clear communication with stakeholders
Key Skills:
- 3+ years of experience in infrastructure and software engineering with exposure to data engineering
- Strong proficiency in SQL across relational databases and lakehouse environments
- Experience building and supporting backend services in distributed systems
- Hands-on experience with CI/CD pipelines (e.g., GitHub, Jenkins) and containerization (Docker)
- Proficiency in Python, C#, Kafka, shell scripting, and Linux/Unix systems
- Proven ability to work independently and proactively in a remote environment
Salary (Rate): undetermined
City: undetermined
Country: undetermined
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Key Responsibilities
Design, build, and maintain robust data pipelines for large-scale, distributed systems
Develop and support backend services for high-throughput, data-intensive applications
Optimize and manage SQL queries across both traditional RDBMS and modern lakehouse architectures
Collaborate with engineering teams to deploy scalable services using Docker and CI/CD pipelines
Monitor, troubleshoot, and improve data workflows to ensure reliability and performance
Contribute to infrastructure automation and deployment processes
Work independently to solve technical challenges while maintaining clear communication with stakeholders
Required Qualifications
3+ years of experience in infrastructure and software engineering with exposure to data engineering
Strong proficiency in SQL across relational databases and lakehouse environments
Experience building and supporting backend services in distributed systems
Hands-on experience with CI/CD pipelines (e.g., GitHub, Jenkins) and containerization (Docker)
Proficiency in Python, C#, Kafka, shell scripting, and Linux/Unix systems
Proven ability to work independently and proactively in a remote environment
Preferred Qualifications
Experience with Java
Familiarity with cloud storage solutions (e.g., blob/object storage)
Experience with infrastructure-as-code tools such as Terraform
Exposure to Kubernetes for container orchestration
Experience with workflow orchestration tools (e.g., Airflow, Cloud Composer)
Familiarity with Apache Beam or similar data processing frameworks