Sr Data Engineer - Remote - Manufacturing or Steel company exp required - 10+ Years only
Posted 5 days ago by SUNRAY INFORMATICS
Negotiable
Undetermined
Remote
Remote
Summary: The role of Sr Data Engineer requires extensive experience (10+ years) in the manufacturing or steel industry, focusing on building and optimizing ETL/ELT data pipelines. The position is remote with occasional travel and emphasizes proficiency in various technologies including Python, SQL, and cloud storage solutions. Candidates should also have experience with DevOps practices and API development.
Key Responsibilities:
- Building, maintaining, and optimizing ETL/ELT Cloud-agnostic data pipelines using Python, Pandas, PySpark, and orchestrating workflows with Apache Airflow and Kedro framework.
- Advanced SQL/KQL query development and optimization across Oracle, MSSQL, and MySQL databases.
- Working with cloud object storage across providers and designing reliable, scalable data lake or Lakehouse solutions.
- Developing and consuming RESTful APIs for data services and integration.
- Proficiency in Linux shell scripting for automation.
- Experience with DevOps practices, including CI/CD for data pipelines and use of tools such as Git and Docker.
Key Skills:
- 10+ years of experience in data engineering, specifically in manufacturing or steel industries.
- Strong experience with Python, Pandas, PySpark, and ETL/ELT processes.
- Advanced SQL/KQL skills for database optimization.
- Experience with cloud object storage solutions (e.g., ADLS, S3, GCS).
- Proficiency in developing RESTful APIs.
- Linux shell scripting for automation.
- Familiarity with DevOps practices and CI/CD tools.
Salary (Rate): undetermined
City: undetermined
Country: undetermined
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Title: Sr Data Engineer with Manufacturing or Steel company exp required - 10+ Years only
Location: Remote (Occationally travel required)
- Strong experience in building, maintaining, and optimizing ETL/ELT Cloud-agnostics data pipelines using Python, Pandas, PySpark and orchestrating workflows like Apache Airflow and Kedro framework.
- Advanced SQL/ KQL query development and optimization across Oracle, MSSQL, and MySQL databases (hosted on-premises or via PaaS offerings).
- Experience working with cloud object storage across providers (e.g., ADLS, S3, GCS) and designing reliable, scalable data lake or Lakehouse solutions.
- Developing and consuming RESTful API (Fast API )s for data services and integration.
- Proficiency in Linux shell scripting for automation.
- Experience with DevOps practices, including CI/CD for data pipelines and use of tools such as Git, Docker and deployment.
Best Wishes
Naveen P
naveen at sunrayinformatics dot com