Negotiable
Outside
Remote
USA
Summary: The Data Engineer role focuses on ETL/ELT and database management, requiring candidates to have 2 to 8 years of experience in data science and engineering. The position is fully remote, with a preference for candidates comfortable working in the PST time zone. Responsibilities include designing data pipelines, collaborating with teams, and ensuring data quality and security. The role offers potential for conversion to full-time employment after a 12-month contract period.
Key Responsibilities:
- Design and develop robust ETL/ELT pipelines for structured and unstructured data.
- Hands-on database administrator skills to manage or identify root cause and resolve issues stemming from the management of structured and/or unstructured data sources.
- Automate data ingestion, transformation, and validation processes to improve efficiency and reduce manual effort.
- Collaborate with biostatistics, data scientists, analysts, and stakeholders to support AI/ML model development and deployment.
- Build and optimize data systems for performance, scalability, and reliability, and ensure data quality, integrity, and security across platforms.
- Develop and maintain documentation for data architecture, processes, and workflows.
- Monitor and troubleshoot data pipeline issues and implement proactive solutions.
- Experienced in Rest API connection.
Key Skills:
- Minimum of a Bachelor's Degree (required); Advanced Degree preferred in fields such as Computer Science, Data Science, or related quantitative field.
- 2 to 8 years of experience in data science, data pipelining, data mining, data migration, and data engineering.
- Proficient with SQL, R, Python, and ETL solutions.
- Advanced SQL knowledge and experience with relational databases.
- Experience with data visualization software such as Power BI, Python, RShiny.
- Certification preferred in SAS or SQL.
- Hands-on experience with data pipeline and workflow management tools.
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
DivIHN (pronounced divine ) is a CMMI ML3-certified Technology and Talent solutions firm. Driven by a unique Purpose, Culture, and Value Delivery Model, we enable meaningful connections between talented professionals and forward-thinking organizations. Since our formation in 2002, organizations across commercial and public sectors have been trusting us to help build their teams with exceptional temporary and permanent talent.
Visit us at to learn more and view our open positions.
Meghna at
Location: Remote
Duration: 12 Months with possibility of extension
- Design and develop robust ETL/ELT pipelines for structured and unstructured data.
- Hands-on database administrator skills to manage or identify root cause and resolve issues stemming from the management of structured and/or unstructured data sources.
- Automate data ingestion, transformation, and validation processes to improve efficiency and reduce manual effort.
- Collaborate with biostatistics, data scientists, analysts, and stakeholders to support AI/ML model development and deployment.
- Build and optimize data systems for performance, scalability, and reliability, and ensure data quality, integrity, and security across platforms.
- Develop and maintain documentation for data architecture, processes, and workflows.
- Monitor and troubleshoot data pipeline issues and implement proactive solutions.
- Experienced in Rest API connection.
- Minimum of a Bachelors' Degree (required)
- Advanced Degree preferred
- Desired fields of study include Computer Science, Computer Engineering, Data Science, Statistics, Informatics, Information Systems or related quantitative field.
- Certification is preferred in SAS or SAS-based, or SQL
- Proficient with SQL, R, Python and some forms of ETL solutions;
- Advanced SQL knowledge and experience with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases;
- Experience with contributing to the building and optimizing data pipelines, architectures, and data sets;
- software/tools:
- SAS include SAS base/graph/listing
- Relational SQL
- Web Rest API connection
- Data visualization software such as Power BI, Python, RShiny
- Data pipeline and workflow management tools
DivIHN is an equal opportunity employer. DivIHN does not and shall not discriminate against any employee or qualified applicant on the basis of race, color, religion (creed), gender, gender expression, age, national origin (ancestry), disability, marital status, sexual orientation, or military status.