Negotiable
Undetermined
Remote
United Kingdom
Summary: Mindrift is seeking a Python Data Scraping Engineer for the Tendem project, focusing on specialized data scraping workflows within a hybrid AI and human system. This part-time remote role requires technical expertise in web scraping and data processing, emphasizing critical thinking and quality control. The position offers flexibility for professionals with relevant experience to contribute to innovative AI projects. Candidates will utilize tools like Apify and OpenRouter to ensure accurate data extraction from complex websites.
Key Responsibilities:
- Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets.
- Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements.
- Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior.
- Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery.
- Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes.
Key Skills:
- At least 3 years of relevant experience in data engineering, web scraping, automation, or software development.
- Bachelor's or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plus.
- Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies.
- Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML).
- Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets).
- Hands-on experience with LLMs and AI frameworks to enhance automation and problem-solving.
- Strong attention to detail and commitment to data accuracy.
- Self-directed work ethic with ability to troubleshoot independently.
- A link to GitHub is a plus.
- English proficiency: Upper-intermediate (B2) or above.
Salary (Rate): undetermined
City: undetermined
Country: United Kingdom
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
hackajob is collaborating with Mindrift to connect them with exceptional professionals for this role. Mindrift is looking for highly skilled Python Data Scraping Engineers to join the Tendem project and drive specialized data scraping workflows within our hybrid AI + human system. In this role, as an AI Pilot – that’s how we refer to this role at Mindrift – you’ll collaborate with Tendem Agents that handle repetitive tasks, while you provide critical thinking, domain expertise, and quality control to deliver accurate and actionable results. This part-time remote opportunity is ideal for technical professionals with hands-on experience in web scraping, data extraction and processing.
What We Do
The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.
About The Role
This is a freelance role for a Tendem project. As a Python Data Scraping Engineer, you'll handle data scraping tasks requiring technical precision for web extraction and processing, utilizing various tools such as our provided Apify and OpenRouter alongside your own resourceful approaches.
Key Responsibilities
- Own end-to-end data extraction workflows across complex websites, ensuring complete coverage, accuracy, and reliable delivery of structured datasets.
- Leverage internal tools (Apify, OpenRouter) alongside custom workflows to accelerate data collection, validation, and task execution while meeting defined requirements.
- Ensure reliable extraction from dynamic and interactive web sources, adapting approaches as needed to handle JavaScript-rendered content and changing site behavior.
- Enforce data quality standards through validation checks, cross-source consistency controls, adherence to formatting specifications, and systematic verification prior to delivery.
- Scale scraping operations for large datasets using efficient batching or parallelization, monitor failures, and maintain stability against minor site structure changes.
Compensation
On this project, contributors can earn up to $37 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.
How To Get Started
Simply apply to this post, qualify, and get the chance to contribute to projects that match your technical skills, on your own schedule. From coding and automation to fine-tuning AI outputs, you’ll play a key role in advancing AI capabilities and real-world applications.
Requirements
- At least 3 year of relevant experience in data engineering, web scraping, automation, or software development (required).
- Bachelor's or Master’s Degree in Engineering, Applied Mathematics, Computer Science, or related technical fields is a plus.
- Strong experience in Python web scraping (BeautifulSoup, Selenium or similar), including dynamic content (JS, AJAX, infinite scroll) and APIs via proxies.
- Proven ability to extract data from complex structures (hierarchies, archived pages, inconsistent HTML).
- Solid background in data cleaning, normalization, and validation, delivering structured datasets (CSV, JSON, Google Sheets).
- Hands-on experience with LLMs and AI frameworks to enhance automation and problem-solving.
- Strong attention to detail and commitment to data accuracy.
- Self-directed work ethic with ability to troubleshoot independently.
- A link to GitHub is a plus.
- English proficiency: Upper-intermediate (B2) or above (required).
Benefits
Why this freelance opportunity might be a great fit for you? Work fully remote on your own schedule with just a laptop and stable internet connection. Gain hands-on experience in a unique hybrid environment where human expertise and AI agents collaborate seamlessly — a distinctive skill set in a rapidly growing field. Participate in performance-based bonus programs that reward high-quality work and consistent delivery.