Negotiable
Undetermined
Remote
United Kingdom
Summary: The Computational Data Science Problem Creation Expert role involves designing and verifying computational data science problems that simulate real-world analytical workflows. This contract position is remote and requires collaboration with AI labs and technology companies to create high-quality datasets for training AI models. Candidates will leverage their extensive data science experience to ensure problems are deterministic and reproducible. The role emphasizes rigor, correctness, and clarity in problem creation and documentation.
Key Responsibilities:
- Design realistic data science problems grounded in business scenarios
- Cover the full data science lifecycle, including data ingestion & cleaning, exploratory analysis, feature engineering, statistical analysis & modeling, validation, and interpretation
- Implement verified Python solutions
- Ensure all problems are fully deterministic and reproducible
- Clearly document business context, data inputs & schemas, analytical logic, and exact expected outputs
Key Skills:
- MSc or PhD in Data Science, Statistics, Mathematics, Computer Science, or a related field
- 5+ years of professional data science experience
- Ownership of end-to-end data science pipelines
- Strong Python skills (pandas, numpy, scipy, scikit-learn, statsmodels)
- Solid grounding in statistics and modeling
- Ability to define explicit, verifiable outputs
- Cross-industry experience (finance, healthcare, telecom, government, e-commerce) - nice to have
- Teaching, mentoring, publications, or case studies - nice to have
- Consulting or research-oriented industry background - nice to have
Salary (Rate): £50.00 hourly
City: undetermined
Country: United Kingdom
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Computational Data Science Problem Creation Expert (Contract, Remote)
Type: Independent Contractor
Location: Remote
Schedule: Flexible, project-based
hackajob partners with innovative companies to connect top-tier technical talent with high-impact, cutting-edge projects . For this role, we are working closely with a platform that collaborates with leading AI labs and technology companies to build high-quality datasets used to train and evaluate advanced AI systems. Together, we are onboarding senior data scientists to design fully deterministic, end-to-end data science problems that reflect how real-world data science is done.
About the Role
This is a problem creation and verification role , not execution or production work. You will design computational, real-world data science problems that simulate complete analytical workflows — from raw data to validated, reproducible outputs. Your work will directly contribute to training and evaluating AI models on professional data science reasoning.
What You’ll Do
You will:
- Design realistic data science problems grounded in business scenarios
- Cover the full data science lifecycle , including:
- Data ingestion & cleaning
- Exploratory analysis
- Feature engineering
- Statistical analysis & modeling
- Validation and interpretation
- Implement verified Python solutions
- Ensure all problems are fully deterministic and reproducible
- Clearly document:
- Business context
- Data inputs & schemas
- Analytical logic
- Exact expected outputs
What We’re Looking For
Required:
- MSc or PhD in Data Science, Statistics, Mathematics, Computer Science, or a related field
- 5+ years of professional data science experience
- Ownership of end-to-end data science pipelines
- Strong Python skills (pandas, numpy, scipy, scikit-learn, statsmodels)
- Solid grounding in statistics and modeling
- Ability to define explicit, verifiable outputs
Nice to Have:
- Cross-industry experience (finance, healthcare, telecom, government, e-commerce)
- Teaching, mentoring, publications, or case studies
- Consulting or research-oriented industry background
This role is focused on rigor, correctness, and clarity .