Negotiable
Outside
Remote
USA
Summary: The Data Scientist role requires a professional with over 3 years of experience in coordinating and prioritizing tasks for engineering teams, particularly in the realm of GenAI architectures and related technologies. The position emphasizes proficiency in programming languages and cloud-based AI solutions, alongside a strong capability in API design and data workflows. The ideal candidate will also have experience in leading GenAI deployments and building data visualization frameworks.
Key Responsibilities:
- Coordinate, plan, and prioritize tasks for medium-to-large engineering teams.
- Utilize knowledge of GenAI architectures, RAG pipelines, vector databases, and prompt engineering.
- Develop and maintain proficiency in Python, JavaScript, and TypeScript.
- Implement and optimize retrieval systems using FAISS, Pinecone, and Milvus.
- Work with cloud-based LLM providers like OpenAI and AWS Bedrock.
- Design and integrate APIs for AI/ML systems.
- Build ETL workflows and data cleaning pipelines for production.
- Query and optimize MongoDB databases.
- Design reporting and dashboard solutions using tools like Plotly and PowerBI.
- Utilize Git, Docker, and CI/CD pipelines effectively.
- Debug and optimize performance of systems.
- Drive high-quality initiatives to completion in fast-paced environments.
Key Skills:
- Proven experience in coordinating engineering teams.
- Strong knowledge of GenAI architectures and RAG pipelines.
- Proficiency in Python, JavaScript, and TypeScript.
- Familiarity with FAISS, Pinecone, and Milvus.
- Understanding of cloud-based LLM providers.
- Experience in API design and integration.
- Experience in building ETL workflows.
- Practical experience with MongoDB.
- Experience designing reporting solutions.
- Skilled with Git, Docker, and CI/CD pipelines.
- Strong debugging and optimization skills.
- Ability to drive initiatives in fast-paced environments.
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
3+ years of experience preferred.
Required Skills & Experience
- Proven experience in coordinating, planning, and prioritizing tasks for medium-to-large engineering teams.
- Strong knowledge of GenAI architectures, RAG pipelines, vector databases, and prompt engineering.
- Proficiency in Python, JavaScript, and TypeScript.
- Familiarity with FAISS, Pinecone, Milvus, and retrieval optimization.
- Understanding of cloud-based LLM providers like OpenAI, Anthropic, Google Vertex AI, and AWS Bedrock.
- Experience in API design and integration architecture for AI/ML systems
- Experience in building ETL workflows and data cleaning pipelines for production.
- Practical experience with MongoDB, including querying and optimization.
- Experience designing reporting and dashboard solutions with Plotly, Dash, PowerBI, or custom tools.
- Skilled with Git, Docker, and CI/CD pipelines (GitHub Actions preferred).
- Strong debugging and performance optimization skills.
- Ability to drive high-quality initiatives to completion in fast-paced environments.
Preferred Qualifications
- Experience leading production-level GenAI or RAG deployments.
- Experience building scalable data visualization frameworks and dashboards.
- Proficiency with cloud platforms (Azure, AWS, or Google Cloud Platform) for data and AI.
- Background in defining and implementing AI system metrics and KPIs
- Experience in vendor evaluation and management for AI services
- Strong ability to collaborate and mentor across engineering, product, and data science teams.