Machine Learning (Multi Modal)

Machine Learning (Multi Modal)

Posted 2 weeks ago by microTECH Global LTD on Linkedin

Negotiable
Undetermined
Undetermined
Egham, England, United Kingdom

Summary: Join a leading AI research team focused on next-generation mobile technologies as a PhD-level intern or recent graduate. The role involves developing innovative audio-visual AI solutions and collaborating with top researchers to transform machine learning concepts into production-ready software. Candidates will engage in research and implementation of advanced methods while contributing to impactful publications. This position offers a unique opportunity to work on cutting-edge technology in a high-impact team environment.

Key Responsibilities:

  • Develop and prototype innovative solutions in multimodal on-device AI (audio + video).
  • Research and implement methods such as contrastive learning, model compression, or multimodal LLMs.
  • Tackle real-world challenges with efficient, scalable code using PyTorch or TensorFlow.
  • Contribute to research publications and internal reports.

Key Skills:

  • PhD student or recent graduate in ML/AI, Computer Science, Engineering, or a related field.
  • First-author publications in top AI/ML venues (CVPR, NeurIPS, ICML, ICLR, etc.).
  • Strong skills in Python and/or C/C++, and hands-on experience with modern ML frameworks.
  • Familiarity with Git and sound software engineering practices.
  • Excellent communication and problem-solving abilities.
  • Experience in emotion recognition, foundational face models, or deception detection (bonus).
  • Knowledge of multi-task learning, embedded AI, or distributed ML systems (bonus).
  • Contributions to open-source ML libraries (bonus).
  • Expertise in AI pipeline optimization and profiling (bonus).

Salary (Rate): undetermined

City: Egham

Country: United Kingdom

Working Arrangements: undetermined

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT