Negotiable
Undetermined
Remote
Remote
Summary: The role of SR Software Engineer focuses on bridging high-scale system architecture with advanced AI technologies. The engineer will be responsible for building and managing model and agent runtimes across hybrid-cloud environments, ensuring scalable and secure AI capabilities. Collaboration with cross-functional teams and driving operational excellence through robust monitoring solutions are key aspects of the position. The ideal candidate will possess deep systems expertise and a strong understanding of AI/ML lifecycles.
Key Responsibilities:
- Bridge the gap between high-scale system architecture and frontier AI.
- Build Model & Agent Runtimes to govern model and autonomous agent execution across a hybrid-cloud footprint.
- Adopt a platform-first mindset to ensure scalable, secure, and governed AI capabilities.
- Architect the future of runtimes by designing and implementing high-performance hosting environments for specialized models and autonomous agents.
- Scale hybrid-cloud infrastructure by building multi-tenant platform capabilities for seamless agent and model execution across diverse cloud and on-prem environments.
- Drive operational excellence through robust solutions for real-time monitoring, distributed tracing, and automated evaluation of agents & models in production.
- Participate in rotational platform operational team to ensure the reliability and uptime of critical AI services.
- Collaborate with cross-functional AI teams to streamline workflows and accelerate the delivery of AI-driven value.
Key Skills:
- Deep systems expertise with proven experience building scalable back-end systems, focusing on containerization and orchestration (4 years of in-depth Kubernetes is a must).
- AI/ML fluency with a strong understanding of the lifecycle of LLMs and autonomous agents, including inference optimization and RAG architectures.
- Operational rigor with a track record of implementing observability (logging, metrics, tracing) in complex, distributed environments.
- Problem-solving mindset with the ability to turn abstract AI research into production-grade, hardened platform features.
- In-depth knowledge of software engineering with experience coding applications or services in a high-level programming language, preferably Python, and a basic knowledge of related fields.
- Demonstrated problem-solving and time management skills.
- Strong technical aptitude for designing and implementing software solutions.
- Experience with modern application development frameworks.
- Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
- Deep hands-on technical expertise, excellent verbal and written communication skills.
- Experience with Agile software development techniques.
Salary (Rate): £61.50 hourly
City: undetermined
Country: undetermined
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Duties:
- Bridge the gap between high-scale system architecture and frontier AI.
- Build Model & Agent Runtimes to govern model and autonomous agent execution across a hybrid-cloud footprint.
- Adopt a platform-first mindset to ensure scalable, secure, and governed AI capabilities.
- Architect the future of runtimes by designing and implementing high-performance hosting environments for specialized models and autonomous agents.
- Scale hybrid-cloud infrastructure by building multi-tenant platform capabilities for seamless agent and model execution across diverse cloud and on-prem environments.
- Drive operational excellence through robust solutions for real-time monitoring, distributed tracing, and automated evaluation of agents & models in production.
- Participate in rotational platform operational team to ensure the reliability and uptime of critical AI services.
- Collaborate with cross-functional AI teams to streamline workflows and accelerate the delivery of AI-driven value.
What You’ll Bring:
- Deep systems expertise with proven experience building scalable back-end systems, focusing on containerization and orchestration (4 years of in-depth Kubernetes is a must).
- AI/ML fluency with a strong understanding of the lifecycle of LLMs and autonomous agents, including inference optimization and RAG architectures.
- Operational rigor with a track record of implementing observability (logging, metrics, tracing) in complex, distributed environments.
- Problem-solving mindset with the ability to turn abstract AI research into production-grade, hardened platform features.
- In-depth knowledge of software engineering with experience coding applications or services in a high-level programming language, preferably Python, and a basic knowledge of related fields.
- Demonstrated problem-solving and time management skills.
- Strong technical aptitude for designing and implementing software solutions.
- Experience with modern application development frameworks.
- Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
- Deep hands-on technical expertise, excellent verbal and written communication skills.
- Experience with Agile software development techniques.
Preferred Qualifications:
- Ability to use a wide variety of open-source technologies and cloud-based services.
- Experience writing software for hybrid cloud (Google Cloud Platform, AWS, Azure).
- Experience building high-performance, highly available, and scalable distributed systems.
- Experience developing software for healthcare-related industries.