Sr Software Engineer

Sr Software Engineer

Posted Today by Talent Software Services, Inc

Negotiable
Undetermined
Remote
Remote

Summary: The role of SR Software Engineer focuses on bridging high-scale system architecture with advanced AI technologies. The engineer will be responsible for building and managing model and agent runtimes across hybrid-cloud environments, ensuring scalable and secure AI capabilities. Collaboration with cross-functional teams and driving operational excellence through robust monitoring solutions are key aspects of the position. The ideal candidate will possess deep systems expertise and a strong understanding of AI/ML lifecycles.

Key Responsibilities:

  • Bridge the gap between high-scale system architecture and frontier AI.
  • Build Model & Agent Runtimes to govern model and autonomous agent execution across a hybrid-cloud footprint.
  • Adopt a platform-first mindset to ensure scalable, secure, and governed AI capabilities.
  • Architect the future of runtimes by designing and implementing high-performance hosting environments for specialized models and autonomous agents.
  • Scale hybrid-cloud infrastructure by building multi-tenant platform capabilities for seamless agent and model execution across diverse cloud and on-prem environments.
  • Drive operational excellence through robust solutions for real-time monitoring, distributed tracing, and automated evaluation of agents & models in production.
  • Participate in rotational platform operational team to ensure the reliability and uptime of critical AI services.
  • Collaborate with cross-functional AI teams to streamline workflows and accelerate the delivery of AI-driven value.

Key Skills:

  • Deep systems expertise with proven experience building scalable back-end systems, focusing on containerization and orchestration (4 years of in-depth Kubernetes is a must).
  • AI/ML fluency with a strong understanding of the lifecycle of LLMs and autonomous agents, including inference optimization and RAG architectures.
  • Operational rigor with a track record of implementing observability (logging, metrics, tracing) in complex, distributed environments.
  • Problem-solving mindset with the ability to turn abstract AI research into production-grade, hardened platform features.
  • In-depth knowledge of software engineering with experience coding applications or services in a high-level programming language, preferably Python, and a basic knowledge of related fields.
  • Demonstrated problem-solving and time management skills.
  • Strong technical aptitude for designing and implementing software solutions.
  • Experience with modern application development frameworks.
  • Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Deep hands-on technical expertise, excellent verbal and written communication skills.
  • Experience with Agile software development techniques.

Salary (Rate): £61.50 hourly

City: undetermined

Country: undetermined

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Duties:

  • Bridge the gap between high-scale system architecture and frontier AI.
  • Build Model & Agent Runtimes to govern model and autonomous agent execution across a hybrid-cloud footprint.
  • Adopt a platform-first mindset to ensure scalable, secure, and governed AI capabilities.
  • Architect the future of runtimes by designing and implementing high-performance hosting environments for specialized models and autonomous agents.
  • Scale hybrid-cloud infrastructure by building multi-tenant platform capabilities for seamless agent and model execution across diverse cloud and on-prem environments.
  • Drive operational excellence through robust solutions for real-time monitoring, distributed tracing, and automated evaluation of agents & models in production.
  • Participate in rotational platform operational team to ensure the reliability and uptime of critical AI services.
  • Collaborate with cross-functional AI teams to streamline workflows and accelerate the delivery of AI-driven value.

What You’ll Bring:

  • Deep systems expertise with proven experience building scalable back-end systems, focusing on containerization and orchestration (4 years of in-depth Kubernetes is a must).
  • AI/ML fluency with a strong understanding of the lifecycle of LLMs and autonomous agents, including inference optimization and RAG architectures.
  • Operational rigor with a track record of implementing observability (logging, metrics, tracing) in complex, distributed environments.
  • Problem-solving mindset with the ability to turn abstract AI research into production-grade, hardened platform features.
  • In-depth knowledge of software engineering with experience coding applications or services in a high-level programming language, preferably Python, and a basic knowledge of related fields.
  • Demonstrated problem-solving and time management skills.
  • Strong technical aptitude for designing and implementing software solutions.
  • Experience with modern application development frameworks.
  • Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Deep hands-on technical expertise, excellent verbal and written communication skills.
  • Experience with Agile software development techniques.

Preferred Qualifications:

  • Ability to use a wide variety of open-source technologies and cloud-based services.
  • Experience writing software for hybrid cloud (Google Cloud Platform, AWS, Azure).
  • Experience building high-performance, highly available, and scalable distributed systems.
  • Experience developing software for healthcare-related industries.