Site Reliability Engineering with Java and Agentic AI experience - W2 - Remote, must reside in EST or CST (Posted by SAM)

Site Reliability Engineering with Java and Agentic AI experience - W2 - Remote, must reside in EST or CST (Posted by SAM)

Posted Today by Global Force USA

Negotiable
Undetermined
Remote
Remote

Summary: The role is for a Site Reliability Engineer with a strong background in Java and experience in Agentic AI. Candidates should have over 7 years of relevant experience and be proficient in various technologies including Kubernetes and SQL. The position is remote, requiring candidates to reside in the EST or CST time zones.

Key Responsibilities:

  • Ensure the reliability and performance of production systems.
  • Automate infrastructure and operational workflows.
  • Implement and maintain CI/CD pipelines.
  • Monitor and optimize cloud infrastructure.
  • Apply SRE principles to improve system reliability.

Key Skills:

  • 7+ years of Site Reliability Engineering or Production Engineering experience.
  • Strong proficiency in Java and SRE.
  • Experience with Agentic AI.
  • SQL knowledge.
  • Familiarity with Kubernetes and Docker.
  • Hands-on experience with Azure cloud infrastructure.
  • Experience with CI/CD pipelines (GitHub Actions).
  • Knowledge of observability platforms, preferably Dynatrace.
  • Deep understanding of SRE principles (SLIs, SLOs, error budgets).

Salary (Rate): undetermined

City: undetermined

Country: undetermined

Working Arrangements: remote

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Required Qualifications

  • 7+ years of Site Reliability Engineering or Production Engineering experience
  • Must be strong in Java and SRE and have Agentic AI experience.
  • SQL
  • Kubernetes

Strong hands-on experience with:

  • Azure cloud infrastructure
  • Kubernetes and Docker
  • Java production systems
  • CI/CD pipelines (GitHub Actions)
  • Observability platforms (Dynatrace strongly preferred)
  • Demonstrated experience automating infrastructure and operational workflows
  • Deep understanding of SRE principles (SLIs, SLOs, error budgets)