Negotiable
Inside
Onsite
Warwick -United Kingdom
Summary: The Application and Monitoring SME role is focused on defining and implementing monitoring strategies for enterprise applications and infrastructure at the Warwick office. The position requires strong expertise in monitoring tools and collaboration with various teams to enhance application performance and incident management. The candidate must possess SC Clearance and be able to travel to the Warwick office for training. Key responsibilities include establishing monitoring frameworks, conducting root cause analysis, and providing training to IT staff.
Key Responsibilities:
- Define and implement monitoring frameworks for enterprise applications, Middleware, and infrastructure.
- Deploy and configure monitoring tools (eg, Dynatrace, AppDynamics, Splunk, Prometheus, Grafana, Datadog).
- Establish KPIs, dashboards, and alerting mechanisms for proactive issue detection.
- Analyze application performance, identify bottlenecks, and recommend optimization strategies.
- Collaborate with development and operations teams to ensure monitoring is integrated into CI/CD pipelines.
- Conduct root cause analysis for recurring incidents and propose long-term solutions.
- Act as SME during major incidents, providing technical expertise and rapid resolution guidance.
- Develop automated monitoring scripts and self-healing mechanisms to reduce downtime.
- Document monitoring processes, playbooks, and best practices.
- Partner with business units to understand application requirements and translate them into monitoring solutions.
- Provide training and mentorship to IT staff on monitoring tools and methodologies.
- Communicate performance insights and recommendations to leadership in a clear, actionable manner.
Key Skills:
- Strong expertise in at least two enterprise monitoring platforms (eg, Dynatrace, AppDynamics, Splunk, Datadog).
- Solid understanding of application architectures (microservices, cloud-native, APIs).
- Hands-on experience with Scripting languages (Python, PowerShell, Bash) for automation.
- Knowledge of cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).
- Familiarity with ITIL processes (Incident, Problem, Change Management).
- Excellent analytical, troubleshooting, and communication skills.
Salary (Rate): £268/day
City: Warwick
Country: United Kingdom
Working Arrangements: on-site
IR35 Status: inside IR35
Seniority Level: undetermined
Industry: IT
Role Title: Application and Monitoring SME
Location: Warwick Technology Park, Gallows Hill, Warwick CV34 6DA - Full Office
Duration: 31/12/2026
Role Description:
Key Requirements:
Working from Warwick office, ideally within 1 hours travel
Wokingham office considered but needs to travel to Warwick office for training
SC Clearance
Primary skills - L1 monitoring/observability, ITIL process, incident, MIM, etc
Secondary skills - L1 troubleshooting, work with multi-team and engage with L2 support engineers and partners
Description:
- Monitoring Strategy & Implementation
- Define and implement monitoring frameworks for enterprise applications, Middleware, and infrastructure.
- Deploy and configure monitoring tools (eg, Dynatrace, AppDynamics, Splunk, Prometheus, Grafana, Datadog).
- Establish KPIs, dashboards, and alerting mechanisms for proactive issue detection.
- Application Performance Management
- Analyze application performance, identify bottlenecks, and recommend optimization strategies.
- Collaborate with development and operations teams to ensure monitoring is integrated into CI/CD pipelines.
- Conduct root cause analysis for recurring incidents and propose long-term solutions.
- Incident & Problem Management
- Act as SME during major incidents, providing technical expertise and rapid resolution guidance.
- Develop automated monitoring scripts and self-healing mechanisms to reduce downtime.
- Document monitoring processes, playbooks, and best practices.
- Stakeholder Collaboration
- Partner with business units to understand application requirements and translate them into monitoring solutions.
- Provide training and mentorship to IT staff on monitoring tools and methodologies.
- Communicate performance insights and recommendations to leadership in a clear, actionable manner.
Required Skills & Experience
- Strong expertise in at least two enterprise monitoring platforms (eg, Dynatrace, AppDynamics, Splunk, Datadog).
- Solid understanding of application architectures (microservices, cloud-native, APIs).
- Hands-on experience with Scripting languages (Python, PowerShell, Bash) for automation.
- Knowledge of cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).
- Familiarity with ITIL processes (Incident, Problem, Change Management).
- Excellent analytical, troubleshooting, and communication skills.
Preferred Qualifications
- Certifications in monitoring tools (eg, Dynatrace Certified Professional, Splunk Certified Admin).
- Experience with observability practices (metrics, logs, traces).
- Exposure to DevOps practices and CI/CD pipelines.
- Strong stakeholder management and ability to influence technical decisions