Negotiable
Undetermined
Remote
Remote
Summary: The Application Performance Monitoring (APM) Engineer role focuses on leveraging expertise in Dynatrace and observability to design, implement, and manage observability solutions that ensure optimal application performance and system reliability. The position requires proactive issue detection across complex distributed environments and collaboration with various teams to resolve performance issues. The ideal candidate will have a strong background in performance monitoring and a deep understanding of observability principles. This is a contract position with a duration of 12 months, potentially extendable.
Key Responsibilities:
- Design, implement, and maintain end-to-end observability solutions using Dynatrace.
- Monitor application, infrastructure, cloud, and user experience metrics across production and non-production environments.
- Configure and optimize Dynatrace OneAgent deployments, management zones, tagging strategies, and alerting policies.
- Develop custom dashboards, reports, and monitoring solutions for business and technical stakeholders.
Utilize Dynatrace Query Language (DQL) to analyze logs, metrics, traces, and business events. - Perform root cause analysis using distributed tracing, service flow analysis, and AI-powered Davis insights.
- Create proactive alerting, anomaly detection, and automated incident response workflows.
Collaborate with development, DevOps, SRE, infrastructure, and support teams to resolve performance issues. - Establish observability standards, best practices, and governance frameworks.
Support application performance tuning and capacity planning initiatives. - Analyze service-level indicators (SLIs), service-level objectives (SLOs), and system health metrics.
Create executive and operational reports highlighting application performance trends and system reliability. - Participate in production support, incident management, and post-incident reviews.
Key Skills:
- Bachelor's degree in Computer Science, Information Technology, Engineering, or related field.
- 5+ years of experience in Application Performance Monitoring, Observability, or Site Reliability Engineering.
- Hands-on experience with Dynatrace platform administration and monitoring.
- Strong expertise in Dynatrace Query Language (DQL).
- Experience with distributed tracing, log analytics, and metrics analysis.
- Knowledge of modern observability principles including Metrics, Logs, Traces, and Events.
- Experience monitoring cloud-native and microservices-based applications.
- Strong troubleshooting and performance analysis skills.
- Understanding of networking fundamentals, web technologies, and APIs.
- Experience working with Agile and DevOps methodologies.
Salary (Rate): £52.50 hourly
City: undetermined
Country: undetermined
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Job Title: Application Performance Monitoring (APM) Engineer Dynatrace & Observability Location: Remote Employment Type: Contract W2 Only Duration: 12 Months with Possible extensions
Job Summary
We are seeking an experienced Application Performance Monitoring (APM) Engineer with strong expertise in Dynatrace, Observability, DQL, and Performance Monitoring. The ideal candidate will be responsible for designing, implementing, and managing enterprise-wide observability solutions, ensuring optimal application performance, system reliability, and proactive issue detection across complex distributed environments.
Key Responsibilities
- Design, implement, and maintain end-to-end observability solutions using Dynatrace.
- Monitor application, infrastructure, cloud, and user experience metrics across production and non-production environments.
- Configure and optimize Dynatrace OneAgent deployments, management zones, tagging strategies, and alerting policies.
- Develop custom dashboards, reports, and monitoring solutions for business and technical stakeholders.
Utilize Dynatrace Query Language (DQL) to analyze logs, metrics, traces, and business events. - Perform root cause analysis using distributed tracing, service flow analysis, and AI-powered Davis insights.
- Create proactive alerting, anomaly detection, and automated incident response workflows.
Collaborate with development, DevOps, SRE, infrastructure, and support teams to resolve performance issues. - Establish observability standards, best practices, and governance frameworks.
Support application performance tuning and capacity planning initiatives. - Analyze service-level indicators (SLIs), service-level objectives (SLOs), and system health metrics.
Create executive and operational reports highlighting application performance trends and system reliability. - Participate in production support, incident management, and post-incident reviews.
Required Qualifications:
- Bachelor's degree in Computer Science, Information Technology, Engineering, or related field.
- 5+ years of experience in Application Performance Monitoring, Observability, or Site Reliability Engineering.
- Hands-on experience with Dynatrace platform administration and monitoring.
- Strong expertise in Dynatrace Query Language (DQL).
- Experience with distributed tracing, log analytics, and metrics analysis.
- Knowledge of modern observability principles including Metrics, Logs, Traces, and Events.
- Experience monitoring cloud-native and microservices-based applications.
- Strong troubleshooting and performance analysis skills.
- Understanding of networking fundamentals, web technologies, and APIs.
- Experience working with Agile and DevOps methodologies.