Data Network Engineer - SRE, Telemetry, Observability, Monitoring
Posted 4 days ago by La Fosse Associates Limited
£600 Per day
Outside
Undetermined
City of London, Greater London, England, UK
Summary: The Data Network Engineer role focuses on leveraging telemetry, observability, monitoring, and performance within a high availability Network Infrastructure Site Reliability Engineering environment. The position requires collaboration across functions to embed observability into software development processes and to design and implement telemetry pipelines. Candidates should have experience with various observability tools and a strong understanding of cloud-native architectures. The ultimate goal is to enhance the value of observability tooling and develop actionable insights from telemetry data.
Key Responsibilities:
- Collaborate cross-functionally to ensure observability is embedded into the SDLC & CI/CD pipelines.
- Design and implement telemetry pipelines for metrics, logs, traces, and events.
- Develop observability standards, NMS tooling, dashboards, alerting frameworks, and SLOs.
- Integrate and optimize observability tools such as OpenTelemetry, Prometheus, Grafana, Splunk, and Elastic.
Key Skills:
- Experience within Network/Platform Observability, Networks SRE, or Platform Engineering roles in complex, distributed environments.
- Strong expertise with telemetry tools such as OpenTelemetry, Prometheus, Grafana, Splunk, Elastic, Loki, Jaeger, or similar.
- Proficiency in at least one programming language (e.g., Python, Go, Java) and infrastructure-as-code tools (e.g., Terraform, Helm).
- Deep understanding of cloud-native architectures (Kubernetes, microservices, service meshes).
Salary (Rate): £600 per day
City: City of London
Country: UK
Working Arrangements: undetermined
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
Data Network Engineer - SRE, Telemetry, Observability, Monitoring & Performance
Seeking a Network Engineer with experience of Telemetry, Observability, Monitoring & Peformance, ideally within a high availability Network Infrastructure Site Reliability Engineering environment. The network strategy is highly focused towards Next-Gen, Software Defined Networking and in this role you you will work at the intersection of software engineering, Networks SRE & platform operations & engineering, with the ulitmate aim of developing actionable insights from telemetry data and enhancing the value of observability tooling.
Previous experience might include:
- Collaborate cross-functionally to ensure observability is Embedded into the SDLC & CI/CD pipelines.
- Designing & implementing telemetry pipelines for metrics, logs, traces, and events.
- Developing observability standards, NMS tooling, dashboards, alerting frameworks, and SLOs.
- Integrating & optimising observability tools such as OpenTelemetry, Prometheus, Grafana, Splunk & Elastic.
This role will require:
- Having previously worked within Network/Platform Observability, Networks SRE, or Platform Engineering roles within complex, distributed environments.
- Strong expertise with telemetry tools such as OpenTelemetry, Prometheus, Grafana, Splunk, Elastic, Loki, Jaeger, or similar.
- Proficiency in at least one programming language (eg, Python, Go, Java) and infrastructure-as-code tools (eg, Terraform, Helm).
- Deep understanding of cloud-native architectures (Kubernetes, microservices, service meshes).
Highly desired:
- Industry experience such as the following Media/Streaming, High Frequency Trading eg Investment Banking, Online Gaming, Hyperscalers, High Availability, Low Latency Network Infrastructure