Jabil
SiteReliabilityEngineer-ObservabilityPlatform
Neural analysis suggests this role is
optimal for Mid candidates.
“Site Reliability Engineer - Observability Platform at Jabil. Skills: Observability Platform, Grafana Tempo, OpenTelemetry, Site Reliability Engineering. Support deployment, configuration, scaling of Grafana Tempo. Integrate application services with OpenTelemetry”
Industry & Context.
Analyze distributed traces to identify latency bottlenecks and reliability risks
What They're Looking For.
Must Have
Bachelor’s degree in Computer Science, Engineering, or a related field, 3 years of experience with a Master's degree or 5 years of experience with a Bachelor's degree in Backend software development, 3 years of experience with a Master's degree or 5 years of experience with a Bachelor's degree in Application performance monitoring (APM), 3 years of experience with a Master's degree or 5 years of experience with a Bachelor's degree in Site reliability engineering or production systems support, Proficiency working in Linux environments, Using command-line tools, Programming or scripting experience, Foundational understanding of networking, Foundational understanding of distributed systems
Nice to Have
Experience with Kubernetes, Experience with containerized environments, Familiarity with observability tools such as Grafana, Familiarity with observability tools such as Prometheus, Familiarity with similar platforms, Exposure to OpenTelemetry, Exposure to distributed tracing systems, Experience with cloud platforms (AWS, Azure, or Google Cloud)
What You'll Do.
scaling of Grafana Tempo
Integrate application services with OpenTelemetry
Build and maintain Grafana dashboards
Evolve APM and observability platform architecture
Develop dashboards and telemetry pipelines
Monitor service health and performance
Analyze distributed traces
Support definition and monitoring of SLIs/SLOs
Contribute to operational reviews
Support infrastructure deployment and automation
Operate observability components in Kubernetes
Manage telemetry pipelines
Instrument services using distributed tracing
Support incident investigations
Document monitoring strategies
Document observability architecture
Contribute to shared reliability practices
How You'll Work.
Team & Collaboration
Partner closely with application engineers; Partner with application teams to instrument services; Contribute to shared reliability practices across engineering teams
Full Job Description
At Jabil (NYSE: JBL), we are proud to be a trusted partner for the world's top brands, offering comprehensive engineering, supply chain, and manufacturing solutions. With 60 years of experience across industries and a vast network of over 100 sites worldwide, Jabil combines global reach with local expertise to deliver both scalable and customized solutions. Our commitment extends beyond business success as we strive to build sustainable processes that minimize environmental impact and foster vibrant and diverse communities around the globe. # **Site Reliability Engineer (SRE)** ## **Based onsite in Lexington, KY** The Site Reliability Engineer (SRE) supports the development and operation of the organization's Application Performance Monitoring (APM) and Observability Platform. This role focuses on expanding the Grafana Tempo distributed tracing system into a comprehensive platform that delivers visibility into application performance, service dependencies, and overall system reliability. This position operates within the Cloud and Platform Engineering team and partners closely with application engineers to improve instrumentation, monitoring, and reliability practices across services. ## **Key Responsibilities** ### **Observability Platform Development** * Support the deployment, configuration, and scaling of Grafana Tempo for distributed tracing * Integrate application services with OpenTelemetry instrumentation * Build and maintain Grafana dashboards and visualizations to surface performance insights * Assist in the design and evolution of the APM and observability platform architecture ### **Reliability & Performance Monitoring** * Develop dashboards and telemetry pipelines to monitor service health and performance * Analyze distributed traces to identify latency bottlenecks and reliability risks * Support the definition and monitoring of Service Level Indicators (SLIs) and Service Level Objectives (SLOs) * Contribute to operational reviews and continuous reliabi
Applying for this Site Reliability Engineer - Observability Platform role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Jabil?
Real rants from real employees. Read before you apply.