Datadog
Technology
SeniorSoftwareEngineer-ObservabilityVisibility
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Software Engineer - Observability Visibility at Datadog. Skills: Observability, Resilience, Software engineering, Site reliability engineering. Define observability baselines. Evolve observability baselines”
What You'll Achieve.
Reduce operational risk; Drive adoption; Drive consistency; Improve engineering effectiveness; Improve service resilience
Industry & Context.
Root cause analysis; Troubleshooting
What They're Looking For.
Must Have
5+ years experience software engineering, 5+ years experience site reliability engineering, Hands-on experience observability, Hands-on experience resilience, Expertise identifying failure modes, Expertise analyzing failure modes, Expertise mitigating failure modes, Programming skills in Go, Programming skills in Python, Design reliable systems, Build reliable systems, Design maintainable systems, Build maintainable systems, Navigating complex technical challenges, Proposing efficient solutions, Proposing scalable solutions, Proposing easy-to-adopt solutions, Experience delivering AI-enabled software, Articulate when AI is appropriate, Communication skills, Collaboration skills, Mentorship skills
Nice to Have
Experience influencing technical direction
What You'll Do.
Define observability baselines
Evolve observability baselines
Define resilience baselines
Evolve resilience baselines
Measure service compliance
Assess remediation complexity
Drive sustainable solutions
Design scalable observability capabilities
Deliver scalable observability capabilities
Design scalable reliability capabilities
Deliver scalable reliability capabilities
Leverage AI-driven solutions
Enable service owners
Partner with platform teams
Partner with SRE teams
Partner with product teams
Partner with engineering teams
Provide technical leadership
Accelerate team growth
Conduct design reviews
Collaborative problem-solving
Promote operational excellence
How You'll Work.
Team & Collaboration
SRE teams; Platform teams; Product teams; Engineering teams; Cross-functional teams
Communication Scope
Technical direction
Full Job Description
The Observability Visibility SRE Team is part of the Observability and Resilience Enablement group within the SRE/Security organization. Observability and Resilience Enablement focuses on closing the loop between how Datadog engineers detect and respond to issues and incidents and how those learnings translate into measurable risk reduction and lower customer impact. The Observability Visibility team carries the organization's 100% visibility priority, defining observability and reliability baselines and ensuring services consistently meet them by default through scalable, automated, and sustainable solutions. As a Senior Software Engineer on this team, you will help define, implement and evolve observability and resilience standards across Datadog's engineering organization. You will build systems, tooling, libraries, and automation that make observability and reliability the default experience for service owners, reducing operational risk while driving adoption and consistency. This role combines software engineering and site reliability engineering to drive measurable improvements in engineering effectiveness and service resilience. You will work closely with SRE, platform and product teams to identify gaps, deliver scalable solutions and ensure long-term coverage and compliance with established standards. At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them. What You'll Do: Define and evolve observability and resilience baselines, ensuring alignment with measurable risk reduction goals across Datadog services. Measure service compliance against established standards, assess risk and remediation complexity and drive sustainable solutions to close identified gaps. Design and deliver scalable observability and reliability capabilities across the software development lifecycl
Applying for this Senior Software Engineer - Observability Visibility role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Datadog?
Real rants from real employees. Read before you apply.