MongoDB
Database
StaffTechnicalProgramManager,SiteReliabilityEngineering
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Technical Program Manager, Site Reliability Engineering at MongoDB. Skills: Technical Program Management, Site Reliability Engineering (SRE), Platform Scaling, Production Reliability, Cross-functional Coordination. Drive Program Planning & Execution. Define program scope, milestones, and success criteria”
What You'll Achieve.
smoother launches; clearer roadmaps; stronger reliability metrics; an SRE organization that's better-equipped to deliver predictability at scale; deliver reliably at scale
Industry & Context.
Own hard, ambiguous problems end to end; Motivated by ownership of hard problems end to end
What They're Looking For.
Must Have
8+ years in technical program management, engineering management, or a comparable technical role partnering with software engineering teams, Proven track record leading large-scale, cross-team platform initiatives through ambiguity and change, knowledge of production change management, software development lifecycle, and reliability metrics (SLOs, SLIs), Skilled at shaping roadmaps and managing dependencies, Able to query and interpret metrics, logs, or other data sources to inform decisions and communicate risk, Excellent communicator—clear, concise, and calm—across engineers, cross-functional partners, and executives, Low-ego, highly collaborative, and motivated by ownership of hard problems end to end
Nice to Have
Hands-on or close-partner experience with Kubernetes, cloud networking, or observability stacks (metrics, logs, tracing, alerting), Prior experience working with or alongside SRE teams, Background in large-scale cloud infrastructure or platform engineering, Familiarity with MongoDB Atlas or other modern cloud database platforms
What You'll Do.
Drive Program Planning & Execution
Manage dependencies across platform teams
Strengthen Production Reliability
Lead change management and launch readiness programs
Partner with SREs and product teams to define and operationalize SLOs/SLIs
and capacity signals to drive prioritization and continuous improvement
Lead Cross-Functional Coordination
Align SRE with Security
and other engineering teams
Coordinate cross-team incident response
Ensure clear follow-through
Build trust as the go-to driver of complex
Build Scalable Systems & Processes
Design lightweight frameworks and communication patterns that help SRE deliver reliably at scale
How You'll Work.
Team & Collaboration
Partner with SRE leaders and engineers; Coordinate cross-functional efforts across US and EMEA teams; Align SRE with Security, Compliance, Cloud platform, and other engineering teams; Coordinate cross-team incident response; Build trust as the go-to driver of complex, multi-team efforts; Build together with SRE, engineering, Security, and Compliance to co-create solutions
Communication Scope
Excellent communicator—clear, concise, and calm—across engineers, cross-functional partners, and executives
Process & Methodology
Drive Program Planning & Execution, Define program scope, milestones, and success criteria, Manage dependencies across platform teams, Keep work clearly tracked in Jira, Deliver on time, Shaping roadmaps, Managing dependencies
Full Job Description
As a TPM for SRE, you will partner with SRE leaders and engineers to scale the platform that underpins all of MongoDB’s cloud products. You will drive program execution, strengthen production reliability practices, and coordinate cross-functional efforts across US and EMEA teams. Success in this role means smoother launches, clearer roadmaps, stronger reliability metrics and an SRE organization that's better-equipped to deliver predictability at scale. This role can be based out of our Dublin or Cork office or remotely in Ireland. What You'll Do Drive Program Planning & Execution – Define program scope, milestones, and success criteria with SRE engineers and leaders. Manage dependencies across platform teams, keep work clearly tracked in Jira, and deliver on time Strengthen Production Reliability – Lead change management and launch readiness programs. Partner with SREs and product teams to define and operationalize SLOs/SLIs, and use incident data, metrics, and capacity signals to drive prioritization and continuous improvement Lead Cross-Functional Coordination – Align SRE with Security, Compliance, Cloud platform, and other engineering teams. Coordinate cross-team incident response, ensure clear follow-through, and build trust as the go-to driver of complex, multi-team efforts Build Scalable Systems & Processes – Design lightweight frameworks and communication patterns that help SRE deliver reliably at scale. Work yourself out of the "hero" role by leaving teams better-equipped to execute independently Requirements 8+ years in technical program management, engineering management, or a comparable technical role partnering with software engineering teams Proven track record leading large-scale, cross-team platform initiatives through ambiguity and change Strong knowledge of production change management, software development lifecycle, and reliability metrics (SLOs, SLIs) Skilled at shaping roadmaps and managing dependencies Able to query and interpret metrics, logs,
Applying for this Staff Technical Program Manager, Site Reliability Engineering role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about MongoDB?
Real rants from real employees. Read before you apply.