MongoDB

Database

StaffTechnicalProgramManager,SiteReliabilityEngineering

Dublin, Ireland; Cork, Ireland; Ireland Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff Technical Program Manager, Site Reliability Engineering at MongoDB. Skills: Technical Program Management, Site Reliability Engineering (SRE), Platform Scaling, Production Reliability, Cross-functional Coordination. Drive Program Planning & Execution. Define program scope, milestones, and success criteria”

What You'll Achieve.

smoother launches; clearer roadmaps; stronger reliability metrics; an SRE organization that's better-equipped to deliver predictability at scale; deliver reliably at scale

Industry & Context.

Database

Problems you'll solve

Own hard, ambiguous problems end to end; Motivated by ownership of hard problems end to end

What They're Looking For.

Must Have

8+ years in technical program management, engineering management, or a comparable technical role partnering with software engineering teams, Proven track record leading large-scale, cross-team platform initiatives through ambiguity and change, knowledge of production change management, software development lifecycle, and reliability metrics (SLOs, SLIs), Skilled at shaping roadmaps and managing dependencies, Able to query and interpret metrics, logs, or other data sources to inform decisions and communicate risk, Excellent communicator—clear, concise, and calm—across engineers, cross-functional partners, and executives, Low-ego, highly collaborative, and motivated by ownership of hard problems end to end

Nice to Have

Hands-on or close-partner experience with Kubernetes, cloud networking, or observability stacks (metrics, logs, tracing, alerting), Prior experience working with or alongside SRE teams, Background in large-scale cloud infrastructure or platform engineering, Familiarity with MongoDB Atlas or other modern cloud database platforms

What You'll Do.

Drive Program Planning & Execution

Manage dependencies across platform teams

Strengthen Production Reliability

Lead change management and launch readiness programs

Partner with SREs and product teams to define and operationalize SLOs/SLIs

and capacity signals to drive prioritization and continuous improvement

Lead Cross-Functional Coordination

Align SRE with Security

and other engineering teams

Coordinate cross-team incident response

Ensure clear follow-through

Build trust as the go-to driver of complex

Build Scalable Systems & Processes

Design lightweight frameworks and communication patterns that help SRE deliver reliably at scale

How You'll Work.

Team & Collaboration

Partner with SRE leaders and engineers; Coordinate cross-functional efforts across US and EMEA teams; Align SRE with Security, Compliance, Cloud platform, and other engineering teams; Coordinate cross-team incident response; Build trust as the go-to driver of complex, multi-team efforts; Build together with SRE, engineering, Security, and Compliance to co-create solutions

Communication Scope

Excellent communicator—clear, concise, and calm—across engineers, cross-functional partners, and executives

Process & Methodology

Drive Program Planning & Execution, Define program scope, milestones, and success criteria, Manage dependencies across platform teams, Keep work clearly tracked in Jira, Deliver on time, Shaping roadmaps, Managing dependencies

Full Job Description

As a TPM for SRE, you will partner with SRE leaders and engineers to scale the platform that underpins all of MongoDB’s cloud products. You will drive program execution, strengthen production reliability practices, and coordinate cross-functional efforts across US and EMEA teams. Success in this role means smoother launches, clearer roadmaps, stronger reliability metrics and an SRE organization that's better-equipped to deliver predictability at scale. This role can be based out of our Dublin or Cork office or remotely in Ireland. What You'll Do Drive Program Planning & Execution – Define program scope, milestones, and success criteria with SRE engineers and leaders. Manage dependencies across platform teams, keep work clearly tracked in Jira, and deliver on time Strengthen Production Reliability – Lead change management and launch readiness programs. Partner with SREs and product teams to define and operationalize SLOs/SLIs, and use incident data, metrics, and capacity signals to drive prioritization and continuous improvement Lead Cross-Functional Coordination – Align SRE with Security, Compliance, Cloud platform, and other engineering teams. Coordinate cross-team incident response, ensure clear follow-through, and build trust as the go-to driver of complex, multi-team efforts Build Scalable Systems & Processes – Design lightweight frameworks and communication patterns that help SRE deliver reliably at scale. Work yourself out of the "hero" role by leaving teams better-equipped to execute independently Requirements 8+ years in technical program management, engineering management, or a comparable technical role partnering with software engineering teams Proven track record leading large-scale, cross-team platform initiatives through ambiguity and change Strong knowledge of production change management, software development lifecycle, and reliability metrics (SLOs, SLIs) Skilled at shaping roadmaps and managing dependencies Able to query and interpret metrics, logs,

Free ATS check

Applying for this Staff Technical Program Manager, Site Reliability Engineering role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 41 detected · ranked by frequency

Technical Program Management ×5

Kubernetes ×4

cloud networking ×4

observability stacks ×4

metrics ×4

logs ×4

tracing ×4

alerting ×4

cloud infrastructure ×4

platform engineering ×4

MongoDB Atlas ×4

cloud database platforms ×4

Cross-functional Coordination ×3

engineering management ×3

platform initiatives ×3

production change management ×3

software development lifecycle ×3

reliability metrics ×3

SLOs ×3

SLIs ×3

roadmaps ×3

dependency management ×3

metrics interpretation ×3

log interpretation ×3

data analysis ×3

risk communication ×3

Site Reliability Engineering (SRE) ×2

Platform Scaling ×2

Production Reliability ×2

program execution

production reliability practices

change management

BEHAVIOURAL

Low-egohighly collaborativemotivated by ownershipclearconcisecalm

Role Details

Experience 8–10 yrs

Level Senior

Work Mode Remote

Category pto-site-reliability-engineering

AI-Extracted Insights

Domain Areas

production-change-managementsoftware-development-lifecyclereliability-metrics-slosslislarge-scale-cloud-infrastructureplatform-engineeringmodern-cloud-database-platforms

ANONYMOUS · UNFILTERED

What do employees actually say about MongoDB?

Real rants from real employees. Read before you apply.

Read Company Rants →