Monstro
Financial Technology
SiteReliabilityEngineer(SRE)
Neural analysis suggests this role is
optimal for Mid candidates.
“Site Reliability Engineer (SRE) at Monstro. Skills: Observability, Reliability engineering, Incident response, Automation. Define and maintain SLOs/SLIs. Build canonical dashboards and alerts”
What You'll Achieve.
Kill toil; Ensure next break is smaller, shorter, or doesn't happen
Industry & Context.
Bias toward fixing the system, not the symptoms
On-call rotation
What They're Looking For.
Must Have
Solid production experience on GCP, Comfortable on-call, Observability fundamentals, Working knowledge of Kubernetes, Working knowledge of API gateways, Working knowledge of identity systems, At least one IaC tool, Scripting / coding fluency (Python, Go, Bash), Good written communication, Bias toward fixing the system
Nice to Have
Apigee or another enterprise API gateway in production, BigQuery for log analytics or audit, Experience standing up observability from scratch, SOC2 or similar compliance environments
What You'll Do.
Define and maintain SLOs/SLIs
Build canonical dashboards and alerts
Instrument services for tracing
Reduce toil via automation
First responder for production alerts
Drive postmortems to closure
Clean written handoffs
How You'll Work.
Team & Collaboration
Work with product team
Communication Scope
Good written communication
Process & Methodology
Track action items as audit evidence
Full Job Description
About Monstro Monstro is the operating system for governed financial intelligence. We build governance and intelligence infrastructure that enables artificial intelligence to operate safely, explainably, and at institutional scale. We exist because the level of financial guidance historically available to a small group should be accessible to many more people. By combining AI with deep institutional infrastructure, we help financial institutions deliver more personalized, responsible, and life-changing financial support to millions of individuals. We’re building mission-critical systems in a highly regulated domain, and we care deeply about doing it right. If you’re motivated by meaningful problems, high standards, and shaping infrastructure that improves financial outcomes, you’ll feel at home here. About the Role Monstro is building a secure, multi-tenant platform on Google Cloud, and we’re hiring a Site Reliability Engineer to own the reliability and observability of that platform end-to-end. This is a hands-on role for someone who wants to do real SRE work - not a rebrand of L1 support. You’ll write the dashboards, define the SLOs, build the automation that kills toil, and take your turn on the on-call rotation that proves it all works. When something breaks at 2 AM, you’re the person who keeps it running; when nothing’s breaking, you’re the person making sure the next break is smaller, shorter, or doesn’t happen at all What You’ll Do Observability and reliability engineering Define and maintain SLOs and SLIs for our tier-1 services: API gateway, application services, identity, and edge availability Build canonical dashboards and alerts in Google Cloud Monitoring, backed by structured logs and BigQuery log analytics Tune alert routing so every page is actionable — kill the rest Instrument services for distributed tracing and structured logging; push back on services that ship without it Own error budgets and use them to prioritize reliability work over feature w
Applying for this Site Reliability Engineer (SRE) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Monstro?
Real rants from real employees. Read before you apply.