Monstro

Financial Technology

SiteReliabilityEngineer(SRE)

Bengaluru, Karnataka, India Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Site Reliability Engineer (SRE) at Monstro. Skills: Observability, Reliability engineering, Incident response, Automation. Define and maintain SLOs/SLIs. Build canonical dashboards and alerts”

What You'll Achieve.

Kill toil; Ensure next break is smaller, shorter, or doesn't happen

Industry & Context.

Financial Technology

Problems you'll solve

Bias toward fixing the system, not the symptoms

Eligibility Requirements

On-call rotation

What They're Looking For.

Must Have

Solid production experience on GCP, Comfortable on-call, Observability fundamentals, Working knowledge of Kubernetes, Working knowledge of API gateways, Working knowledge of identity systems, At least one IaC tool, Scripting / coding fluency (Python, Go, Bash), Good written communication, Bias toward fixing the system

Nice to Have

Apigee or another enterprise API gateway in production, BigQuery for log analytics or audit, Experience standing up observability from scratch, SOC2 or similar compliance environments

What You'll Do.

Define and maintain SLOs/SLIs

Build canonical dashboards and alerts

Instrument services for tracing

Reduce toil via automation

First responder for production alerts

Drive postmortems to closure

Clean written handoffs

How You'll Work.

Team & Collaboration

Work with product team

Communication Scope

Good written communication

Process & Methodology

Track action items as audit evidence

Full Job Description

About Monstro Monstro is the operating system for governed financial intelligence. We build governance and intelligence infrastructure that enables artificial intelligence to operate safely, explainably, and at institutional scale. We exist because the level of financial guidance historically available to a small group should be accessible to many more people. By combining AI with deep institutional infrastructure, we help financial institutions deliver more personalized, responsible, and life-changing financial support to millions of individuals. We’re building mission-critical systems in a highly regulated domain, and we care deeply about doing it right. If you’re motivated by meaningful problems, high standards, and shaping infrastructure that improves financial outcomes, you’ll feel at home here. About the Role Monstro is building a secure, multi-tenant platform on Google Cloud, and we’re hiring a Site Reliability Engineer to own the reliability and observability of that platform end-to-end. This is a hands-on role for someone who wants to do real SRE work - not a rebrand of L1 support. You’ll write the dashboards, define the SLOs, build the automation that kills toil, and take your turn on the on-call rotation that proves it all works. When something breaks at 2 AM, you’re the person who keeps it running; when nothing’s breaking, you’re the person making sure the next break is smaller, shorter, or doesn’t happen at all What You’ll Do Observability and reliability engineering Define and maintain SLOs and SLIs for our tier-1 services: API gateway, application services, identity, and edge availability Build canonical dashboards and alerts in Google Cloud Monitoring, backed by structured logs and BigQuery log analytics Tune alert routing so every page is actionable — kill the rest Instrument services for distributed tracing and structured logging; push back on services that ship without it Own error budgets and use them to prioritize reliability work over feature w

Free ATS check

Applying for this Site Reliability Engineer (SRE) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 61 detected · ranked by frequency

Incident response ×5

Automation ×5

SLOs ×3

SLIs ×3

Dashboards ×3

Alerts ×3

Log analytics ×3

Distributed tracing ×3

Error budgets ×3

Runbooks ×3

Severity triage ×3

Incident bridge ×3

Mitigation ×3

Revision rollback ×3

Traffic shift ×3

Scaling ×3

Edge block ×3

Credential rotation ×3

Incident comms ×3

Postmortems ×3

Action items ×3

Written handoffs ×3

Observability ×2

Reliability engineering ×2

Apigee X ×2

Cloud Run ×2

GKE Autopilot ×2

Cloud SQL ×2

Identity Platform ×2

Cloud Armor ×2

Cloud IDS ×2

Security Command Center ×2

Role Details

Experience 2–5 yrs

Level Mid

Work Mode Hybrid

Category engineering

AI-Extracted Insights

Domain Areas

governed-financial-intelligenceai-safetyexplainable-aiinstitutional-scale-airegulated-financial-domain

How to Apply on Greenhouse

Create a Greenhouse profile before applying — it saves time across multiple applications.
Upload your resume as a PDF; the parser handles it better than Word.
Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Monstro?

Real rants from real employees. Read before you apply.

Read Company Rants →