Obsidian Security

SaaS Security

Sr.SiteReliabilityEngineer

£95–117k Cheltenham, England, United Kingdom
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Sr. Site Reliability Engineer at Obsidian Security. Skills: Site Reliability Engineering, DevOps, Production Engineering, Kubernetes, AWS, GCP, observability, monitoring, CI/CD, automation. Improve the reliability, availability, and resiliency of Obsidian’s production systems and distributed services. Build and maintain monitoring, alerting, dashboards, and observability tooling”

What You'll Achieve.

Production issues are detected and resolved quickly; Monitoring and alerting provide clear, actionable operational insights; Reliability metrics and operational practices improve over time; Engineering teams can effectively troubleshoot and self-serve observability; Automation reduces operational toil and improves platform stability

Industry & Context.

SaaS Security
Problems you'll solve

troubleshooting; debugging

Eligibility Requirements

on-call operations

What They're Looking For.

Must Have

3-6 years of experience in Site Reliability Engineering, DevOps, Production Engineering, or related roles, Experience operating and supporting production systems in AWS and/or GCP, Familiarity with Kubernetes and Helm in cloud-native environments, Experience with observability and monitoring tools such as Prometheus, Grafana, Datadog, or similar platforms, Exposure to CI/CD systems such as GitLab CI/CD, GitHub Actions, ArgoCD, or equivalent troubleshooting and debugging skills across distributed systems and microservices, Experience writing automation or infrastructure tooling using scripting or programming languages, systems thinking and a collaborative engineering mindset

Nice to Have

AI Agent development experience, Experience supporting SaaS platforms in production environments, Familiarity with incident management and postmortem practices, Exposure to infrastructure-as-code and GitOps workflows, Understanding of SLI/SLO concepts and operational metrics, Experience with enterprise-scale monitoring or customer-facing production systems

What You'll Do.

Improve the reliability

and resiliency of Obsidian’s production systems and distributed services

Build and maintain monitoring

and observability tooling

Support incident response

and postmortem processes

Automate infrastructure operations

How You'll Work.

Team & Collaboration

work closely with DevOps, Platform Engineering, and product teams; Partner with engineering teams to implement SLI/SLO practices, operational standards, and reliability-focused workflows

Full Job Description

Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Backed by top investors including Greylock, Norwest Venture Partners, and IVP, we’ve built a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Our team includes leaders who helped define the categories of endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, we’re transforming how SaaS is secured—in the era of agentic AI. Today, Obsidian is trusted by global enterprises like Snowflake, T-Mobile, and Pure Storage. We protect more than 200 organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand—including many of the world’s largest Fortune 1000 and Global 2000 companies. With strong global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise on the horizon, we’re scaling quickly toward long-term growth and IPO readiness. Join us as we define the future of SaaS security! Sr. Site Reliability Engineer (SRE) — Obsidian At Obsidian, our Sr. Site Reliability Engineers ensure the reliability, scalability, and operational excellence of a complex multi-tenant SaaS platform serving enterprise and financial customers. As an SRE, you will work closely with DevOps, Platform Engineering, and product teams to improve system observability, incident response, and service resilience across the platform. This is a hands-on engineering role focused on building operational excellence through monitoring, automation, debugging, and continuous improvement. You will help ensure that issues are detected and addressed quickly while contributing to systems that improve platform reliability at scale. Key Responsibilities Reliability Engineering: Improve the reliability, availability, and resiliency of Obs

Free ATS check

Applying for this Sr. Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Obsidian Security?

Real rants from real employees. Read before you apply.

Read Company Rants →