Okta

Technology

StaffSiteReliabilityEngineer

₹35–55L ~AI est. Bengaluru, India FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff Site Reliability Engineer at Okta. Skills: Site Reliability Engineering, Cloud Infrastructure, Automation. Design cloud infrastructure. Operate production services”

What You'll Achieve.

Improve service availability; Improve service scalability; Improve service performance; Improve service resilience; Reduce toil; Improve developer velocity

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting

Eligibility Requirements

On-call rotation

What They're Looking For.

Must Have

Experience operating production services in AWS and/or GCP, Deep expertise with Kubernetes in production, Experience troubleshooting Kubernetes issues, Extensive experience with Terraform and Helm, Software engineering skills in Golang and/or Python, Experience building automation

Nice to Have

Experience operating SaaS platforms, Experience with Kubernetes-based microservices, Experience supporting globally distributed production environments, Experience with GitOps and ArgoCD, Experience implementing AI-assisted operational tooling

What You'll Do.

Design cloud infrastructure

Operate production services

Participate in on-call rotation

Lead incident response efforts

Drive post-incident reviews

Measure error budgets

Improve error budgets

Partner with engineering teams

Improve service availability

Improve service scalability

Improve service performance

Improve service resilience

Improve observability

Develop infrastructure

Eliminate operational toil

Improve deployment safety

Improve operational workflows

Modernize existing workloads

Align workloads with platform

Build self-service platforms

Build operational guardrails

Build automation for developers

Lead reliability initiatives

Guide engineers on best practices

Influence architecture decisions

Influence operational decisions

Drive projects from conception

Drive production rollout

Drive long-term ownership

Explore AI-assisted engineering

Identify emerging technologies

How You'll Work.

Team & Collaboration

Partnering with software engineers; Partnering with architects; Partnering with product teams; Cross-functional teams; Globally distributed teams

Process & Methodology

Project management

Full Job Description

Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Get to know Okta Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth. At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box, we’re looking for lifelong learners and people who can make us better with their unique experiences. Join our team! We’re building a world where Identity belongs to you. The Engineering Opportunity We are looking for an experienced Staff Site Reliability Engineer to join Okta's Emerging Products Group (EPG). Our mission is to build highly reliable, scalable, and secure cloud services that our customers can trust. We embrace an automation-first mindset and continuously invest in platform engineering, observability, and operational excellence to enable our engineering teams to move quickly and safely. This role is ideal for an engineer who enjoys solving complex technical challenges at scale, building automation, and improving the reliability of production systems. You will serve as a technical leader within the EPG SRE organization, partnering closely with software engineers, architects, and product teams to design, build, and operate world-class cloud services. The ideal candid

Free ATS check

Applying for this Staff Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Okta?

Real rants from real employees. Read before you apply.

Read Company Rants →