Okta
Technology
StaffSiteReliabilityEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Site Reliability Engineer at Okta. Skills: Site Reliability Engineering, Cloud Infrastructure, Automation. Design cloud infrastructure. Operate production services”
What You'll Achieve.
Improve service availability; Improve service scalability; Improve service performance; Improve service resilience; Reduce toil; Improve developer velocity
Industry & Context.
Root cause analysis; Troubleshooting
On-call rotation
What They're Looking For.
Must Have
Experience operating production services in AWS and/or GCP, Deep expertise with Kubernetes in production, Experience troubleshooting Kubernetes issues, Extensive experience with Terraform and Helm, Software engineering skills in Golang and/or Python, Experience building automation
Nice to Have
Experience operating SaaS platforms, Experience with Kubernetes-based microservices, Experience supporting globally distributed production environments, Experience with GitOps and ArgoCD, Experience implementing AI-assisted operational tooling
What You'll Do.
Design cloud infrastructure
Operate production services
Participate in on-call rotation
Lead incident response efforts
Drive post-incident reviews
Measure error budgets
Improve error budgets
Partner with engineering teams
Improve service availability
Improve service scalability
Improve service performance
Improve service resilience
Improve observability
Develop infrastructure
Eliminate operational toil
Improve deployment safety
Improve operational workflows
Modernize existing workloads
Align workloads with platform
Build self-service platforms
Build operational guardrails
Build automation for developers
Lead reliability initiatives
Guide engineers on best practices
Influence architecture decisions
Influence operational decisions
Drive projects from conception
Drive production rollout
Drive long-term ownership
Explore AI-assisted engineering
Identify emerging technologies
How You'll Work.
Team & Collaboration
Partnering with software engineers; Partnering with architects; Partnering with product teams; Cross-functional teams; Globally distributed teams
Process & Methodology
Project management
Full Job Description
Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Get to know Okta Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth. At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box, we’re looking for lifelong learners and people who can make us better with their unique experiences. Join our team! We’re building a world where Identity belongs to you. The Engineering Opportunity We are looking for an experienced Staff Site Reliability Engineer to join Okta's Emerging Products Group (EPG). Our mission is to build highly reliable, scalable, and secure cloud services that our customers can trust. We embrace an automation-first mindset and continuously invest in platform engineering, observability, and operational excellence to enable our engineering teams to move quickly and safely. This role is ideal for an engineer who enjoys solving complex technical challenges at scale, building automation, and improving the reliability of production systems. You will serve as a technical leader within the EPG SRE organization, partnering closely with software engineers, architects, and product teams to design, build, and operate world-class cloud services. The ideal candid
Applying for this Staff Site Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Okta?
Real rants from real employees. Read before you apply.