Okta
StaffSiteReliabilityEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Site Reliability Engineer at Okta. Skills: Site Reliability Engineering, Kubernetes, Cloud Infrastructure, CI/CD. Design infrastructure. Build infrastructure”
Industry & Context.
Problem-solving skills; Troubleshooting; Root cause analysis
On-call rotation
What They're Looking For.
Must Have
8+ years in SRE, DevOps, or Infrastructure Engineering, 3–5 years of experience with Kubernetes (EKS/GKE), 3–5 years of experience with AWS and GCP, 3–5 years using Terraform, 5+ years of coding experience in Python, Go, or similar, Bachelor’s degree in Computer Science or equivalent hands-on experience
Nice to Have
Prior work in SaaS or high-scale, cloud-native environments
What You'll Do.
Design infrastructure
Operate infrastructure
Lead reliability initiatives
Lead modernization initiatives
Architect microservice applications
Enable microservice applications
Ensure production readiness
Implement infrastructure as code
Manage infrastructure as code
Automate provisioning
Automate configuration management
Drive observability improvements
Drive performance improvements
Drive cost efficiency improvements
Conduct blameless postmortems
Improve incident response
Lead technical projects
Manage project timelines
Manage technical dependencies
Foster reliability culture
Foster automation culture
Foster continuous learning culture
Collaborate with security partners
Collaborate with compliance partners
Ensure infrastructure adherence
Participate in on-call rotation
Use incidents as learning opportunities
How You'll Work.
Team & Collaboration
Partner with development teams; Collaborate with security partners; Collaborate with compliance partners; Mentor engineers across teams
Process & Methodology
Project management, Manage timelines, Manage dependencies
Full Job Description
Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence. This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. What You’ll Be Doing Design, build, and operate highly scalable, reliable, and secure infrastructure powering our production systems across AWS and GCP. Lead major reliability and modernization initiatives, including container platform migrations (e.g., ECS to EKS/GKE) and microservice enablement across multi-cloud environments. Serve as a technical authority in Kubernetes (EKS and GKE), cloud infrastructure (AWS and GCP), and modern CI/CD practices (GitOps, automation pipelines). Partner with development teams to architect and enable microservice-based applications, ensuring production readiness, scalability, and observability. Implement and manage infrastructure as code (Terraform, Ansible) to automate provisioning, scaling, and configuration management across multiple cloud providers. Drive improvements in observability, performance, and cost efficiency through robust monitoring, logging, and alerting systems that span AWS and GCP. Champion SRE best practices — defining SLOs/SLIs, conducting blameless postmortems, and continuously improving incident response. Lead complex technical projects from conception to completion, managing timelines, and technical dependencies across teams. Mentor engineers across teams, fostering a culture of reliability, automation, and continuous learning. Collaborate with security and compliance partners to ensure infrastructure adheres to best practices and standards (e.g., IAM Federation, Workload Identity). Participate in t
Applying for this Staff Site Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Okta?
Real rants from real employees. Read before you apply.