Kong

Technology

StaffSiteReliabilityEngineerProjectVolcano

$140–197k United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Staff candidates.

The Brief

“Staff Site Reliability Engineer — Project Volcano at Kong. Skills: Site Reliability Engineering, Platform Engineering, Kubernetes, Developer Platforms. Own reliability end-to-end. Define SLOs”

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting

Eligibility Requirements

On-call practices

What They're Looking For.

Must Have

BS in Computer Science, Staff or Principal IC level experience, SRE or Platform Engineering experience, Building SRE practices for developer platforms, Building SRE practices for PaaS/SaaS products, Deep Kubernetes expertise

Nice to Have

Greenfield stage SRE/platform experience

What You'll Do.

Own reliability end-to-end

Define incident response practices

Architect platform infrastructure

Design Kubernetes infrastructure

Build Kubernetes infrastructure

Design multi-region infrastructure

Build multi-region infrastructure

Build GitOps backbone

Establish deployment automation

Establish canary pipelines

Establish preview environment provisioning

Scale managed data services

Design PostgreSQL clusters

Operate PostgreSQL clusters

Harden PostgreSQL clusters

Design Redis caching layers

Operate Redis caching layers

Harden Redis caching layers

Design object storage

Operate object storage

Harden object storage

Drive observability from day one

Instrument every service

Build meaningful dashboards

Build meaningful alerts

Build meaningful runbooks

Lead cross-functional reliability work

Bake reliability into architecture

Bake compliance into architecture

Mentor engineers on reliability

Define on-call practices

Build blameless engineering culture

Evaluate emerging technologies

Adopt emerging technologies

Make architectural decisions

How You'll Work.

Team & Collaboration

Engineering leadership; OCTO team; Product engineering; Security teams; Contributing teams

Process & Methodology

Roadmap planning

Full Job Description

Are you ready to unlock intelligence? If you don’t think you meet all of the criteria below but are still interested in the job, please apply. Nobody checks every box - we’re looking for candidates that are particularly strong in a few areas, and have some interest and capabilities in others. ABOUT THE ROLE Kong is building Project Volcano, an internal developer platform purpose-built for Kong's engineering ecosystem. Volcano will provide teams with on-demand preview environments, edge deployments, managed PostgreSQL, auth, realtime, and storage APIs all deeply integrated with Kong products. As the Staff SRE for Volcano, you will be the founding reliability voice for this platform. This role is a strategic initiative driven by the Office of the CTO (OCTO). You will partner directly with engineering leadership to define the platform's reliability posture, build its SRE practice from the ground up, and ensure Volcano can scale to serve all of Kong's customers. This is a high-visibility, high-impact role with direct influence on Kong's next generation developer platform. WHAT YOU'LL DO - Own reliability for Volcano end-to-end: Define and drive SLOs, error budgets, and incident response practices for all Volcano services — edge deployments, managed Postgres, auth, realtime, storage, and the control plane. - Architect the platform's infrastructure: Design and build the multi-region Kubernetes infrastructure, networking, and data plane that powers Volcano's edge deployment pipeline and backend-as-a-service capabilities. - Build the GitOps and CI/CD backbone: Establish deployment automation, canary pipelines, and preview environment provisioning using ArgoCD, Helm, and Terraform/Terragrunt — setting patterns the broader team will follow. - Scale managed data services: Design, operate, and harden multi-tenant PostgreSQL clusters, Redis caching layers, and object storage — with a focus on data isolation, performance, and disaster recovery. - Drive observability from day one:

Free ATS check

Applying for this Staff Site Reliability Engineer — Project Volcano role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Kong?

Real rants from real employees. Read before you apply.

Read Company Rants →