Nextiva
Technology
StaffEngineer-SiteReliability
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Engineer - Site Reliability at Nextiva. Skills: Site Reliability, Kafka, Vector Databases, Kubernetes, GCP. Own reliability, availability, scalability, performance of platforms. Support and optimize Kafka environments”
Industry & Context.
Troubleshooting; Root cause analysis
What They're Looking For.
Must Have
5+ years of relevant experience, 10+ years of relevant experience, 5+ years of production experience with GCP and Kubernetes, Experience managing and troubleshooting production GKE environments, Experience with incident management, Experience with root cause analysis, Experience with infrastructure automation, Experience with CI/CD practices, Infrastructure as Code experience, Automation and scripting experience, Experience with monitoring, Experience with alerting, Experience with metrics, Experience with logs, Experience with distributed tracing, Linux
Nice to Have
5+ years of hands on Kafka production experience, Experience with Weaviate is strongly preferred, Terraform experience preferred, Python, Go, Shell, or similar scripting experience preferred
What You'll Do.
performance of platforms
Support and optimize Kafka environments
Perform performance tuning
Perform capacity planning
Perform Kafka upgrades
Troubleshoot Kafka environments
Administer Vector Database platforms
Support Vector Database platforms
Optimize Vector Database platforms
Manage GCP environments
Support GCP environments
Manage GKE environments
Support GKE environments
Drive infrastructure automation
Drive operational excellence
Drive platform reliability initiatives
Lead production incident response
Lead root cause analysis
Lead post incident reviews
Build monitoring solutions
Maintain monitoring solutions
Build alerting solutions
Maintain alerting solutions
Build observability solutions
Maintain observability solutions
Maintain error budgets
Support database operations
Optimize database performance
Maintain database operational health
How You'll Work.
Team & Collaboration
Middleware Engineering team; Fast-moving team
Process & Methodology
CI/CD practices
Full Job Description
Redefine the future of customer experiences. One conversation at a time. At Nextiva, we’re reimagining how businesses connect, bringing together customer experience and team collaboration on a single, conversation centric platform. Powered by AI, driven by human innovation. Our culture is forward thinking, customer obsessed and built on the belief that meaningful connections drive better business outcomes. Whether it’s through our signature Amazing Service®, the technology we create, or the experiences we cultivate, connection is at the core of who we are. If you’re ready to collaborate with incredible people, make an impact, and help businesses everywhere deliver truly amazing experiences, this is where you belong. Location: This is an onsite role based at Nextiva’s Bengaluru office (Wilshire III by MFAR, 492, Hobli, RHB Colony, Mahadevapura, Bengaluru, Karnataka 560048). Working together onsite strengthens how we operate, enabling faster decisions, clearer communication, and stronger execution, so you can make a greater impact and move work forward with speed and clarity. In-Office Expectation: This role is expected to work onsite four days per week, with the potential to increase to five days per week, as required by the business. Specific scheduling and flexibility will be guided by your leader to support both team collaboration and individual productivity. We are seeking a Senior or Staff Site Reliability Engineer to join the Middleware Engineering team supporting NCC Next, Nextiva's AI Native platform. This role is responsible for the reliability, scalability, performance, and operational excellence of critical middleware and cloud infrastructure services. The ideal candidate will have strong experience with Kafka, Vector Databases, Kubernetes, GCP, observability, automation, and distributed systems, along with a passion for building highly reliable platforms at scale. If you enjoy owning systems end to end, writing clean automation, and working in a fast-movi
Applying for this Staff Engineer - Site Reliability role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Nextiva?
Real rants from real employees. Read before you apply.