Nebius
Cloud Infrastructure
SeniorSiteReliabilityEngineer(DevTools)
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Site Reliability Engineer (DevTools) at Nebius”
Industry & Context.
Full Job Description
About Nebius: Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure. Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI. Listed on Nasdaq (NBIS) and headquartered in Amsterdam, we have a global footprint with R&D hubs across Europe, the UK, North America and Israel. Our team of 1,500+ includes hundreds of engineers with deep expertise across hardware, software and AI R& D. The Role We're an SRE team within DevTools, looking for someone ready to help maintain and grow our systems. We run 25k builds a day in TeamCity, store 100 TB of artifacts in Artifactory, and work with a massive monorepo in GitLab — comparable in scale to what you'd find at FAANG companies. We modify GitLab and build our own TeamCity plugins to give users a product that meets their needs. We're also experimenting with AI — we have our own RAG setup and are figuring out how to operate in the age of agents. Our goal: understand users' problems and requests, define metrics that capture the problem, improve those metrics, and verify that the user's problem is actually gone. Your responsibilities will include: Improving services based on user feedback Building fault-tolerant, self-healing architecture Finding ways to speed up our systems and reduce user friction Modifying well-known closed- and open-source solutions Supporting our users We expect you to have: A combination of SRE and SWE experience (for us that's a 50/50 split). Our code is in Java/Kotlin, Go, Python, and Ruby An understanding of what's happening under the hood in Unix-like systems and the JVM A passion for improving the user experience The ability to adapt quickly
Applying for this Senior Site Reliability Engineer (DevTools) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Nebius?
Real rants from real employees. Read before you apply.