NordVPN
Technology
StaffSiteReliabilityEngineer-PlatformEngineering
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Site Reliability Engineer - Platform Engineering at NordVPN. Skills: Site Reliability Engineering, Platform Engineering, Globally distributed systems, AI tooling. Design backend services. Operate backend services”
Industry & Context.
Problem-solver
What They're Looking For.
Must Have
Designing and operating globally distributed systems, Systems architecture, Linux administration at scale, Docker, Databases, Web servers, load balancing, and failover, Working with dedicated hardware, Python or another scripting/programming language, Familiarity with AI/LLM tooling
Nice to Have
Kubernetes in production environments, SaltStack, Advanced networking
What You'll Do.
Design backend services
Operate backend services
Make architectural decisions
Manage full lifecycle
Maintain infrastructure tooling
Improve infrastructure tooling
Automate infrastructure
How You'll Work.
Team & Collaboration
Platform engineering team
Full Job Description
The world’s most advanced VPN, and a whole lot more. If you’re a curious problem-solver who carves their own path, join the team behind Threat Protection Pro, the NordLynx protocol, and the fastest VPN on the planet—tools that put privacy, security, and control back in people’s hands. Your impact? Helping millions take back control of their online security, privacy, and data. NordVPN protects millions of users daily through a global edge infrastructure spanning thousands of servers across dozens of countries. The platform engineering team builds and operates the internal backend services that make this possible. We're looking for a Staff Site Reliability Engineer (SRE) to design, build, operate, and improve these systems. This is a high-ownership role - you'll architect solutions, ship them to production. You're the person others rely on when the architecture needs rethinking or a service needs to go from zero to globally deployed. MAIN RESPONSIBILITIES - Design and operate on-demand, globally distributed backend services - Make architectural decisions on how internal services integrate and scale - Manage the full lifecycle: planning, implementation, monitoring, incident response, postmortems - Maintain and improve infrastructure tooling and automation - Contribute to engineering standards, documentation, and operational maturity - Evaluate and integrate AI tooling (LLMs, Claude Code, model integrations) into engineering workflows CORE REQUIREMENTS - Designing and operating globally distributed systems - Systems architecture - service communication, data flow, resilience patterns - Linux administration at scale (systemd, kernel tuning, debugging production systems) - Docker - building, shipping, and running containers in production - Databases - PostgreSQL, MySQL, Redis, OpenSearch, VictoriaMetrics - Web servers, load balancing, and failover (Nginx, HAProxy, or similar) - Working with dedicated/bare-metal hardware - Python or another scripting/programming language -
Applying for this Staff Site Reliability Engineer - Platform Engineering role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about NordVPN?
Real rants from real employees. Read before you apply.