NVIDIA
AI, High-Performance Computing and Visualization
SoftwareEngineer,NVIDIAOpenShell
Neural analysis suggests this role is
optimal for Senior candidates.
“Software Engineer, NVIDIA OpenShell at NVIDIA. Skills: systems programming, distributed systems, control planes, Container/Sandbox Internals, Kubernetes, gRPC, Protobuf, mTLS, observability. Work across the full stack of a distributed systems platform, from crafting gRPC contracts to building secure sandbox runtimes. Implement and harden network security features, including policy enforcement, L4/L7 proxies, and secure inter-service communication using mTLS”
Industry & Context.
reasoning about state divergence; building reconciliation loops; designing crash recovery paths
What They're Looking For.
Must Have
Bachelor's degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience, 8+ years of meaningful experience, Proficiency in systems programming, including building and debugging long-running services, async runtimes, and handling OS-level integration, Deep knowledge of distributed systems/control planes, including reasoning about state divergence, building reconciliation loops, and designing crash recovery paths, Experience with Container/Sandbox Internals, managing isolated workloads, process lifecycle, capabilities, and network namespaces, Familiarity with gRPC and Protobuf, including crafting machine-to-machine APIs with clean streaming semantics and version safety, Experience operating and extending workloads on Kubernetes, including working with compute drivers, image management, and detailed debugging, Ability to secure inter-service communication using mTLS, gateway registration flows, and non-browser identity verification, Proficiency in instrumenting systems with structured logging, health checks, and distributed tracing for production observability
Nice to Have
Familiarity with virtualization technologies and alternative runtimes, such as microVMs (e. g. , libkrun), Experience improving operator experience through CLI/TUI development, status reporting, and clear error messages, Comfort working at cross-language boundaries, specifically between Rust, Python, protobuf codegen, and shell scripting
What You'll Do.
Work across the full stack of a distributed systems platform
from crafting gRPC contracts to building secure sandbox runtimes
Implement and harden network security features
including policy enforcement
and secure inter-service communication using mTLS
Develop core platform components such as inference routing
ensuring model provider adapters
credential management
and protocol translation integrate seamlessly with the sandbox and gateway
Build reliable configuration and control plane systems that handle state divergence
implement reconciliation loops
and support safe merging and hot-reloading policies
Own the operability experience by creating effective CLI tools
managing release automation
and instrumenting all systems for observability with structured logging and distributed tracing
Full Job Description
NVIDIA is defining the next era of computing by tapping into the unlimited potential of AI, an era where our GPU acts as the brains of computers, robots, and self-driving cars. Joining the OpenShell team offers a unique opportunity to work on a highly advanced platform that enables this future. This core system provides secure, sandboxed runtimes essential for autonomous AI agents. The OpenShell platform is sophisticated, incorporating a control-plane gateway, a privacy-conscious inference router, declarative policy enforcement, and specialized container and VM-based sandbox execution environments. This is a chance to make a lasting impact on the world alongside some of the most forward-thinking and hardworking people on the planet. **What you’ll be doing:** * Work across the full stack of a distributed systems platform, from crafting gRPC contracts to building secure sandbox runtimes. * Implement and harden network security features, including policy enforcement, L4/L7 proxies, and secure inter-service communication using mTLS. * Develop core platform components such as inference routing, ensuring model provider adapters, credential management, and protocol translation integrate seamlessly with the sandbox and gateway. * Build reliable configuration and control plane systems that handle state divergence, implement reconciliation loops, and support safe merging and hot-reloading policies. * Own the operability experience by creating effective CLI tools, managing release automation, and instrumenting all systems for observability with structured logging and distributed tracing. **What we need to see:** * Minimum of a Bachelor's degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience. * 8+ years of meaningful experience. * Proficiency in systems programming, including building and debugging long-running services, async runtimes, and handling OS-level integration. * Deep knowledge of distributed systems/control planes,
Applying for this Software Engineer, NVIDIA OpenShell role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.