NVIDIA
Networking Systems & Software Architecture
SeniorSoftwareArchitect,AISystemsandNetworking
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Software Architect, AI Systems and Networking at NVIDIA. Skills: AI Systems, Networking, High-performance communication, Hardware-software co-optimization, Systems programming. Architecting and implementing high-performance communication and memory management libraries for distributed AI. Driving hardware-software co-optimization with GPU, DPU, NIC, and switch teams through GPUDirect RDMA, NVLink, and next-generation interconnects”
What You'll Achieve.
shipping production code
Industry & Context.
solving some of AI’s hardest infrastructure problems
What They're Looking For.
Must Have
12+ years in systems software and/or networking with demonstrated ownership of complex projects, Solid understanding of high-performance networking: InfiniBand, RoCE, RDMA, NVLink, GPUDirect, C/C++/Rust systems programming with comfort in performance profiling and low-level debugging, Understanding of ML systems concepts—transformer architectures, KV cache mechanics, model parallelism, or distributed training and inference patterns
Nice to Have
Knowledge of ML inference frameworks (vLLM, SGLang, TensorRT-LLM) and their communication requirements, Knowledge of storage networking (NVMe-oF, GPUDirect Storage, S3), Background of Reinforcement Learning systems
What You'll Do.
Architecting and implementing high-performance communication and memory management libraries for distributed AI
Driving hardware-software co-optimization with GPU
and switch teams through GPUDirect RDMA
and next-generation interconnects
Profiling and optimizing data movement across GPU memory
Integrating networking capabilities into AI serving stacks such as vLLM
Contributing to and maintaining open-source projects
prototyping experimental technologies to evaluate their viability
own modules and projects end-to-end—from scoping research questions to shipping production code
How You'll Work.
Team & Collaboration
Driving hardware-software co-optimization with GPU, DPU, NIC, and switch teams; mentoring engineers; conducting design reviews
Process & Methodology
ownership of complex projects, own modules and projects end-to-end
Full Job Description
An applied research team within NVIDIA’s Networking Systems & Software Architecture group is solving some of AI’s hardest infrastructure problems. The team builds systems-level software that moves data between GPUs, nodes, and storage at the speed modern AI demands—spanning low-level transport optimization, hardware-software co-design, and communication frameworks that plug directly into production AI stacks. The team's charter expands into emerging domains including quantum computing interconnects. The Senior Architect role is to own modules and projects end-to-end—from scoping research questions to shipping production code. It calls for a recognized expert who drives technical decisions, pulls in ideas from research and industry, and regularly prototypes new approaches to prove a point. The work lives at the boundary of applied research and production engineering! **What you will be doing:** * Architecting and implementing high-performance communication and memory management libraries for distributed AI * Driving hardware-software co-optimization with GPU, DPU, NIC, and switch teams through GPUDirect RDMA, NVLink, and next-generation interconnects * Profiling and optimizing data movement across GPU memory, system DRAM, NVMe, and network fabrics * Integrating networking capabilities into AI serving stacks such as vLLM, SGLang, and TensorRT-LLM * Contributing to and maintaining open-source projects, mentoring engineers, conducting design reviews, and prototyping experimental technologies to evaluate their viability **What we need to see:** * 12+ years in systems software and/or networking with demonstrated ownership of complex projects. * MS, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, or a related field. * Solid understanding of high-performance networking: InfiniBand, RoCE, RDMA, NVLink, GPUDirect. * Strong C/C++/Rust systems programming with comfort in performance profiling and low-level debugging. * Understandi
Applying for this Senior Software Architect, AI Systems and Networking role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.