NVIDIA

Networking Systems & Software Architecture

PrincipalArchitect,AINetworking

$272–431k Santa Clara, California, United States FULL TIME Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Principal candidates.

The Brief

“Principal Architect, AI Networking at NVIDIA. Skills: AI Networking, Distributed AI communication systems, High-performance networking, GPU accelerated systems, Systems software, Networking, Research, Architecture. Setting the long-term technical vision for distributed AI communication systems—GPU-to-GPU, GPU-to-storage, and cross-node data movement.. Conducting original research and prototyping next-generation networking solutions over RDMA, NVLink, and GPUDirect.”

What You'll Achieve.

Moves data between GPUs, nodes, and storage at the speed modern AI demands.; Ships industry-wide production-grade software.

Industry & Context.

Networking Systems & Software Architecture

Problems you'll solve

Solving some of AI’s hardest infrastructure problems.; Investigating fundamental bottlenecks in communication runtimes for large-scale AI workloads

What They're Looking For.

Must Have

15+ years in systems software and/or networking with deep expertise in high-performance networking (InfiniBand, RoCE, RDMA, NVLink), communication libraries (e. g. NIXL, NCCL, UCX, MPI, NVSHMEM), and GPU accelerated systems, with track record of defining and delivering complex, cross-team technical initiatives from research concept to production., MS, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, or a related field., Deep understanding of computer architecture, memory hierarchies, DMA engines, and OS-level networking., Understanding of ML systems concepts—transformer architectures, KV cache mechanics, model parallelism, or distributed training and inference patterns., Proficiency in programming languages such as C, C++, Rust and Python.

Nice to Have

Knowledge of ML inference frameworks (vLLM, SGLang, TensorRT-LLM) and their communication requirements., CUDA programming and NVIDIA GPU architecture expertise., Proved experience influencing product strategy and technical roadmap at a senior level., Major open-source contributions.

What You'll Do.

Setting the long-term technical vision for distributed AI communication systems—GPU-to-GPU

and cross-node data movement.

Conducting original research and prototyping next-generation networking solutions over RDMA

Driving hardware-software co-optimization with GPU

Investigating fundamental bottlenecks in communication runtimes for large-scale AI workloads (KV cache transfer

disaggregated prefill/decode

Integrating networking capabilities into AI serving stacks such as vLLM

Representing NVIDIA in industry forums and standards bodies.

Mentoring senior engineers across the organization.

Defining project scope from scratch.

Translating research breakthroughs into production-grade software.

How You'll Work.

Team & Collaboration

Driving hardware-software co-optimization with GPU, DPU, NIC, and network switch.; Mentoring senior engineers across the organization.

Communication Scope

Publishing findings; Representing NVIDIA in industry forums and standards bodies

Process & Methodology

Defining project scope from scratch., Delivering complex, cross-team technical initiatives from research concept to production., Influencing product strategy and technical roadmap at a senior level.

Full Job Description

An applied research team within NVIDIA’s Networking Systems & Software Architecture group is solving some of AI’s hardest infrastructure problems. The team builds systems-level software that moves data between GPUs, nodes, and storage at the speed modern AI demands—spanning low-level transport optimization, hardware-software co-design, and communication frameworks that plug directly into production AI stacks. The team's charter expands into emerging domains including quantum computing interconnects. This Principal Architect role leads the research agenda and architectural direction for how NVIDIA’s AI systems communicate at scale—across GPUs, DPUs, NICs, and heterogeneous storage. It requires someone who defines project scope from scratch, publishes original work, and translates research breakthroughs into production-grade software that ships industry-wide! **What you will be doing:** * Setting the long-term technical vision for distributed AI communication systems—GPU-to-GPU, GPU-to-storage, and cross-node data movement. * Conducting original research and prototyping next-generation networking solutions over RDMA, NVLink, and GPUDirect. * Driving hardware-software co-optimization with GPU, DPU, NIC, and network switch. Investigating fundamental bottlenecks in communication runtimes for large-scale AI workloads (KV cache transfer, disaggregated prefill/decode, model parallelism). * Integrating networking capabilities into AI serving stacks such as vLLM, SGLang, and TensorRT-LLM. * Publishing findings, representing NVIDIA in industry forums and standards bodies, and mentoring senior engineers across the organization. **What we need to see:** * 15+ years in systems software and/or networking with deep expertise in high-performance networking (InfiniBand, RoCE, RDMA, NVLink), communication libraries (e.g. NIXL, NCCL, UCX, MPI, NVSHMEM), and GPU accelerated systems, with track record of defining and delivering complex, cross-team technical initiatives from research conc

Free ATS check

Applying for this Principal Architect, AI Networking role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 46 detected · ranked by frequency

High-performance networking ×5

GPU accelerated systems ×5

Low-level transport optimization ×3

Computer architecture ×3

Memory hierarchies ×3

DMA engines ×3

OS-level networking ×3

Programming languages ×3

CUDA programming ×3

NVIDIA GPU architecture ×3

AI Networking ×2

Distributed AI communication systems ×2

Systems software ×2

Networking ×2

Research ×2

Architecture ×2

RDMA

NVLink

GPUDirect

InfiniBand

RoCE

NIXL

NCCL

UCX

MPI

NVSHMEM

Rust

Python

CUDA

AI infrastructure

AI systems communication at scale

Hardware-software co-design

BEHAVIOURAL

Mentoring senior engineers

Role Details

Seniority senior

Experience 15–99 yrs

Level Principal

Type FULL TIME

Education MS, PhD or equivalent experience in Computer Science, Comput

Salary Band 200k+

AI-Extracted Insights

Domain Areas

ai-infrastructureai-systems-communication-at-scaleml-systems-conceptstransformer-architectureskv-cache-mechanicsmodel-parallelismdistributed-training-and-inference-patternsml-inference-frameworks

How to Apply on Workday

Workday has a multi-step form — save your progress after every section.
"Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →