NVIDIA
Technology
SystemsPerformanceEngineer,AgenticAIWorkloads–NewCollegeGrad2026
Neural analysis suggests this role is
optimal for Entry candidates.
“Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026 at NVIDIA. Skills: Agentic AI Workloads, Systems Performance, Deep Learning. Develop simulators. Extend simulators”
Industry & Context.
Root cause analysis
What They're Looking For.
Must Have
MS, or PhD in CS, EE, Mathematics, programming skills in C++ and Python, Solid foundations in queueing theory, Solid foundations in traffic modeling, statistics background, Understanding of deep learning fundamentals, Understanding of LLMs, Understanding of modern inference serving frameworks
Nice to Have
Hands-on experience with traffic simulators, Hands-on experience with network simulators, Familiarity with roofline modeling, Familiarity with performance scaling, Experience running large-scale simulation campaigns, Experience building data pipelines, Prior work characterizing ML inference workloads, Prior work benchmarking ML inference workloads
What You'll Do.
Model network traffic
Model compute traffic
Characterize LLM serving workloads
Distill workloads into inputs
Run simulations at scale
Interpret simulation results
Identify performance bottlenecks
Translate findings into recommendations
Influence design of AI systems
How You'll Work.
Team & Collaboration
hardware teams; software teams; research teams
Full Job Description
NVIDIA is looking for a Deep Learning Architect to join our team working at the cutting edge of AI infrastructure. As agentic LLM workloads reshape the demands placed on modern datacenters, we need engineers who can model, simulate, and reason about complex system-level traffic at scale. If you have a passion for performance analysis, a strong quantitative foundation, and excitement about the future of AI systems, we'd love to talk. In this role, you will build and run simulations that capture the traffic dynamics of agentic AI workloads, mine the results for actionable insights, and help guide architectural decisions for next-generation datacenter and GPU systems. **What you 'll be doing:** * Develop and extend C++ and Python simulators that model system-level network and compute traffic for agentic LLM workloads in datacenter environments * Characterize real-world LLM serving workloads and distill them into representative simulator inputs * Run simulations at scale and apply statistical techniques to post-process and interpret results * Identify performance bottlenecks and translate findings into concrete architectural recommendations * Collaborate with hardware, software, and research teams to influence the design of future AI systems **What we need to see:** * Pursuing or recently completed a MS, or PhD in CS, EE, Mathematics, or a related field (or equivalent experience) * Strong programming skills in C++ and Python * Solid foundations in queueing theory and traffic modeling (e.g., Erlang models, Little's Law) * Strong statistics background: characterize huge datasets with percentiles, distributions, and clustering techniques such as K-means * Understanding of deep learning fundamentals, LLMs, and modern inference serving frameworks **Ways to stand out from the crowd:** * Hands-on experience with traffic or network simulators, even in an academic or course project context * Familiarity with roofline modeling and performance scaling of deep learning models at th
Applying for this Systems Performance Engineer, Agentic AI Workloads – New College Grad 2026 role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.