NVIDIA

SeniorPerformanceModelingArchitect,CPUFabricandLLC

$152–288k Santa Clara, California, United States FULL TIME Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Performance Modeling Architect, CPU Fabric and LLC at NVIDIA. Skills: Performance Modeling, CPU Cache Hierarchies, Interconnects, C++, SystemC, Python. Developing and maintaining high-fidelity, cycle-accurate performance models (C++/SystemC) for coherent interconnects and large-scale shared caches. Modeling and analyzing performance bottlenecks across varying scales”

What You'll Achieve.

achieve ambitious performance goals

Industry & Context.

Problems you'll solve

architectural trade-offs; performance bottlenecks

What They're Looking For.

Must Have

Master’s or Ph. D. in Computer Engineering, Electrical Engineering, or Computer Science (or equivalent experience) with a focus on architecture, 5+ years of experience, understanding of CPU microarchitecture, memory consistency models, and cache coherency protocols, Proven experience in C++ or SystemC for cycle-accurate or functional modeling, Proficiency in Python or similar scripting languages for processing large datasets, generating performance visualizations, and automating simulation sweeps, Understanding of Network-on-Chip (NoC) topologies (Mesh, Ring, Torus), credit-based flow control, and arbitration logic

Nice to Have

Practical experience managing the functional safety (ISO 26262) requirements of automotive chips alongside the power-performance-area (PPA) limitations of data center hardware, Experience defining or using PMU (Performance Monitoring Unit) events to debug performance on real silicon or emulators, A background in using formal verification or mathematical modeling to prove the correctness of complex coherency state machines, A history of building your own internal tools or frameworks to accelerate architectural exploration rather than just using off-the-shelf simulators, Knowledge of emerging memory technologies like CXL (Compute Express Link) or HBM (High Bandwidth Memory) and how they collaborate with coherent fabrics

What You'll Do.

Developing and maintaining high-fidelity

cycle-accurate performance models (C++/SystemC) for coherent interconnects and large-scale shared caches

Modeling and analyzing performance bottlenecks across varying scales

Evaluating the performance impact of different coherency protocols and snooping filters

Running and analyzing industry-standard benchmarks to drive architectural trade-offs

Architectural definition and improvement of next-generation CPU Cache Hierarchies and interconnects

Build the 'source of truth' models that govern data movement across silicon

Ensure next-level caches and coherent fabrics achieve ambitious performance goals

How You'll Work.

Team & Collaboration

Collaborating with build and verification teams to correlate performance models with silicon; working with software teams to optimize drivers for the underlying hardware topology

Full Job Description

We are looking for a highly skilled Performance Modeling Architect to lead the architectural definition and improvement of our next-generation CPU Cache Hierarchies and interconnects. This is an outstanding chance to create scalable solutions that connect two fast-paced domains: the high-reliability, low-latency needs of Automotive and the massive efficiency, high-density demands of Data Center systems. You will build the "source of truth" models that govern data movement across our silicon, ensuring our next-level caches (L3/System Cache) and coherent fabrics achieve ambitious performance goals. **What you 'll be doing:** As a core member of the architecture team, your daily work will involve: * Developing and maintaining high-fidelity, cycle-accurate performance models (C++/SystemC) for coherent interconnects and large-scale shared caches. * Modeling and analyzing performance bottlenecks across varying scales, from small-cluster automotive SoCs to massive, multi-mesh data center architectures. * Evaluating the performance impact of different coherency protocols (e.g., CHI, ACE, or proprietary) and snooping filters. * Running and analyzing industry-standard benchmarks (SPEC, MLPerf, Automotive-specific suites) to drive architectural trade-offs. * Collaborating with build and verification teams to correlate performance models with silicon and working with software teams to optimize drivers for the underlying hardware topology. **What we need to see:** To be successful in this role, you should possess a deep technical foundation in computer architecture: * A Master’s or Ph.D. in Computer Engineering, Electrical Engineering, or Computer Science (or equivalent experience) with a focus on architecture with 5+ years of experience. * Strong understanding of CPU microarchitecture, memory consistency models, and cache coherency protocols. * Proven experience in C++ or SystemC for cycle-accurate or functional modeling. * Proficiency in Python or similar scripting languages f

Free ATS check

Applying for this Senior Performance Modeling Architect, CPU Fabric and LLC role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 27 detected · ranked by frequency

SystemC ×3

Python ×3

high-fidelity, cycle-accurate performance models ×3

performance bottleneck analysis ×3

coherency protocols ×3

Network-on-Chip (NoC) topologies ×3

credit-based flow control ×3

arbitration logic ×3

functional safety (ISO 26262) ×3

power-performance-area (PPA) analysis ×3

Hardware Performance Counters ×3

Performance Monitoring Unit (PMU) events ×3

formal verification ×3

mathematical modeling ×3

custom tooling development ×3

architectural exploration ×3

Performance Modeling ×2

CPU Cache Hierarchies ×2

Interconnects ×2

CHI

ACE

CXL

HBM

Cross-Domain Versatility

systems-thinking approach to hardware development

cycle-accurate simulators

off-the-shelf simulators

BEHAVIOURAL

collaboration

Role Details

Seniority senior

Experience 5–10 yrs

Level Senior

Type FULL TIME

Education Master’s or Ph. D. in Computer Engineering, Electrical Engin

Salary Band 150k-200k

AI-Extracted Insights

Domain Areas

automotive-systemsdata-center-systemscpu-microarchitecturememory-consistency-modelscache-coherency-protocolsnetwork-on-chip-noc-topologiescredit-based-flow-controlarbitration-logic

How to Apply on Workday

Workday has a multi-step form — save your progress after every section.
"Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →