NVIDIA

Technology

SystemsQualityandReliabilityEngineer

$136–265k Santa Clara, California, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Systems Quality and Reliability Engineer at NVIDIA. Skills: Systems Quality, Reliability Engineering, Failure Analysis. Own, build, manage RMA/FA debug. Perform root-cause analysis”

Industry & Context.

Technology
Problems you'll solve

Root-cause analysis; Debug; Troubleshooting

What They're Looking For.

Must Have

BS/MS in EE, Physics or related degree, 5+ yrs systems test/validation experience, Proven hands-on systems quality/reliability experience, Competence using lab equipment, Experience enabling reliability tests, Knowledge of Fault isolation techniques, Proficiency with high speed interfaces, Proficiency in Python, PERL, C++ on UNIX/Linux, Excellent knowledge of PCB card/system level test/debug

Nice to Have

Working knowledge of FA techniques and tools

What You'll Do.

Perform root-cause analysis

Conduct and lead debug/root-cause analysis

Collaborate with Systems Engineers

Collaborate with Hardware engineers

Collaborate with Software engineers

Collaborate with operations engineers

Scale root cause FA capabilities

Create FA result reports

Drive resolution plans

Drive containment plans

Drive mitigation plans

Oversee hardware quality performance

Monitor field quality data

Manage operational perf of FA

Ensure partner achieve KPIs

Oversee setup of new products

How You'll Work.

Team & Collaboration

Systems Engineers; Hardware engineers; Software engineers; Operations engineers; Factory floor partners

Full Job Description

We are seeking Systems Quality and Reliability Engineer to join our LPU team! NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can tackle, and that matter to the world. This is our life’s work, to amplify human imagination and intelligence! ## **What you 'll be doing:** You will own, build, and manage the RMA and FA debug and root-cause analysis for existing and new NVIDIA AI/ML products. You will conduct tests, and root-cause analysis. Other responsibilities include: * Conduct and lead debug and root-cause analysis of field RMAs. Collaborate with Systems Engineers, Hardware engineers, Software engineers, and operations engineers as required * Scale root cause FA capabilities within your organization * Create FA result reports that align with standard 8D or similar process * Analyze RMA, FA and repair data. Identify trends and raise quality alerts when necessary. Drive resolution, containment, and mitigation plans for such quality alerts * Oversee hardware quality performance, monitoring field quality data and associated metrics including RMA rates, MTBF, and Reliability Ratio * Manage operational perf of FA at CMs, ensuring partner achieve key perf indicators including FA cycle times, fault duplication rates and fault isolation rates * Oversee the setup of new products into Failure Analysis operations ## **What we need to see:** * BS/MS in EE, Physics or a related degree (or equivalent experience) * 5+ yrs of hands on systems test and/or validation engineering experience * Proven hands-on experience in systems quality and reliability Engineer * Competence using lab equipment such as oscilloscopes, logic a

Free ATS check

Applying for this Systems Quality and Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →