NVIDIA

semiconductor

PrincipalSiliconFailureAnalysisEngineer

$200–322k Santa Clara, California, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Principal candidates.

The Brief

“Principal Silicon Failure Analysis Engineer at NVIDIA. Skills: Silicon Failure Analysis (SiFA) Lab infrastructure leadership, high-availability lab operations, scalable lab environment, root-cause analysis of semiconductor issues, lab facilities and utilities management, failure analysis tool enablement and reliability, vendor and cross-functional partnerships, safety, chemical, ESD and regulatory governance, long-term SiFA lab infrastructure roadmap definition and execution. Lead the overall Si”

What You'll Achieve.

enabling a high-availability, safe, and scalable failure analysis environment; efficiently root cause our groundbreaking semiconductor products; ensure reliable, secure, and scalable lab operations aligned with NVIDIA’s technology roadmap; minimize disruption; improving uptime, availability, MTBF, MTTR, and PM compliance; reduce downtime and ensure operational resilience; accurate tracking of inventory, asset lifecycle status, and preventative maintenance schedules; enforcing training, certification, safety, and IP protection requirements; support future silicon nodes, advanced packaging technologies, and increasing system complexity; resolve product yield, performance, reliability, and quality issues; metric-driven reliability improvements; delivering on a multi-year scaling roadmap to meet the demands of the latest silicon, packaging, and system challenges

Industry & Context.

semiconductor
Problems you'll solve

root cause analysis

What They're Looking For.

Must Have

Bachelor's degree or higher in Engineering or a related technical field or equivalent experience, 15+ overall years of experience in semiconductor, R&D, or high-precision lab infrastructure, Demonstrated experience with capital equipment enablement, facilities coordination, and vendor management, multi-functional leadership, communication, and execution skills

Nice to Have

Demonstrated end-to-end ownership of high-availability failure analysis labs to resolve product yield, performance, reliability, and quality issues, Proven experience enabling and sustaining complex capital tools with metric-driven reliability improvements, Achieved rigorous safety/compliance governance while delivering on a multi-year scaling roadmap to meet the demands of the latest silicon, packaging, and system challenges

What You'll Do.

Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure

and scalable environment

Own day-to-day lab operations and infrastructure readiness

Manage lab facilities and utilities including power

Drive failure analysis tool enablement and reliability from delivery through sustained operation

ESD and regulatory governance

Define and execute the long-term SiFA lab infrastructure roadmap

How You'll Work.

Team & Collaboration

partners closely with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment vendors; Lead vendor and cross-functional partnerships with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment suppliers

Communication Scope

communication

Process & Methodology

multi-year planning, phased expansion

Full Job Description

NVIDIA is seeking a Principal Failure Analysis Engineer to lead Silicon Failure Analysis (SiFA) Lab Infrastructure, responsible for enabling a high-availability, safe, and scalable failure analysis environment. This role leads the lab framework including facilities, utilities, tool enablement, safety, access control, and operational readiness so that Fault Isolation (FI), Physical Failure Analysis (PFA), and Supplier Quality Engineering (SQE) teams can efficiently root cause our groundbreaking semiconductor products. The role partners closely with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment vendors to ensure reliable, secure, and scalable lab operations aligned with NVIDIA’s technology roadmap. **What You 'll Be Doing:** * Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure, ensuring a safe, highly available, and scalable environment that enables FI, PFA, and SQE teams to efficiently root‑cause advanced semiconductor issues * Own day‑to‑day lab operations and infrastructure readiness, serving as the primary point of accountability for availability, reliability, and rapid resolution of infrastructure issues impacting failure analysis operations * Manage lab facilities and utilities including power, backup power, cooling water, DI/PCW, exhaust, vacuum, CDA, nitrogen, and specialty gases, coordinating upgrades, maintenance, outages, and construction to minimize disruption * Drive failure analysis tool enablement and reliability from delivery through sustained operation, ensuring preventive maintenance and improving uptime, availability, MTBF, MTTR, and PM compliance * Lead vendor and cross‑functional partnerships with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment suppliers to reduce downtime and ensure operational resilience * Own consumables, inventory, and asset management including gases, chemicals, PPE, and materials, with accurate tracking of inventory, asset lifecycle status,

Free ATS check

Applying for this Principal Silicon Failure Analysis Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →