NVIDIA

Semiconductor

PrincipalSiliconFailureAnalysisEngineer

$200–322k Santa Clara, California, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Principal candidates.

The Brief

“Principal Silicon Failure Analysis Engineer at NVIDIA. Skills: Silicon Failure Analysis (SiFA) Lab Infrastructure leadership, High-availability lab operations, Scalable failure analysis environment enablement, Root cause analysis of semiconductor issues. Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure. Ensure a safe, highly available, and scalable environment”

What You'll Achieve.

Enablement of FI, PFA, and SQE teams to efficiently root-cause advanced semiconductor issues; Rapid resolution of infrastructure issues impacting failure analysis operations; Minimize disruption from upgrades, maintenance, outages, and construction; Improve uptime, availability, MTBF, MTTR, and PM compliance for failure analysis tools; Reduce downtime and ensure operational resilience; Accurate tracking of inventory, asset lifecycle status, and preventative maintenance schedules; Support future silicon nodes, advanced packaging technologies, and increasing system complexity

Industry & Context.

Semiconductor
Problems you'll solve

Root cause analysis

What They're Looking For.

Must Have

Bachelor's degree or higher in Engineering or a related technical field or equivalent experience, 15+ overall years of experience in semiconductor, R&D, or high-precision lab infrastructure, Demonstrated experience with capital equipment enablement, facilities coordination, and vendor management, multi-functional leadership, communication, and execution skills

Nice to Have

Demonstrated end-to-end ownership of high-availability failure analysis labs to resolve product yield, performance, reliability, and quality issues, Proven experience enabling and sustaining complex capital tools with metric-driven reliability improvements, Achieved rigorous safety/compliance governance while delivering on a multi-year scaling roadmap to meet the demands of the latest silicon, packaging, and system challenges

What You'll Do.

Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure

and scalable environment

Own day-to-day lab operations and infrastructure readiness

Manage lab facilities and utilities

Drive failure analysis tool enablement and reliability

ESD and regulatory governance

Define and execute the long-term SiFA lab infrastructure roadmap

How You'll Work.

Team & Collaboration

Partners closely with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment vendors; Lead vendor and cross-functional partnerships

Communication Scope

Communication skills

Process & Methodology

Multi-year planning, Phased expansion

Full Job Description

NVIDIA is seeking a Principal Failure Analysis Engineer to lead Silicon Failure Analysis (SiFA) Lab Infrastructure, responsible for enabling a high-availability, safe, and scalable failure analysis environment. This role leads the lab framework including facilities, utilities, tool enablement, safety, access control, and operational readiness so that Fault Isolation (FI), Physical Failure Analysis (PFA), and Supplier Quality Engineering (SQE) teams can efficiently root cause our groundbreaking semiconductor products. The role partners closely with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment vendors to ensure reliable, secure, and scalable lab operations aligned with NVIDIA’s technology roadmap. **What You 'll Be Doing:** * Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure, ensuring a safe, highly available, and scalable environment that enables FI, PFA, and SQE teams to efficiently root‑cause advanced semiconductor issues * Own day‑to‑day lab operations and infrastructure readiness, serving as the primary point of accountability for availability, reliability, and rapid resolution of infrastructure issues impacting failure analysis operations * Manage lab facilities and utilities including power, backup power, cooling water, DI/PCW, exhaust, vacuum, CDA, nitrogen, and specialty gases, coordinating upgrades, maintenance, outages, and construction to minimize disruption * Drive failure analysis tool enablement and reliability from delivery through sustained operation, ensuring preventive maintenance and improving uptime, availability, MTBF, MTTR, and PM compliance * Lead vendor and cross‑functional partnerships with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment suppliers to reduce downtime and ensure operational resilience * Own consumables, inventory, and asset management including gases, chemicals, PPE, and materials, with accurate tracking of inventory, asset lifecycle status,

Free ATS check

Applying for this Principal Silicon Failure Analysis Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →