NVIDIA

Technology

PrincipalArchitect,SystemSoftware

$272–431k Santa Clara, California, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Principal Architect, System Software at NVIDIA. Skills: System software architecture, AI infrastructure, Data center manageability, Space systems. Own system architecture. Make stack resilient”

What You'll Achieve.

Deliver production-ready inference platform; Bring best orbital AI products to market; Operate orbital fleets; Resolve issues at Speed of Light; Deliver system software with each ODC module

Industry & Context.

Technology
Problems you'll solve

Creative solutions

What They're Looking For.

Must Have

15+ years system software experience, BS, MS, or PhD in EE/CS, Building AI infrastructure in space, Architecting platform software, Server architecture knowledge, Data center manageability knowledge, Full-stack integration knowledge, Data center health management experience, Hardware management interfaces knowledge, Redfish, MCTP, PLDM proficiency, C/C++ and Python skill, Server platform programming experience, SCM experience, Project management tools experience, Quality, reliability, telemetry performance ownership

Nice to Have

Platform software for space architecting, Startup or space data center initiative experience, Autonomous data center operations experience, x86 or ARM system architecture experience, NVIDIA AI software stack experience, NSA PHIPs security familiarity, Post-quantum networking familiarity, Aerospace standards familiarity, Technical leadership experience, Reviewing hardware schematics, Reviewing PCB layouts

What You'll Do.

Own system architecture

Co-architect interfaces

Own end-to-end architecture

Define manageability architecture

Architect rad-tolerant behaviors

Drive Redfish protocols

Define BMC feature set

Define boot architecture

Define redundancy strategy

Translate mission requirements

Ensure correct operation

Implement idle-power retention

Own telemetry performance

How You'll Work.

Team & Collaboration

Orbital hardware system architecture team; Constellation operators; Platform engineering; Mechanical engineering; Cloud customers; Constellation customers; Bring-up team

Communication Scope

Written communication; Oral communication

Process & Methodology

Jira

Full Job Description

Space-1 is NVIDIA's first Orbital Data Center (ODC) module — a Vera Rubin–class compute platform engineered for low-Earth orbit mission. It is the first step in a multi-generation orbital roadmap to speed up AI adoption. We are looking for a strong technical architect to own end-to-end system software architecture for Space-1 and successor orbital platforms. You will architect the full stack — application to libraries, from data center stack to BMC and BIOS firmware, manageability, and telemetry through the host OS, GPU and CPU drivers, and CUDA — to deliver a production-ready inference platform that operates reliably in the radiation, thermal-cycling, and remote-operations environment of LEO. You will partner closely with the orbital hardware system architecture team, drive customer use cases with constellation operators, align architecture with mission requirements, and bring the best orbital AI products to market. Join us at the forefront of technological advancement. **What you 'll be doing:** * Own system architecture for inference stack and other applications running on this class of products and make it resilient to any fault happening in space. * Co-architect with the orbital hardware system architecture team to define interfaces, partitioning, and trade-offs across silicon, board, firmware, OS, and AI workload layers for 5-year LEO missions. * Own end-to-end system software architecture for Space-1 and successor Orbital Data Center modules — covering data center stack, BMC firmware, BIOS, host OS, GPU/CPU drivers, CUDA, DCGM, and manageability telemetry as a single integrated stack. * Define the manageability architecture for an unreachable, autonomous data center: remote bring-up, in-orbit firmware update, dual-module redundancy, fault containment, recovery from SEU/SEFI events, and telemetry for fleets ranging from tens to millions of nodes. * Architect rad-tolerant system software behaviors — ECC handling, memory scrubbing, latch-up mitigation, determini

Free ATS check

Applying for this Principal Architect, System Software role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →