NVIDIA

Artificial Intelligence

SeniorHPCArchitect,AutomationandAtScaleDeployment

$184–357k Santa Clara, California, United States FULL TIME Remote Friendly
The Brief

“Senior HPC Architect, Automation and At-Scale Deployment at NVIDIA. Skills: HPC architecture, GPU computing, automation, at-scale deployment, system administration, performance tuning, Deep Learning. support deployment and bringup of large-scale GPU compute clusters. enable computing hardware and software”

Industry & Context.

Artificial Intelligence
Problems you'll solve

analytical skills

What They're Looking For.

Must Have

8+ years of experience using in accelerated computing for datacenter/HPC-based Enterprise computing solutions, Solid understanding of accelerated computing scheduling and I/O stacks, C/C++/Pythonash programming/scripting experience, Experience working with engineering or academic research community supporting high performance computing or deep learning, Experience with parallel filesystems, BS (or equivalent experience) in Engineering, Mathematics, Physics, or Computer Science

Nice to Have

MS or PhD desirable, Deep Learning framework skills, Exposure to using and deploying telemetry and visualization pipelines, Exposure to container technology and Linux performance tools

What You'll Do.

support deployment and bringup of large-scale GPU compute clusters

enable computing hardware and software

contribute to breakthroughs in artificial intelligence and GPU computing

Provide insights on and implement at-scale system administration and tuning mechanisms for large-scale compute runs

work with accelerated computing and Deep Learning software and hardware platforms

craft improved workflows and develop new

leading differentiated solutions

and systems specialist to architect

develop and bring up large scale performance platforms

Provide engineering solutions to operationalize the latest GPU Computing products and software stacks

ensure technical relationships with internal and external engineering teams

machine learning/deep learning engineers in building creative solutions based on NVIDIA technology

Be an internal reference for system administration

at-scale system analysis

and other datacenter and large-scale GPU-accelerated system solutions among the NVIDIA technical community

How You'll Work.

Team & Collaboration

work with many scientific researchers, developers, and customers; interact with HPC, OS, GPU compute, and systems specialist; ensure technical relationships with internal and external engineering teams; assisting systems, machine learning/deep learning engineers

Communication Scope

teamwork and communication skills, both verbal and written

Free ATS check

Applying for this Senior HPC Architect, Automation and At-Scale Deployment role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →