NVIDIA

AI

GPUPerformanceEngineer-NeuralReconstruction

$224–431k US, CA, Remote FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“GPU Performance Engineer - Neural Reconstruction at NVIDIA. Skills: GPU Performance, Neural Reconstruction, PyTorch, CUDA. Profile neural reconstruction workflows. identify bottlenecks”

What You'll Achieve.

make neural reconstruction faster; more scalable; more reliable; preserve reconstruction quality; numerical behavior; camera/lidar correctness; production reliability; catch performance and quality regressions early

Industry & Context.

AI
Problems you'll solve

root-causing CPU/GPU bottlenecks; synchronization overhead; memory pressure; kernel launch overhead; framework-level inefficiencies

What They're Looking For.

Must Have

BS, MS, PhD, or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, Applied Math, Robotics, Computer Vision, Machine Learning, or a related field, 12+ years of experience, programming skills in Python and C++, Hands-on experience with PyTorch or a similar tensor/autograd framework, Experience optimizing GPU-accelerated workloads using CUDA, C++/CUDA extensions, or related GPU programming approaches, Practical experience with profiling and performance analysis, Ability to develop benchmarks and validate that optimizations preserve correctness, numerical behavior, and user-visible quality, communication skills

Nice to Have

Experience with Gaussian Splatting, NeRF, differentiable rendering, rasterization, neural rendering, SLAM, 3D reconstruction, or robotics/autonomous-vehicle perception pipelines, Deep CUDA performance experience, Experience optimizing PyTorch workloads with custom operators, fused kernels, sparse tensors, distributed training, or distributed rendering, Familiarity with camera and lidar geometry, projection models, calibration, rolling shutter, depth rendering, or multi-sensor reconstruction, Experience improving large production ML systems where quality metrics, training speed, memory footprint, and developer velocity must be balanced

What You'll Do.

Profile neural reconstruction workflows

Improve CUDA and PyTorch performance

Optimize sparse and irregular rendering workloads

Translate Python bottlenecks into CUDA/C++

Validate performance improvements

Build repeatable benchmarks

Collaborate with researchers and engineers

How You'll Work.

Team & Collaboration

Collaborate with researchers; CUDA engineers; ML engineers; production teams

Communication Scope

explain performance tradeoffs; risks; results to research and engineering partners

Full Job Description

Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and Gaussian Splatting are changing how 3D worlds are collected, represented, optimized, and rendered. These workloads push the limits of GPU computing, differentiable rendering, computer vision, and production ML systems. In this role, you will help make neural reconstruction faster, more scalable, and more reliable. You will work across PyTorch, CUDA, C++, and GPU profiling to optimize training and rendering workflows used in sophisticated 3D reconstruction systems. The ideal candidate enjoys working close to the hardware while understanding the ML and 3D vision goals behind the system. **What You 'll Be Doing:** * Profile end-to-end neural reconstruction workflows and identify bottlenecks across data loading, initialization, training, rendering, evaluation, and export. * Improve CUDA and PyTorch performance for Gaussian Splatting and neural reconstruction workloads, including camera/lidar data, multiview batching, large-scene rendering, and memory-sensitive training paths. * Analyze GPU performance using tools such as Nsight Systems, Nsight Compute, NVTX, PyTorch Profiler, CUDA events, and benchmark dashboards. * Optimize sparse and irregular rendering workloads, including tile-level masking/culling, sparse gradients, batching, and multi-GPU execution. * Translate high-impact Python, NumP

Free ATS check

Applying for this GPU Performance Engineer - Neural Reconstruction role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →