NVIDIA
AI
GPUPerformanceEngineer-NeuralReconstruction
Neural analysis suggests this role is
optimal for Mid+ candidates.
“GPU Performance Engineer - Neural Reconstruction at NVIDIA. Skills: GPU Performance, Neural Reconstruction, PyTorch, CUDA. Profile neural reconstruction workflows. identify bottlenecks”
What You'll Achieve.
make neural reconstruction faster; more scalable; more reliable; preserve reconstruction quality; numerical behavior; camera/lidar correctness; production reliability; catch performance and quality regressions early
Industry & Context.
root-causing CPU/GPU bottlenecks; synchronization overhead; memory pressure; kernel launch overhead; framework-level inefficiencies
What They're Looking For.
Must Have
BS, MS, PhD, or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, Applied Math, Robotics, Computer Vision, Machine Learning, or a related field, 12+ years of experience, programming skills in Python and C++, Hands-on experience with PyTorch or a similar tensor/autograd framework, Experience optimizing GPU-accelerated workloads using CUDA, C++/CUDA extensions, or related GPU programming approaches, Practical experience with profiling and performance analysis, Ability to develop benchmarks and validate that optimizations preserve correctness, numerical behavior, and user-visible quality, communication skills
Nice to Have
Experience with Gaussian Splatting, NeRF, differentiable rendering, rasterization, neural rendering, SLAM, 3D reconstruction, or robotics/autonomous-vehicle perception pipelines, Deep CUDA performance experience, Experience optimizing PyTorch workloads with custom operators, fused kernels, sparse tensors, distributed training, or distributed rendering, Familiarity with camera and lidar geometry, projection models, calibration, rolling shutter, depth rendering, or multi-sensor reconstruction, Experience improving large production ML systems where quality metrics, training speed, memory footprint, and developer velocity must be balanced
What You'll Do.
Profile neural reconstruction workflows
Improve CUDA and PyTorch performance
Optimize sparse and irregular rendering workloads
Translate Python bottlenecks into CUDA/C++
Validate performance improvements
Build repeatable benchmarks
Collaborate with researchers and engineers
How You'll Work.
Team & Collaboration
Collaborate with researchers; CUDA engineers; ML engineers; production teams
Communication Scope
explain performance tradeoffs; risks; results to research and engineering partners
Full Job Description
Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. We are now looking for a GPU Performance Engineer for Neural Reconstruction! NVIDIA is building the future of computer graphics, simulation, robotics, and embodied AI. Neural reconstruction and Gaussian Splatting are changing how 3D worlds are collected, represented, optimized, and rendered. These workloads push the limits of GPU computing, differentiable rendering, computer vision, and production ML systems. In this role, you will help make neural reconstruction faster, more scalable, and more reliable. You will work across PyTorch, CUDA, C++, and GPU profiling to optimize training and rendering workflows used in sophisticated 3D reconstruction systems. The ideal candidate enjoys working close to the hardware while understanding the ML and 3D vision goals behind the system. **What You 'll Be Doing:** * Profile end-to-end neural reconstruction workflows and identify bottlenecks across data loading, initialization, training, rendering, evaluation, and export. * Improve CUDA and PyTorch performance for Gaussian Splatting and neural reconstruction workloads, including camera/lidar data, multiview batching, large-scene rendering, and memory-sensitive training paths. * Analyze GPU performance using tools such as Nsight Systems, Nsight Compute, NVTX, PyTorch Profiler, CUDA events, and benchmark dashboards. * Optimize sparse and irregular rendering workloads, including tile-level masking/culling, sparse gradients, batching, and multi-GPU execution. * Translate high-impact Python, NumP
Applying for this GPU Performance Engineer - Neural Reconstruction role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.