NVIDIA
high-performance computing
SeniorAIFrameworksEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior AI Frameworks Engineer at NVIDIA. Skills: AI Frameworks, C++, Python, GPU programming, DSL development, compiler development. Design APIs that prioritize user productivity, providing a "native" feel for developers accustomed to modern scientific computing and deep learning frameworks. Develop robust compilation infrastructure—including AST transformations and JIT-friendly execution—to lower Pythonic descriptions into high-performance GPU machine code”
What You'll Achieve.
bridge the gap between low-level hardware primitives and high-level developer productivity; empower the next generation of AI researchers and engineers with better tools
Industry & Context.
What They're Looking For.
Must Have
MS or PhD degree in Computer Science, Electrical Engineering, or related field (or equivalent experience), At least 3+ years of relevant experience, proficiency in Python and C++, specifically regarding the design of Python extensions and foreign function interfaces (FFI), Experience in library or framework development, with a focus on creating intuitive APIs for complex technical systems, Deep understanding of the Python ecosystem’s delivery stack, including building, testing, and distributing high-performance compiled extensions
Nice to Have
Active maintainer status or significant contributions to high-performance open-source libraries, AI frameworks or compiler projects (LLVM/MLIR), Understanding of compiler foundations, such as intermediate representations (IR), lowering passes, or AST manipulation, Experience with GPU Architecture and parallel programming models (CUDA)
What You'll Do.
Design APIs that prioritize user productivity
providing a "native" feel for developers accustomed to modern scientific computing and deep learning frameworks
Develop robust compilation infrastructure—including AST transformations and JIT-friendly execution—to lower Pythonic descriptions into high-performance GPU machine code
Optimize developer experience by creating debugging tools
profiler integrations
and validation methodologies that make writing and using kernels easy
Build production-grade delivery infrastructure for the open-source community
managing everything from package distribution (wheels
conda) to the user-facing documentation and testing
Full Job Description
We are now looking for a Senior AI Frameworks Engineer (C++/Python)! NVIDIA's high-performance computing platforms are powering the AI revolution across many applications and industries. Within our software stack, CUTLASS stands out as a popular open-source ecosystem dedicated to high-performance math primitives. Since 2017, it has provided the community with C++ template abstractions to implement custom GEMM and related computations efficiently on NVIDIA GPUs. We are building the next frontier of this ecosystem: Pythonic CUTLASS (CUTLASS DSL). This initiative aims to bring "speed-of-light" performance and powerful abstractions of our stack directly into the Python environment. Join the CUTLASS team and help bridge the gap between low-level hardware primitives and high-level developer productivity. If you are passionate about building elegant, high-performance DSLs and want to empower the next generation of AI researchers and engineers with better tools, apply today! **What you 'll be doing:** As a core contributor to the CUTLASS project, you will use your expertise in systems programming and API design to create a world-class developer experience for GPU programming and kernel delivery. * Design APIs that prioritize user productivity, providing a "native" feel for developers accustomed to modern scientific computing and deep learning frameworks. * Develop robust compilation infrastructure—including AST transformations and JIT-friendly execution—to lower Pythonic descriptions into high-performance GPU machine code. * Optimize developer experience by creating debugging tools, profiler integrations, and validation methodologies that make writing and using kernels easy. * Build production-grade delivery infrastructure for the open-source community, managing everything from package distribution (wheels, conda) to the user-facing documentation and testing. **What we need to see:** * MS or PhD degree in Computer Science, Electrical Engineering, or related field (or equiva
Applying for this Senior AI Frameworks Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.