AI Software Engineer, Kernel Libraries
AI
AISoftwareEngineer,KernelLibraries-NewCollegeGrad2026
Neural analysis suggests this role is
optimal for Entry candidates.
“AI Software Engineer, Kernel Libraries - New College Grad 2026 at AI Software Engineer, Kernel Libraries. Skills: AI systems software, inference systems software stack, GPU kernel technologies, LLM inference. Innovating and developing new AI systems technologies for efficient inference. Designing, implementing, and optimizing kernels for high impact AI workloads”
What You'll Achieve.
accelerate for AI inference; accelerate large language models, agents, and other high-impact AI workloads
Industry & Context.
What They're Looking For.
Must Have
Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience), 2 + years (academic/ industry) experience with ML/DL systems development preferable, experience in developing or using deep learning frameworks (e. g. PyTorch, JAX, TensorFlow, ONNX, etc), Python and C/C++ programming skills
Nice to Have
PhD are preferred, inference engines and runtimes such as vLLM, SGLang, and MLC, Background in domain specific compiler and library solutions for LLM inference and training (e. g. FlashInfer, Flash Attention), Expertise in inference engines like vLLM and SGLang, Expertise in machine learning compilers (e. g. Apache TVM, MLIR), experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar), Open source project ownership or contributions
What You'll Do.
Innovating and developing new AI systems technologies for efficient inference
and optimizing kernels for high impact AI workloads
Designing and implementing extensible abstractions for LLM serving engines
Building efficient just-in-time domain specific compilers and runtimes
and GPU kernel technologies for NVIDIA's hardware architecture
Designing and building things like new abstractions
efficient attention kernel implementations
new LLM inference runtimes components
and kernel code generators to accelerate large language models
and other high-impact AI workloads
How You'll Work.
Team & Collaboration
Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams
Full Job Description
We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate large language models, agents, and other high-impact AI workloads. **What you 'll be doing:** * Innovating and developing new AI systems technologies for efficient inference * Designing, implementing, and optimizing kernels for high impact AI workloads * Designing and implementing extensible abstractions for LLM serving engines * Building efficient just-in-time domain specific compilers and runtimes * Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams * Contributing to open source communities like FlashInfer, vLLM, and SGLang **What we need to see:** * Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred * 2 + years (academic/ industry) experience with ML/DL systems development preferable * Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC. * Strong Python and C/C++ programming skills **Ways to stand out from the crowd:** * Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention) * Expertise in inference engines like vLLM and SGLang * Expertise in machine learning compilers (e.g. Apache TVM, MLIR) * Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Trit
Applying for this AI Software Engineer, Kernel Libraries - New College Grad 2026 role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about AI Software Engineer, Kernel Libraries?
Real rants from real employees. Read before you apply.