AI Software Engineer, Kernel Libraries

AI

AISoftwareEngineer,KernelLibraries-NewCollegeGrad2026

$124–242k Santa Clara, California, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Entry candidates.

The Brief

“AI Software Engineer, Kernel Libraries - New College Grad 2026 at AI Software Engineer, Kernel Libraries. Skills: AI systems software, inference systems software stack, GPU kernel technologies, LLM inference. Innovating and developing new AI systems technologies for efficient inference. Designing, implementing, and optimizing kernels for high impact AI workloads”

What You'll Achieve.

accelerate for AI inference; accelerate large language models, agents, and other high-impact AI workloads

Industry & Context.

AI

What They're Looking For.

Must Have

Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience), 2 + years (academic/ industry) experience with ML/DL systems development preferable, experience in developing or using deep learning frameworks (e. g. PyTorch, JAX, TensorFlow, ONNX, etc), Python and C/C++ programming skills

Nice to Have

PhD are preferred, inference engines and runtimes such as vLLM, SGLang, and MLC, Background in domain specific compiler and library solutions for LLM inference and training (e. g. FlashInfer, Flash Attention), Expertise in inference engines like vLLM and SGLang, Expertise in machine learning compilers (e. g. Apache TVM, MLIR), experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Triton, or similar), Open source project ownership or contributions

What You'll Do.

Innovating and developing new AI systems technologies for efficient inference

and optimizing kernels for high impact AI workloads

Designing and implementing extensible abstractions for LLM serving engines

Building efficient just-in-time domain specific compilers and runtimes

and GPU kernel technologies for NVIDIA's hardware architecture

Designing and building things like new abstractions

efficient attention kernel implementations

new LLM inference runtimes components

and kernel code generators to accelerate large language models

and other high-impact AI workloads

How You'll Work.

Team & Collaboration

Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams

Full Job Description

We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate large language models, agents, and other high-impact AI workloads. **What you 'll be doing:** * Innovating and developing new AI systems technologies for efficient inference * Designing, implementing, and optimizing kernels for high impact AI workloads * Designing and implementing extensible abstractions for LLM serving engines * Building efficient just-in-time domain specific compilers and runtimes * Collaborating closely with other engineers at NVIDIA across deep learning frameworks, libraries, kernels, and GPU arch teams * Contributing to open source communities like FlashInfer, vLLM, and SGLang **What we need to see:** * Masters degree in Computer Science, Electrical Engineering, or related field (or equivalent experience); PhD are preferred * 2 + years (academic/ industry) experience with ML/DL systems development preferable * Strong experience in developing or using deep learning frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX, etc) and ideally inference engines and runtimes such as vLLM, SGLang, and MLC. * Strong Python and C/C++ programming skills **Ways to stand out from the crowd:** * Background in domain specific compiler and library solutions for LLM inference and training (e.g. FlashInfer, Flash Attention) * Expertise in inference engines like vLLM and SGLang * Expertise in machine learning compilers (e.g. Apache TVM, MLIR) * Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, Trit

Free ATS check

Applying for this AI Software Engineer, Kernel Libraries - New College Grad 2026 role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about AI Software Engineer, Kernel Libraries?

Real rants from real employees. Read before you apply.

Read Company Rants →