NVIDIA

SeniorDeveloperTechnologyEngineer-WindowsAIPlatform

Singapore FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Developer Technology Engineer - Windows AI Platform at NVIDIA. Skills: AI, accelerated computing, GPU deployment, profiling, optimization, C/C++, Python, Windows operating system, LLM, GenAI, CUDA, NVIDIA's Nsight GPU profiling and debugging suite. Work closely with internal engineering and product teams and external app developers on solving local end-to-end AI GPU deployment challenges on the NVIDIA RTX AI platform. Apply powerful profiling and debugging tools for analyzing most demandi”

What You'll Achieve.

help them adopt groundbreaking advancements in AI and accelerated computing on NVIDIA RTX; make a significant contribution to the next era of enterprise and consumer AI; solving local end-to-end AI GPU deployment challenges; detect insufficient GPU utilization resulting in suboptimal runtime performance; give good guidance on efficient end-to-end AI deployment targeting optimal runtime performance; Improve Windows LLM & GenAI user experience on NVIDIA RTX; influence next generation GPU features

Industry & Context.

Problems you'll solve

solving local end-to-end AI GPU deployment challenges; detect insufficient GPU utilization resulting in suboptimal runtime performance; problem-solving skills

Eligibility Requirements

Some travel is required for conferences and for on-site visits with external partners

What They're Looking For.

Must Have

5+ years of professional experience in local GPU deployment, profiling and optimization, Bachelor's or Master's degree or equivalent experience in Computer Science, Engineering, or a related field, proficiency in C/C++, Python, software design, programming techniques, Familiarity with and development experience on the Windows operating system, Experience working with open-source LLM and GenAI software, Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite, problem-solving skills, ability to work both independently and collaboratively in a fast-paced environment, Excellent interpersonal and communication skills, passion for keeping track with the latest advancements in AI technology

Nice to Have

Experience with GPU-accelerated AI inference driven by NVIDIA APIs, specifically cuDNN, CUTLASS, TensorRT, Confirmed expert knowledge in Vulkan and / or DX12, Detailed knowledge of the latest generation GPU architectures, Experience with AI deployment on NPUs and ARM architectures

What You'll Do.

Work closely with internal engineering and product teams and external app developers on solving local end-to-end AI GPU deployment challenges on the NVIDIA RTX AI platform

Apply powerful profiling and debugging tools for analyzing most demanding GPU-accelerated end-to-end AI applications to detect insufficient GPU utilization resulting in suboptimal runtime performance

Conduct hands-on trainings

develop sample code and host presentations to give good guidance on efficient end-to-end AI deployment targeting optimal runtime performance on NVIDIA ARM-based SoCs

Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software

including but not limited to projects like GGML

Collaborate with GPU driver and architecture teams as well as NVIDIA research to influence next generation GPU features by providing real-world workflows and giving feedback on partner and customer needs

Providing technical leadership and mentorship to junior engineers

How You'll Work.

Team & Collaboration

Work closely with internal engineering and product teams and external app developers; Collaborate with GPU driver and architecture teams as well as NVIDIA research; Providing technical leadership and mentorship to junior engineers, encouraging an inclusive and high-performing team environment; ability to work both independently and collaboratively

Communication Scope

Excellent interpersonal and communication skills; host presentations

Full Job Description

At NVIDIA, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. As a Developer Technology Engineer, you will be at the forefront of innovation, working with leading industry partners and exciting OSS projects to help them adopt groundbreaking advancements in AI and accelerated computing on NVIDIA RTX. This role offers an outstanding opportunity to collaborate with world-class talent and make a significant contribution to the next era of enterprise and consumer AI. **What you 'll be doing:** * Work closely with internal engineering and product teams and external app developers on solving local end-to-end AI GPU deployment challenges on the NVIDIA RTX AI platform. * Apply powerful profiling and debugging tools for analyzing most demanding GPU-accelerated end-to-end AI applications to detect insufficient GPU utilization resulting in suboptimal runtime performance. * Conduct hands-on trainings, develop sample code and host presentations to give good guidance on efficient end-to-end AI deployment targeting optimal runtime performance on NVIDIA ARM-based SoCs. * Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software, including but not limited to projects like GGML, Llama.cpp, Ollama, ONNX Runtime. * Collaborate with GPU driver and architecture teams as well as NVIDIA research to influence next generation GPU features by providing real-world workflows and giving feedback on partner and customer needs. * Providing technical leadership and mentorship to junior engineers, encou

Free ATS check

Applying for this Senior Developer Technology Engineer - Windows AI Platform role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 39 detected · ranked by frequency

CUDA ×7

NVIDIA's Nsight GPU profiling and debugging suite ×7

C/C++ ×6

Python ×6

Windows operating system ×6

optimization ×5

cuDNN ×5

CUTLASS ×5

TensorRT ×5

Vulkan ×5

DX12 ×5

GPU architectures ×4

AI ×3

accelerated computing ×3

GenAI ×3

local GPU deployment ×3

open-source LLM and GenAI software ×3

AI deployment on NPUs and ARM architectures ×3

GPU deployment ×2

LLM ×2

NVIDIA RTX AI platform ×2

GGML ×2

Llama. cpp ×2

Ollama ×2

ONNX Runtime ×2

software design ×2

programming techniques ×2

GPU

NVIDIA ARM-based SoCs

Windows LLM

OSS software

GPU driver

BEHAVIOURAL

collaborationmentorshipinclusive and high-performing team environmentproblem-solving skillswork both independently and collaborativelyfast-paced environmentinterpersonal skillscommunication skillspassion for keeping track with the latest advancements in AI technologycreativeautonomouslove a challenge

Role Details

Seniority senior

Experience 5–10 yrs

Level Senior

Work Mode No

Type FULL TIME

Education Bachelor's or Master's degree or equivalent experience in Co

AI-Extracted Insights

Domain Areas

aiaccelerated-computingnvidia-rtx-ai-platformgpu-architecturesnpuarm-architectures

How to Apply on Workday

Workday has a multi-step form — save your progress after every section.
"Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →