NVIDIA

SOCAIApplicationEngineer—AIServices,AgentsandKnowledgeSystems

Shanghai, China FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“SOC AI Application Engineer — AI Services, Agents and Knowledge Systems at NVIDIA. Skills: AI Services, Agents, Knowledge Systems, LLM-backed services, RAG, LangChain, coding-agent, Python. Design, implement, and operate LLM-backed services: APIs, async jobs, streaming responses, and integration with internal tools and data sources. Build RAG and knowledge systems: chunking, embeddings, vector retrieval, reranking, access control, and quality/latency tuning”

What You'll Achieve.

boost HW execution team's work efficiency; shipping and operating AI services; evaluating and using modern frameworks and tools; Improve developer and engineer experience; Own reliability and perform evaluation

What They're Looking For.

Must Have

MS/PhD in CS, CE, EE, 2+ years of professional experience with a clear focus on AI application / AI service development (building products on top of LLMs, not only ad-hoc scripts), Python and experience shipping services (REST/gRPC, containers, basic cloud or on-prem deployment patterns as applicable), Hands-on use of LLM application frameworks (e. g. LangChain or equivalent) and RAG (vector DBs, retrieval design, evaluation), Familiarity with coding agents and IDE workflows (e. g. Claude Code-style usage) and frameworks (skills, templates, or internal “agent packs”), Solid software engineering habits: dependency management, configuration, testing, and clear interfaces for other teams, Excellent communication and ability to work with partners who are not AI specialists

Nice to Have

Hardware knowledge: RTL Coding Makefile Coding SOC Design know-how; Physical Design know-how; etc—enough to understand user context and data (no requirement to be a chip designer), Web development: lightweight UIs, internal portals, or full-stack slices (e. g. React/TypeScript, FastAPI + frontend) for AI features

What You'll Do.

and operate LLM-backed services: APIs

and integration with internal tools and data sources

Build RAG and knowledge systems: chunking

and quality/latency tuning

Apply agent and orchestration patterns with frameworks like LangChain (or comparable): tool use

and guardrails—aligned with how SOC Hardware team works

Improve developer and engineer experience with AI-assisted coding and repeatable “skills”: prompts

and small utilities that teams can run consistently (including patterns like Claude Code + structured skills)

Own reliability and perform evaluation: logging

regression tests for prompts/pipelines

and metrics for usefulness and safety on proprietary data

Co-work with Hardware engineers from Methodology

and Design teams to scope the problem

implementation (in multiple iterations)

and online production-ready features

How You'll Work.

Team & Collaboration

Co-work with Hardware engineers from Methodology, CAD, and Design teams; work with partners who are not AI specialists

Communication Scope

Excellent communication; ability to work with partners who are not AI specialists

Full Job Description

The NVIDIA System-On-Chip(SOC) Design team is looking for a top AI Engineer with curiosity about SOC design automation, RTL integration, and chip build and assembly now. If you are interested in using AI to upgrade the conventional SOC Design flow, come and join us. We need you to be passionate about AI+Hardware. You are expected to help us to build AI application-layer services which would boost HW execution team's work efficiency, includes : assistants, retrieval and Q workflow automation; and develop AI agent for SOC Design-related tasks. You will be shipping and operating AI services (APIs, orchestration, RAG, evaluation), evaluating and using modern frameworks and tools, such as LangChain (and similar stacks), RAG pipelines, and coding-agent / IDE-centric workflows (e.g. Claude Code-class assistants, reusable skills / playbooks for agents). **What you’ll be doing:** * Design, implement, and operate LLM-backed services: APIs, async jobs, streaming responses, and integration with internal tools and data sources. * Build RAG and knowledge systems: chunking, embeddings, vector retrieval, reranking, access control, and quality/latency tuning. * Apply agent and orchestration patterns with frameworks like LangChain (or comparable): tool use, multi-step plans, memory, and guardrails—aligned with how SOC Hardware team works. * Improve developer and engineer experience with AI-assisted coding and repeatable “skills”: prompts, procedures, and small utilities that teams can run consistently (including patterns like Claude Code + structured skills). * Own reliability and perform evaluation: logging, tracing, regression tests for prompts/pipelines, and metrics for usefulness and safety on proprietary data. * Co-work with Hardware engineers from Methodology, CAD, and Design teams to scope the problem, propose the solution, implementation (in multiple iterations), and online production-ready features. **What we need to see:** * MS/PhD in CS, CE, EE * 2+ years of professional e

Free ATS check

Applying for this SOC AI Application Engineer — AI Services, Agents and Knowledge Systems role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 70 detected · ranked by frequency

RAG ×7

LangChain ×7

Python ×6

vector DBs ×5

FastAPI ×5

containers ×4

Knowledge Systems ×3

LLM-backed services ×3

coding-agent ×3

REST/gRPC ×3

cloud or on-prem deployment patterns ×3

retrieval design ×3

evaluation ×3

coding agents ×3

IDE workflows ×3

skills ×3

templates ×3

agent packs ×3

dependency management ×3

configuration ×3

testing ×3

clear interfaces ×3

RTL Coding ×3

Makefile Coding ×3

SOC Design know-how ×3

Physical Design know-how ×3

React/TypeScript ×3

AI Services ×2

Agents ×2

Claude Code-style usage ×2

React ×2

TypeScript ×2

BEHAVIOURAL

curiositypassionExcellent communicationability to work with partners who are not AI specialists

Role Details

Seniority mid

Experience 2–5 yrs

Level Mid

Work Mode No

Type FULL TIME

Education MS/PhD in CS, CE, EE

AI-Extracted Insights

Domain Areas

soc-design-automationrtl-integrationchip-build-and-assemblyhardware-knowledgertl-codingmakefile-codingsoc-design-know-howphysical-design-know-how

How to Apply on Workday

Workday has a multi-step form — save your progress after every section.
"Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →