NVIDIA

SOCAIApplicationEngineerAIServices,AgentsandKnowledgeSystems

Shanghai, China FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“SOC AI Application Engineer — AI Services, Agents and Knowledge Systems at NVIDIA. Skills: AI Services, Agents, Knowledge Systems, LLM-backed services, RAG pipelines, coding-agent / IDE-centric workflows, LangChain. build AI application-layer services. assist HW execution team's work efficiency”

What You'll Achieve.

boost HW execution team's work efficiency; shipping and operating AI services; online production-ready features

What They're Looking For.

Must Have

MS/PhD in CS, CE, EE, 2+ years of professional experience with a clear focus on AI application / AI service development (building products on top of LLMs, not only ad-hoc scripts), Python and experience shipping services (REST/gRPC, containers, basic cloud or on-prem deployment patterns as applicable), Hands-on use of LLM application frameworks (e. g. LangChain or equivalent) and RAG (vector DBs, retrieval design, evaluation), Familiarity with coding agents and IDE workflows (e. g. Claude Code-style usage) and frameworks (skills, templates, or internal “agent packs”), Solid software engineering habits: dependency management, configuration, testing, and clear interfaces for other teams, Excellent communication and ability to work with partners who are not AI specialists

Nice to Have

Hardware knowledge: RTL Coding Makefile Coding SOC Design know-how; Physical Design know-how; etc—enough to understand user context and data (no requirement to be a chip designer), Web development: lightweight UIs, internal portals, or full-stack slices (e. g. React/TypeScript, FastAPI + frontend) for AI features

What You'll Do.

build AI application-layer services

assist HW execution team's work efficiency

retrieval and Q workflow

develop AI agent for SOC Design-related tasks

shipping and operating AI services (APIs

evaluating and using modern frameworks and tools

and operate LLM-backed services

Build RAG and knowledge systems

Apply agent and orchestration patterns with frameworks like LangChain (or comparable)

Improve developer and engineer experience with AI-assisted coding and repeatable “skills”

Own reliability and perform evaluation

Co-work with Hardware engineers from Methodology

and Design teams to scope the problem

implementation (in multiple iterations)

and online production-ready features

How You'll Work.

Team & Collaboration

Co-work with Hardware engineers from Methodology, CAD, and Design teams; work with partners who are not AI specialists

Communication Scope

Excellent communication; ability to work with partners who are not AI specialists

Full Job Description

The NVIDIA System-On-Chip(SOC) Design team is looking for a top AI Engineer with curiosity about SOC design automation, RTL integration, and chip build and assembly now. If you are interested in using AI to upgrade the conventional SOC Design flow, come and join us. We need you to be passionate about AI+Hardware. You are expected to help us to build AI application-layer services which would boost HW execution team's work efficiency, includes : assistants, retrieval and Q workflow automation; and develop AI agent for SOC Design-related tasks. You will be shipping and operating AI services (APIs, orchestration, RAG, evaluation), evaluating and using modern frameworks and tools, such as LangChain (and similar stacks), RAG pipelines, and coding-agent / IDE-centric workflows (e.g. Claude Code-class assistants, reusable skills / playbooks for agents). **What you’ll be doing:** * Design, implement, and operate LLM-backed services: APIs, async jobs, streaming responses, and integration with internal tools and data sources. * Build RAG and knowledge systems: chunking, embeddings, vector retrieval, reranking, access control, and quality/latency tuning. * Apply agent and orchestration patterns with frameworks like LangChain (or comparable): tool use, multi-step plans, memory, and guardrails—aligned with how SOC Hardware team works. * Improve developer and engineer experience with AI-assisted coding and repeatable “skills”: prompts, procedures, and small utilities that teams can run consistently (including patterns like Claude Code + structured skills). * Own reliability and perform evaluation: logging, tracing, regression tests for prompts/pipelines, and metrics for usefulness and safety on proprietary data. * Co-work with Hardware engineers from Methodology, CAD, and Design teams to scope the problem, propose the solution, implementation (in multiple iterations), and online production-ready features. **What we need to see:** * MS/PhD in CS, CE, EE * 2+ years of professional e

Free ATS check

Applying for this SOC AI Application Engineer — AI Services, Agents and Knowledge Systems role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →