NVIDIA
SOCAIApplicationEngineer—AIServices,AgentsandKnowledgeSystems
Neural analysis suggests this role is
optimal for Mid candidates.
“SOC AI Application Engineer — AI Services, Agents and Knowledge Systems at NVIDIA. Skills: AI Services, Agents, Knowledge Systems, LLM-backed services, RAG, LangChain, coding-agent, Python. Design, implement, and operate LLM-backed services: APIs, async jobs, streaming responses, and integration with internal tools and data sources. Build RAG and knowledge systems: chunking, embeddings, vector retrieval, reranking, access control, and quality/latency tuning”
What You'll Achieve.
boost HW execution team's work efficiency; shipping and operating AI services; evaluating and using modern frameworks and tools; Improve developer and engineer experience; Own reliability and perform evaluation
What They're Looking For.
Must Have
MS/PhD in CS, CE, EE, 2+ years of professional experience with a clear focus on AI application / AI service development (building products on top of LLMs, not only ad-hoc scripts), Python and experience shipping services (REST/gRPC, containers, basic cloud or on-prem deployment patterns as applicable), Hands-on use of LLM application frameworks (e. g. LangChain or equivalent) and RAG (vector DBs, retrieval design, evaluation), Familiarity with coding agents and IDE workflows (e. g. Claude Code-style usage) and frameworks (skills, templates, or internal “agent packs”), Solid software engineering habits: dependency management, configuration, testing, and clear interfaces for other teams, Excellent communication and ability to work with partners who are not AI specialists
Nice to Have
Hardware knowledge: RTL Coding Makefile Coding SOC Design know-how; Physical Design know-how; etc—enough to understand user context and data (no requirement to be a chip designer), Web development: lightweight UIs, internal portals, or full-stack slices (e. g. React/TypeScript, FastAPI + frontend) for AI features
What You'll Do.
and operate LLM-backed services: APIs
and integration with internal tools and data sources
Build RAG and knowledge systems: chunking
and quality/latency tuning
Apply agent and orchestration patterns with frameworks like LangChain (or comparable): tool use
and guardrails—aligned with how SOC Hardware team works
Improve developer and engineer experience with AI-assisted coding and repeatable “skills”: prompts
and small utilities that teams can run consistently (including patterns like Claude Code + structured skills)
Own reliability and perform evaluation: logging
regression tests for prompts/pipelines
and metrics for usefulness and safety on proprietary data
Co-work with Hardware engineers from Methodology
and Design teams to scope the problem
implementation (in multiple iterations)
and online production-ready features
How You'll Work.
Team & Collaboration
Co-work with Hardware engineers from Methodology, CAD, and Design teams; work with partners who are not AI specialists
Communication Scope
Excellent communication; ability to work with partners who are not AI specialists
Full Job Description
The NVIDIA System-On-Chip(SOC) Design team is looking for a top AI Engineer with curiosity about SOC design automation, RTL integration, and chip build and assembly now. If you are interested in using AI to upgrade the conventional SOC Design flow, come and join us. We need you to be passionate about AI+Hardware. You are expected to help us to build AI application-layer services which would boost HW execution team's work efficiency, includes : assistants, retrieval and Q workflow automation; and develop AI agent for SOC Design-related tasks. You will be shipping and operating AI services (APIs, orchestration, RAG, evaluation), evaluating and using modern frameworks and tools, such as LangChain (and similar stacks), RAG pipelines, and coding-agent / IDE-centric workflows (e.g. Claude Code-class assistants, reusable skills / playbooks for agents). **What you’ll be doing:** * Design, implement, and operate LLM-backed services: APIs, async jobs, streaming responses, and integration with internal tools and data sources. * Build RAG and knowledge systems: chunking, embeddings, vector retrieval, reranking, access control, and quality/latency tuning. * Apply agent and orchestration patterns with frameworks like LangChain (or comparable): tool use, multi-step plans, memory, and guardrails—aligned with how SOC Hardware team works. * Improve developer and engineer experience with AI-assisted coding and repeatable “skills”: prompts, procedures, and small utilities that teams can run consistently (including patterns like Claude Code + structured skills). * Own reliability and perform evaluation: logging, tracing, regression tests for prompts/pipelines, and metrics for usefulness and safety on proprietary data. * Co-work with Hardware engineers from Methodology, CAD, and Design teams to scope the problem, propose the solution, implementation (in multiple iterations), and online production-ready features. **What we need to see:** * MS/PhD in CS, CE, EE * 2+ years of professional e
Applying for this SOC AI Application Engineer — AI Services, Agents and Knowledge Systems role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.