NVIDIA
Cloud
SolutionsArchitect,CloudInferenceServices
Neural analysis suggests this role is
optimal for Senior candidates.
“Solutions Architect, Cloud Inference Services at NVIDIA. Skills: AI, neural network inference, agentic pipelines, LLMs, VLMs, NVIDIA AI technology platform. Work directly with our NCPs and their key customers to understand their technology and provide the best solutions. Develop and demonstrate solutions based on NVIDIA’s and open-source NLP and LLM technology and integrate them into agentic pipelines”
What You'll Achieve.
simplify its deployment to production
Industry & Context.
Driven with analytical and problem-solving skills
What They're Looking For.
Must Have
Master's or Ph. D. in Computer Science, Artificial Intelligence, or equivalent experience, 5+ years of industry and/or academic experience in fields related to machine learning, deep learning and/or data science with preference towards DNN inference, Work experience and knowledge of modern LLM, VLM, diffusion architectures with emphasis on MoE, Understanding of key libraries used for DNN inference (e. g. TRT-LLM, Dynamo, RedHat Inference Server) as well as agentic pipeline development, Excellent verbal, written communication, and technical presentation skills in English
Nice to Have
Experience working with inference of very large MoE architectures for NLP, CV, ASR or other, Experience using DevOps technologies such as Docker, Kubernetes, Singularity, etc., Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience
What You'll Do.
Work directly with our NCPs and their key customers to understand their technology and provide the best solutions
Develop and demonstrate solutions based on NVIDIA’s and open-source NLP and LLM technology and integrate them into agentic pipelines
Perform in-depth analysis and optimisation to ensure the best performance on GPU based systems
Partner with Engineering
Product and Sales teams to develop
plan best suitable solutions for customers
Enable development and growth of product features through customer feedback and proof-of-concept evaluations
Build industry expertise and become a contributor in integrating NVIDIA technology into AI Cloud solutions and Enterprise Computing architectures
working on proof-of-concept demonstrations
leading the discussion with developers
product teams and key executives
encourage adoption of NVIDIA’s AI technology platform
simplify its deployment to production
How You'll Work.
Team & Collaboration
coordinate efforts between customers, corporate marketing, industry business development and engineering; Dynamically engaging with different roles within NVIDIA and with the NCP and other partner; Partner with Engineering, Product and Sales teams; sharing findings across the team
Communication Scope
Excellent verbal, written communication, and technical presentation skills in English
Process & Methodology
time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects
Full Job Description
NVIDIA’s Worldwide Field Operations (WWFO) team is looking for an AI focused Solution Architect with expertise in neural network inference and development/operation of agentic pipelines. A candidate with understanding of large scale DNN inference as well as end to end design of agentic utilities using tools such as NVIDIA NeMo Agent Toolkit, LangChain, LLamaIndex, Haystack, etc. . As a Solutions Architect in our team, you will have a customer facing technical role helping one or potentially a few leading NVIDIA Cloud Partners (NCPs) to integrate the NVIDIA AI stack, and other OpenSource GPU accelerated stacks and help them develop, deploy and support an E2E solution for AI services from Training to Post Training and Inference workloads. You will participate in projects that involve technologies like LLMs, VLMs, Physical-AI, Agentic Pipelines and others. We are looking for someone who always thinks about artificial intelligence, someone who can thrive in a fast paced, rapidly developing field, someone able to coordinate efforts between customers, corporate marketing, industry business development and engineering. Working across different projects and tasks and efficiently multi-tasking while keeping a customer-facing approach will be critical in this capacity. In this role, you will be the first line of technical expertise between NVIDIA and our partners and customers. Your duties include working on proof-of-concept demonstrations and leading the discussion with developers, product teams and key executives. You will encourage adoption of NVIDIA’s AI technology platform and simplify its deployment to production. Dynamically engaging with different roles within NVIDIA and with the NCP and other partner is a significant part of the Solutions Architect role and will give you experience with a range of technologies. ## ## What You’ll Be Doing: * Work directly with our NCPs and their key customers to understand their technology and provide the best solutions. * Develop and
Applying for this Solutions Architect, Cloud Inference Services role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about NVIDIA?
Real rants from real employees. Read before you apply.