Llm Inference Infrastructure
in Mumbai
20,800 Llm Inference Infrastructure jobs in Mumbai are indexed on JobsGlitch directly from company ATS platforms, updated daily. Most in-demand skills in this market: LLM, Distributed systems, LLM Inference, Fine-tuning, Memory management. 20% remote-friendly, average advertised salary $12487k. Currently hiring: Endava, Amazon.com Services LLC, Talentuch, NVIDIA.
AI Engineer - AI/LLM (Backend)
Talentuch
SerbiaDirectSoftware Development Manager, LLM Inference Model Enablement, Neuron SDK
Annapurna Labs
Cupertino, California, USAOnsiteManagerDirectMember of Technical Staff
Fireworks AI
New York, New York, United StatesMidDirectSr. Software Engineer, Inference
CoreWeave
Warszawa, Masovian Voivodeship, PolandSeniorDirectApplied Scientist (LLM)
SQUAD
KyivRemoteSeniorDirectSoftware engineer -AI/ML, AWS Neuron Inference, AWS Neuron Inference
Annapurna Labs (U. S. ) Inc.
Seattle, Washington, USAOnsiteSeniorDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Cali, Valle del Cauca, COOnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Córdoba, Cordoba, AROnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Bogotá, Bogota D.C., COOnsitemidDirectSenior AI Engineer, Product Engineering
You.com
New York, New York, United StatesHybridSeniorDirectProduct Finance, Inference Capacity Lead
Anthropic
San Francisco, California, United StatesOnsiteLeadDirectInfrastructure / Cluster Engineer
Gimlet
San FranciscoOnsiteDirectAI Engineer - AI/LLM (Backend)
Talentuch
PolandDirectML Software Engineer, Data Plane
Annapurna Labs Ltd.
Tel Aviv-Yafo, Tel Aviv, ISROnsiteDirectSenior ML Engineer (Token Factory)
RomaniaFlexibleSeniorDirectSenior ML Engineer (Token Factory)
SwitzerlandFlexibleSeniorDirectSenior ML Engineer (Token Factory)
SpainFlexibleSeniorDirectStaff Software Engineer, Machine Learning Inference Platform
Stack AV
Pittsburgh/RemoteFlexibleSeniorDirectStaff Engineer, system design engineering
Sandisk
Milpitas, CA, United StatesOnsitemidDirectLead AI Engineer (FM Hosting, LLM Inference)
Capital One
New York, NYOnsiteLeadDirectLead AI Engineer (FM Hosting, LLM Inference)
Capital One
New York, NYOnsiteLeadDirectSenior Software Engineer, Machine Learning Inference Platform
Stack AV
Pittsburgh/RemoteRemoteSeniorDirectSenior Software Engineer — LLM Post-Training Platform
Snowflake
US-WA-BellevueOnsiteSeniorDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Rosario, Santa Fe, AROnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Rosario, Santa Fe, AROnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Cali, Valle del Cauca, COOnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Buenos Aires, Buenos Aires, AROnsitemidDirectStaff Software Engineer, Inference
CoreWeave
Warszawa, Masovian Voivodeship, PolandSeniorDirectApplied Scientist (GenAI/LLM)
Amazon.com Services LLC
Seattle, Washington, USAOnsiteDirectEngineering Tech Lead
Unframe
Tel Aviv-Yafo, Tel Aviv District, IsraelOnsiteLeadDirectEngineering Manager, Model Inference
Abridge
SF OfficeManagerDirectEngineering Manager, Model Inference
Abridge
SF OfficeManagerDirectSenior Engineer, Inference Control Plane
DigitalOcean
Seattle MetroHybridSeniorDirectEngineering Tech Lead
Unframe
Tel Aviv-Yafo, Tel Aviv District, IsraelOnsiteLeadDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Monterrey, Nuevo León, MXOnsitemidDirectDeep Learning Architect, LLM Inference
NVIDIA
US, CA, Santa ClaraNoEntryDirectSoftware Development Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteMidDirectSoftware Development Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteMidDirectMachine Learning Engineer, Alexa AI
Amazon.com Services LLC
Boston, Massachusetts, USAOnsiteMidDirectSoftware Development Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteMidDirectSenior Software Development Engineer, Stores Foundational AI - Rufus
Amazon.com Services LLC
Palo Alto, California, USAOnsiteSeniorDirectAI Field Engineer
Fireworks AI
New York, New York, United StatesOnsiteDirectSr Software Dev Engineer, Machine Learning, Sponsored Products and Brands Ads Response Prediction
Amazon.com Services LLC
Palo Alto, California, USAOnsiteSeniorDirectSenior AI Engineer
Emergence
IndiaRemoteSeniorDirectSenior Machine Learning Engineer – LLM & ML Systems
SolarWinds
Bangalore OfficeOnsiteSeniorDirectPerformance Engineer, On-Device Inference
Sarvam
BengaluruMidDirectStaff + Senior Software Engineer, Inference
Anthropic
San Francisco, California, United StatesHybridSeniorDirectEngineering Manager, Inference Benchmarking
NVIDIA
US, CA, Santa ClaraLeadDirectEngineering Manager, Inference Benchmarking
NVIDIA
US, CA, Santa ClaraLeadDirectAI Engineer - AI/LLM (Backend)
Talentuch
RomaniaDirect
Free Resume Analysis
See how you match these Llm Inference Infrastructure roles
Upload once — get a skill match score for every job listed above
Common Questions
- How many llm inference infrastructure jobs in mumbai are available?
- JobsGlitch lists 20,800 open llm inference infrastructure jobs in mumbai sourced directly from company ATS platforms — not reposted from LinkedIn.
- Are these Llm Inference Infrastructure roles actually hiring in Mumbai?
- Yes — every listing is indexed directly from company career pages (Greenhouse, Lever, Workday, Ashby). These are not aggregated from other job boards, so they reflect live hiring intent.
- What skills do Llm Inference Infrastructure jobs in Mumbai require?
- Required skills vary by employer and seniority. Browse the listings above to see the specific requirements for each open role.
- How do I apply for llm inference infrastructure jobs in mumbai?
- Click any job listing to view the full description and apply directly on the company's career page. Upload your resume on JobsGlitch first to see your match score before applying.