Llm Inference Infrastructure
in Austin
20,759 Llm Inference Infrastructure jobs in Austin are indexed on JobsGlitch directly from company ATS platforms, updated daily. Most in-demand skills in this market: LLM, LLM Inference, Distributed systems, Memory management, Fine-tuning. 26% remote-friendly, average advertised salary $13823k. Currently hiring: Endava, NVIDIA, ADCI, Capital One.
AI Engineer - AI/LLM (Backend)
Talentuch
SerbiaDirectSoftware Development Manager, LLM Inference Model Enablement, Neuron SDK
Annapurna Labs
Cupertino, California, USAOnsiteManagerDirectMember of Technical Staff
Fireworks AI
New York, New York, United StatesMidDirectSoftware engineer -AI/ML, AWS Neuron Inference, AWS Neuron Inference
Annapurna Labs (U. S. ) Inc.
Seattle, Washington, USAOnsiteSeniorDirectApplied Scientist (LLM)
SQUAD
KyivRemoteSeniorDirectSr. Software Engineer, Inference
CoreWeave
Warszawa, Masovian Voivodeship, PolandSeniorDirectLead AI Engineer (FM Hosting, LLM Inference)
Capital One
New York, NYOnsiteLeadDirectLead AI Engineer (FM Hosting, LLM Inference)
Capital One
New York, NYOnsiteLeadDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Cali, Valle del Cauca, COOnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Córdoba, Cordoba, AROnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Bogotá, Bogota D.C., COOnsitemidDirectSenior AI Engineer, Product Engineering
You.com
New York, New York, United StatesHybridSeniorDirectEngineering Manager, Model Inference
Abridge
SF OfficeManagerDirectEngineering Manager, Model Inference
Abridge
SF OfficeManagerDirectProduct Finance, Inference Capacity Lead
Anthropic
San Francisco, California, United StatesOnsiteLeadDirectInfrastructure / Cluster Engineer
Gimlet
San FranciscoOnsiteDirectDeep Learning Architect, LLM Inference
NVIDIA
US, CA, Santa ClaraNoEntryDirectStaff Engineer, system design engineering
Sandisk
Milpitas, CA, United StatesOnsitemidDirectSoftware Development Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteMidDirectSoftware Development Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteMidDirectAI Engineer - AI/LLM (Backend)
Talentuch
PolandDirectML Software Engineer, Data Plane
Annapurna Labs Ltd.
Tel Aviv-Yafo, Tel Aviv, ISROnsiteDirectSenior ML Engineer (Token Factory)
RomaniaFlexibleSeniorDirectSenior ML Engineer (Token Factory)
SwitzerlandFlexibleSeniorDirectSenior ML Engineer (Token Factory)
SpainFlexibleSeniorDirectSoftware Development Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteMidDirectStaff Software Engineer, Machine Learning Inference Platform
Stack AV
Pittsburgh/RemoteFlexibleSeniorDirectEngineering Manager, Inference Benchmarking
NVIDIA
US, CA, Santa ClaraLeadDirectEngineering Manager, Inference Benchmarking
NVIDIA
US, CA, Santa ClaraLeadDirectSenior Software Engineer — LLM Post-Training Platform
Snowflake
US-WA-BellevueOnsiteSeniorDirectApplied Scientist (GenAI/LLM)
Amazon.com Services LLC
Seattle, Washington, USAOnsiteDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Rosario, Santa Fe, AROnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Rosario, Santa Fe, AROnsitemidDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Cali, Valle del Cauca, COOnsitemidDirectSenior Software Engineer, Machine Learning Inference Platform
Stack AV
Pittsburgh/RemoteRemoteSeniorDirectAI Architect – Agentic Systems (LLM & Multi-Agent Solutions)
Endava
Buenos Aires, Buenos Aires, AROnsitemidDirectStaff Software Engineer, Inference
CoreWeave
Warszawa, Masovian Voivodeship, PolandSeniorDirectSenior ML Engineer - Kimchi (LLM Inference Optimization)
Cast AI
European Unionremote-firstSeniorDirectSenior ML Engineer - Kimchi (LLM Inference Optimization)
Cast AI
European Unionremote-firstSeniorDirectSenior Deep Learning Researcher, LLM Inference
NVIDIA
Israel, Tel AvivNoSeniorDirectSenior Performance Engineer - LLM Inference Frameworks
NVIDIA
Israel, YokneamHybridSeniorDirectSenior Engineer, Inference Control Plane
DigitalOcean
Seattle MetroHybridSeniorDirectSoftware Development Manager, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteManagerDirectSr. Software Dev Engineer, Alexa AI Logistics for Infrastructure, Cost, & Efficiency
ADCI
Bengaluru, Karnataka, INDOnsiteSeniorDirectEngineering Tech Lead
Unframe
Tel Aviv-Yafo, Tel Aviv District, IsraelOnsiteLeadDirectSenior Deep Learning Researcher, LLM Inference
NVIDIA
Israel, Tel AvivNoSeniorDirectEngineering Tech Lead
Unframe
Tel Aviv-Yafo, Tel Aviv District, IsraelOnsiteLeadDirectMachine Learning Engineer, Alexa AI
Amazon.com Services LLC
Boston, Massachusetts, USAOnsiteMidDirectLead AI Engineer (FM Hosting, LLM Inference)
Capital One
New York, NYNoLeadDirectAI Infrastructure Engineer
Swissquote
Gland, VD, CHOnsitemidDirect
Free Resume Analysis
See how you match these Llm Inference Infrastructure roles
Upload once — get a skill match score for every job listed above
Common Questions
- How many llm inference infrastructure jobs in austin are available?
- JobsGlitch lists 20,759 open llm inference infrastructure jobs in austin sourced directly from company ATS platforms — not reposted from LinkedIn.
- Are these Llm Inference Infrastructure roles actually hiring in Austin?
- Yes — every listing is indexed directly from company career pages (Greenhouse, Lever, Workday, Ashby). These are not aggregated from other job boards, so they reflect live hiring intent.
- What skills do Llm Inference Infrastructure jobs in Austin require?
- Required skills vary by employer and seniority. Browse the listings above to see the specific requirements for each open role.
- How do I apply for llm inference infrastructure jobs in austin?
- Click any job listing to view the full description and apply directly on the company's career page. Upload your resume on JobsGlitch first to see your match score before applying.