Home / Jobs / Gpu Inference
Jobs by Role

Gpu Inference
Jobs

No openings found right now · Updated daily

Active Gpu Inference roles are indexed directly from company ATS systems — Greenhouse, Lever, Workday, Ashby, and 15+ others. Advertised salaries average $4909k/year based on live listings. 28% of roles are remote-friendly. These listings don't come from other job boards — they're pulled from source, so many won't appear on LinkedIn, Indeed, or Glassdoor.

Open Roles

0

Avg Salary

$4909k

Remote-Friendly

28%

Added This Week

35

50 shown

GPU Performance Engineer

CoreWeave

Bellevue, WA Mid Direct
Apply →

GPU Engineer

KOG

Paris, France Hybrid Direct
Apply →

Staff Software Engineer, Inference

CoreWeave

Warszawa, Masovian Voivodeship, Poland Senior Direct
Apply →

Software engineer -AI/ML, AWS Neuron Inference, AWS Neuron Inference

Annapurna Labs (U. S. ) Inc.

Seattle, Washington, USA Onsite Senior Direct
Apply →

Senior Engineer, Inference Control Plane

DigitalOcean

Seattle Metro Hybrid Senior Direct
Apply →

Senior Performance Engineer, Discrete GPU

Sarvam

Bengaluru Senior Direct
Apply →

Performance Engineer, On-Device Inference

Sarvam

Bengaluru Mid Direct
Apply →

Distributed Training and Inference Engineer

Sciforium

San Francisco Senior Direct
Apply →

Research Intern, Inference (Fall 2026)

Together AI

San Francisco, California, United States Onsite Direct
Apply →

Technical Program Manager – GPU Infrastructure

Nscale

US Direct
Apply →

Presales Manager - Inference & Agentic AI

Paytm

Noida, Uttar Pradesh Onsite Manager Direct
Apply →

Senior Data Scientist, Causal Inference

Lyft

New York, New York, United States Hybrid Senior Direct
Apply →

Senior Data Scientist, Causal Inference

Lyft

New York, New York, United States Hybrid Senior Direct
Apply →

Senior Data Scientist, Causal Inference

Lyft

New York, New York, United States Hybrid Senior Direct
Apply →

Staff + Senior Software Engineer, Inference

Anthropic

San Francisco, California, United States Hybrid Senior Direct
Apply →

Staff Software Engineer, Machine Learning Inference Platform

Stack AV

Pittsburgh/Remote Flexible Senior Direct
Apply →

Senior Software Engineer, Machine Learning Inference Platform

Stack AV

Pittsburgh/Remote Remote Senior Direct
Apply →

Global Supply Manager, GPU/Accelerators (Starlink)

SpaceX

Austin, Texas, United States Onsite Manager Direct
Apply →

Systems Research Engineer Intern - GPU Programming (Fall 2026)

Together AI

San Francisco, California, United States Onsite Direct
Apply →

Solutions Architect (AI GPU Infrastructure / Data Centre Architecture)

Nscale

Singapore Onsite Senior Direct
Apply →

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon.com Services LLC

Cupertino, California, USA Onsite Direct
Apply →

Software Development Manager, LLM Inference Model Enablement, Neuron SDK

Annapurna Labs

Cupertino, California, USA Onsite Manager Direct
Apply →

Principal Engineer - Systems for ML Inference and Training Optimization

Amazon Web Services Development Center Germany GmbH

Tübingen, Baden-Wurttemberg, DEU Onsite Senior Direct
Apply →

Software Development Engineer - AI/ML, Amazon Neuron, Multimodal Inference

Annapurna Labs (U. S. ) Inc.

Seattle, Washington, USA Onsite Direct
Apply →

Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Annapurna Labs (U. S. ) Inc.

Cupertino, California, USA Onsite Direct
Apply →

Member of Technical Staff — Model Optimization and Inference (New Grad)

Nuance Labs

Seattle Onsite Entry Direct
Apply →

Machine Learning Engineer II - Autonomous Driving & Inference Runtime

May Mobility

Anywhere, USA Remote Mid Direct
Apply →

Sr. Staff Observability Engineer (GPU Cloud & Telemetry Platform)

Coupang

Seoul, South Korea Onsite Senior Direct
Apply →

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U. S. ) Inc.

Seattle, Washington, USA Onsite Senior Direct
Apply →

Sr. Staff Observability Engineer (GPU Cloud & Telemetry Platform)

Coupang

Seoul, South Korea Senior Direct
Apply →

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U. S. ) Inc.

Cupertino, California, USA Onsite Senior Direct
Apply →

Senior Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Amazon.com Services LLC

Cupertino, California, USA Onsite Senior Direct
Apply →

Senior Applied Scientist - Systems for ML Inference and Training Optimization

Amazon Web Services Development Center Germany GmbH

Tübingen, Baden-Wurttemberg, DEU Onsite Senior Direct
Apply →

Principal Software Engineer, GPU Compute

Roblox

San Mateo, CA, United States Hybrid Senior Direct
Apply →

Senior Embedded GPU Software Engineer

CHAOS Industries

Los Angeles, California, United States Onsite Senior Direct
Apply →

GPU & ML Developer for Reconstruction and Simulation (EP-ALI-SC-2026-106-GRAP)

CERN

Geneva, GENEVA, CH Onsite entry Direct
Apply →

GPU & ML Developer for Reconstruction and Simulation (EP-ALI-SC-2026-106-GRAP)

CERN

Geneva, GENEVA, CH Onsite entry Direct
Apply →

GPU Architect

Graphcore

US - Milpitas Onsite Senior Direct
Apply →

Product Manager / Director – GPU & AI Services

Impossible Cloud

Hamburg Onsite Director Direct
Apply →

Member of Technical Staff — Model Optimization and Inference

Nuance Labs

Seattle Onsite Direct
Apply →

GPU Performance Engineer - Neural Reconstruction

NVIDIA

Canada, Remote Remote Direct
Apply →

AI Research Engineer, Inference

Hudson River Trading

New York, NY, United States Onsite Direct
Apply →

Verification Engineer - GPU Fullchip

NVIDIA

India, Bengaluru Hybrid Mid Direct
Apply →

Senior GPU Architect

NVIDIA

US, CA, Santa Clara Onsite Senior Direct
Apply →

Verification Engineer - GPU Fullchip

NVIDIA

India, Bengaluru Hybrid Mid Direct
Apply →

Senior GPU Architect

NVIDIA

US, CA, Santa Clara Onsite Senior Direct
Apply →

Software Engineer- BIS (Baseten Inference Stack)

Baseten

San Francisco Direct
Apply →

Software Engineer- BIS (Baseten Inference Stack)

Baseten

San Francisco Direct
Apply →

Senior Staff System Engineer, GPU Fleet

Coupang

Bengaluru Hybrid Senior Direct
Apply →

Staff + Sr. Software Engineer, Cloud Inference

Anthropic

San Francisco, California, United States Hybrid Senior Direct
Apply →

Common Questions

How many Gpu Inference jobs are available?
JobsGlitch lists active Gpu Inference jobs sourced daily from Greenhouse, Lever, Ashby, Workday, and other top ATS platforms.
What skills are required for Gpu Inference roles?
The most in-demand skills for Gpu Inference roles are Distributed systems, Kernel development, Low-level optimization, AI/ML, AWS Neuron. Requirements vary by seniority and company.
What is the average salary for a Gpu Inference?
The average salary for Gpu Inference roles on JobsGlitch is approximately $4909k/year. Compensation varies by location, seniority, and company.
Are there remote Gpu Inference jobs?
Yes — 28% of Gpu Inference jobs on JobsGlitch are remote-friendly. Browse remote Gpu Inference jobs at jobsglitch.com/jobs/remote/gpu-inference.