See your match scores

Gpu Inference Jobs in New York

No openings found

Active Gpu Inference roles in New York, indexed directly from company ATS systems — not reposted from LinkedIn, Indeed, or Glassdoor. Upload your resume to see your match score against open positions.

GPU Engineer

KOG

Paris, France Hybrid Direct
Apply →

Staff Software Engineer, Machine Learning Inference Platform

Stack AV

Pittsburgh/Remote Flexible Senior Direct
Apply →

GPU Performance Engineer

CoreWeave

Bellevue, WA Mid Direct
Apply →

Staff Software Engineer, Inference

CoreWeave

Warszawa, Masovian Voivodeship, Poland Senior Direct
Apply →

Sr. Software Engineer, Inference

CoreWeave

Warszawa, Masovian Voivodeship, Poland Senior Direct
Apply →

Software engineer -AI/ML, AWS Neuron Inference, AWS Neuron Inference

Annapurna Labs (U. S. ) Inc.

Seattle, Washington, USA Onsite Senior Direct
Apply →

Performance Engineer, On-Device Inference

Sarvam

Bengaluru Mid Direct
Apply →

Senior Engineer, Inference Control Plane

DigitalOcean

Seattle Metro Hybrid Senior Direct
Apply →

Distributed Training and Inference Engineer

Sciforium

San Francisco Senior Direct
Apply →

Senior Performance Engineer, Discrete GPU

Sarvam

Bengaluru Senior Direct
Apply →

Research Intern, Inference (Fall 2026)

Together AI

San Francisco, California, United States Onsite Direct
Apply →

Senior Applied Scientist - Systems for ML Inference and Training Optimization

Amazon Web Services Development Center Germany GmbH

Tübingen, Baden-Wurttemberg, DEU Onsite Senior Direct
Apply →

Member of Technical Staff — Model Optimization and Inference (New Grad)

Nuance Labs

Seattle Onsite Entry Direct
Apply →

Senior Data Scientist, Causal Inference

Lyft

New York, New York, United States Hybrid Senior Direct
Apply →

Senior Software Engineer, Machine Learning Inference Platform

Stack AV

Pittsburgh/Remote Remote Senior Direct
Apply →

Technical Program Manager – GPU Infrastructure

Nscale

US Direct
Apply →

Presales Manager - Inference & Agentic AI

Paytm

Noida, Uttar Pradesh Onsite Manager Direct
Apply →

Software Engineering Manager, GPU AI Infrastructure

[Taipei, Taiwan, [110, Taiwan, Taipei City, Xinyi District, Ankang Village, Section 5, Xinyi Rd, 109號2樓], Ankang Village, 110, Taipei City, TW] Onsite Manager Direct
Apply →

Senior Data Scientist, Causal Inference

Lyft

New York, New York, United States Hybrid Senior Direct
Apply →

Senior Data Scientist, Causal Inference

Lyft

New York, New York, United States Hybrid Senior Direct
Apply →

Staff + Senior Software Engineer, Inference

Anthropic

San Francisco, California, United States Hybrid Senior Direct
Apply →

Software Development Manager, LLM Inference Model Enablement, Neuron SDK

Annapurna Labs

Cupertino, California, USA Onsite Manager Direct
Apply →

Systems Research Engineer Intern - GPU Programming (Fall 2026)

Together AI

San Francisco, California, United States Onsite Direct
Apply →

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U. S. ) Inc.

Seattle, Washington, USA Onsite Senior Direct
Apply →

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Amazon.com Services LLC

Cupertino, California, USA Onsite Direct
Apply →

Senior Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Amazon.com Services LLC

Cupertino, California, USA Onsite Senior Direct
Apply →

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U. S. ) Inc.

Cupertino, California, USA Onsite Senior Direct
Apply →

Software Development Engineer - AI/ML, Amazon Neuron, Multimodal Inference

Annapurna Labs (U. S. ) Inc.

Seattle, Washington, USA Onsite Direct
Apply →

Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Annapurna Labs (U. S. ) Inc.

Cupertino, California, USA Onsite Direct
Apply →

Global Supply Manager, GPU/Accelerators (Starlink)

SpaceX

Austin, Texas, United States Onsite Manager Direct
Apply →

Solutions Architect (AI GPU Infrastructure / Data Centre Architecture)

Nscale

Singapore Onsite Senior Direct
Apply →

Machine Learning Engineer II - Autonomous Driving & Inference Runtime

May Mobility

Anywhere, USA Remote Mid Direct
Apply →

Principal Engineer - Systems for ML Inference and Training Optimization

Amazon Web Services Development Center Germany GmbH

Tübingen, Baden-Wurttemberg, DEU Onsite Senior Direct
Apply →

Sr. Staff Observability Engineer (GPU Cloud & Telemetry Platform)

Coupang

Seoul, South Korea Senior Direct
Apply →

Sr. Staff Observability Engineer (GPU Cloud & Telemetry Platform)

Coupang

Seoul, South Korea Onsite Senior Direct
Apply →

GPU & ML Developer for Reconstruction and Simulation (EP-ALI-SC-2026-106-GRAP)

CERN

Geneva, GENEVA, CH Onsite entry Direct
Apply →

GPU & ML Developer for Reconstruction and Simulation (EP-ALI-SC-2026-106-GRAP)

CERN

Geneva, GENEVA, CH Onsite entry Direct
Apply →

Principal Software Engineer, GPU Compute

Roblox

San Mateo, CA, United States Hybrid Senior Direct
Apply →

Senior Embedded GPU Software Engineer

CHAOS Industries

Los Angeles, California, United States Onsite Senior Direct
Apply →

AI Research Engineer, Inference

Hudson River Trading

New York, NY, United States Onsite Direct
Apply →

Member of Technical Staff — Model Optimization and Inference

Nuance Labs

Seattle Onsite Direct
Apply →

Software Engineer- BIS (Baseten Inference Stack)

Baseten

San Francisco Direct
Apply →

Software Engineer- BIS (Baseten Inference Stack)

Baseten

San Francisco Direct
Apply →

AI Inference Performance Engineer - New College Grad 2026

AI Inference Performance Engineer

US, CA, Santa Clara Remote Entry Direct
Apply →

GPU Architect

Graphcore

US - Milpitas Onsite Senior Direct
Apply →

Product Manager / Director – GPU & AI Services

Impossible Cloud

Hamburg Onsite Director Direct
Apply →

AI Inference Performance Engineer - New College Grad 2026

NVIDIA

US, CA, Santa Clara Remote Entry Direct
Apply →

GPU Performance Engineer - Neural Reconstruction

NVIDIA

Canada, Remote Remote Direct
Apply →

Verification Engineer - GPU Fullchip

NVIDIA

India, Bengaluru Hybrid Mid Direct
Apply →

Senior GPU Architect

NVIDIA

US, CA, Santa Clara Onsite Senior Direct
Apply →

See how you match these Gpu Inference roles

Upload your resume and get a skill match score for every job

Get match scores →

Common Questions

How many gpu inference jobs in new york are available?
JobsGlitch lists active gpu inference jobs in new york sourced directly from company ATS platforms — not reposted from LinkedIn.
Are these Gpu Inference roles actually hiring in New York?
Yes — every listing is indexed directly from company career pages (Greenhouse, Lever, Workday, Ashby). These are not aggregated from other job boards, so they reflect live hiring intent.
What skills do Gpu Inference jobs in New York require?
Required skills vary by employer and seniority. Browse the listings above to see the specific requirements for each open role.
How do I apply for gpu inference jobs in new york?
Click any job listing to view the full description and apply directly on the company's career page. Upload your resume on JobsGlitch first to see your match score before applying.