Lightning AI

AI

PlatformSupportEngineer

San Francisco, California, United States Remote Friendly
The Brief

“Platform Support Engineer at Lightning AI. Skills: ML systems, Cloud infrastructure, Kubernetes. Support engineers training models. Support engineers deploying inference systems”

What You'll Achieve.

Take ideas from research to production; Less friction; Experimentation; Training; Production inference; Security; Observability; Control; Scale GPU workloads; Improve reliability; Shape infrastructure

Industry & Context.

AI
Problems you'll solve

Help diagnose failures; Improve reliability; Guide customers through complex problems; Technical reasoning

What They're Looking For.

Must Have

Experience running machine learning workloads at scale, Understanding of ML systems, Understanding of cloud infrastructure, Understanding of Kubernetes, Experience supporting engineers training models, Experience supporting engineers deploying inference systems, Experience supporting engineers scaling GPU workloads

Nice to Have

Experience with Kubernetes scheduling, Experience with GPU orchestration, Experience with distributed PyTorch failures, Experience with inference latency, Experience with networking bottlenecks, Experience with storage performance, Experience with platform reliability

What You'll Do.

Support engineers training models

Support engineers deploying inference systems

Support engineers scaling GPU workloads

Diagnose and resolve distributed systems issues

Diagnose and resolve ML infrastructure issues

Act as technical advisor during incidents

Translate infrastructure issues to guidance

How You'll Work.

Team & Collaboration

Partner directly with customer engineering teams; Work Directly With ML Engineers

Communication Scope

Clear communication

Free ATS check

Applying for this Platform Support Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Lightning AI?

Real rants from real employees. Read before you apply.

Read Company Rants →