Applied Compute

Technology

AIPlatformEngineer

$175–235k ~AI est. San Francisco, California, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“AI Platform Engineer at Applied Compute. Skills: AI Platform Engineering, Distributed Systems, ML Infrastructure, Cloud Infrastructure. Build orchestration systems for post-training. Build orchestration systems for evaluation”

Industry & Context.

Technology
Problems you'll solve

Reason about performance; Reason about scalability; Reason about fault tolerance; Reason about operational tradeoffs

What They're Looking For.

Must Have

5+ years experience building distributed systems, 5+ years experience building infrastructure platforms, 5+ years experience building ML infrastructure, 5+ years experience building large-scale backend services, Systems engineering fundamentals, Experience designing production systems, Experience operating production systems, Experience building orchestration systems, Experience building data pipelines, Experience building model serving infrastructure, Experience building large-scale platform services, Experience operating orchestration systems, Experience operating data pipelines, Experience operating model serving infrastructure, Experience operating large-scale platform services, Familiarity with containers, Familiarity with Kubernetes, Familiarity with infrastructure-as-code, Familiarity with modern deployment workflows, Understanding of security fundamentals, Ability to reason about performance, Ability to reason about scalability, Ability to reason about fault tolerance, Ability to reason about operational tradeoffs

Nice to Have

Experience with sandboxing technologies, Experience with isolation technologies, Experience with workflow orchestration systems, Experience building platforms deployed into customer-controlled cloud environments, Experience with ML infrastructure, Experience with model serving, Experience with distributed training, Experience with evaluation systems, Experience with GPU scheduling, Experience building developer platforms, Experience building internal tooling, Experience building systems that accelerate productivity

What You'll Do.

Build orchestration systems for post-training

Build orchestration systems for evaluation

Build orchestration systems for data generation

Build orchestration systems for continuous improvement

Build large-scale evaluation infrastructure

Measure model performance

Measure agent performance

Design model serving systems

Operate model serving systems

Deliver low-latency inference

Deliver reliable inference

Architect data infrastructure

Power model improvement

Develop secure execution environments

Design authentication

Design security controls

Build deployment systems

Build provisioning systems

Improve observability

Improve operational efficiency

Partner with applied researchers

Build infrastructure for production data

Build infrastructure for better models

Build infrastructure for evaluations

Build infrastructure for AI systems

How You'll Work.

Team & Collaboration

Partner closely with applied researchers

Full Job Description

THE ROLE At Applied Compute, our applied researchers work directly with enterprises to design, deploy, and continuously improve AI agents that solve real operational problems. As an AI Platform Engineer, you'll build the infrastructure that makes this possible. You'll own the foundational systems that power Applied Compute's post-training and agent infrastructure: large-scale evaluation pipelines, model serving systems, training orchestration, secure execution environments, and the deployment platform that brings continuously improving AI systems into customer environments. Your work will enable researchers to rapidly build, evaluate, and deploy production AI systems while meeting the security, reliability, and compliance requirements of large enterprises. What you'll do - Build orchestration systems for post-training, evaluation, data generation, and continuous improvement workflows - Build large-scale evaluation infrastructure that measures model and agent performance across customer deployments and research workflows - Design and operate model serving systems that deliver low-latency, reliable inference for production AI applications - Architect the data infrastructure that powers training, evaluation, observability, and model improvement across customer environments - Develop secure execution environments for agents, evaluations, and training workloads using microVMs, containers, and modern sandboxing technologies - Design authentication, authorization, audit logging, and security controls that enable AI systems to operate safely within enterprise environments - Build deployment and provisioning systems that allow continuously improving models and agents to run inside customer VPCs and cloud environments - Improve reliability, scalability, observability, and operational efficiency across serving, evaluation, and training infrastructure - Partner closely with applied researchers to build the infrastructure that turns production data into better models, evaluation

Free ATS check

Applying for this AI Platform Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Applied Compute?

Real rants from real employees. Read before you apply.

Read Company Rants →