Applied Compute
Technology
AIPlatformEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“AI Platform Engineer at Applied Compute. Skills: AI Platform Engineering, Distributed Systems, ML Infrastructure, Cloud Infrastructure. Build orchestration systems for post-training. Build orchestration systems for evaluation”
Industry & Context.
Reason about performance; Reason about scalability; Reason about fault tolerance; Reason about operational tradeoffs
What They're Looking For.
Must Have
5+ years experience building distributed systems, 5+ years experience building infrastructure platforms, 5+ years experience building ML infrastructure, 5+ years experience building large-scale backend services, Systems engineering fundamentals, Experience designing production systems, Experience operating production systems, Experience building orchestration systems, Experience building data pipelines, Experience building model serving infrastructure, Experience building large-scale platform services, Experience operating orchestration systems, Experience operating data pipelines, Experience operating model serving infrastructure, Experience operating large-scale platform services, Familiarity with containers, Familiarity with Kubernetes, Familiarity with infrastructure-as-code, Familiarity with modern deployment workflows, Understanding of security fundamentals, Ability to reason about performance, Ability to reason about scalability, Ability to reason about fault tolerance, Ability to reason about operational tradeoffs
Nice to Have
Experience with sandboxing technologies, Experience with isolation technologies, Experience with workflow orchestration systems, Experience building platforms deployed into customer-controlled cloud environments, Experience with ML infrastructure, Experience with model serving, Experience with distributed training, Experience with evaluation systems, Experience with GPU scheduling, Experience building developer platforms, Experience building internal tooling, Experience building systems that accelerate productivity
What You'll Do.
Build orchestration systems for post-training
Build orchestration systems for evaluation
Build orchestration systems for data generation
Build orchestration systems for continuous improvement
Build large-scale evaluation infrastructure
Measure model performance
Measure agent performance
Design model serving systems
Operate model serving systems
Deliver low-latency inference
Deliver reliable inference
Architect data infrastructure
Power model improvement
Develop secure execution environments
Design authentication
Design security controls
Build deployment systems
Build provisioning systems
Improve observability
Improve operational efficiency
Partner with applied researchers
Build infrastructure for production data
Build infrastructure for better models
Build infrastructure for evaluations
Build infrastructure for AI systems
How You'll Work.
Team & Collaboration
Partner closely with applied researchers
Full Job Description
THE ROLE At Applied Compute, our applied researchers work directly with enterprises to design, deploy, and continuously improve AI agents that solve real operational problems. As an AI Platform Engineer, you'll build the infrastructure that makes this possible. You'll own the foundational systems that power Applied Compute's post-training and agent infrastructure: large-scale evaluation pipelines, model serving systems, training orchestration, secure execution environments, and the deployment platform that brings continuously improving AI systems into customer environments. Your work will enable researchers to rapidly build, evaluate, and deploy production AI systems while meeting the security, reliability, and compliance requirements of large enterprises. What you'll do - Build orchestration systems for post-training, evaluation, data generation, and continuous improvement workflows - Build large-scale evaluation infrastructure that measures model and agent performance across customer deployments and research workflows - Design and operate model serving systems that deliver low-latency, reliable inference for production AI applications - Architect the data infrastructure that powers training, evaluation, observability, and model improvement across customer environments - Develop secure execution environments for agents, evaluations, and training workloads using microVMs, containers, and modern sandboxing technologies - Design authentication, authorization, audit logging, and security controls that enable AI systems to operate safely within enterprise environments - Build deployment and provisioning systems that allow continuously improving models and agents to run inside customer VPCs and cloud environments - Improve reliability, scalability, observability, and operational efficiency across serving, evaluation, and training infrastructure - Partner closely with applied researchers to build the infrastructure that turns production data into better models, evaluation
Applying for this AI Platform Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Applied Compute?
Real rants from real employees. Read before you apply.