Unity Technologies
Technology
SeniorMachineLearningEngineer,MLInfrastructure
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Machine Learning Engineer, ML Infrastructure at Unity Technologies. Skills: ML Infrastructure, Online ML systems, Model serving, Inference platform. Design online inference infrastructure. Operate online inference infrastructure”
Industry & Context.
Systems thinking
What They're Looking For.
Must Have
Experience building production ML inference systems, Experience with model serving frameworks, Experience optimizing inference workloads, Experience with distributed systems, Experience with Kubernetes, Experience with autoscaling, Experience with service reliability, Experience with production observability, Programming skills in Python, Practical experience with production ML systems, Practical experience with high-scale services, Experience with PyTorch, Experience with modern model deployment workflows, Experience designing infrastructure for safe model rollout, Experience with canary testing, Experience with A/B experimentation, Experience with automated rollback, Systems thinking, Ability to reason about tradeoffs, Proven ability to lead technical direction, Proven ability to influence architectural decisions
Nice to Have
Relocation support not available, Work visa sponsorship not available
What You'll Do.
Design online inference infrastructure
Operate online inference infrastructure
Serve production ML models
Build model serving systems
Improve model serving systems
Optimize inference performance
Develop infrastructure for model deployment
Develop infrastructure for canary testing
Develop infrastructure for A/B experimentation
Develop infrastructure for traffic splitting
Develop infrastructure for rollback
Develop infrastructure for production validation
Improve observability of ML systems
Build self-healing capabilities
Build autoscaling capabilities
Partner with ML engineers
Support faster model iteration
Improve reliability of model serving workflows
Improve reproducibility of model serving workflows
Lead architectural improvements
How You'll Work.
Team & Collaboration
Work with ML engineers; Work with platform teams; Work with product stakeholders
Full Job Description
The opportunity Unity Vector builds ML infrastructure that powers real-time prediction, experimentation, attribution, and AI-driven decision-making across the company. Our online ML systems serve production models at scale, supporting low-latency inference, large-scale experimentation, model deployment and optimization, feature processing, and business-critical decisioning. As model complexity, traffic volume, and experimentation velocity continue to grow, our inference platform must remain reliable, scalable, observable, and cost-efficient. To support this growth, we need strong technical ownership to evolve the online ML infrastructure that enables ML teams to safely deploy, validate, and operate production models at scale. The Role We are seeking a senior/staff ML engineer to design and evolve Unity Vector’s online model inference platform. This role focuses on building reliable infrastructure for serving machine learning models in production, optimizing inference performance, and enabling safe, efficient experimentation across high-traffic online systems. You will work closely with ML engineers, platform teams, and product stakeholders to ensure models can be deployed, scaled, monitored, and iterated on efficiently. You will play a key role in shaping how models are packaged, served, validated, monitored, and optimized in production environments. This role requires strong systems thinking, deep experience with production ML infrastructure, and the ability to drive architectural improvements across teams. What you'll be doing Design and operate large-scale online inference infrastructure that serves production ML models with low latency and high reliability. Build and improve model serving systems using technologies such as PyTorch, Triton Inference Server, Kubernetes, GKE, Ray, or similar distributed serving frameworks. Optimize inference performance through batching, model compilation, GPU/CPU utilization improvements, request scheduling, and runtime-level tuni
Applying for this Senior Machine Learning Engineer, ML Infrastructure role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Unity Technologies?
Real rants from real employees. Read before you apply.