Unity Technologies

Technology

SeniorMachineLearningEngineer,MLInfrastructure

$550–950k ~AI est. Shanghai, China

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Machine Learning Engineer, ML Infrastructure at Unity Technologies. Skills: ML Infrastructure, Online ML systems, Model serving, Inference platform. Design online inference infrastructure. Operate online inference infrastructure”

Industry & Context.

Technology

Problems you'll solve

Systems thinking

What They're Looking For.

Must Have

Experience building production ML inference systems, Experience with model serving frameworks, Experience optimizing inference workloads, Experience with distributed systems, Experience with Kubernetes, Experience with autoscaling, Experience with service reliability, Experience with production observability, Programming skills in Python, Practical experience with production ML systems, Practical experience with high-scale services, Experience with PyTorch, Experience with modern model deployment workflows, Experience designing infrastructure for safe model rollout, Experience with canary testing, Experience with A/B experimentation, Experience with automated rollback, Systems thinking, Ability to reason about tradeoffs, Proven ability to lead technical direction, Proven ability to influence architectural decisions

Nice to Have

Relocation support not available, Work visa sponsorship not available

What You'll Do.

Design online inference infrastructure

Operate online inference infrastructure

Serve production ML models

Build model serving systems

Improve model serving systems

Optimize inference performance

Develop infrastructure for model deployment

Develop infrastructure for canary testing

Develop infrastructure for A/B experimentation

Develop infrastructure for traffic splitting

Develop infrastructure for rollback

Develop infrastructure for production validation

Improve observability of ML systems

Build self-healing capabilities

Build autoscaling capabilities

Partner with ML engineers

Support faster model iteration

Improve reliability of model serving workflows

Improve reproducibility of model serving workflows

Lead architectural improvements

How You'll Work.

Team & Collaboration

Work with ML engineers; Work with platform teams; Work with product stakeholders

Full Job Description

The opportunity Unity Vector builds ML infrastructure that powers real-time prediction, experimentation, attribution, and AI-driven decision-making across the company. Our online ML systems serve production models at scale, supporting low-latency inference, large-scale experimentation, model deployment and optimization, feature processing, and business-critical decisioning. As model complexity, traffic volume, and experimentation velocity continue to grow, our inference platform must remain reliable, scalable, observable, and cost-efficient. To support this growth, we need strong technical ownership to evolve the online ML infrastructure that enables ML teams to safely deploy, validate, and operate production models at scale. The Role We are seeking a senior/staff ML engineer to design and evolve Unity Vector’s online model inference platform. This role focuses on building reliable infrastructure for serving machine learning models in production, optimizing inference performance, and enabling safe, efficient experimentation across high-traffic online systems. You will work closely with ML engineers, platform teams, and product stakeholders to ensure models can be deployed, scaled, monitored, and iterated on efficiently. You will play a key role in shaping how models are packaged, served, validated, monitored, and optimized in production environments. This role requires strong systems thinking, deep experience with production ML infrastructure, and the ability to drive architectural improvements across teams. What you'll be doing Design and operate large-scale online inference infrastructure that serves production ML models with low latency and high reliability. Build and improve model serving systems using technologies such as PyTorch, Triton Inference Server, Kubernetes, GKE, Ray, or similar distributed serving frameworks. Optimize inference performance through batching, model compilation, GPU/CPU utilization improvements, request scheduling, and runtime-level tuni

Free ATS check

Applying for this Senior Machine Learning Engineer, ML Infrastructure role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 37 detected · ranked by frequency

Inference optimization ×3

Model compilation ×3

GPU utilization ×3

CPU utilization ×3

Request scheduling ×3

Runtime tuning ×3

Batching ×3

Quantization ×3

GPU acceleration ×3

GPU kernel optimization ×3

Caching ×3

Service reliability ×3

Production observability ×3

Model packaging ×3

Artifact validation ×3

Compatibility testing ×3

Deployment automation ×3

ML Infrastructure ×2

Online ML systems ×2

Model serving ×2

Inference platform ×2

Kubernetes ×2

GKE ×2

TensorFlow Serving ×2

TorchServe ×2

Ray Serve ×2

Python

PyTorch

Triton Inference Server

Ray

Systems thinking

Inference performance optimization

Role Details

Experience 5–10 yrs

Level Senior

Work Mode Onsite

Category ai-&-machine-learning

Salary Band 200k+

AI-Extracted Insights

Domain Areas

online-ml-systemsreal-time-predictionml-infrastructurelow-latency-inferencelarge-scale-experimentationmodel-deploymentmodel-optimizationfeature-processing

ANONYMOUS · UNFILTERED

What do employees actually say about Unity Technologies?

Real rants from real employees. Read before you apply.

Read Company Rants →