Unity Technologies

Technology

StaffMachineLearningEngineer,MLInfrastructure-Offline

$750–1200k ~AI est. Shanghai, China

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff Machine Learning Engineer, ML Infrastructure - Offline at Unity Technologies. Skills: ML Infrastructure, Offline ML platform, Distributed training. Design data pipelines. Operate data pipelines”

Industry & Context.

Technology

Problems you'll solve

Systems thinking

Eligibility Requirements

No relocation support, No work visa sponsorship

What They're Looking For.

Must Have

Experience building large-scale ML pipelines, Experience with distributed computing frameworks, Programming skills in Python, Experience working with large-scale distributed workloads, Experience with modern data infrastructure, Systems thinking, Proven ability to lead technical direction, Sufficient knowledge of English

Nice to Have

Familiarity in the Ray ecosystem, Experience building infrastructure for training data generation, Experience building infrastructure for dataset preparation, Experience building infrastructure for ML feature pipelines, Deep experience designing and operating production-grade data pipelines

What You'll Do.

Design data pipelines

Operate data pipelines

Generate training datasets

Develop infrastructure for distributed training

Integrate ML pipelines with orchestration systems

Improve reproducibility of ML pipelines

Improve observability of ML pipelines

Optimize performance across compute systems

Optimize resource utilization

Partner with ML engineers

Enable large-scale experimentation

Enable model iteration

Lead architectural improvements

How You'll Work.

Team & Collaboration

ML engineers; Platform teams

Full Job Description

The opportunity Unity Vector builds an offline ML platform that powers insight, experimentation, attribution, and AI-driven decision-making across the company. Our systems operate at scale across batch and streaming data, supporting analytics, product intelligence, machine learning pipelines, and business operations. As data volume and complexity grow, our platform also supports large-scale model training, feature generation, and experimentation workflows that power production ML systems. To support this growth, we need strong technical ownership to ensure our ML pipelines remain reliable, scalable, and architecturally sound. The Role We are seeking a senior ML engineer to design and evolve the large-scale offline platform. This role focuses on building reliable infrastructure for generating training datasets, orchestrating ML workflows, and enabling efficient, distributed model training at scale. You will work closely with ML engineers and platform teams to ensure our pipelines can efficiently handle growing data volumes and increasingly complex training workloads. You will play a key role in shaping how model datasets are prepared as well as model training, validated, and delivered to distributed training systems, while ensuring the reliability, scalability, and performance of our offline ML platform. What you'll be doing Design and operate large-scale data pipelines that generate training datasets used for machine learning training and experimentation Develop infrastructure that supports distributed training workflows using technologies such as Pytorch, Ray Data, and Ray Train, etc. Integrate ML pipelines with workflow orchestration systems (e.g., Flyte, Airflow, or similar) to enable reliable multi-stage training workflows Improve reproducibility and observability of ML pipelines through dataset validation, monitoring, and automated testing Optimize performance and resource utilization across distributed compute systems used for data processing and model trainin

Free ATS check

Applying for this Staff Machine Learning Engineer, ML Infrastructure - Offline role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 28 detected · ranked by frequency

Distributed training ×3

Distributed computing ×3

Data processing ×3

Model training ×3

Feature generation ×3

Experimentation workflows ×3

ML Infrastructure ×2

Offline ML platform ×2

Python

Pytorch

Ray Data

Ray Train

Ray

Spark

Flink

Data lakes

Data warehouses

Streaming platforms

ML pipelines

Data pipelines

Dataset validation

Performance optimization

Resource utilization

Model iteration

Architectural improvements

Cost efficiency

Flyte

Airflow

Role Details

Experience 5–10 yrs

Level Senior

Work Mode Onsite

Category ai-&-machine-learning

Salary Band 200k+

AI-Extracted Insights

Domain Areas

offline-ml-platformdistributed-systemsbatch-datastreaming-datamachine-learning-pipelinesmodel-trainingfeature-generationexperimentation-workflows

ANONYMOUS · UNFILTERED

What do employees actually say about Unity Technologies?

Real rants from real employees. Read before you apply.

Read Company Rants →