Humanoid

Technology

NeuralNetworkPerformanceEngineer

£75–110k ~AI est. London, United Kingdom FULL TIME Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Neural Network Performance Engineer at Humanoid. Skills: Neural Network Performance, Deep Learning Systems, GPU Architecture. Analyze performance bottlenecks. Improve model architecture”

Industry & Context.

Technology

Problems you'll solve

Analyze performance bottlenecks; Debug numerics

What They're Looking For.

Must Have

3+ years building deep-learning systems, 1+ years experience on performance, Excellent understanding of GPU architecture, Python + PyTorch/JAX

Nice to Have

Robotics or autonomous driving experience, Open source code showcasing ability, Publications at ICLR/ICML/NeurIPS, Equivalent open-source contributions, Familiarity with VLM or VLA models

What You'll Do.

Analyze performance bottlenecks

Improve model architecture

Make model run efficiently

Implement custom kernels

Quantize model with minimal loss

How You'll Work.

Team & Collaboration

VLA team

Communication Scope

Communicate trade-offs crisply

Full Job Description

Here at Humanoid, we believe in a future where robots amplify human potential. That’s why we’ve set out on a mission to build the world’s most capable, commercially-scalable, and safe humanoid robots. We’re bringing that mission to life with HMND‑01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we’re growing the team to take it even further. ABOUT THE ROLE We're hiring a Neural Network Performance Engineer to join our VLA team based in London. In this role, you will work on all aspects of running capable neural-network based control policies at a high rate with minimal latency, both on cloud hardware and onboard. Your work will be critical to delivering smooth robot motions while reacting to environment changes as quickly as possible. WHAT YOU'LL DO - Analyze performance bottlenecks of a particular model architecture and come up with potential improvements. - Make the model run on a new hardware (e.g. NVIDIA Thor) efficiently. - Implement custom kernels to reduce memory throughput requirements where it matters. - Quantize a model with minimal loss of quality. - Suggest and implement changes of model architecture that will enable better performance characteristics without sacrificing model capabilities. WHAT WE'RE LOOKING FOR - 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it. - 1+ years experience working on performance of neural network inference (analyzing bottlenecks, writing custom kernels, quantizing models, fighting deep learning compilers). - Excellent understanding of GPU architecture and why some models run faster than others. - Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code. - You document experiments clearly and communicate trade‑offs crisply. Nice to have: - Robotics or autonomous driving experience. - Open source code showcasing your ability to improve inference performance. - Publicati

Free ATS check

Applying for this Neural Network Performance Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 18 detected · ranked by frequency

Performance tuning ×3

Kernel programming ×3

Model optimization ×3

Numerics debugging ×3

Neural Network Performance ×2

Deep Learning Systems ×2

GPU Architecture ×2

PyTorch ×2

JAX ×2

Python

Deep Learning

Neural Networks

GPU

Performance analysis

System optimization

Kernel implementation

Model quantization

Architecture modification

Role Details

Seniority Mid

Experience 3–5 yrs

Level Mid

Work Mode Onsite

Type FULL TIME

Category software

Salary Band 75k-100k

AI-Extracted Insights

Domain Areas

neural-network-inferencedeep-learning-compilersvision-language-modelsvision-language-action-models

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Humanoid?

Real rants from real employees. Read before you apply.

Read Company Rants →