Humanoid

Technology

NeuralNetworkPerformanceEngineer

£75–110k ~AI est. London, United Kingdom FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Neural Network Performance Engineer at Humanoid. Skills: Neural Network Performance, Deep Learning Systems, GPU Architecture. Analyze performance bottlenecks. Improve model architecture”

Industry & Context.

Technology
Problems you'll solve

Analyze performance bottlenecks; Debug numerics

What They're Looking For.

Must Have

3+ years building deep-learning systems, 1+ years experience on performance, Excellent understanding of GPU architecture, Python + PyTorch/JAX

Nice to Have

Robotics or autonomous driving experience, Open source code showcasing ability, Publications at ICLR/ICML/NeurIPS, Equivalent open-source contributions, Familiarity with VLM or VLA models

What You'll Do.

Analyze performance bottlenecks

Improve model architecture

Make model run efficiently

Implement custom kernels

Quantize model with minimal loss

How You'll Work.

Team & Collaboration

VLA team

Communication Scope

Communicate trade-offs crisply

Full Job Description

Here at Humanoid, we believe in a future where robots amplify human potential. That’s why we’ve set out on a mission to build the world’s most capable, commercially-scalable, and safe humanoid robots. We’re bringing that mission to life with HMND‑01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we’re growing the team to take it even further. ABOUT THE ROLE We're hiring a Neural Network Performance Engineer to join our VLA team based in London. In this role, you will work on all aspects of running capable neural-network based control policies at a high rate with minimal latency, both on cloud hardware and onboard. Your work will be critical to delivering smooth robot motions while reacting to environment changes as quickly as possible. WHAT YOU'LL DO - Analyze performance bottlenecks of a particular model architecture and come up with potential improvements. - Make the model run on a new hardware (e.g. NVIDIA Thor) efficiently. - Implement custom kernels to reduce memory throughput requirements where it matters. - Quantize a model with minimal loss of quality. - Suggest and implement changes of model architecture that will enable better performance characteristics without sacrificing model capabilities. WHAT WE'RE LOOKING FOR - 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it. - 1+ years experience working on performance of neural network inference (analyzing bottlenecks, writing custom kernels, quantizing models, fighting deep learning compilers). - Excellent understanding of GPU architecture and why some models run faster than others. - Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code. - You document experiments clearly and communicate trade‑offs crisply. Nice to have: - Robotics or autonomous driving experience. - Open source code showcasing your ability to improve inference performance. - Publicati

Free ATS check

Applying for this Neural Network Performance Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Humanoid?

Real rants from real employees. Read before you apply.

Read Company Rants →