Handshake

SeniorEngineeringManager,ReinforcementLearningEnvironments(RLE)

$230–310k San Francisco, California, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Engineering Manager, Reinforcement Learning Environments (RLE) at Handshake. Skills: Engineering management, RL environments, Platform development. Lead, hire, and develop a high-performing team. Own the RLE roadmap and execution”

What You'll Achieve.

RLE becomes default platform; New domains launch quickly; Environment reliability trusted; Data quality trusted; Platform measurably improves task completion

Industry & Context.

Problems you'll solve

Execution in ambiguity; Deliver in fast-moving, unclear problem spaces

What They're Looking For.

Must Have

3+ years managing teams, 5+ years hands-on engineering experience

Nice to Have

Experience with RL training infrastructure, Simulation systems experience, Evaluation platforms experience, Human-in-the-loop systems experience, Operations-heavy, tech-enabled environment experience, Familiarity with AWS/GCP, Familiarity with APIs, Familiarity with Docker, Familiarity with modern stacks, TypeScript/Node experience, React experience, Experience building systems used by applied ML or AI research teams, Experience leading senior managing an (or equivalent scope) is a plus

What You'll Do.

and develop a high-performing team

Own the RLE roadmap and execution

Drive architecture for scalable systems

plug-and-play domains

Raise the bar on reliability

Create a culture of ownership

How You'll Work.

Team & Collaboration

Partner with Research; Partner with Product; Partner with Operations; Align cross-functionally

Process & Methodology

Roadmap execution

Full Job Description

ABOUT HANDSHAKE Handshake was founded on a simple belief that everyone deserves a path to a great career, regardless of where they went to school or who they know. Today, we power 25 million job seekers, 1 million+ employers, and 1,600 educational institutions. In 2025, we started Handshake AI and built the fastest-growing AI data business in history. We work directly with frontier AI lab researchers to create evaluations, publish benchmarks, and push the boundary of data. We’ve grown from $0 to ~$1B run rate and pay ~$60M to over 30K individuals every month. Why join Handshake now: - Shape how every career evolves in the AI economy, at global scale, with impact your friends, family and peers can see and feel - Partner hand-in-hand with world-class AI labs, Fortune 500 partners and the world’s top educational institutions - Work together with engineers, scientists, operators, and more from Palantir, Meta, Scale AI, and former YC founders - Build a massive, fast-growing business with billions in revenue About Handshake AI Human data is the core infrastructure to AI advancement. Frontier AI labs currently improve model capabilities with various data-intensive post-training techniques. We believe that data spend for AI training will increase by 3-5x in the next few years and continue for much longer as models take on new domains. Handshake AI supports all of the frontier AI labs, working on their most complex data at the largest scale. ABOUT THE ROLE We’re hiring a Senior Engineering Manager to lead our Reinforcement Learning Environments (RLE) team - the group building the interactive sandboxes where frontier models learn to complete real work. RLE environments simulate end-to-end workflows across domains like software engineering, finance, and legal research, with realistic tools, constraints, and feedback loops. The platform generates high-signal interaction data researchers use to train and evaluate models for task completion, quality, and robustness. This is a h

Free ATS check

Applying for this Senior Engineering Manager, Reinforcement Learning Environments (RLE) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Handshake?

Real rants from real employees. Read before you apply.

Read Company Rants →