Skydio

Drones

PhDAutonomyEngineerIntern-Planning&Controls(ReinforcementLearning)

Zurich, Switzerland INTERNSHIP

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Entry candidates.

The Brief

“PhD Autonomy Engineer Intern - Planning & Controls (Reinforcement Learning) at Skydio. Skills: Reinforcement Learning, Autonomous flight, Robotics. Develop reinforcement learning policies. Deploy reinforcement learning policies”

What You'll Achieve.

make Skydio aircraft plan, navigate, and control themselves more intelligently; safely, reliably, and efficiently; robust obstacle avoidance; dynamic goal-seeking; smooth, agile flight; tight safety envelopes; mission constraints; safety validate on real drones weekly; intuitive control handoffs; minimal comms; make a difference in the real world; dependable deployment in human environments; close the loop from research to field ops

Industry & Context.

Drones

Problems you'll solve

Navigation & avoidance in the wild; RL-augmented planning; Sim → Real at scale; Human-in-the-loop shared control; Fleet & multi-agent

What They're Looking For.

Must Have

PhD student in Robotics, Machine Learning, Controls, or related field, fundamentals in RL, control theory, and motion comfort with safety/robustness concepts, Proficient in Python (PyTorch/JAX/Ray RLlib), at least one of C++ or CUDA, Hands-on experience with robotics simulation (Isaac Lab/MuJoCo/PyBullet), sim2real techniques, Experience training/deploying policies for navigation, manipulation, or locomotion on real robots or autonomous vehicles

Nice to Have

Publications (CoRL, ICRA, IROS, RSS, NeurIPS), Experience with onboard inference optimization (TensorRT, quantization, sparsity), Familiarity with modern policy learning beyond vanilla RL: diffusion policies, ILC, offline RL, model-based RL, Experience with multi-agent RL or distributed training

What You'll Do.

Develop reinforcement learning policies

Deploy reinforcement learning policies

Train policies for navigation

Train policies for avoidance

Fuse learned cost shaping

Build scalable datasets

Learn assistive policies

Explore decentralized coordination

Flight test on aircraft

Flight test on edge compute

How You'll Work.

Team & Collaboration

Collaborate with perception teams; Collaborate with mapping teams; Collaborate with controls teams; Collaborate with product teams

Full Job Description

Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial mobility. The Skydio team combines deep expertise in artificial intelligence, best-in-class hardware and software product development, operational excellence, and customer obsession to empower a broader, more diverse audience of drone users, from utility inspectors https://www.skydio.com/solutions/energy-and-utilities to first responders https://www.skydio.com/solutions/public-safety, soldiers in battlefield scenarios https://www.skydio.com/solutions/national-security/tactical-isr, and beyond https://www.skydio.com/solutions. About the role: Skydio builds the world’s most advanced autonomous drones used across inspection, public safety, defense, cinematography, and more. Your research won’t languish in a paper—it will fly, shaping how pilots and operators complete real missions in complex environments. Develop and deploy reinforcement learning (and adjacent policy-learning methods) that make Skydio aircraft plan, navigate, and control themselves more intelligently—safely, reliably, and efficiently—across our ecosystem: handheld apps, ground control, cloud autonomy services, and fleet workflows. HOW YOU'LL MAKE AN IMPACT: - Navigation & avoidance in the wild: Train policies that adapt online to cluttered 3D scenes (forests, bridges, urban canyons), complementing our geometric stack for robust obstacle avoidance and dynamic goal-seeking. - RL-augmented planning: Fuse learned cost shaping / value functions with trajectory optimization for smooth, agile flight with tight safety envelopes and mission constraints. - Sim → Real at scale: Build scalable datasets and training loops with Isaac Lab, domain randomization, residual learning, and safety filters; validate on real drones weekly. - Human-in-the-loop shared control: Learn assistive policies that blend pilot intent, autonomy priors, and uncertainty-aware behaviors for intuitive cont

Free ATS check

Applying for this PhD Autonomy Engineer Intern - Planning & Controls (Reinforcement Learning) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 59 detected · ranked by frequency

Reinforcement Learning ×6

Python ×5

PyTorch ×5

JAX ×5

Ray RLlib ×5

CUDA ×5

Isaac Lab ×5

MuJoCo ×5

PyBullet ×5

TensorRT ×5

Robotics Simulation ×4

Sim2real ×4

Quantization ×4

Sparsity ×4

Diffusion policies ×4

ILC ×4

Offline RL ×4

Model-based RL ×4

Multi-agent RL ×4

Distributed training ×4

Autonomous flight ×3

Control Theory ×3

Motion Comfort ×3

Robotics ×2

Policy Learning

Aerial mobility

Artificial intelligence

Hardware product development

Software product development

Operational excellence

Customer obsession

Drone technology

Role Details

Experience 0–2 yrs

Level Entry

Type INTERNSHIP

Education PhD

Category autonomy

AI-Extracted Insights

Domain Areas

autonomous-dronesroboticsmachine-learningcontrolsreinforcement-learningcontrol-theorymotion-comfortrobotics-simulation

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Skydio?

Real rants from real employees. Read before you apply.

Read Company Rants →