NBCUniversal

media and entertainment

Forward-DeployedRLEngineer

Montréal, Quebec, Canada FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for mid candidates.

The Brief

“Forward-Deployed RL Engineer at NBCUniversal. Skills: Reinforcement Learning, Simulation Environments, Reward Engineering, Algorithm Implementation, Sim-to-Real Strategy. design of robust simulation environments, reward structures, and policy architectures. Build and maintain high-fidelity 2D/3D simulation environments”

What You'll Achieve.

ensure models perform reliably in real-world scenarios

Industry & Context.

media and entertainment
Problems you'll solve

debugging non-deterministic agent behaviors

Eligibility Requirements

Must be willing to travel for work related business, if necessary

What They're Looking For.

Must Have

Graduate degree (Master’s or PhD) in Robotics, Computer Science, AI, or a related field with a focus on Reinforcement Learning, Imitation Learning, or other Online Machine Learning fields., Proven experience as an RL Engineer or Research Engineer in a fast-paced environment., Prior experience in industries with complex multi-disciplinary teams such as robotics, smart grids, precision agriculture, game development, or aerospace., Fluency with Python, Git, and the Unix shell., Deep familiarity with frameworks like Ray Rllib, Stable Baselines3, or CleanRL., Experience with physics engines (MuJoCo, Bullet) or 3D game engines., Familiarity with collaborative tools such as Jira/Confluence, Slack, a Git server, and an experiment tracking framework., Must be legally authorized to work in Canada.

Nice to Have

Essential for understanding Markov Decision Processes (MDPs) and gradient-based optimization., Critical for debugging non-deterministic agent behaviors and ensuring environment parity.

What You'll Do.

design of robust simulation environments

and policy architectures

Build and maintain high-fidelity 2D/3D simulation environments

Design and tune complex reward functions

Develop and optimize RL algorithms

Analyze the 'reality gap' and implement domain randomization or adaptation techniques

How You'll Work.

Team & Collaboration

Work with partner ML and Annotation engineers and TPMs to spec out data, simulation, and training requirements.

Full Job Description

NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our global theme park destinations, consumer products, and experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, NBC Sports, Telemundo, NBC Local Stations, Bravo, and Peacock, our premium ad-supported streaming service. We produce and distribute premier filmed entertainment and programming through our powerhouse film and television studios, including Universal Pictures, DreamWorks Animation, and Focus Features, and the four global television studios under the Universal Studio Group banner, and operate industry-leading theme parks and experiences around the world through Universal Destinations & Experiences, including Universal Orlando Resort, home to Universal Epic Universe, and Universal Studios Hollywood. NBCUniversal is a subsidiary of Comcast Corporation. Visit www.nbcuniversal.com for more information. Our impact is rooted in improving the communities where our employees, customers, and audiences live and work. We have a rich tradition of giving back and ensuring our employees have the opportunity to serve their communities. We champion an inclusive culture and strive to attract and develop a talented workforce to create and deliver a wide range of content reflecting our world. NBCUniversal est l’une des principales entreprises mondiales de médias et de divertissement. Nous créons du contenu de calibre mondial, que nous distribuons à travers notre portefeuille de cinéma, de télévision et de diffusion en continu, et que nous faisons vivre par le biais de nos destinations de parcs thématiques mondiaux, de nos produits de consommation et de nos expériences. Nous détenons et exploitons des marques de divertissement et d’information de premier plan, notamment NBC, NBC News, NBC Sports, Telemundo, NBC Local Stations,

Free ATS check

Applying for this Forward-Deployed RL Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on SmartRecruiters

  • SmartRecruiters often includes a video screening step — check camera and mic permissions.
  • Link your GitHub or portfolio directly in the profile section for technical roles.
  • Applications may be reviewed by AI scoring before reaching a recruiter — use keywords from the job description.

ANONYMOUS · UNFILTERED

What do employees actually say about NBCUniversal?

Real rants from real employees. Read before you apply.

Read Company Rants →