Recraft

Technology

MLDataEngineer

£70–100k ~AI est. London, United Kingdom FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“ML Data Engineer at Recraft. Skills: Data pipelines, Kubernetes, Unstructured data. Develop data-ingestion pipelines. Prepare large-scale image datasets”

What You'll Achieve.

Ship datasets that move model quality forward

Industry & Context.

Technology
Problems you'll solve

Root cause analysis

What They're Looking For.

Must Have

Python production-ready code, Hands-on Kubernetes experience, Unstructured data experience, Data-ingestion tools experience, S3/object storage experience, English B2+

Nice to Have

Familiarity with ML workflows, Experience with image quality scoring, DAG/workflow visualizations tooling, DevOps fluency

What You'll Do.

Develop data-ingestion pipelines

Prepare large-scale image datasets

Source datasets from public sources

Filter data for quality

Operate data-pipeline framework

Improve data-pipeline framework

Work with S3 object storage

Add tooling around pipelines

Visualize pipeline progress

Add metrics to pipelines

Add alerts to pipelines

Collaborate with ML engineers

Align datasets with training needs

Accelerate experimentation

How You'll Work.

Team & Collaboration

ML engineers

Full Job Description

ABOUT US Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation. We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 190+ countries have produced hundreds of millions of images using Recraft, and we're just getting started. Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality. If you’re passionate about pushing the boundaries of AI, we want you on board! JOB DESCRIPTION At Recraft, we’re building the next generation of generative models across images and text. We’re looking for an ML Data Engineer to scale our data pipelines for unstructured data (primarily images) and keep our training flows fast, reliable, and repeatable. You’ll design and operate high-throughput ingestion and preprocessing on Kubernetes, evolve our internal data-pipeline framework, and work hand-in-hand with ML engineers to ship datasets that move model quality forward. KEY RESPONSIBILITIES - Develop and maintain data-ingestion pipelines to source and prepare large-scale image (and occasional text/HTML) datasets from open, publicly accessible, and permitted sources. - Own the end-to-end flow: raw data → quality/beauty/relevance filtering → dedup/validation → ready-to-train artifacts. Operate and improve our Kubernetes-based data-pipeline framework (distributed jobs, retries, monitoring, automation). - Work with S3-style object storage: efficient layouts, lifecycle, throughput, and cost awareness. - Add tooling ar

Free ATS check

Applying for this ML Data Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Recraft?

Real rants from real employees. Read before you apply.

Read Company Rants →