Preply
Ed-Tech
SeniorIIDataEngineer-DataIngestionandEnrichmentteam
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior II Data Engineer - Data Ingestion and Enrichment team at Preply. Skills: Data ingestion, Data enrichment, Data lake architecture, Batch and streaming data pipelines, Data quality, Observability, Reliability engineering, Technical leadership. Own and drive technical vision for the data layer. Design, build, and own Preply’s data lake”
What You'll Achieve.
Ensure features, datasets, and pipelines are production-ready, observable, and reusable; Treat trust, correctness, and predictability as first-class features of the platform; Catch issues before they propagate; Ensure data failures are visible, diagnosable, and recoverable; Enable teams to onboard new data sources independently within clear guardrails
Industry & Context.
Exceptional problem-solving skills; Dive deep into disparities between numbers and stories, unlocking meaningful insights
What They're Looking For.
Must Have
Solid experience working in platform or data engineering teams (or equivalent impact) with evidence of leading multi-stakeholder deliveries, Familiarity with cloud platforms (AWS/GCP or equivalent) and modern DevOps practices, Hands-on experience designing and implementing real-time and batch data processing infrastructures using modern frameworks like Spark, Flink, Spark streaming, Kafka, Debezium, etc., Expertise with orchestration tools such as Airflow, dbt, or similar, Exceptional problem-solving skills paired with a proactive, innovative mindset focused on continuous improvement, Communication and cross-functional collaboration skills (English level B2+)
Nice to Have
Proven track record in scaling data infrastructures within fast-growing startups, Terraform/Kubernetes for data tooling, SQL proficiency
What You'll Do.
Own and drive technical vision for the data layer
and own Preply’s data lake
Ensure datasets have clear ownership
and quality expectations
Develop and operate scalable
reliable batch and streaming ingestion pipelines
Design raw -> standardized -> consumption layers
Define and implement data contracts
Build enrichment logic
Support historical tracking
point-in-time correctness
and dataset versioning
Instrument ingestion pipelines with observability metrics
and incident response
Apply consistent access control
and privacy protections
Contribute to standardized ingestion templates
Improve discoverability
How You'll Work.
Team & Collaboration
Work closely with ML Platform, Applied/Data Scientists, Analytics Engineering, and Product squads; Drive cross-functional initiatives involving stakeholders from different functional areas and different levels of seniority; Collaborate with Product, Backend, Analytics, and ML partners; Promote shared ownership of data quality and platform standards; Foster a culture where teams move fast together under common data contracts and principles; Prioritize collaboration, inclusion, and the success of the team over personal ambitions
Communication Scope
English level B2+
Process & Methodology
Leading multi-stakeholder deliveries
Full Job Description
WE POWER PEOPLE’S PROGRESS. At Preply, we’re all about creating life-changing learning experiences. We help people discover the magic of the perfect tutor, craft a personalised learning journey, and stay motivated to keep growing. Our approach is human-led, tech-enabled - and it’s creating real impact. We’ve just reached unicorn status with a $150M Series D, accelerating our vision to transform education through human-led, AI-enhanced learning. Today, 100,000+ tutors teach 90+ languages to learners in 180 countries - and we’re only getting started. As a category-defining company, we’re shaping what the future of learning looks like at global scale. Every Preply lesson sparks change, fuels ambition, and drives progress that matters. Joining Preply means helping define the future of education at global scale, and building something that truly matters for millions of people, every day. MEET THE TEAM! At Preply, the Data ingestion and enrichment team provides a single, trusted, and scalable data foundation. The team ensures that all analytics, machine learning, and product features are built on unified, governed, and production-grade data assets in Preply’s Lake House, including the extraction, normalization, and generation of structured data from Preply’s unstructured assets, forming a durable data moat for AI-driven products. As a Senior II Data Engineer in the Data Ingestion and Enrichment team, you will own and drive technical vision for the data layer that powers both Preply’s analytics, machine learning, and product. You will work closely with ML Platform, Applied/Data Scientists, Analytics Engineering, and Product squads to ensure that features, datasets, and pipelines are production-ready, observable, and reusable across the company. This role combines hands-on engineering with technical leadership. You will drive cross-functional initiatives involving stakeholders from different functional areas and different levels of seniority. WHAT YOU’LL BE DOING: Build tru
Applying for this Senior II Data Engineer - Data Ingestion and Enrichment team role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Preply?
Real rants from real employees. Read before you apply.