Datadog

AI

SeniorSoftwareEngineer-AnalyticsDataPlatformLakehouse

$130–300k Boston, Massachusetts, United States Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Software Engineer - Analytics Data Platform Lakehouse at Datadog. Skills: Apache Iceberg, Trino, Apache Spark, large-scale distributed data systems, Kubernetes. Design, build, and operate core components of our lakehouse platform. Apache Iceberg table management (data compaction, data layout optimization, materialized view scheduling…)”

What You'll Achieve.

power data engineers, applied AI, and product teams; managing millions of tables on their behalf; simplifying operations from maintenance and observability to governance, for both internal and customer-facing use cases; reliably run thousands of pipelines per day against our lakehouse; define the roadmap for our lakehouse architecture; shape how Datadog manages analytic data at scale

Industry & Context.

AI
Problems you'll solve

identify query performance bottlenecks; contribute fixes back to upstream open-source projects

What They're Looking For.

Must Have

BS/MS/PhD in Computer Science, Engineering, or a related field, or equivalent professional experience, deep, production-grade experience with one or more of Apache Iceberg, Trino, or Apache Spark, built or operated large-scale distributed data systems, solid grasp of query planning, columnar file formats (Parquet, ORC), and table format internals (snapshots, manifests, partition evolution), fluent in Java, Scala or Go and comfortable with Python for pipeline tooling, experience deploying and running data infrastructure on Kubernetes in cloud environments

Nice to Have

significant open-source contributions: merged PRs, committer status, or PMC membership on projects

What You'll Do.

and operate core components of our lakehouse platform

Apache Iceberg table management (data compaction

data layout optimization

materialized view scheduling…)

Drive adoption of open table formats across internal teams

owning the integration of Trino

Spark and other query engines (DuckDB

Puppygraph…) with our Iceberg-based at petabyte scale

Build observability for managed iceberg tables

contribute fixes back to upstream open-source projects (Iceberg

Open Lineage) where relevant

Build self-serve tooling and abstractions that allow data engineering teams to reliably run thousands of pipelines per day against our lakehouse

How You'll Work.

Team & Collaboration

Drive adoption of open table formats across internal teams; Collaborate with data engineers, analysts, and infrastructure teams to define the roadmap for our lakehouse architecture and shape how Datadog manages analytic data at scale

Full Job Description

Analytics Data Platform Lakehouse team builds and operates the foundations that power data engineers, applied AI, and product teams—managing millions of tables on their behalf and simplifying operations from maintenance and observability to governance, for both internal and customer-facing use cases. If you're excited by the intersection of petabyte data processing scale, open-source query engines, and building platforms with real product stakes, this is the team for you. At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them. What You’ll Do: Design, build, and operate core components of our lakehouse platform, including Apache Iceberg table management (data compaction, data layout optimization, materialized view scheduling…) and Iceberg catalog Drive adoption of open table formats across internal teams, owning the integration of Trino, Spark and other query engines (DuckDB, Puppygraph…) with our Iceberg-based at petabyte scale Build observability for managed iceberg tables, to identify query performance bottlenecks, cost drivers and contribute fixes back to upstream open-source projects (Iceberg, Trino, Spark, Open Lineage) where relevant Build self-serve tooling and abstractions that allow data engineering teams to reliably run thousands of pipelines per day against our lakehouse Collaborate with data engineers, analysts, and infrastructure teams to define the roadmap for our lakehouse architecture and shape how Datadog manages analytic data at scale Who You Are: You have a BS/MS/PhD in Computer Science, Engineering, or a related field, or equivalent professional experience You have deep, production-grade experience with one or more of Apache Iceberg, Trino, or Apache Spark, ideally demonstrated through significant open-source contributions: merged PRs, committer status, or P

Free ATS check

Applying for this Senior Software Engineer - Analytics Data Platform Lakehouse role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Datadog?

Real rants from real employees. Read before you apply.

Read Company Rants →