Datadog

Technology

StaffGenAIEngineer-ApplicationPerformanceMonitoring(APM)

$245–355k ~AI est. San Francisco, California, United States; New York, New York, United States; Chicago, Illinois, United States Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff GenAI Engineer - Application Performance Monitoring (APM) at Datadog. Skills: GenAI, Machine Learning, Model deployment. Drive GenAI/machine learning projects. Build GenAI/ML models”

Industry & Context.

Technology
Problems you'll solve

Solve ambiguous challenges

What They're Looking For.

Must Have

BS/MS/PhD in scientific field, 10+ years engineering experience, Experience acting as technical lead, Proven track record leading GenAI/ML initiatives, Significant experience in model deployment, Significant experience in model development, Significant experience in model training, Significant experience in model fine-tuning, Significant experience in model evaluation, Ability to drive initiatives across teams, Ability to solve ambiguous challenges

Nice to Have

Product-minded ML engineer

What You'll Do.

Drive GenAI/machine learning projects

Build GenAI/ML models

Benchmark GenAI/ML models

Collaborate to build tools

Build automated investigation tools

Build automated triaging tools

Influence product direction

Guide teams through ambiguity

Guide teams through scaling challenges

Guide teams through evolving requirements

Influence engineering culture

How You'll Work.

Team & Collaboration

Cross-functional teams; APM organization

Full Job Description

We’re looking for a Staff Software Engineer with deep experience in GenAI/ML to join Datadog’s Application Performance Monitoring (APM) team. APM is a product which provides deep visibility into applications, enabling users to identify performance bottlenecks, troubleshoot issues, and optimize services. With distributed tracing, profiling, out-of-the-box dashboards, and seamless correlation with other telemetry data, Datadog APM provides some of the deepest and most structured visibility into the health and performance of applications. This context sets us up for an opportunity to be the world leaders in agentic investigations and incident troubleshooting. You’ll act as a technical leader within the APM group, focused on agentic workflows. You’ll lead efforts to design, train, evaluate, and deploy GenAI/ML models at scale. We’re looking for a product-minded ML engineer with strong technical expertise, excellent communication skills, and a track record of driving impactful initiatives end to end. At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them. What You’ll Do: Act as a technical leader within the APM organization, driving GenAI/machine learning projects from concept to production. Build and benchmark GenAI/ML models using state-of-the-art techniques. Collaborate with cross-functional teams to build automated investigation and triaging tools. Influence product direction by bringing a strong product mindset to your work, always advocating for the end user. Guide teams through ambiguity, scaling challenges, and evolving requirements with clear technical direction. Actively mentor engineers and influence engineering culture through leadership in design reviews, technical talks, and working groups. Who You Are: You have a BS/MS/PhD in a scientific field or equivalent experienc

Free ATS check

Applying for this Staff GenAI Engineer - Application Performance Monitoring (APM) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Datadog?

Real rants from real employees. Read before you apply.

Read Company Rants →