Twenty

national security

StaffDataEngineer

New York, New York, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff Data Engineer at Twenty. Skills: Data lake architecture, ETL pipeline development, Schema and index design, Data modeling, Production operations. Own the data infrastructure that powers Twenty’s cyber operations applications and capabilities. Build a durable, high-performance data lake and the pipelines, schemas, and query patterns that make petabyte-scale datasets usable and economical”

What You'll Achieve.

Deliver game-changing outcomes that directly impact national security; Make petabyte-scale datasets usable and economical; Drive real missions; Scale what we can support and ship; Ship improvements with clear performance wins

Industry & Context.

national security

Problems you'll solve

Identify bottlenecks in storage/compute/query layers

Eligibility Requirements

U. S. citizen, Eligibility to obtain a U. S. Government security clearance may be required

What They're Looking For.

Must Have

8+ years of experience in data engineering and/or data architecture, Mastery-level expertise building ETL pipelines and operating them in production, Deep experience with data lake architecture and systems used to query data lakes, Schema and index design skills, including partitioning, indexing, and clustering strategies, Experience with column-oriented databases in production environments, Built data systems from scratch (not only maintained existing platforms), Proven leadership experience mentoring engineers and driving technical initiatives, U. S. citizen and can meet the role’s security requirements

Nice to Have

Experience with key-value datastores, Worked with streaming and message queue systems, Experience with graph database technologies, Worked with internet/networking datasets (e. g. , scan data, DNS, netflow, certificates), Experience supporting analysts or operational users with high-stakes data needs

What You'll Do.

Own the data infrastructure that powers Twenty’s cyber operations applications and capabilities

high-performance data lake and the pipelines

and query patterns that make petabyte-scale datasets usable and economical

Partner closely with engineers and intelligence analysts to turn messy

high-volume operational data into reliable

well-modeled systems that drive real missions

Lead technical initiatives and mentor other engineers as we scale what we can support and ship

Lead the development and operation of a data lake for cyber operations and intelligence data

and indexes that make complex datasets performant and cost-effective to query

Partner with engineers and intelligence analysts to define query patterns and data products for mission use cases

Build and evolve ETL pipelines that are observable

and resilient to upstream change

Drive technical initiatives end-to-end

from architecture decisions through production rollout and iteration

Establish best practices for data quality

and operational ownership across the platform

Mentor engineers on data modeling

and production-grade pipeline design

Identify bottlenecks in storage/compute/query layers and ship improvements with clear performance wins

How You'll Work.

Team & Collaboration

Partner closely with engineers and intelligence analysts; Collaborate tightly across roles, especially with engineers and analysts who need fast, correct answers

Process & Methodology

Drive technical initiatives end-to-end

Full Job Description

ABOUT THE COMPANY At Twenty, we're taking on one of the most critical challenges of our time: defending democracies in the digital age. We develop revolutionary technologies that operate at the intersection of the cyber and electromagnetic domains, where the speed of operations exceeds human sensing and complexity transcends conventional boundaries. Our team doesn't just solve problems – we deliver game-changing outcomes that directly impact national security. We're pragmatic optimists who understand that while our mission of protecting America and its allies is challenging, success is possible. ROLE SUMMARY You will own the data infrastructure that powers Twenty’s cyber operations applications and capabilities. This role is about building a durable, high-performance data lake and the pipelines, schemas, and query patterns that make petabyte-scale datasets usable and economical. You’ll partner closely with engineers and intelligence analysts to turn messy, high-volume operational data into reliable, well-modeled systems that drive real missions. You’ll also lead technical initiatives and mentor other engineers as we scale what we can support and ship. WHO YOU ARE - You think in systems: data modeling, storage formats, compute engines, and access patterns all have to fit together. - You’re opinionated about schema and index design, and you can explain tradeoffs clearly. - You default to measurable reliability: data quality, lineage, repeatability, and operational excellence. - You’re comfortable working with ambiguous datasets and evolving requirements without lowering standards. - You collaborate tightly across roles, especially with engineers and analysts who need fast, correct answers. - You take leadership seriously—mentoring others, raising the bar, and driving initiatives to completion. - You’re motivated by national security outcomes and want your work to matter in the real world. WHAT YOU’LL DO - Lead the development and operation of a data lake for cyber ope

Free ATS check

Applying for this Staff Data Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 55 detected · ranked by frequency

Data lake architecture ×5

Data modeling ×3

Data pipeline development ×3

Query pattern definition ×3

ETL pipeline construction ×3

Data system architecture ×3

Database operations ×3

Storage optimization ×3

Compute optimization ×3

Query optimization ×3

ETL pipeline development ×2

Schema and index design ×2

Production operations ×2

Apache Iceberg ×2

Delta Lake ×2

Apache Hive ×2

Trino ×2

Presto ×2

AWS Athena ×2

Apache Spark ×2

ClickHouse ×2

Amazon Redshift ×2

Google BigQuery ×2

Airflow ×2

AWS Glue ×2

NiFi ×2

ClickPipe ×2

Kafka ×2

RabbitMQ ×2

NATS ×2

AWS Kinesis ×2

Neo4j ×2

BEHAVIOURAL

Systems thinkingCollaborationLeadershipMentoringPragmatic optimismComfort working with ambiguous datasetsComfort with evolving requirements

Role Details

Experience 8–10 yrs

Level Senior

Type FULL TIME

Category swe

AI-Extracted Insights

Domain Areas

cyber-operationsintelligence-datanational-security-outcomesdigital-age-democracy-defensecyber-and-electromagnetic-domains

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Twenty?

Real rants from real employees. Read before you apply.

Read Company Rants →