Twenty
national security
StaffDataEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Staff Data Engineer at Twenty. Skills: Data lake architecture, ETL pipeline development, Schema and index design, Data modeling, Production operations. Own the data infrastructure that powers Twenty’s cyber operations applications and capabilities. Build a durable, high-performance data lake and the pipelines, schemas, and query patterns that make petabyte-scale datasets usable and economical”
What You'll Achieve.
Deliver game-changing outcomes that directly impact national security; Make petabyte-scale datasets usable and economical; Drive real missions; Scale what we can support and ship; Ship improvements with clear performance wins
Industry & Context.
Identify bottlenecks in storage/compute/query layers
U. S. citizen, Eligibility to obtain a U. S. Government security clearance may be required
What They're Looking For.
Must Have
8+ years of experience in data engineering and/or data architecture, Mastery-level expertise building ETL pipelines and operating them in production, Deep experience with data lake architecture and systems used to query data lakes, Schema and index design skills, including partitioning, indexing, and clustering strategies, Experience with column-oriented databases in production environments, Built data systems from scratch (not only maintained existing platforms), Proven leadership experience mentoring engineers and driving technical initiatives, U. S. citizen and can meet the role’s security requirements
Nice to Have
Experience with key-value datastores, Worked with streaming and message queue systems, Experience with graph database technologies, Worked with internet/networking datasets (e. g. , scan data, DNS, netflow, certificates), Experience supporting analysts or operational users with high-stakes data needs
What You'll Do.
Own the data infrastructure that powers Twenty’s cyber operations applications and capabilities
high-performance data lake and the pipelines
and query patterns that make petabyte-scale datasets usable and economical
Partner closely with engineers and intelligence analysts to turn messy
high-volume operational data into reliable
well-modeled systems that drive real missions
Lead technical initiatives and mentor other engineers as we scale what we can support and ship
Lead the development and operation of a data lake for cyber operations and intelligence data
and indexes that make complex datasets performant and cost-effective to query
Partner with engineers and intelligence analysts to define query patterns and data products for mission use cases
Build and evolve ETL pipelines that are observable
and resilient to upstream change
Drive technical initiatives end-to-end
from architecture decisions through production rollout and iteration
Establish best practices for data quality
and operational ownership across the platform
Mentor engineers on data modeling
and production-grade pipeline design
Identify bottlenecks in storage/compute/query layers and ship improvements with clear performance wins
How You'll Work.
Team & Collaboration
Partner closely with engineers and intelligence analysts; Collaborate tightly across roles, especially with engineers and analysts who need fast, correct answers
Process & Methodology
Drive technical initiatives end-to-end
Full Job Description
ABOUT THE COMPANY At Twenty, we're taking on one of the most critical challenges of our time: defending democracies in the digital age. We develop revolutionary technologies that operate at the intersection of the cyber and electromagnetic domains, where the speed of operations exceeds human sensing and complexity transcends conventional boundaries. Our team doesn't just solve problems – we deliver game-changing outcomes that directly impact national security. We're pragmatic optimists who understand that while our mission of protecting America and its allies is challenging, success is possible. ROLE SUMMARY You will own the data infrastructure that powers Twenty’s cyber operations applications and capabilities. This role is about building a durable, high-performance data lake and the pipelines, schemas, and query patterns that make petabyte-scale datasets usable and economical. You’ll partner closely with engineers and intelligence analysts to turn messy, high-volume operational data into reliable, well-modeled systems that drive real missions. You’ll also lead technical initiatives and mentor other engineers as we scale what we can support and ship. WHO YOU ARE - You think in systems: data modeling, storage formats, compute engines, and access patterns all have to fit together. - You’re opinionated about schema and index design, and you can explain tradeoffs clearly. - You default to measurable reliability: data quality, lineage, repeatability, and operational excellence. - You’re comfortable working with ambiguous datasets and evolving requirements without lowering standards. - You collaborate tightly across roles, especially with engineers and analysts who need fast, correct answers. - You take leadership seriously—mentoring others, raising the bar, and driving initiatives to completion. - You’re motivated by national security outcomes and want your work to matter in the real world. WHAT YOU’LL DO - Lead the development and operation of a data lake for cyber ope
Applying for this Staff Data Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Twenty?
Real rants from real employees. Read before you apply.