DataHub
AI & Data Context Platform
SoftwareEngineer-Ingestion&Integrations
Neural analysis suggests this role is
optimal for Mid candidates.
“Software Engineer - Ingestion & Integrations at DataHub. Skills: Python, Ingestion framework, Connectors, Cloud native environment. Enhance the Python-based ingestion framework to support ingesting usage statistics, lineage, and operational metadata from systems like Snowflake, Redshift, Kafka, & more!. Build connectors for major systems in the modern data and ML stacks”
What You'll Achieve.
Accelerate time-to-value from their data investments; Ensure AI system reliability; Implement unified governance; Enable AI & data to work together and bring order to data chaos
Industry & Context.
What They're Looking For.
Must Have
3-7 years of engineering experience, Expertise in Python, Knowledge of distributed systems, Ability to design for scale and fault tolerance
Nice to Have
Familiarity with tools in the modern data and ML ecosystem
What You'll Do.
Enhance the Python-based ingestion framework to support ingesting usage statistics
and operational metadata from systems like Snowflake
Build connectors for major systems in the modern data and ML stacks
Enable the ingestion framework to run in a cloud native environment
Full Job Description
DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises, including Apple, CVS Health, Netflix, and Visa. Innovated jointly with a thriving open-source community of 13,000+ members, DataHub's metadata graph provides in-depth context of AI and data assets with best-in-class scalability and extensibility. The company's enterprise SaaS offering, DataHub Cloud, delivers a fully managed solution with AI-powered discovery, observability, and governance capabilities. Organizations rely on DataHub solutions to accelerate time-to-value from their data investments, ensure AI system reliability, and implement unified governance, enabling AI & data to work together and bring order to data chaos. The company's enterprise SaaS offering, DataHub Cloud, delivers a fully managed solution with AI-powered discovery, observability, and governance capabilities. Organizations rely on DataHub solutions to accelerate time-to-value from their data investments, ensure AI system reliability, and implement unified governance, enabling AI & data to work together and bring order to data chaos. In this role, you will Enhance the Python-based ingestion framework to support ingesting usage statistics, lineage, and operational metadata from systems like Snowflake, Redshift, Kafka, & more! Build connectors for major systems in the modern data and ML stacks Enable the ingestion framework to run in a cloud native environment Requirements: 3-7 years of engineering experience Expertise in Python Familiarity with tools in the modern data and ML ecosystem Knowledge of distributed systems Ability to design for scale and fault tolerance Benefits and Perks We invest in people so they can do their best work and enjoy doing it. Our benefits reflect the way we build: practical, thoughtful, and designed to support long-term growth. Competitive compensation We offer salaries that reflect your skills, experience, and the impact you make. You bring value—we make sure you're recognized for it. Equit
Applying for this Software Engineer - Ingestion & Integrations role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about DataHub?
Real rants from real employees. Read before you apply.