Apna
Internet
Lead/StaffDataEngineer-DataPlatform
Neural analysis suggests this role is
optimal for Lead candidates.
“Lead / Staff Data Engineer - Data Platform at Apna. Skills: Data Platform Engineering, Lakehouse Architecture, Data Pipelines, Workflow Orchestration. Build scalable data pipelines. Design lakehouse architecture”
What You'll Achieve.
Critical datasets reliable; Critical datasets discoverable; Critical datasets trusted; Pipelines observable; Pipelines recoverable; Clear SLAs; Query performance improves; Data freshness issues reduce; Data quality issues reduce; Teams build faster; Platform scales with growth
Industry & Context.
Debugging; Root cause analysis
What They're Looking For.
Must Have
5-7 Years of Experience, data engineering experience, Apache Airflow experience, Presto / Trino knowledge, Apache Hudi concepts knowledge, distributed data processing knowledge, reliable ETL / ELT pipelines design, SQL skills, data architectures understanding, data modeling experience, Python, Java, or Scala programming, reason about trade-offs, debugging and production ownership
Nice to Have
Kafka, Spark, Flink experience, Hive, Iceberg, Delta Lake experience, BigQuery experience, building internal data platforms, data quality frameworks experience, ML feature pipelines exposure, feature stores exposure, metadata management experience, data catalogs experience, lineage experience, governance experience, AWS, GCP, or Azure experience, privacy, compliance, PII handling understanding, access control understanding
What You'll Do.
Build scalable data pipelines
Design lakehouse architecture
Improve query engines
Build orchestration workflows
Create reusable data models
Create curated datasets
Create reliable data marts
Improve data platform reliability
Implement data quality checks
Optimize query performance
Optimize pipeline costs
Partner with product teams
Partner with analytics teams
Partner with ML teams
Partner with backend teams
Drive engineering standards
Mentor data engineers
Influence architecture decisions
How You'll Work.
Team & Collaboration
Partnering with product teams; Partnering with analytics teams; Partnering with ML teams; Partnering with backend teams
Full Job Description
**Company:** Apna **Team:** Data Platform / Engineering **Location:** Bangalore **Experience** : 5-7 Years of Experience **Why Join Apna** At Apna, data is central to how we build products, understand users, improve employer outcomes, power recommendations, and scale decision-making. This role gives you the opportunity to build the backbone of Apna’s data platform and influence how data is used across the company. You will work on real-world, high-scale problems across jobs, users, employers, communities, matching, growth, and AI-driven systems. **About the Role** Apna is looking for a **Lead / Staff Data Engineer** to build and scale our core data platform. This role will work on large-scale data pipelines, lakehouse architecture, query platforms, workflow orchestration, and data reliability systems that power analytics, product intelligence, machine learning, business dashboards, experimentation, and operational decision-making across Apna. We are looking for someone who can think deeply about **data architecture** , design reliable pipelines, improve data quality, and help build a platform that can scale with Apna’s growth. **What You’ll Own:** You will be responsible for designing, building, and operating critical parts of Apna’s data platform, including: * Building scalable batch and near-real-time data pipelines across product, business, growth, and ML use cases. * Designing and improving our lakehouse architecture using technologies like**Apache Hudi**. * Working with query engines such as**Presto / Trino** for large-scale analytical workloads. * Building and maintaining orchestration workflows using**Apache Airflow**. * Creating reusable data models, curated datasets, and reliable data marts for analytics and product teams. * Improving data platform reliability, observability, SLA tracking, lineage, and data quality checks. * Optimizing storage, compute, query performance, and pipeline costs. * Partnering with product, analytics, ML, and backend engineering
Applying for this Lead / Staff Data Engineer - Data Platform role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Apna?
Real rants from real employees. Read before you apply.