Apna

Internet

Lead/StaffDataEngineer-DataPlatform

₹35–55L ~AI est. Bengaluru, Karnataka, India
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Lead candidates.

The Brief

“Lead / Staff Data Engineer - Data Platform at Apna. Skills: Data Platform Engineering, Lakehouse Architecture, Data Pipelines, Workflow Orchestration. Build scalable data pipelines. Design lakehouse architecture”

What You'll Achieve.

Critical datasets reliable; Critical datasets discoverable; Critical datasets trusted; Pipelines observable; Pipelines recoverable; Clear SLAs; Query performance improves; Data freshness issues reduce; Data quality issues reduce; Teams build faster; Platform scales with growth

Industry & Context.

Internet
Problems you'll solve

Debugging; Root cause analysis

What They're Looking For.

Must Have

5-7 Years of Experience, data engineering experience, Apache Airflow experience, Presto / Trino knowledge, Apache Hudi concepts knowledge, distributed data processing knowledge, reliable ETL / ELT pipelines design, SQL skills, data architectures understanding, data modeling experience, Python, Java, or Scala programming, reason about trade-offs, debugging and production ownership

Nice to Have

Kafka, Spark, Flink experience, Hive, Iceberg, Delta Lake experience, BigQuery experience, building internal data platforms, data quality frameworks experience, ML feature pipelines exposure, feature stores exposure, metadata management experience, data catalogs experience, lineage experience, governance experience, AWS, GCP, or Azure experience, privacy, compliance, PII handling understanding, access control understanding

What You'll Do.

Build scalable data pipelines

Design lakehouse architecture

Improve query engines

Build orchestration workflows

Create reusable data models

Create curated datasets

Create reliable data marts

Improve data platform reliability

Implement data quality checks

Optimize query performance

Optimize pipeline costs

Partner with product teams

Partner with analytics teams

Partner with ML teams

Partner with backend teams

Drive engineering standards

Mentor data engineers

Influence architecture decisions

How You'll Work.

Team & Collaboration

Partnering with product teams; Partnering with analytics teams; Partnering with ML teams; Partnering with backend teams

Full Job Description

**Company:** Apna **Team:** Data Platform / Engineering **Location:** Bangalore **Experience** : 5-7 Years of Experience **Why Join Apna** At Apna, data is central to how we build products, understand users, improve employer outcomes, power recommendations, and scale decision-making. This role gives you the opportunity to build the backbone of Apna’s data platform and influence how data is used across the company. You will work on real-world, high-scale problems across jobs, users, employers, communities, matching, growth, and AI-driven systems. **About the Role** Apna is looking for a **Lead / Staff Data Engineer** to build and scale our core data platform. This role will work on large-scale data pipelines, lakehouse architecture, query platforms, workflow orchestration, and data reliability systems that power analytics, product intelligence, machine learning, business dashboards, experimentation, and operational decision-making across Apna. We are looking for someone who can think deeply about **data architecture** , design reliable pipelines, improve data quality, and help build a platform that can scale with Apna’s growth. **What You’ll Own:** You will be responsible for designing, building, and operating critical parts of Apna’s data platform, including: * Building scalable batch and near-real-time data pipelines across product, business, growth, and ML use cases. * Designing and improving our lakehouse architecture using technologies like**Apache Hudi**. * Working with query engines such as**Presto / Trino** for large-scale analytical workloads. * Building and maintaining orchestration workflows using**Apache Airflow**. * Creating reusable data models, curated datasets, and reliable data marts for analytics and product teams. * Improving data platform reliability, observability, SLA tracking, lineage, and data quality checks. * Optimizing storage, compute, query performance, and pipeline costs. * Partnering with product, analytics, ML, and backend engineering

Free ATS check

Applying for this Lead / Staff Data Engineer - Data Platform role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Apna?

Real rants from real employees. Read before you apply.

Read Company Rants →