Company

AssociateDataEngineer

Madurai, India FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Associate Data Engineer. Skills: Databricks, PySpark, Python, ETL/ELT pipelines, SQL, Delta Lake. Develop, test, and maintain data pipelines. Ingest, transform, and process structured and semi-structured data”

Industry & Context.

Problems you'll solve

analytical, problem-solving, and troubleshooting skills

Eligibility Requirements

This position may require working during later shifts (evening or night) depending on business needs.

What They're Looking For.

Must Have

Bachelor’s degree in Computer Science, Data Engineering, Information Systems, Engineering, Mathematics, or a related field, or equivalent practical experience, Hands-on experience with Python for data processing, scripting, and automation, working knowledge of PySpark and distributed data processing concepts, Proven hands-on experience using Databricks for data engineering, including notebooks, clusters, jobs, workflows, Delta tables, and performance optimization, Ability to build, maintain, and troubleshoot scalable ETL/ELT pipelines in Databricks, Experience working with Delta Lake and lakehouse architecture concepts, Working knowledge of SQL for querying, transforming, and validating data, Ability to work with structured and semi-structured data formats such as CSV, JSON, Parquet, and Delta, Understanding of data engineering concepts such as ETL/ELT, data pipelines, data lakes, data warehouses, batch processing, and data quality, Basic understanding of AI and machine learning concepts, including features, training datasets, model inputs/outputs, and model evaluation basics, Experience supporting data preparation or feature engineering for AI/ML use cases, Familiarity with cloud-based data platforms, preferably AWS, Azure, or GCP, Understanding of Git or other version control tools, analytical, problem-solving, and troubleshooting skills, Good communication skills and ability to work collaboratively with technical and non-technical stakeholders, Willingness to learn new tools, technologies, and data engineering best practices

Nice to Have

Exposure to Delta Lake, Unity Catalog, or Lakehouse architecture, Experience with workflow orchestration tools or Databricks Jobs, Familiarity with CI/CD practices for data engineering projects, Exposure to machine learning workflows using MLflow, scikit-learn, or similar tools, Experience with Tableau, Power BI, or similar data visualization tools to create dashboards, support reporting needs, validate datasets, and perform exploratory analysis, Understanding of data governance, security, and access control concepts, Experience working in an Agile/Scrum environment

What You'll Do.

and maintain data pipelines

and process structured and semi-structured data

Support the development of scalable ETL/ELT workflows

Perform data cleansing

Optimize Spark jobs and Databricks notebooks

Create and maintain documentation

Assist in troubleshooting pipeline failures

and performance bottlenecks

Follow best practices for version control

Support basic AI/ML data preparation activities

Monitor scheduled jobs and workflows

How You'll Work.

Team & Collaboration

Work with data engineers, analysts, and data scientists to understand data requirements and deliver reliable datasets; Collaborate with cross-functional teams in an Agile or iterative development environment; Ability to work collaboratively with technical and non-technical stakeholders

Communication Scope

Good communication skills

Full Job Description

## **Career Category** Information Systems ## ## **Job Description** **Roles & Responsibilities** * Develop, test, and maintain data pipelines using Databricks, PySpark, and Python. * Ingest, transform, and process structured and semi-structured data from multiple sources. * Support the development of scalable ETL/ELT workflows for analytics, reporting, and machine learning use cases. * Work with data engineers, analysts, and data scientists to understand data requirements and deliver reliable datasets. * Perform data cleansing, validation, and quality checks to ensure accuracy and consistency. * Optimize Spark jobs and Databricks notebooks for performance, reliability, and cost efficiency. * Create and maintain documentation for data pipelines, workflows, data definitions, and processes. * Assist in troubleshooting pipeline failures, data issues, and performance bottlenecks. * Follow best practices for version control, code quality, testing, and deployment. * Support basic AI/ML data preparation activities, including feature engineering, dataset creation, and model input preparation. * Monitor scheduled jobs and workflows to ensure timely and successful data delivery. * Collaborate with cross-functional teams in an Agile or iterative development environment. **Basic Qualifications and Experience** * 2-6 years of experience with Bachelor’s degree in Computer Science, Data Engineering, Information Systems, Engineering, Mathematics, or a related field, or equivalent practical experience ### Must-Have Qualifications * Bachelor’s degree in Computer Science, Data Engineering, Information Systems, Engineering, Mathematics, or a related field, or equivalent practical experience. * Hands-on experience with Python for data processing, scripting, and automation. * Strong working knowledge of PySpark and distributed data processing concepts. * Proven hands-on experience using Databricks for data engineering, including notebooks, clusters, jobs, workflows, Delta tables, and per

Free ATS check

Applying for this Associate Data Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 43 detected · ranked by frequency

ETL/ELT ×6

Databricks ×4

PySpark ×4

Python ×4

SQL ×4

Delta Lake ×4

data engineering ×4

data processing ×3

scripting ×3

automation ×3

distributed data processing ×3

notebooks ×3

clusters ×3

jobs ×3

workflows ×3

performance optimization ×3

querying ×3

transforming ×3

validating data ×3

working with structured and semi-structured data formats ×3

AI and machine learning concepts ×3

version control ×3

Git ×2

Spark

AWS

Azure

GCP

MLflow

scikit-learn

Tableau

Power BI

data pipelines

BEHAVIOURAL

analyticalproblem-solvingtroubleshootingcollaborativewillingness to learn

Role Details

Seniority mid

Experience 2–6 yrs

Level Mid

Work Mode No

Type FULL TIME

Education Bachelor’s degree

AI-Extracted Insights

Domain Areas

data-engineering-conceptsai-and-machine-learning-conceptslakehouse-architecturedata-governancesecurityaccess-control

How to Apply on Workday

Workday has a multi-step form — save your progress after every section.
"Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about this company?

Real rants from real employees. Read before you apply.

Read Company Rants →