Dodge Construction Network
Tech / AI / Software
LeadDataSoftwareEngineer
Neural analysis suggests this role is
optimal for Lead candidates.
“Lead Data Software Engineer at Dodge Construction Network. Skills: data platform, data lakes, Lakehouse architectures, data warehouses, delta lakes, AWS, Databricks, Delta Lake, Redshift, Apache Spark, Python, ETL pipelines, ML integration, LLM integration, AI-Assisted Development, DevOps, CI/CD, Automation. Design and implement data lake, Lakehouse, and data warehouse architectures. Build and maintain scalable ETL pipelines”
What You'll Achieve.
enable data science teams; integrating ML and LLM solutions; building scalable, event-driven data platforms; improve code quality; reduce cycle time; increase sprint velocity; meet scalability, governance, and compliance requirements
Industry & Context.
What They're Looking For.
Must Have
8+ years of experience in data engineering, cloud architectures, and ML/AI integrations, Hands-on experience with Databricks, Delta Lake, AWS Redshift, and modern data Lakehouse solutions, Demonstrated use of AI development tools to improve personal and team productivity, AWS certifications (Solutions Architect, or equivalent)
Nice to Have
Qualified candidates should be based in or near Kochi and able to work from our Kochi office as part of a hybrid schedule.
What You'll Do.
Design and implement data lake
and data warehouse architectures
Build and maintain scalable ETL pipelines
Develop data ingestion
and enrichment workflows
Optimize data storage and partitioning strategies
Implement real-time and batch data processing frameworks
Leverage serverless computing and containerized compute
Integrate machine learning and large language model solutions into production data pipelines
Utilize AWS SageMaker
and Redshift ML to support AI/ML workloads
Apply MLOps frameworks to manage model deployment
and retraining at scale
Incorporate AI coding tools into daily development workflows
and validate AI-generated code
Integrate AI tools into CI/CD pipelines
Track and report on AI tool ROI
Model responsible AI-assisted development practices
Uphold and advance DevOps best practices
Containerize and orchestrate data workloads
Drive automated testing integration into DevOps pipelines
Monitor system health and data platform observability
How You'll Work.
Team & Collaboration
Partner with data science, AI/ML, and business analytics teams to drive data-driven innovation across the organization; Communicate technical concepts clearly to engineering peers, data science stakeholders, and executive leadership
Communication Scope
Communicate technical concepts clearly to engineering peers, data science stakeholders, and executive leadership
Full Job Description
Dodge Construction Network (Dodge) is looking for a Lead Data Software Engineer with expertise in data platform, data lakes, Lakehouse architectures, data warehouses, and delta lakes to drive our modern data infrastructure. This role will focus on enabling data science teams, integrating ML and LLM solutions, and building scalable, event-driven data platforms using cutting-edge AWS services. This is a full-time position and reports directly to the VP, Data Innovation & AI. _**Preferred Location**_ Qualified candidates should be based in or near Kochi and able to work from our Kochi office as part of a hybrid schedule. _**Essential Functions**_ **Data Engineering & Data Architectures** * Design and implement data lake, Lakehouse, and data warehouse architectures leveraging AWS Data Lake Formation, Redshift, and Delta Lakes * Build and maintain scalable ETL pipelines using AWS Glue, Apache Spark, Databricks, and EMR * Develop data ingestion, transformation, and enrichment workflows using Python, Spark, and SQL * Optimize data storage and partitioning strategies (Parquet, Delta, Iceberg) for performance and cost efficiency * Implement real-time and batch data processing frameworks to support analytics and AI-driven use cases * Leverage serverless computing (AWS Lambda, Fargate) and containerized compute (ECS, EKS, Kubernetes) to scale data workloads **ML & LLM Integration** * Integrate machine learning (ML) and large language model (LLM) solutions into production data pipelines * Utilize AWS SageMaker, Databricks ML, Bedrock, and Redshift ML to support AI/ML workloads * Apply MLOps frameworks to manage model deployment, monitoring, and retraining at scale **AI-Assisted Development** * Incorporate AI coding tools (such as Claude Code, GitHub Copilot, Cursor, or equivalent) into daily development workflows to accelerate delivery * Effectively prompt, review, and validate AI-generated code across Python, SQL, and Spark workloads * Integrate AI tools into CI/CD pipelines t
Applying for this Lead Data Software Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Dodge Construction Network?
Real rants from real employees. Read before you apply.