Innodata Inc.

Technology

DataEngineer

$110–150k ~AI est. United States Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Data Engineer at Innodata Inc.. Skills: Data warehousing, Data lakes, ETL, AI/ML pipelines. Design data-driven solutions on GCP. Build ETL scripts”

What You'll Achieve.

Build AI systems at scale; Provide data for AI builders; Provide evaluation frameworks for AI; Provide human expertise for AI

Industry & Context.

Technology

Problems you'll solve

Query optimization; ETL optimization; Pipeline optimization

What They're Looking For.

Must Have

Advanced proficiency in SQL, Advanced proficiency in Python, Experience building ETL/ELT pipelines, Knowledge of data warehouse architectures, Knowledge of data lake architectures

Nice to Have

Familiarity with supply chain data, Familiarity with data center operations data, Experience with ML Engineering, Experience with data visualization tools, Experience with MLOps practices, Hands-on expertise with GCP services, Hands-on expertise with LookerI

What You'll Do.

Design data-driven solutions on GCP

Develop data pipelines

Optimize data pipelines

Extend data solutions

Provide pipelines for AI solutions

Ensure data governance

Optimize query performance

Optimize ETL processes

Optimize pipeline reliability

How You'll Work.

Team & Collaboration

Partner with AI/ML teams; Partner with supply chain teams; Partner with real estate teams

Full Job Description

Innodata (Nasdaq: INOD) is a global data engineering company. We believe that data and Artificial Intelligence (AI) are inextricably linked. Our mission is to enable the responsible advancement of artificial intelligence by providing the data, evaluation frameworks, and human expertise required to build AI systems that can be trusted at scale. We provide a range of transferable solutions, platforms, and services for Generative AI / AI builders and adopters. In every relationship, we honor our 36+ year legacy delivering the highest quality data and outstanding outcomes for our customers. Scope of the Role: We are seeking a Data Engineer to design and build enterprise data warehouses, data lakes, and pipelines that power data-driven decision-making for data center supply chain and real estate operations. This role is responsible for creating scalable, secure, and optimized ETL infrastructure on GCP/AWS, while enabling advanced AI/ML use cases such as RAG, copilots, and agentic AI for predictive analytics and workflow automation. What You’ll Own: Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI. Build ETL scripts using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems. Develop and optimize data pipelines for ingestion, transformation, and loading into enterprise data lakes and warehouses. Build and extend end-to-end data and BI solutions, spanning extraction, storage, transformation, and visualization layers. Partner with supply chain, real estate, and AI/ML teams to provide pipelines for AI solutions (e.g., RAG ingestion, Copilot integration, multi-agent workflows). Ensure data governance, lineage, and compliance across supply chain datasets. Continuously optimize query performance, ETL processes, and pipeline reliability. You’ll Thrive in This Role If You Have: Advanced proficiency in SQL (complex querie

Free ATS check

Applying for this Data Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 34 detected · ranked by frequency

Data warehousing ×5

Data lakes ×5

ETL ×3

ETL infrastructure ×3

Predictive analytics ×3

Workflow automation ×3

Data ingestion ×3

Data transformation ×3

Data visualization ×3

AI/ML pipelines ×2

BigQuery ×2

Cloud Storage ×2

Dataflow ×2

Pub/Sub ×2

LookerI ×2

SQL

Python

GCP

ELT

RAG

Vector DB

Embeddings

Data engineering

Scripting

APIs

Data governance

Data lineage

Compliance

Query optimization

Pipeline reliability

Role Details

Seniority Mid

Work Mode Remote

Category data

Salary Band 100k-150k

AI-Extracted Insights

Domain Areas

generative-aiai-systemsdata-center-operationssupply-chain-operations

How to Apply on Greenhouse

Create a Greenhouse profile before applying — it saves time across multiple applications.
Upload your resume as a PDF; the parser handles it better than Word.
Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Innodata Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →