Veeva Systems

life sciences

DataEngineer-LinkKeyPeople

Berlin, Germany FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Data Engineer - Link Key People at Veeva Systems. Skills: Data Engineering, Data Pipelines, Lakehouse Architecture, Cloud-native software, LLM systems. Responsible for the life cycle of the data that defines the healthcare landscape. Design, build, and maintain the robust data pipelines required to ingest and process global Healthcare Organization (HCO) data”

What You'll Achieve.

Accelerate drug development; Significantly improve patient outcomes; Deliver immense value by combining highest quality data with state-of-the-art software; Ensure data integrity, scalability, and seamless delivery to downstream stakeholders; Meet the rapidly changing demands of the market; Enhance both productivity and the product's capabilities

Industry & Context.

life sciences

What They're Looking For.

Must Have

Experience with Python and Apache Spark/PySpark, Expertise in building cloud-native software within AWS or GCP, Background in designing and maintaining modern architectures, specifically Data Lakes, Lakehouses and Warehouses (DeltaLake, Redshift), Experience operating LLM systems in production, including third-party model providers, human/data feedback loops, and multi-model traffic orchestration, Driving technical execution within Agile environments, English communication skills

Nice to Have

A proactive interest in using AI tools to streamline development and solve complex data problems

What You'll Do.

Responsible for the life cycle of the data that defines the healthcare landscape

and maintain the robust data pipelines required to ingest and process global Healthcare Organization (HCO) data

Architect in managing the complex hierarchical relationships of over 4 million entities

ensuring data integrity

and seamless delivery to downstream stakeholders

Architect the “Data DNA” used by global biopharmas to make data-driven decisions

Enable global AI initiatives within high-stakes

high-impact environments

Design PySpark pipelines and collaborate on ML models to integrate diverse data into robust Lakehouse architecture

and maintain end-to-end HCO data pipelines

Refine data structures and processing logic to meet the rapidly changing demands of the market

Deploy solid principles and clean patterns to data engineering tasks

Advance the long-term architectural roadmap

Validate high quality and availability of HCO deliveries

Govern and optimize the underlying infrastructure

and performance metrics from day one

Embrace AI technologies to enhance both productivity and product capabilities

How You'll Work.

Team & Collaboration

Collaborate on ML models; Align with global stakeholders

Communication Scope

English communication skills

Process & Methodology

Agile environments

Full Job Description

## Description Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $3B in revenue in our last fiscal year with extensive growth potential ahead.   At the heart of Veeva are our values: Do the Right Thing, Customer Success, Employee Success, and Speed. We're not just any public company – we made history in 2021 by becoming a public benefit corporation (PBC), legally bound to balancing the interests of customers, employees, society, and investors.   As a Work Anywhere company, we support your flexibility to work from home or in the office, so you can thrive in your ideal environment.   Join us in transforming the life sciences industry, committed to making a positive impact on its customers, employees, and communities. The Role At Veeva Link, we're building the intelligence layer for life sciences, creating connected data applications that accelerate drug development and significantly improve patient outcomes. Our core belief: combining the highest quality data with state-of-the-art software delivers immense value.   We believe execution matters most. Progress comes from speed, accuracy, and quality in what we build every day. Our engineering approach emphasizes clear product definitions leading to meticulous technical designs. We leverage a modern tech stack to deliver inherently reliable applications and a great user experience.   As a Data Engineer, you will be responsible for the life cycle of the data that defines the healthcare landscape. You will design, build, and maintain the robust data pipelines required to ingest and process global Healthcare Organization (HCO) data. You will be a key architect in managing the complex hierarchical relationships of over 4 million entities, ensuring data integrity, scalability, and seamless delivery to downstream stakeholders.   We are an AI-forward team. We activ

Free ATS check

Applying for this Data Engineer - Link Key People role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Lever

  • Lever uses a streamlined one-page form — apply in under 5 minutes.
  • LinkedIn import works well; review parsed data before submitting.
  • The cover letter field is optional but visible to reviewers — use it to differentiate.
  • Referral codes from employees can significantly boost visibility of your application.

ANONYMOUS · UNFILTERED

What do employees actually say about Veeva Systems?

Real rants from real employees. Read before you apply.

Read Company Rants →