Veeva Systems
life sciences
DataEngineer-LinkKeyPeople
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Data Engineer - Link Key People at Veeva Systems. Skills: Data Engineering, Data Pipelines, Lakehouse Architecture, Cloud-native software, LLM systems. Responsible for the life cycle of the data that defines the healthcare landscape. Design, build, and maintain the robust data pipelines required to ingest and process global Healthcare Organization (HCO) data”
What You'll Achieve.
Accelerate drug development; Significantly improve patient outcomes; Deliver immense value by combining highest quality data with state-of-the-art software; Ensure data integrity, scalability, and seamless delivery to downstream stakeholders; Meet the rapidly changing demands of the market; Enhance both productivity and the product's capabilities
Industry & Context.
What They're Looking For.
Must Have
Experience with Python and Apache Spark/PySpark, Expertise in building cloud-native software within AWS or GCP, Background in designing and maintaining modern architectures, specifically Data Lakes, Lakehouses and Warehouses (DeltaLake, Redshift), Experience operating LLM systems in production, including third-party model providers, human/data feedback loops, and multi-model traffic orchestration, Driving technical execution within Agile environments, English communication skills
Nice to Have
A proactive interest in using AI tools to streamline development and solve complex data problems
What You'll Do.
Responsible for the life cycle of the data that defines the healthcare landscape
and maintain the robust data pipelines required to ingest and process global Healthcare Organization (HCO) data
Architect in managing the complex hierarchical relationships of over 4 million entities
ensuring data integrity
and seamless delivery to downstream stakeholders
Architect the “Data DNA” used by global biopharmas to make data-driven decisions
Enable global AI initiatives within high-stakes
high-impact environments
Design PySpark pipelines and collaborate on ML models to integrate diverse data into robust Lakehouse architecture
and maintain end-to-end HCO data pipelines
Refine data structures and processing logic to meet the rapidly changing demands of the market
Deploy solid principles and clean patterns to data engineering tasks
Advance the long-term architectural roadmap
Validate high quality and availability of HCO deliveries
Govern and optimize the underlying infrastructure
and performance metrics from day one
Embrace AI technologies to enhance both productivity and product capabilities
How You'll Work.
Team & Collaboration
Collaborate on ML models; Align with global stakeholders
Communication Scope
English communication skills
Process & Methodology
Agile environments
Full Job Description
## Description Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $3B in revenue in our last fiscal year with extensive growth potential ahead. At the heart of Veeva are our values: Do the Right Thing, Customer Success, Employee Success, and Speed. We're not just any public company – we made history in 2021 by becoming a public benefit corporation (PBC), legally bound to balancing the interests of customers, employees, society, and investors. As a Work Anywhere company, we support your flexibility to work from home or in the office, so you can thrive in your ideal environment. Join us in transforming the life sciences industry, committed to making a positive impact on its customers, employees, and communities. The Role At Veeva Link, we're building the intelligence layer for life sciences, creating connected data applications that accelerate drug development and significantly improve patient outcomes. Our core belief: combining the highest quality data with state-of-the-art software delivers immense value. We believe execution matters most. Progress comes from speed, accuracy, and quality in what we build every day. Our engineering approach emphasizes clear product definitions leading to meticulous technical designs. We leverage a modern tech stack to deliver inherently reliable applications and a great user experience. As a Data Engineer, you will be responsible for the life cycle of the data that defines the healthcare landscape. You will design, build, and maintain the robust data pipelines required to ingest and process global Healthcare Organization (HCO) data. You will be a key architect in managing the complex hierarchical relationships of over 4 million entities, ensuring data integrity, scalability, and seamless delivery to downstream stakeholders. We are an AI-forward team. We activ
Applying for this Data Engineer - Link Key People role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about Veeva Systems?
Real rants from real employees. Read before you apply.