Wellcome Sanger Institute

Human Health

DataEngineer(SeniororPrincipal)

£50–73k Hinxton, United Kingdom FULL TIME Remote Friendly
The Brief

“Data Engineer (Senior or Principal) at Wellcome Sanger Institute. Skills: Data Platform, Data Lakehouse, Data Integration, Data Analysis, Python, SQL, dbt, Prefect, Trino, Apache Spark, Kubernetes. Develop, maintain and operate our data platform. Work on a Data Integration and Analysis platform underpinned by a Data Lakehouse (DLH)”

What You'll Achieve.

Improve human health; Understand life on Earth; Shape the future to enable or deliver life-changing science; Solve some of humanity’s greatest challenges; Enable robust, reproducible analyses linking climate and demographic variables with health outcomes; Enable interdisciplinary research by ensuring that data is well-structured, discoverable, and reproducible; Support scientists to generate new insights from integrated datasets

Industry & Context.

Human Health
Problems you'll solve

Translating often-complex scientific and data requirements into robust technical solutions

What They're Looking For.

Must Have

Proficiency in Python, Proficiency in SQL, Data transformation practices, Data modelling, Warehousing paradigms (e.g. ELT, Star schemas), Modern data platform architectures (e.g. data lakes or lakehouses), Distributed query or processing engines (e.g. Trino, Spark, Presto), Object storage systems (e.g. S3-compatible systems such as MinIO), Workflow orchestration tools (e.g. Prefect, Airflow), Containerisation and orchestration (e.g. Docker, Kubernetes), CI/CD (e.g. Gitlab CI, Github Actions)

Nice to Have

Technical leadership, with the ability to define and drive architectural decisions across complex data ecosystems, Ownership and accountability for quality and reliability, Designing, developing and operating data platforms at scale, Line management, mentoring and coaching

What You'll Do.

maintain and operate our data platform

Work on a Data Integration and Analysis platform underpinned by a Data Lakehouse (DLH)

Delivery of a DLH-based data integration and analysis platform

Ingesting and transforming a wide range of data types (including e. g. geospatial and climate data

along with genomic data)

Ensure the platform meets scientific needs while remaining scalable

Define and drive architectural decisions across complex data ecosystems (Principal)

develop and operate data platforms at scale (Principal)

How You'll Work.

Team & Collaboration

Work closely with data engineers, bioinformaticians, and scientists; Collaborate with international partners; Facilitated collaborative opportunities and team discussions on campus (Hybrid)

Communication Scope

Communicate effectively with both technical and non-technical stakeholders

Process & Methodology

Ownership and accountability for quality and reliability (Principal)

Free ATS check

Applying for this Data Engineer (Senior or Principal) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about Wellcome Sanger Institute?

Real rants from real employees. Read before you apply.

Read Company Rants →