Wellcome Sanger Institute
Human Health
DataEngineer(SeniororPrincipal)
“Data Engineer (Senior or Principal) at Wellcome Sanger Institute. Skills: Data Platform, Data Lakehouse, Data Integration, Data Analysis, Python, SQL, dbt, Prefect, Trino, Apache Spark, Kubernetes. Develop, maintain and operate our data platform. Work on a Data Integration and Analysis platform underpinned by a Data Lakehouse (DLH)”
What You'll Achieve.
Improve human health; Understand life on Earth; Shape the future to enable or deliver life-changing science; Solve some of humanity’s greatest challenges; Enable robust, reproducible analyses linking climate and demographic variables with health outcomes; Enable interdisciplinary research by ensuring that data is well-structured, discoverable, and reproducible; Support scientists to generate new insights from integrated datasets
Industry & Context.
Translating often-complex scientific and data requirements into robust technical solutions
What They're Looking For.
Must Have
Proficiency in Python, Proficiency in SQL, Data transformation practices, Data modelling, Warehousing paradigms (e.g. ELT, Star schemas), Modern data platform architectures (e.g. data lakes or lakehouses), Distributed query or processing engines (e.g. Trino, Spark, Presto), Object storage systems (e.g. S3-compatible systems such as MinIO), Workflow orchestration tools (e.g. Prefect, Airflow), Containerisation and orchestration (e.g. Docker, Kubernetes), CI/CD (e.g. Gitlab CI, Github Actions)
Nice to Have
Technical leadership, with the ability to define and drive architectural decisions across complex data ecosystems, Ownership and accountability for quality and reliability, Designing, developing and operating data platforms at scale, Line management, mentoring and coaching
What You'll Do.
maintain and operate our data platform
Work on a Data Integration and Analysis platform underpinned by a Data Lakehouse (DLH)
Delivery of a DLH-based data integration and analysis platform
Ingesting and transforming a wide range of data types (including e. g. geospatial and climate data
along with genomic data)
Ensure the platform meets scientific needs while remaining scalable
Define and drive architectural decisions across complex data ecosystems (Principal)
develop and operate data platforms at scale (Principal)
How You'll Work.
Team & Collaboration
Work closely with data engineers, bioinformaticians, and scientists; Collaborate with international partners; Facilitated collaborative opportunities and team discussions on campus (Hybrid)
Communication Scope
Communicate effectively with both technical and non-technical stakeholders
Process & Methodology
Ownership and accountability for quality and reliability (Principal)
Applying for this Data Engineer (Senior or Principal) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Wellcome Sanger Institute?
Real rants from real employees. Read before you apply.