PwC
Technology
DataEngineer
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Data Engineer at PwC. Skills: Data engineering, Python, Spark, SQL. Design data pipelines. Implement data pipelines”
Industry & Context.
Debug data issues; Design logical data flows; Design efficient data flows
What They're Looking For.
Must Have
2 years of relevant professional experience, Python programming skills, SQL proficiency, English at B2 level or higher
Nice to Have
Experience with other programming languages, Experience with cloud data platforms, Azure experience, AWS experience, GCP experience, PhD preferred
What You'll Do.
Design data pipelines
Implement data pipelines
Maintain data pipelines
Develop ETL/ELT processes
Build data warehouses
Manage data warehouses
Ensure datasets are clean
Ensure datasets are consistent
Ensure datasets are suitable for ML
Optimize data processing jobs
Monitor data pipelines
Understand client data landscape
Understand client requirements
Translate business needs
Provide expert guidance
Implement data quality checks
Implement validation frameworks
Stay at forefront of technologies
Improve existing solutions
Support junior team members
How You'll Work.
Team & Collaboration
Data scientists; ML engineers; Junior team members
Communication Scope
Client guidance
Full Job Description
**Job Description & Summary** **PwC** is a global network of more than **370,000 professionals in 149 countries** that turns challenges into opportunities. We create innovative solutions in audit, consulting, tax and technology, combining knowledge from all over the world. Join PwC’s pioneering data & AI team and help build the data foundations behind impactful AI solutions. We’re growing quickly due to a strong pipeline of client work, and we’re hiring across **multiple seniority levels** — from junior to senior Data Engineers and Data Scientists. Across our projects, **Python is the core skill** we expect. Depending on your strengths, you may focus more on **data engineering** (pipelines, platforms, SQL) or **data architecture** (designing scalable data solutions). Many roles also include “full-solver” flexibility — contributing where needed, including automation, integration work, or enabling AI/GenAI use cases on modern platforms (including Microsoft technologies where relevant). **Key responsibilities:** * **Data Pipeline Design & Development: **Design, implement, and maintain scalable data pipelines and ETL/ELT processes, primarily using Python and Spark (PySpark), to ingest, transform, and deliver data from various sources into analytics and ML platforms. * **Data Modelling & Warehousing: **Design and optimize data models (e.g. star/snowflake schemas), build and manage data warehouses and data lakes, and ensure data structures support reporting, analytics, and ML use cases. * **Data Preparation for ML:** Collaborate closely with data scientists and ML engineers to understand data requirements, implement robust preprocessing and feature engineering steps, and ensure datasets are clean, consistent, and suitable for machine learning models. * **Performance & Reliability: **Optimize data processing jobs and SQL queries for performance and cost efficiency, monitor data pipelines in production, and ensure reliability, scalability, and adherence to SLAs. * **Client
Applying for this Data Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about PwC?
Real rants from real employees. Read before you apply.