Company
Technology
SeniorSoftwareEngineer-DataPlatform
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Software Engineer - Data Platform. Skills: Data Platform Engineering, Distributed Systems, Data Lakehouse, Cloud Infrastructure. Design data ingestion pipelines. Build data ingestion pipelines”
Industry & Context.
Systems thinking
What They're Looking For.
Must Have
4+ years professional software engineering, 4+ years building/operating large-scale data platforms, Apache Spark experience, AWS cloud infrastructure experience, Proficiency in Go, Python, Scala, or Java, Systems thinking ability, Excellent collaboration skills
Nice to Have
Databricks experience, Delta Lake experience, Iceberg experience, Data ingestion/replication tools experience, Infrastructure-as-Code tools experience, Orchestration tools experience, Containerization technologies experience, Experience building internal platforms, Experience building developer tooling, Experience building foundational infrastructure
What You'll Do.
Design data ingestion pipelines
Build data ingestion pipelines
Operate data ingestion pipelines
Design data replication pipelines
Build data replication pipelines
Operate data replication pipelines
Develop distributed data processing infrastructure
Maintain distributed data processing infrastructure
Improve platform reliability
Improve platform observability
Improve platform scalability
Improve platform security
Improve developer experience
Build internal libraries
Build internal developer tooling
Enable efficient data access
Enable secure data access
Enable scalable data access
Contribute to data lake architecture
Contribute to metadata systems architecture
Collaborate with infrastructure teams
Collaborate with product engineering teams
Collaborate with data science teams
Collaborate with security teams
Collaborate with analytics teams
Deliver production-grade data solutions
Define long-term data platform strategy
Support next-generation data products
How You'll Work.
Team & Collaboration
Cross-functional stakeholders; Engineers; Data scientists; Analysts; Product teams; Infrastructure teams; Product engineering teams; Security teams; Analytics teams
Full Job Description
## Accountabilities Design, build, and operate large-scale data ingestion and replication pipelines from production systems (e.g., databases, APIs, and event-driven services) into a centralized data lakehouse. Develop and maintain distributed data processing infrastructure at petabyte scale using technologies such as Apache Spark, Databricks, and lakehouse frameworks. Improve platform reliability, observability, scalability, security, and developer experience across core data infrastructure systems. Build internal libraries, APIs, and developer tooling (using languages such as Go and Python) to enable efficient, secure, and scalable data access across engineering teams. Contribute to the architecture and evolution of data lake and metadata systems, including catalogs, orchestration frameworks, and storage layers. Collaborate closely with infrastructure, product engineering, data science, security, and analytics teams to deliver robust, production-grade data solutions. Participate in defining long-term data platform strategy, including support for AI workloads, global scale, privacy, and next-generation data products. Requirements: 4+ years of professional software engineering experience in production environments. 4+ years of experience building or operating large-scale data platforms, distributed systems, or data lake/lakehouse infrastructure. Strong hands-on experience with Apache Spark or comparable distributed data processing frameworks. Experience working with AWS cloud infrastructure, including services such as S3, RDS, DynamoDB, SQS, Kinesis, and Lambda. Proficiency in at least one production programming language such as Go, Python, Scala, or Java. Strong systems thinking with the ability to design scalable, reliable, and observable distributed systems. Excellent collaboration skills with cross-functional stakeholders including engineers, data scientists, analysts, and product teams. Nice to have: experience with Databricks, Delta Lake, Iceberg, or Hudi; data
Applying for this Senior Software Engineer - Data Platform role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about this company?
Real rants from real employees. Read before you apply.