Bluefish AI
AI marketing
SeniorDataEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Data Engineer at Bluefish AI. Skills: Data Engineering, Data Pipelines, Data Lake Architecture, AWS, Python. Design, build, and maintain scalable data pipelines that ingest, transform, and validate large volumes of data across multiple sources and channels. Improve the scalability, reliability, and performance of our data pipelines to support rapidly growing workloads and new data streams”
Industry & Context.
find creative solutions; resolve performance bottlenecks
What They're Looking For.
Must Have
experience building and operating scalable data pipelines in production environments, Hands-on experience working with Data Lakes or Data Warehouses (e. g. , AWS Athena or similar technologies), Experience with data transformation and modeling, experience working with AWS, Experience using Infrastructure-as-Code tools to manage cloud infrastructure, Proficiency in Python for data processing and automation, Experience working with distributed systems and managing large-scale data workflows, Experience implementing monitoring, observability, and incident response practices for data systems
Nice to Have
Experience working with large-scale web scraping or external data ingestion systems, Experience supporting systems with rapidly increasing traffic or data volume
What You'll Do.
and maintain scalable data pipelines that ingest
and validate large volumes of data across multiple sources and channels
Improve the scalability
and performance of our data pipelines to support rapidly growing workloads and new data streams
Contribute to the design and implementation of our Data Lake architecture
enabling reliable data storage and reuse across teams
Manage and optimize data ingestion workflows
including data collected from web scrapers
Monitor pipeline health
investigate incidents
and implement improvements to increase system reliability and observability
Support the onboarding and integration of new AI channels and data sources into the platform
Identify and resolve performance bottlenecks in distributed systems
including rate limiting
and throughput constraints
Continuously evaluate and improve our data platform to support the company’s rapid growth and evolving product needs
How You'll Work.
Team & Collaboration
work closely with engineering, product, and go-to-market teams; act as a trusted technical partner across teams; Collaborate with teams across the organization to ensure data generated by different systems can be reused effectively for analytics and business intelligence; Advise engineering and product teams on data architecture, data quality, and best practices for managing scalable data workflows
Communication Scope
fostering open communication
Full Job Description
About the Position As a Senior Data Engineer, you’ll play a key role in building and scaling the data infrastructure that powers our AI-driven platform. You’ll be responsible for designing, implementing, and optimizing reliable and scalable data pipelines that process large volumes of structured and unstructured data, from synthetic LLM prompts to large-scale web-scraped datasets, across a growing AWS-based data ecosystem. This role is focused on enabling rapid scale. Our data volume and traffic are increasing quickly as we expand to new AI channels and data sources, and we need robust, production-grade data systems that can keep pace with that growth. You’ll work closely with engineering, product, and go-to-market teams to ensure data is reliable, observable, and reusable across the organization. A core part of the role will be shaping the evolution of our data platform, including contributing to the design and implementation of our Data Lake architecture. You’ll help ensure our pipelines can handle increasing load, maintain high data quality, and support new product capabilities as we scale. You’ll also act as a trusted technical partner across teams, helping establish data best practices, improving operational reliability, and enabling teams to use data effectively in both product and business contexts. This role is remote in Germany. What You’ll Be Doing Design, build, and maintain scalable data pipelines that ingest, transform, and validate large volumes of data across multiple sources and channels. Improve the scalability, reliability, and performance of our data pipelines to support rapidly growing workloads and new data streams. Contribute to the design and implementation of our Data Lake architecture, enabling reliable data storage and reuse across teams. Manage and optimize data ingestion workflows, including data collected from web scrapers, third-party vendors, and internal systems. Monitor pipeline health, investigate incidents, and implement improvemen
Applying for this Senior Data Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Bluefish AI?
Real rants from real employees. Read before you apply.