Cortea
Technology
SoftwareEngineer,Data&AIPlatform(m/f/x)
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Software Engineer, Data & AI Platform (m/f/x) at Cortea. Skills: LLM agents, Data infrastructure, Evaluation, Observability. Build data infrastructure. Build evaluation infrastructure”
Industry & Context.
Reason from first principles; Identify failure modes; Identify quality regressions; Identify latency issues; Identify reliability gaps; Identify cost optimization opportunities
What They're Looking For.
Must Have
Python backend engineering experience, SQL skills, Deployed systems in cloud, Practical experience designing data pipelines, Comfortable with analytical databases, Understand system design, Work in complex systems, Senior-level engineering judgment
Nice to Have
Building infrastructure around LLM-based products, Working with production traces, Building internal platforms, Using workflow orchestration systems, Familiarity with audit, Experience in early-stage startup
What You'll Do.
Build data infrastructure
Build evaluation infrastructure
Build observability infrastructure
Build production code
Design infrastructure
Work inside backend systems
Improve AI agent quality
Improve AI agent cost
Improve AI agent reliability
Improve AI agent performance
Build evaluation systems
Create automated quality gates
Identify failure modes
Identify quality regressions
Identify latency issues
Identify reliability gaps
Identify cost optimization opportunities
Work with columnar data stores
Build data retention mechanisms
Build replay mechanisms
Create observability tooling
Improve existing agents
How You'll Work.
Team & Collaboration
Backend engineering; Data engineering; AI infrastructure; LLM operations
Communication Scope
Communicate trade-offs
Full Job Description
ABOUT US We’re Cortea, a Berlin startup transforming audits with AI. Manual, document-heavy audits waste expert time while demand keeps rising. Our AI-powered software and specialized AI agents remove the repetitive work so auditors can focus on judgment. Backed by top-tier VCs with >10m funding, with a working product and paying customers, we’re rapidly scaling. We value first-principles thinking, speed, trust, and kindness. We build side by side in our Berlin office. YOUR ROLE We are looking for an Engineer with strong data engineering and AI systems experience to build the data, evaluation, and observability foundation for production-grade LLM agents used in complex audit workflows. This role sits at the intersection of backend engineering, data engineering, AI infrastructure, and LLM operations. You will work hands-on in our backend and agent architecture, building the systems that help us evaluate, monitor, debug, optimize, and continuously improve AI agents in production. This is not a traditional analytics, BI, or dashboarding role. You should expect to write production code, design infrastructure, work inside backend systems, and directly improve the quality, cost, reliability, and performance of LLM-based agents. WHAT YOU’LL DO You will help building and operating the technical infrastructure around our AI agents, with a focus on data infrastructure, evaluation, observability, and optimization. Your work will include: - Building online and offline evaluation systems for LLM agents, including pipelines that use golden datasets, ground-truth data, human review workflows, and experiment results. - Creating automated quality gates so changes to prompts, context, models, or agent logic can be tested before reaching production. - Analyzing large volumes of agent traces and executions to identify failure modes, quality regressions, latency issues, reliability gaps, and cost optimization opportunities. - Working with columnar data stores and analytical databases su
Applying for this Software Engineer, Data & AI Platform (m/f/x) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Cortea?
Real rants from real employees. Read before you apply.