Cortea

Technology

SoftwareEngineer,Data&AIPlatform(m/f/x)

€85–115k ~AI est. Berlin, Germany FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Software Engineer, Data & AI Platform (m/f/x) at Cortea. Skills: LLM agents, Data infrastructure, Evaluation, Observability. Build data infrastructure. Build evaluation infrastructure”

Industry & Context.

Technology

Problems you'll solve

Reason from first principles; Identify failure modes; Identify quality regressions; Identify latency issues; Identify reliability gaps; Identify cost optimization opportunities

What They're Looking For.

Must Have

Python backend engineering experience, SQL skills, Deployed systems in cloud, Practical experience designing data pipelines, Comfortable with analytical databases, Understand system design, Work in complex systems, Senior-level engineering judgment

Nice to Have

Building infrastructure around LLM-based products, Working with production traces, Building internal platforms, Using workflow orchestration systems, Familiarity with audit, Experience in early-stage startup

What You'll Do.

Build data infrastructure

Build evaluation infrastructure

Build observability infrastructure

Build production code

Design infrastructure

Work inside backend systems

Improve AI agent quality

Improve AI agent cost

Improve AI agent reliability

Improve AI agent performance

Build evaluation systems

Create automated quality gates

Identify failure modes

Identify quality regressions

Identify latency issues

Identify reliability gaps

Identify cost optimization opportunities

Work with columnar data stores

Build data retention mechanisms

Build replay mechanisms

Create observability tooling

Improve existing agents

How You'll Work.

Team & Collaboration

Backend engineering; Data engineering; AI infrastructure; LLM operations

Communication Scope

Communicate trade-offs

Full Job Description

ABOUT US We’re Cortea, a Berlin startup transforming audits with AI. Manual, document-heavy audits waste expert time while demand keeps rising. Our AI-powered software and specialized AI agents remove the repetitive work so auditors can focus on judgment. Backed by top-tier VCs with >10m funding, with a working product and paying customers, we’re rapidly scaling. We value first-principles thinking, speed, trust, and kindness. We build side by side in our Berlin office. YOUR ROLE We are looking for an Engineer with strong data engineering and AI systems experience to build the data, evaluation, and observability foundation for production-grade LLM agents used in complex audit workflows. This role sits at the intersection of backend engineering, data engineering, AI infrastructure, and LLM operations. You will work hands-on in our backend and agent architecture, building the systems that help us evaluate, monitor, debug, optimize, and continuously improve AI agents in production. This is not a traditional analytics, BI, or dashboarding role. You should expect to write production code, design infrastructure, work inside backend systems, and directly improve the quality, cost, reliability, and performance of LLM-based agents. WHAT YOU’LL DO You will help building and operating the technical infrastructure around our AI agents, with a focus on data infrastructure, evaluation, observability, and optimization. Your work will include: - Building online and offline evaluation systems for LLM agents, including pipelines that use golden datasets, ground-truth data, human review workflows, and experiment results. - Creating automated quality gates so changes to prompts, context, models, or agent logic can be tested before reaching production. - Analyzing large volumes of agent traces and executions to identify failure modes, quality regressions, latency issues, reliability gaps, and cost optimization opportunities. - Working with columnar data stores and analytical databases su

Free ATS check

Applying for this Software Engineer, Data & AI Platform (m/f/x) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 50 detected · ranked by frequency

Data infrastructure ×5

Observability ×5

Data engineering ×3

AI systems ×3

Backend engineering ×3

Evaluation systems ×3

Optimization ×3

Quality gates ×3

Prompt engineering ×3

Model selection ×3

Context windows ×3

Reasoning tokens ×3

Data retention ×3

Replay mechanisms ×3

Agent traces ×3

Agent executions ×3

Failure modes ×3

Quality regressions ×3

Latency issues ×3

Reliability gaps ×3

Cost optimization ×3

Columnar data stores ×3

Analytical databases ×3

High-volume event data ×3

Production agent behaviour ×3

Observability tooling ×3

Core backend architecture ×3

Agent architecture ×3

Retrieval systems ×3

LLM agents ×2

Evaluation ×2

GCP ×2

Role Details

Seniority Senior

Work Mode Onsite

Type FULL TIME

Category software

Salary Band 75k-100k

AI-Extracted Insights

Domain Areas

llm-agentsaudit-workflowsproduction-grade-llmcomplex-audit-workflowsbackend-systemsai-infrastructurellm-operationsdata-pipelines

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Cortea?

Real rants from real employees. Read before you apply.

Read Company Rants →