Mindrift

FreelanceAgentEvaluationEngineer

València, Valencian Community, Spain PART TIME Remote Friendly
The Brief

“Freelance Agent Evaluation Engineer at Mindrift. Skills: Python, Software Development, Testing, AI Evaluation. Create challenging tasks. Build virtual companies”

What You'll Achieve.

evaluate AI coding agents; tasks solvable; evaluation fair; tests accept all correct solutions; tests reject incorrect ones; tests don't break on good ones; catch real problems; don't miss bad solutions

Industry & Context.

Problems you'll solve

reasoning about code; analyze agent performance; design edge cases; design adversarial scenarios

Eligibility Requirements

CV in English, indicate English proficiency level

What They're Looking For.

Must Have

Degree in Computer Science, Software Engineering, or related fields, 5+ years in software development, Python, FastAPI, pytest, async/await, subprocess, file operations, full-stack development, React-based interfaces, JavaScript/TypeScript, robust back-end systems, writing tests, Docker containers, infrastructure tools, Postgres, Kafka, Redis, CI/CD understanding, GitHub Actions, English proficiency - B2

Nice to Have

expert in every item

What You'll Do.

Create challenging tasks

Build virtual companies

Assemble and calibrate tasks

Design tasks in isolated environments

Write tests for solutions

Iterate with AI agent on tests

Review code written by agents

Analyze agent performance

Design adversarial scenarios

Iterate based on feedback

How You'll Work.

Team & Collaboration

Work with expert QA reviewers

Communication Scope

English proficiency - B2

Free ATS check

Applying for this Freelance Agent Evaluation Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Mindrift?

Real rants from real employees. Read before you apply.

Read Company Rants →