Mindrift

AI

FreelanceAgentEvaluationEngineer

Hyderabad, Telangana, India PART TIME
The Brief

“Freelance Agent Evaluation Engineer at Mindrift. Skills: Python, Software Development, Test Automation, AI Evaluation. Create challenging tasks. Define evaluation criteria”

What You'll Achieve.

Tasks meet acceptance criteria; Tasks submitted by deadline

Industry & Context.

AI
Problems you'll solve

reasoning about code; understand where models fail

What They're Looking For.

Must Have

5+ years in software development, Python, FastAPI, pytest, async/await, subprocess, file operations, full-stack development, React-based interfaces, JavaScript/TypeScript, robust back-end systems, writing tests, Docker containers, CI/CD understanding, English proficiency - B2

Nice to Have

infrastructure tools, Postgres, Kafka, Redis, GitHub Actions as a user

What You'll Do.

Create challenging tasks

Define evaluation criteria

Build virtual companies

Assemble and calibrate tasks

Design tasks in isolated environments

Write tests for solutions

Iterate with AI agent on tests

Review code written by agents

Analyze agent failures

Design adversarial scenarios

Iterate based on feedback

How You'll Work.

Team & Collaboration

Work with AI agent; Feedback from expert QA reviewers

Communication Scope

English proficiency - B2

Free ATS check

Applying for this Freelance Agent Evaluation Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Mindrift?

Real rants from real employees. Read before you apply.

Read Company Rants →