talentpluto
AI infrastructure
ResearchEngineer
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Research Engineer at talentpluto. Skills: Data quality, Applied ML operations, Quality assurance. Define quality standards. Enforce quality standards”
Industry & Context.
Problem-solving
What They're Looking For.
Must Have
Proficiency with Python, Experience in Linux environments, Experience with Docker, Experience with large-scale datasets, Problem-solving skills, Evidence of rapid learning, Ability to operate independently, Clear written communication skills, Clear verbal communication skills
Nice to Have
Experience building data validation pipelines, Experience building human-in-the-loop review systems, Familiarity with training-data failure modes, Familiarity with techniques to detect inconsistencies, Comfort designing QA metrics, Comfort designing QA experiments, Comfort designing QA processes, Familiarity with modern AI tooling, Familiarity with LLM capabilities
What You'll Do.
Define quality standards
Enforce quality standards
Build tooling for datasets
Build workflows for datasets
Audit supplier-generated datasets
Evaluate human-in-the-loop review
Implement human-in-the-loop review
Partner with data suppliers
Provide actionable feedback
Improve data generation processes
Integrate QA learnings
Reduce inconsistencies
Improve documentation
How You'll Work.
Team & Collaboration
Internal stakeholders; External data suppliers; Across time zones
Communication Scope
Written communication; Verbal communication
Full Job Description
**Location:** San Francisco Bay Area **Work model:** On-site (some team members are remote, but this role is currently on-site) **Industry:** AI infrastructure / Reinforcement Learning (RL) training data & evaluations **Compensation:** Competitive (range not provided) + benefits (medical/dental/vision coverage, meals, 401(k), commuter benefits, wellness perk) ### About the Company (our partner) Our partner is a fast-growing, venture-backed AI infrastructure company building the tooling and workflows that power reinforcement learning (RL) training data and evaluation for frontier AI agents. Their platform is used by advanced AI teams across large enterprises and high-growth startups, and they’re scaling quickly to meet strong customer demand. The team is small, highly technical, and execution-focused, with a culture that values ownership, speed, and craftsmanship. ### The Opportunity Our partner is hiring a **Research Engineer** to help scale the quality assurance (QA) systems behind training data generated through their infrastructure. This role sits at the intersection of data quality, tooling, and applied ML operations: you’ll build the standards, pipelines, and feedback loops that ensure datasets are reliable, consistent, and ready for training and evaluation. You’ll work closely with internal stakeholders and external data suppliers to diagnose quality issues, improve workflows, and continuously fold QA learnings back into the platform. If you enjoy building systems that make high-quality data scalable—and want to do it in a high-ownership, fast-paced environment—this role is a strong fit. ### Responsibilities * Define and enforce quality standards for training datasets used for RL training and evaluation * Build tooling and workflows to audit supplier-generated datasets, including sampling strategies, validation pipelines (rule-based and model-assisted), and feedback loops * Evaluate and implement human-in-the-loop review workflows where beneficial to improve q
Applying for this Research Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about talentpluto?
Real rants from real employees. Read before you apply.