Company
Technology
ComputerSciencePhD
Neural analysis suggests this role is
optimal for Lead candidates.
“Computer Science PhD. Skills: AI-generated outputs evaluation, technical prompt design, AI/ML domain expertise, analytical reasoning, written English. Evaluate AI-generated outputs for factual accuracy, technical depth, reasoning quality, and relevance. Design, craft, and answer advanced technical prompts related to AI, ML, and adjacent scientific fields. Compare and rank multiple AI-generated responses. Identify hallucinations, errors, inconsistencies, and weaknesses in model reasoning and prov”
What You'll Achieve.
Contribute insights and evaluations that directly improve model training, alignment, and performance. Influence model performance and alignment at scale.
Industry & Context.
identify hallucinations; errors; inconsistencies; weaknesses in model reasoning
Location: India
What They're Looking For.
Must Have
PhD in Computer Science, Machine Learning, Artificial Intelligence, Statistics, Applied Mathematics, or a closely related field. Background in machine learning concepts including model training, evaluation, optimization, and real-world deployment. Prior experience as an ML Engineer, AI Researcher, Applied Scientist, Data Scientist, or similar technical role. Excellent analytical reasoning skills with attention to detail and scientific rigor. Ability to clearly articulate complex technical concepts in fluent written English. Understanding of AI subfields such as NLP, computer vision, reinforcement learning, or data science methodologies. Demonstrated ability to critically evaluate technical outputs and identify subtle errors or inconsistencies.
What You'll Do.
Evaluate AI-generated outputs for factual accuracy, technical depth, reasoning quality, and relevance.
Design, craft, and answer advanced technical prompts related to AI, ML, and adjacent scientific fields.
Compare and rank multiple AI-generated responses.
Identify hallucinations, errors, inconsistencies, and weaknesses in model reasoning and provide structured feedback.
Contribute insights and evaluations that directly improve model training, alignment, and performance.
Apply deep domain expertise to assess complex topics in machine learning, statistics, optimization, and related areas.
Support continuous improvement of AI systems through high-quality analytical and written feedback.
How You'll Work.
Team & Collaboration
Comfortable working independently in a remote, flexible, and research-oriented environment.
Communication Scope
articulate complex technical concepts; fluent written English
Full Job Description
## Accountabilities Evaluate AI-generated outputs for factual accuracy, technical depth, reasoning quality, and relevance across machine learning and AI domains Design, craft, and answer advanced technical prompts related to AI, ML, and adjacent scientific fields Compare and rank multiple AI-generated responses based on correctness, completeness, clarity, and logical consistency Identify hallucinations, errors, inconsistencies, and weaknesses in model reasoning and provide structured feedback Contribute insights and evaluations that directly improve model training, alignment, and performance Apply deep domain expertise to assess complex topics in machine learning, statistics, optimization, and related areas Support continuous improvement of AI systems through high-quality analytical and written feedback Requirements: PhD in Computer Science, Machine Learning, Artificial Intelligence, Statistics, Applied Mathematics, or a closely related field Strong background in machine learning concepts including model training, evaluation, optimization, and real-world deployment Prior experience as an ML Engineer, AI Researcher, Applied Scientist, Data Scientist, or similar technical role Excellent analytical reasoning skills with strong attention to detail and scientific rigor Ability to clearly articulate complex technical concepts in fluent written English Strong understanding of AI subfields such as NLP, computer vision, reinforcement learning, or data science methodologies Demonstrated ability to critically evaluate technical outputs and identify subtle errors or inconsistencies Comfortable working independently in a remote, flexible, and research-oriented environment Benefits: Highly competitive compensation of USD 150 per hour Fully remote and flexible work arrangement Opportunity to directly contribute to the improvement of advanced AI systems Exposure to state-of-the-art machine learning research and evaluation workflows Flexible workload suitable for academic or profess
Applying for this Computer Science PhD role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about this company?
Real rants from real employees. Read before you apply.