Artificial Analysis

Technology

MemberofTechnicalStaff

$145–195k ~AI est. San Francisco, California, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Member of Technical Staff at Artificial Analysis. Skills: AI Benchmarking, AI Strategy, AI Technologies. Structure AI evaluation projects. Design AI evaluation projects”

What You'll Achieve.

Set standard for AI measurement; Shape AI understanding; Support AI strategy; Enhance AI benchmarking platform; Be leading AI benchmarking company

Industry & Context.

Technology
Problems you'll solve

Analytical skills; Critical thinking skills; Structuring ambiguous problems

What They're Looking For.

Must Have

Analytical and critical thinking skills, Proficiency in Python, Data analysis proficiency, Demonstrable interest in frontier AI, Knowledge of frontier AI

Nice to Have

Experience in data analytics division, Experience within AI by McKinsey, Experience within BCG X / Gamma, Experience within QuantumBlack

What You'll Do.

Structure AI evaluation projects

Design AI evaluation projects

Execute AI evaluation projects

Develop AI evaluation methodologies

Develop AI evaluation datasets

Drive report development

Develop data visualizations

Communicate complex AI concepts

Collaborate with AI companies

Support AI benchmarking

Identify platform enhancement opportunities

Embrace AI-native workflow

Contribute to company strategy

Drive large initiatives

How You'll Work.

Team & Collaboration

Cross-functional teams; Work with founders; Across the full team

Communication Scope

Communicate findings; Communicate complex AI concepts

Full Job Description

ABOUT ARTIFICIAL ANALYSIS Artificial Analysis is the leading independent AI benchmarking company. We support labs, engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier. Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist. We are a team of 35+, on track to triple by year end, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, DeepLearning.ai http://DeepLearning.ai, Amazon), Adam D'Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders. THE OPPORTUNITY Our benchmarks and analysis are what the industry turns to when they need to understand AI capabilities, from AI labs and enterprises to media, investors, and policymakers. This role puts you at the forefront of the AI frontier — you won't just observe the cutting edge of AI, your work will define what cutting edge means. We're hiring Members of Technical Staff to design the evaluations that set the standard for how AI is measured, produce analysis that shapes how companies and the broader industry understand AI, and work directly with the leading AI labs and enterprises who rely on our insights. You'll develop new benchmarking methodologies, manage relationships with some of the most important AI labs and enterprise customers in the world, and help drive the product direction of our platform. The bar for success is becoming a world expert in modern AI technologies. This is a unique combination of product, research, technical, and client-facing work, suited to high

Free ATS check

Applying for this Member of Technical Staff role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Artificial Analysis?

Real rants from real employees. Read before you apply.

Read Company Rants →