Rox

Applied AI

FoundingAppliedResearchEngineer

San Francisco, California, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Founding Applied Research Engineer at Rox. Skills: Applied AI, Agent evaluation, ML systems. Design research programs. Run research programs”

What You'll Achieve.

Understand Rox's architecture; Understand production problems; Understand research gaps; Share opinions; Run experiments; Inform how we build; Something worked on is in production; Define research agenda; Build unique systems

Industry & Context.

Applied AI

Problems you'll solve

Where agents fail in practice; Where standard approaches break down

What They're Looking For.

Must Have

Ability to ship things that make it into production, Move fast, Research instincts

Nice to Have

PhD is not required

What You'll Do.

Design research programs

Run research programs

Build evaluation frameworks

Work on context systems

Translate findings into infrastructure

Define Rox Research direction

How You'll Work.

Team & Collaboration

Work alongside elite and competitive engineering minds

Full Job Description

WHY THIS ROLE EXISTS Foundation models are commoditizing. Defensibility comes from specialized models, proprietary training signals, and evaluation ownership. Every applied AI company we benchmark against like Decagon, Harvey, Sierra, Cursor has already moved. The window to claim frontier applied AI for revenue is closing in the next few months. Rox is in market. We run agents against enterprise data at scale, every day. We see exactly where research meets production and where the data is dirty, state is changing, and being wrong costs (a lot of) money. The Applied Research team exists to close that gap permanently. WHAT THIS TEAM WORKS ON Four problems we care about right now: Cost-efficient inference for Clever Columns. Distill a Rox-trained model from frontier teachers so per-account enrichment runs at 1/20th the cost without quality loss. Ships first. Doesn't require trajectory attribution. Signal classification across the public knowledge graph. A small, fast classifier that distinguishes genuine buying signals from noise across the news, jobs, and filings corpus we already ingest at scale. Powers Recommended Next Moves and Auto Prospecting. Cleanest data subset. Personalization grounding and hallucination detection. A reward model that catches fabricated prospect context in Sequences in real time. This is the most underrated production failure mode in outbound AI. Trained on cross-customer consensus edits. Sequencing policy under sparse, delayed rewards. Offline-to-online RL on multi-touch trajectories with intermediate signals as proxies for terminal outcomes. Long-horizon flagship. Hard. [Depends on trajectory instrumentation in progress with Platform Eng.] These are not benchmark problems. They have real SLAs and real customers depending on them. WHAT YOU'LL DO - Design and run research programs tied directly to the four above. - Build evaluation frameworks that measure trajectory quality, not just final output, because most eval infrastructure measures end

Free ATS check

Applying for this Founding Applied Research Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 27 detected · ranked by frequency

ML systems ×6

Agent evaluation ×5

Behavioral retrieval-augmented generation ×4

Knowledge graph RL ×4

Post-training ×4

Model adaptation ×4

Applied AI ×3

Enterprise data ×3

Agents ×2

Foundation models

Specialized models

Proprietary training signals

Evaluation ownership

Cost-efficient inference

Model distillation

Signal classification

Knowledge graph

Personalization grounding

Hallucination detection

Reward model

Sequencing policy

Offline-to-online RL

Agent memory

Retrieval

Context systems

Production impact

Research agenda

Role Details

Type FULL TIME

Category applied-ai

AI-Extracted Insights

Domain Areas

applied-aienterprise-datafoundation-modelsspecialized-modelsproprietary-training-signalsevaluation-ownershipagentscost-efficient-inference

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Rox?

Real rants from real employees. Read before you apply.

Read Company Rants →