Rox
Applied AI
FoundingAppliedResearchEngineer
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Founding Applied Research Engineer at Rox. Skills: Applied AI, Agent evaluation, ML systems. Design research programs. Run research programs”
What You'll Achieve.
Understand Rox's architecture; Understand production problems; Understand research gaps; Share opinions; Run experiments; Inform how we build; Something worked on is in production; Define research agenda; Build unique systems
Industry & Context.
Where agents fail in practice; Where standard approaches break down
What They're Looking For.
Must Have
Ability to ship things that make it into production, Move fast, Research instincts
Nice to Have
PhD is not required
What You'll Do.
Design research programs
Run research programs
Build evaluation frameworks
Work on context systems
Translate findings into infrastructure
Define Rox Research direction
How You'll Work.
Team & Collaboration
Work alongside elite and competitive engineering minds
Full Job Description
WHY THIS ROLE EXISTS Foundation models are commoditizing. Defensibility comes from specialized models, proprietary training signals, and evaluation ownership. Every applied AI company we benchmark against like Decagon, Harvey, Sierra, Cursor has already moved. The window to claim frontier applied AI for revenue is closing in the next few months. Rox is in market. We run agents against enterprise data at scale, every day. We see exactly where research meets production and where the data is dirty, state is changing, and being wrong costs (a lot of) money. The Applied Research team exists to close that gap permanently. WHAT THIS TEAM WORKS ON Four problems we care about right now: Cost-efficient inference for Clever Columns. Distill a Rox-trained model from frontier teachers so per-account enrichment runs at 1/20th the cost without quality loss. Ships first. Doesn't require trajectory attribution. Signal classification across the public knowledge graph. A small, fast classifier that distinguishes genuine buying signals from noise across the news, jobs, and filings corpus we already ingest at scale. Powers Recommended Next Moves and Auto Prospecting. Cleanest data subset. Personalization grounding and hallucination detection. A reward model that catches fabricated prospect context in Sequences in real time. This is the most underrated production failure mode in outbound AI. Trained on cross-customer consensus edits. Sequencing policy under sparse, delayed rewards. Offline-to-online RL on multi-touch trajectories with intermediate signals as proxies for terminal outcomes. Long-horizon flagship. Hard. [Depends on trajectory instrumentation in progress with Platform Eng.] These are not benchmark problems. They have real SLAs and real customers depending on them. WHAT YOU'LL DO - Design and run research programs tied directly to the four above. - Build evaluation frameworks that measure trajectory quality, not just final output, because most eval infrastructure measures end
Applying for this Founding Applied Research Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Rox?
Real rants from real employees. Read before you apply.