Robots and Pencils
StaffTestingDeveloper
Neural analysis suggests this role is
optimal for Staff candidates.
“Staff Testing Developer at Robots and Pencils. Skills: Test automation infrastructure, E2E and integration testing, Serverless/microservices testing, LLM/Agentic system testing, CI/CD, AWS. Write test code. Design and scale test automation infrastructure”
What You'll Achieve.
Ship exceptional, not perfect
Industry & Context.
Problem-solving skills; Sound judgment in ambiguous technical territory; Resourceful when the budget, timeline, or team is tight; Constraints don’t slow you down. They sharpen you.
What They're Looking For.
Must Have
Hands-on experience with AWS services including Lambda, DynamoDB, SQS, S3, Deep expertise with PyTest and Python-native testing frameworks, Experience writing and maintaining E2E and integration tests for event-driven, serverless, or microservices architectures, Experience building or validating agentic or LLM-based comfort with evals, output consistency testing, and hallucination/accuracy validation, CI/CD expertise, with experience owning quality gates in delivery pipelines (e. g. GitHub Actions), Working knowledge of AI safety and responsible AI principles as they apply to validating LLM behavior, prompt injection defenses, and PII handling in test data, Demonstrated ability to work independently, drive architectural recommendations, and deliver with minimal supervision, Demonstrable usage of AI-forward tools such as Claude Code and Cursor, problem-solving skills and sound judgment in ambiguous technical territory
Nice to Have
CDK experience a plus, Familiarity with DynamoDB single-table design and the specific challenges of testing against it
What You'll Do.
Design and scale test automation infrastructure
Write and maintain E2E and integration tests
Build or validate agentic or LLM-based systems
output consistency testing
and hallucination/accuracy validation
Own quality gates in delivery pipelines
Validate LLM behavior
prompt injection defenses
and PII handling in test data
Drive architectural recommendations
Deliver with minimal supervision
How You'll Work.
Team & Collaboration
Back teammates when a decision costs something; No handoffs, no finger-pointing; Work with people who care as much as you do; Push each other to do better work
Communication Scope
Direct feedback
Process & Methodology
Drive architectural recommendations, Deliver with minimal supervision, Honor commitments
Full Job Description
Company Overview Robots this role writes test code, not just test plans Hands-on experience with AWS services including Lambda, DynamoDB, SQS, S3, and EventBridge; CDK experience a strong plus Deep expertise with PyTest and Python-native testing frameworks, with a track record of designing and scaling test automation infrastructure Experience writing and maintaining E2E and integration tests for event-driven, serverless, or microservices architectures Familiarity with DynamoDB single-table design and the specific challenges of testing against it Experience building or validating agentic or LLM-based systems; comfort with evals, output consistency testing, and hallucination/accuracy validation Strong CI/CD expertise, with experience owning quality gates in delivery pipelines (e. g. GitHub Actions) Working knowledge of AI safety and responsible AI principles as they apply to validating LLM behavior, prompt injection defenses, and PII handling in test data Demonstrated ability to work independently, drive architectural recommendations, and deliver with minimal supervision Demonstrable usage of AI-forward tools such as Claude Code and Cursor Strong problem-solving skills and sound judgment in ambiguous technical territory You’ll Do Well Here if You Are A doer. You see something broken and fix it. You’d rather move on clarity than wait for certainty. A fast learner who knows you don’t know everything. The AI landscape changes weekly. You’re senior enough to know better and curious enough to keep learning anyway. Direct in a way that makes the work better. You give honest feedback. You’d rather have the hard conversation than blow smoke. Obsessed with craft. You know genius is in the details. You ship exceptional, not perfect, and you don’t put your name on work you wouldn’t stand behind. Built for ownership. You honor commitments, admit mistakes fast, and back your teammates when a decision costs something. No handoffs, no finger-pointing. All in. You treat clients’ busi
Applying for this Staff Testing Developer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Robots and Pencils?
Real rants from real employees. Read before you apply.