Titan

AI software for banks

QAEngineer

United States FULL TIME Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“QA Engineer at Titan. Skills: AI Evaluation, Test Coverage, Compliance. design and execute evaluation framework. write assertions”

What You'll Achieve.

diagnostic of current test coverage shared with engineering leadership; evaluation framework running against at least one AI-powered workflow; quality gates live in CI/CD; regression baselines established for model behavior; SOC 2 test artifacts documented and audit-ready; test suite running on every release without manual intervention; function is staffed; coverage scales with every product release; quality is a first-class input to every deployment decision

Industry & Context.

AI software for banks

Problems you'll solve

trace a failure from the application layer to infrastructure

Eligibility Requirements

Occasional travel to client sites

What They're Looking For.

Must Have

Seven or more years in software QA engineering, at least two years personally testing AI or ML systems, written test cases against LLM outputs, built evaluation pipelines from scratch, fluent in Python, built automated suites using pytest, Playwright, or Selenium, hands-on experience with RAGAS, DeepEval, LangSmith, or comparable evaluation tooling, trace a failure from the application layer to infrastructure, integrated QA gates into CI/CD pipelines, owned the process end to end

Nice to Have

Experience in fintech, banking, or another regulated environment, Familiarity with document processing pipelines, multi-agent architectures, RAG validation, observability tooling such as Arize or Langfuse

What You'll Do.

design and execute evaluation framework

define behavioral contracts

own regression baselines

build tooling for AI evaluation

write and maintain automated test suite

own performance and load testing

set up and enforce quality gates

write reproduction cases

produce test artifacts

produce process documentation

work with Forward Deployed Engineering

How You'll Work.

Team & Collaboration

triage bugs alongside engineering; work directly with Forward Deployed Engineering

Full Job Description

ABOUT TITAN Titan builds AI software for banks: purpose-built small language models, a banking ontology, and AI bankers that financial institutions can trust. Our models outperform general-purpose LLMs by 30 to 80 percent on banking tasks. Customers include community banks, credit unions, and large regional and super-regional institutions. We are backed by leading fintech investors and operate under the compliance, audit, and model-risk standards that banking requires. WHY THIS ROLE EXISTS Titan is scaling from a handful of live banking customers to thirty, then to hundreds. Right now, there is no formal QA function. There is no evaluation framework, no regression baseline, no quality gate in CI/CD. A QA failure at a bank is not a user experience problem. It is an operational and regulatory risk. This role exists because that gap has to close before the customer count grows. This is a hands-on, individual-contributor role first. You are coming in to do the work: write the test cases, build the evaluation framework, set up CI/CD gates, and triage bugs alongside engineering. The function gets built because you build it yourself. Once the practice is stable and documented, you bring in QE engineers to scale it. WHAT YOU OWN AI Evaluation. You personally design and execute the evaluation framework for LLM and agentic AI outputs across Foundry, Agent Builder, and client-deployed instances. You write the assertions, define the behavioral contracts, and own regression baselines for model behavior. Standard QA methods break down here: you cannot write a deterministic assertion for whether an AI accurately summarized a 200-page loan agreement. You need to think in distributions and confidence intervals, and you need to build tooling that does too. Test Coverage. You write and maintain the automated test suite: end-to-end, integration, and regression coverage for backend APIs, document ingestion pipelines, AI inference workflows, and frontend surfaces. You own performance and

Free ATS check

Applying for this QA Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 38 detected · ranked by frequency

AI Evaluation ×3

Compliance ×3

automated test suite ×3

end-to-end testing ×3

integration testing ×3

regression testing ×3

backend APIs ×3

document ingestion pipelines ×3

AI inference workflows ×3

frontend surfaces ×3

performance testing ×3

load testing ×3

latency-sensitive inference paths ×3

quality gates ×3

bug triage ×3

reproduction case ×3

regression test ×3

Test Coverage ×2

pytest ×2

Playwright ×2

Selenium ×2

RAGAS ×2

DeepEval ×2

LangSmith ×2

Python

Azure

REST APIs

CI/CD

banking ontology

model-risk standards

audit

regulatory risk

Role Details

Experience 7–10 yrs

Level Senior

Work Mode Remote (US)

Type FULL TIME

Category engineering

AI-Extracted Insights

Domain Areas

ai-software-for-bankssmall-language-modelsbanking-ontologyai-bankersfinancial-institutionsgeneral-purpose-llmsbanking-taskscommunity-banks

Certifications

SOC 2 Type II

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Titan?

Real rants from real employees. Read before you apply.

Read Company Rants →