Titan

AI software for banks

QAEngineer

United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“QA Engineer at Titan. Skills: AI Evaluation, Test Coverage, Compliance. design and execute evaluation framework. write assertions”

What You'll Achieve.

diagnostic of current test coverage shared with engineering leadership; evaluation framework running against at least one AI-powered workflow; quality gates live in CI/CD; regression baselines established for model behavior; SOC 2 test artifacts documented and audit-ready; test suite running on every release without manual intervention; function is staffed; coverage scales with every product release; quality is a first-class input to every deployment decision

Industry & Context.

AI software for banks
Problems you'll solve

trace a failure from the application layer to infrastructure

Eligibility Requirements

Occasional travel to client sites

What They're Looking For.

Must Have

Seven or more years in software QA engineering, at least two years personally testing AI or ML systems, written test cases against LLM outputs, built evaluation pipelines from scratch, fluent in Python, built automated suites using pytest, Playwright, or Selenium, hands-on experience with RAGAS, DeepEval, LangSmith, or comparable evaluation tooling, trace a failure from the application layer to infrastructure, integrated QA gates into CI/CD pipelines, owned the process end to end

Nice to Have

Experience in fintech, banking, or another regulated environment, Familiarity with document processing pipelines, multi-agent architectures, RAG validation, observability tooling such as Arize or Langfuse

What You'll Do.

design and execute evaluation framework

define behavioral contracts

own regression baselines

build tooling for AI evaluation

write and maintain automated test suite

own performance and load testing

set up and enforce quality gates

write reproduction cases

produce test artifacts

produce process documentation

work with Forward Deployed Engineering

How You'll Work.

Team & Collaboration

triage bugs alongside engineering; work directly with Forward Deployed Engineering

Full Job Description

ABOUT TITAN Titan builds AI software for banks: purpose-built small language models, a banking ontology, and AI bankers that financial institutions can trust. Our models outperform general-purpose LLMs by 30 to 80 percent on banking tasks. Customers include community banks, credit unions, and large regional and super-regional institutions. We are backed by leading fintech investors and operate under the compliance, audit, and model-risk standards that banking requires. WHY THIS ROLE EXISTS Titan is scaling from a handful of live banking customers to thirty, then to hundreds. Right now, there is no formal QA function. There is no evaluation framework, no regression baseline, no quality gate in CI/CD. A QA failure at a bank is not a user experience problem. It is an operational and regulatory risk. This role exists because that gap has to close before the customer count grows. This is a hands-on, individual-contributor role first. You are coming in to do the work: write the test cases, build the evaluation framework, set up CI/CD gates, and triage bugs alongside engineering. The function gets built because you build it yourself. Once the practice is stable and documented, you bring in QE engineers to scale it. WHAT YOU OWN AI Evaluation. You personally design and execute the evaluation framework for LLM and agentic AI outputs across Foundry, Agent Builder, and client-deployed instances. You write the assertions, define the behavioral contracts, and own regression baselines for model behavior. Standard QA methods break down here: you cannot write a deterministic assertion for whether an AI accurately summarized a 200-page loan agreement. You need to think in distributions and confidence intervals, and you need to build tooling that does too. Test Coverage. You write and maintain the automated test suite: end-to-end, integration, and regression coverage for backend APIs, document ingestion pipelines, AI inference workflows, and frontend surfaces. You own performance and

Free ATS check

Applying for this QA Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Titan?

Real rants from real employees. Read before you apply.

Read Company Rants →