Payward

FinTech

SiteReliabilityEngineer-AIAgents

$96–192k United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Site Reliability Engineer - AI Agents at Payward. Skills: Site Reliability Engineering, AI Infrastructure, Platform Engineering, MLOps. Design infrastructure layer. Build infrastructure layer”

Industry & Context.

FinTech
Eligibility Requirements

On-call rotations

What They're Looking For.

Must Have

5+ years SRE/Infra/Platform Engineer, Hands-on ML infrastructure, Experience building developer platforms, Proficiency with Terraform, Experience with Kubernetes, Solid understanding of AWS, Scripting skills (bash/shell), Proficiency in Python, Experience designing observability, Experience implementing incident response, Collaboration skills

Nice to Have

Experience with agent-based systems, Familiarity with agent orchestration frameworks, Background in data infrastructure, Experience with CI/CD pipelines, Exposure to evaluation frameworks, Experience in 0→1 environments, Experience building SDKs, Experience with Cloudflare platform

What You'll Do.

Design infrastructure layer

Build infrastructure layer

Operate infrastructure layer

Design platform services

Develop platform services

Develop self-service capabilities

Manage compute infrastructure

Manage orchestration infrastructure

Manage serving infrastructure

Implement incident response

Build CI/CD pipelines

Implement failure handling

Implement recovery patterns

Collaborate with AI teams

Collaborate with Data Engineering

Manage containerized workloads

Implement access controls

Implement security best practices

Document architecture

Document best practices

How You'll Work.

Team & Collaboration

AI teams; Data Engineering teams; Product-facing teams; Engineering teams

Full Job Description

BUILDING THE FUTURE OF OPEN FINANCE Payward - the parent company behind Kraken, NinjaTrader, Breakout, xStocks, Payward Services and CF Benchmarks - has spent the last 15 years building one of the most modern and globally accessible financial infrastructure platforms in the industry, built to advance an open, global financial system. Before you apply, we encourage you to explore our culture page https://www.kraken.com/culture to understand what drives us and how we work. THE TEAM Founded in 2011, Kraken is one of the world's longest-standing crypto platforms, trusted by over 10 million individuals and institutions across the globe. It offers spot trading, margin, futures, staking, and OTC services, with products built for both individual investors and institutional clients. The AI Infrastructure team sits within the Data organization and is responsible for building, operating, and scaling the systems that power AI agents in production — both internal tools and external-facing products. Working closely with the AI and Agent Systems teams, this group ensures that the orchestration, execution, and model-serving layers underpinning agentic workflows are reliable, observable, and built to scale. This team operates at the intersection of data infrastructure and applied AI — a space that moves fast and demands engineers who can bring production discipline to emerging technology. You'll partner across Data Engineering, ML, and product-facing teams to harden agent infrastructure and keep it running at the standards our users expect. Importantly, this is a platform engineering team. Beyond operating infrastructure, the team is responsible for building the APIs, SDKs, and platform capabilities that enable AI, Data, and Engineering teams to safely and efficiently consume agent infrastructure as a service. Success in this role requires thinking beyond infrastructure operations and toward developer experience, platform adoption, and long-term scalability. THE OPPORTUNITY - Design

Free ATS check

Applying for this Site Reliability Engineer - AI Agents role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Payward?

Real rants from real employees. Read before you apply.

Read Company Rants →