Payward
FinTech
SiteReliabilityEngineer-AIAgents
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Site Reliability Engineer - AI Agents at Payward. Skills: Site Reliability Engineering, AI Infrastructure, Platform Engineering, MLOps. Design infrastructure layer. Build infrastructure layer”
Industry & Context.
On-call rotations
What They're Looking For.
Must Have
5+ years experience, Hands-on ML infrastructure experience, Experience building developer platforms, Understanding of platform engineering, Proficiency with Terraform, Experience with Kubernetes, Solid understanding of AWS, Scripting skills (bash/shell), Proficiency in Python, Experience designing observability systems, Experience implementing incident response, Collaboration skills
Nice to Have
Experience building agent infrastructure, Familiarity with agent orchestration frameworks, Background in data infrastructure, Experience with CI/CD pipelines, Exposure to evaluation frameworks, Experience in 0→1 environments, Experience building SDKs, Experience with Cloudflare ecosystem
What You'll Do.
Design infrastructure layer
Build infrastructure layer
Operate infrastructure layer
Design platform services
Develop platform services
Develop self-service capabilities
Manage compute infrastructure
Manage orchestration infrastructure
Manage serving infrastructure
Implement incident response
Build CI/CD pipelines
Implement failure handling
Implement recovery patterns
Collaborate with AI teams
Collaborate with Data Engineering
Manage containerized workloads
Implement access controls
Implement security best practices
Document architecture
Document best practices
How You'll Work.
Team & Collaboration
AI teams; Data Engineering teams; Product-facing teams; Agent Systems teams
Full Job Description
BUILDING THE FUTURE OF OPEN FINANCE Payward - the parent company behind Kraken, NinjaTrader, Breakout, xStocks, Payward Services and CF Benchmarks - has spent the last 15 years building one of the most modern and globally accessible financial infrastructure platforms in the industry, built to advance an open, global financial system. Before you apply, we encourage you to explore our culture page https://www.kraken.com/culture to understand what drives us and how we work. THE TEAM Founded in 2011, Kraken is one of the world's longest-standing crypto platforms, trusted by over 10 million individuals and institutions across the globe. It offers spot trading, margin, futures, staking, and OTC services, with products built for both individual investors and institutional clients. The AI Infrastructure team sits within the Data organization and is responsible for building, operating, and scaling the systems that power AI agents in production — both internal tools and external-facing products. Working closely with the AI and Agent Systems teams, this group ensures that the orchestration, execution, and model-serving layers underpinning agentic workflows are reliable, observable, and built to scale. This team operates at the intersection of data infrastructure and applied AI — a space that moves fast and demands engineers who can bring production discipline to emerging technology. You'll partner across Data Engineering, ML, and product-facing teams to harden agent infrastructure and keep it running at the standards our users expect. Importantly, this is a platform engineering team. Beyond operating infrastructure, the team is responsible for building the APIs, SDKs, and platform capabilities that enable AI, Data, and Engineering teams to safely and efficiently consume agent infrastructure as a service. Success in this role requires thinking beyond infrastructure operations and toward developer experience, platform adoption, and long-term scalability. THE OPPORTUNITY - Design
Applying for this Site Reliability Engineer - AI Agents role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Payward?
Real rants from real employees. Read before you apply.