Alpaca
Financial Services
SiteReliabilityEngineer
Neural analysis suggests this role is
optimal for Mid candidates.
“Site Reliability Engineer at Alpaca. Skills: Kubernetes, PostgreSQL, Observability, GitOps. Operate production day-to-day. Oncall”
What You'll Achieve.
Keep brokerage platform reliable; Keep brokerage platform observable; Keep brokerage platform operable; Leveling up database reliability posture
Industry & Context.
Structured debugging
Oncall
What They're Looking For.
Must Have
4+ years in SRE, DevOps, Platform/Infrastructure, or backend engineering with significant production operations ownership, Hands-on experience operating production services on Kubernetes, Shipping infrastructure as code in a GitOps workflow, Solid working knowledge of PostgreSQL in production, Cloud networking fundamentals, Comfort debugging cross-service connectivity, Comfortable with a modern observability stack, Proficient with Linux at the operator level, Practiced in incident response, Calm under pressure, Structured debugging, Postmortems that drive change, Working proficiency in Go or Python, Written and verbal communication, Genuine interest in databases, Growing your PostgreSQL/DBA expertise
Nice to Have
Deeper PostgreSQL experience, Large clusters at OLTP load, Online migrations on big tables, HA/DR ownership, Connection pooling at scale, Change-data-capture pipelines, Typed SQL access layers in Go, Production experience with messaging systems at scale, Security & compliance experience in a regulated environment, Familiarity with trading, brokerage, or other regulated fintech domains
What You'll Do.
Operate production day-to-day
Own reliability practice
Define and refine SLIs/SLOs
Help product teams live within error budgets
Strengthen observability
Ship infrastructure through code
Look after PostgreSQL
Mentor engineers on reliability
Mentor engineers on database fundamentals
How You'll Work.
Team & Collaboration
Work with product teams; Code review; Design review; Pairing
Communication Scope
Written communication; Verbal communication
Full Job Description
Who We Are: Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series D funding round brought our total investment to over $320 million, fueling our ambitious vision. Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totalling over 9 million brokerage accounts. Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet. We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it. Alpaca is proudly backed by top-tier global investors, including Portage Ventures, Spark Capital, Tribe Capital, Social Leverage, Horizons Ventures, Unbound, SBI Group, Derayah Financial, Elefund, and Y Combinator. Our Team Members: We're a dynamic team of 380+ globally distributed members who thrive working from our favorite places around the world, with teammates spanning the USA, Canada, Japan, Hungary, Nigeria, Brazil, the UK, and beyond! We're searching for passionate individuals eager to contribute to Alpaca's rapid growth. If you align with our core values—Stay Curious, Have Empathy, and Be Accountable—and are ready to make a significant impact, we encourage you to apply. Your Role: As a Site Reliability Engineer at Alpaca, you'll help keep our brokerage platform reliable, observable, and operable as we grow - working across our cloud infrastructure, Kubernetes platform, observability stack, messaging layer, and data layer. We're especially interested in candidates with strong Postgre
Applying for this Site Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Alpaca?
Real rants from real employees. Read before you apply.