Coinbase
SeniorSiteReliabilityEngineer,CoreAIInfrastructure
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Site Reliability Engineer, Core AI Infrastructure at Coinbase. Skills: Site Reliability Engineering, AI Infrastructure, Cloud Automation, Kubernetes. Own reliability of AI infrastructure. Own monitoring of AI infrastructure”
What You'll Achieve.
Improve deployment velocity; Improve workflow efficiency; Improve cost; Improve quality
Industry & Context.
Root cause analysis; Troubleshooting
On-call support, Quarterly surges
What They're Looking For.
Must Have
5+ years automating cloud infrastructure, 5+ years supporting cloud infrastructure, 5+ years supporting network environments, Hands-on use of IaC tools, Deploying containerized workloads, Managing containerized workloads, Troubleshooting containerized workloads, Proficiency in scripting language, Proficiency in programming language, Version control workflows using Git, Leading incident response, Root cause analysis, Blameless retros, Measurable reliability improvements, Utilizes generative AI responsibly, Maintain human oversight, Deliver business-ready outputs, Drive workflow efficiency improvements, Drive cost improvements, Drive quality improvements
Nice to Have
Expertise with Linux, Expertise with Bash, Expertise with Ruby, Expertise with Python, Expertise with Go, Automating EC2 deployment, Automating containers deployment, Terraform, Network security fundamentals, Experience managing log aggregation, Experience leveraging log aggregation, Experience in highly regulated environment, Experience in fast-paced company, Experience in high-growth company, Experience in Remote-first IT environment
What You'll Do.
Own reliability of AI infrastructure
Own monitoring of AI infrastructure
Own incident response lifecycle
On-call support for AWS
Build automation and tooling
Streamline operational IT workflows
Eliminate manual tasks
Improve deployment velocity
Extend CI/CD frameworks
Integrate surveillance tooling
Strengthen observability standards
Strengthen documentation standards
Implement monitoring solutions
Maintain technical documentation
Develop full-stack applications
Power internal AI products
How You'll Work.
Team & Collaboration
Partner with Infrastructure team; Partner with Security team; Partner with Compliance team
Communication Scope
Technical documentation
Process & Methodology
CI/CD frameworks
Full Job Description
Ready to do the most impactful work of your career? At Coinbase, we are uncompromising on our mission to increase economic freedom. The bar is high, the environment is intense, and we like it that way. This isn't a place for complacency, it’s a place to be pushed past your perceived limits. If you're ready to build the future of finance alongside people who refuse to settle for "good enough," you belong here. Coinbase is a remote-first, but not remote-only company. Expect to get together quarterly for intense in-person working sessions called “surges.” learn more about working at Coinbase. You'll join a high-performing team of engineers driving AI transformation at Coinbase as a Senior Site Reliability Engineer on the IT Operations team. This team builds and scales the infrastructure powering Coinbase's AI products, with direct exposure to senior leadership in a fast-paced, incubator-style environment. You'll own the reliability and automation of critical AI infrastructure, ensuring our systems are resilient, observable, and secure at scale. What you’ll be doing (ie. job duties): Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros. Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments. Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines. Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence. Develop full-stack applications that power internal AI products and infrastructure with Go or Pyth
Applying for this Senior Site Reliability Engineer, Core AI Infrastructure role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Coinbase?
Real rants from real employees. Read before you apply.