Anthropic

Technology

Staff+Sr.SoftwareEngineer,CloudInference

$320–485k San Francisco, California, United States
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Staff + Sr. Software Engineer, Cloud Inference at Anthropic. Skills: Cloud inference, Backend services, Distributed systems. Design backend services and infrastructure. Build backend services and infrastructure”

What You'll Achieve.

Increase service scale; Accelerate new model launches; Accelerate new feature launches; Meet safety standards; Meet performance standards; Meet security standards

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting

What They're Looking For.

Must Have

Significant software engineering experience, Background in distributed systems, Experience building/operating services on AWS, GCP, or Azure, Exposure to Kubernetes or Infrastructure as Code, Experience working with external partners

Nice to Have

Direct experience working with CSPs, Hands-on capacity management experience, Solid understanding of multi-region deployments, Proficiency in Python or Rust

What You'll Do.

Design backend services and infrastructure

Build backend services and infrastructure

Own backend services and infrastructure

Work cross-functionally with teams

Work with CSP partners

Stand up serving stack

Resolve operational issues

Influence provider roadmaps

Build CI/CD automation systems

Evolve CI/CD automation systems

Ship new model versions

Design interfaces across CSPs

Design tooling abstractions across CSPs

Enable cost-effective inference management

Reduce per-platform complexity

Contribute to capacity planning

Contribute to autoscaling strategies

Contribute to workload routing strategies

Analyze observability data

Identify performance bottlenecks

Identify cost anomalies

How You'll Work.

Team & Collaboration

Cross-functional teams; Internal teams; External partners; CSP partners

Full Job Description

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform, from API integration and intelligent request routing to inference execution, capacity management, and day-to-day operations. Our engineers are extremely high leverage: we simultaneously drive multiple major revenue streams while optimizing one of Anthropic's most precious resources: compute. As we expand to more cloud platforms, the complexity of managing inference efficiently across providers with different hardware, networking stacks, and operational models grows significantly. We need product-minded backend engineers who can navigate these platform differences, design the services and abstractions that work across providers, and make architectural decisions that keep us reliable and cost-effective at massive scale. Your work will increase the scale at which our services operate, accelerate our ability to reliably launch new frontier models and innovative features to customers across all platforms, and ensure our LLMs meet rigorous safety, performance, and security standards. Key responsibilities Design, build, and own backend services and infrastructure that serve Claude across multiple CSPs, accounting for differences in compute hardware, networking, APIs, and operational models Work cross-functionally with internal inference, product API, systems, and security teams, among others, and with CSP partners to stand up the full serving stack on new cloud platforms, resol

Free ATS check

Applying for this Staff + Sr. Software Engineer, Cloud Inference role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Anthropic?

Real rants from real employees. Read before you apply.

Read Company Rants →