CloudZero

SaaS

CloudOpsEngineer

Boston, Massachusetts, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“CloudOps Engineer at CloudZero. Skills: Infrastructure as Code, Observability, Automation, AWS, Python. Own the performance, reliability, and observability of CloudZero's infrastructure. Design and maintain Pulumi modules. Instrument systems for quick failure surfacing and data-driven debugging. Automate deployments, scaling, and backups. Partner with Product Engineering to design resilient services, review architectures, and build deployment pipelines. Optimize for cost efficiency.”

What You'll Achieve.

Empower engineering teams to ship features that help customers understand and optimize their cloud spend. Ensure stability and automatic recovery of the platform. Be exemplars of efficient cloud usage.

Industry & Context.

SaaS
Problems you'll solve

distributed systems; system design; operational problems

Eligibility Requirements

This is real infrastructure work at real scale, not a ticket-closing role or a console-clicking job.

What They're Looking For.

Must Have

3 to 5+ years of experience building and operating distributed systems in AWS. Experience with frontier AI models such as Claude, Codex, or Gemini. Proven ability to debug production issues under pressure. Values thoughtful, reliable system design over reactive hero efforts. Documentation habits to support long-term team clarity and system stability. Ability to clearly explain complex technical issues to non-technical stakeholders. Excited to take ownership of infrastructure and solve operational challenges at scale.

What You'll Do.

Own the performance, reliability, and observability of CloudZero's infrastructure.

Design and maintain Pulumi modules.

Instrument systems for quick failure surfacing and data-driven debugging.

Automate deployments, scaling, and backups.

Partner with Product Engineering to design resilient services, review architectures, and build deployment pipelines.

Optimize for cost efficiency.

How You'll Work.

Team & Collaboration

Partner with Product Engineering to help teams design resilient services, review architectures for operational complexity, and build deployment pipelines that enable safe and fast shipping.

Communication Scope

explain complex technical issues to non-technical stakeholders

Full Job Description

ABOUT THE ROLE CloudZero is growing fast. Our customer base is expanding, the data challenges we're solving are getting more complex, and the platform is scaling to match. As a CloudOps Engineer you'll be a force multiplier for our engineering organization, owning the performance, reliability, and observability of CloudZero's infrastructure and empowering teams to ship features that help customers understand and optimize their cloud spend. This is real infrastructure work at real scale, not a ticket-closing role or a console-clicking job. CloudZero processes billions of events daily across AWS, Azure, and GCP. Our customers rely on real-time, accurate cost data to make business-critical decisions, and any instability in our system impacts their planning. Built entirely on a unique serverless architecture with no EC2s or containers, our platform demands infrastructure that scales gracefully, fails predictably, and recovers automatically. If you thrive on hard operational problems, care deeply about reliability and performance, and want to see your work matter to customers in direct and measurable ways, this role was built for you. WHAT YOU'LL DO Infrastructure as Code - Design and maintain Pulumi modules that provision reliable, cost-efficient cloud resources - Own infrastructure end to end with no clicking through consoles Observability - Instrument systems so that failures surface quickly and debugging happens with data, not guesswork - Build observability into everything so you know about problems before customers do Automation - Automate deployments, scaling, backups, and limit changes; if humans are doing it repeatedly, build a system to do it instead - Balance automation intelligently, building solutions to real problems rather than automating for its own sake Partner with Product Engineering - Help teams design resilient services, review architectures for operational complexity, and build deployment pipelines that enable safe and fast shipping - Optimize for c

Free ATS check

Applying for this CloudOps Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about CloudZero?

Real rants from real employees. Read before you apply.

Read Company Rants →