Backblaze

Object Storage

SiteReliabilityEngineerII

Remote - Bangalore Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Site Reliability Engineer II at Backblaze. Skills: Site Reliability, Automation, Observability, Incident Response. Support service availability. Monitor service health”

What You'll Achieve.

Ensure stability, scalability, and reliability of services and infrastructure; Keep customer-facing systems performing at their best; Reduce manual intervention and toil; Improve efficiency; Reduce manual effort; Support resilient system design and operations; Assist in capacity planning; Assist in disaster recovery exercises; Track SLA performance; Grow a reliability-minded engineering culture; Identify recurring issues; Propose long-term improvements; Promote reliability-focused practices

Industry & Context.

Object Storage

Problems you'll solve

Problem-solving skills

Eligibility Requirements

Participate in on-call rotations

What They're Looking For.

Must Have

Solid Linux systems administration and troubleshooting skills, Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis, Proficiency in at least one scripting language (Python, Bash, or Go), Understanding of containers (Kubernetes, Docker) and microservices concepts, Knowledge of incident response and operational best practices, Ability to work independently, take ownership, and drive projects from problem discovery through resolution

Nice to Have

Experience in a SaaS, service provider, or distributed systems environment, Familiarity with ITIL/OSS practices and SLO/SLA’s problem-solving skills and willingness to learn new technologies, Experience with cloud platforms (AWS, GCP, or Azure)

What You'll Do.

Support service availability

Monitor service health

Participate in on-call rotations

Contribute to monitoring frameworks

Work with CI/CD pipelines

Write scripts for reliability

How You'll Work.

Team & Collaboration

Collaborate with engineering, product, and operations teams; Partner with engineering, product, and operations teams; Work with vendors and service providers

Process & Methodology

Drive projects from problem discovery through resolution

Full Job Description

About Backblaze Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we’re helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the full power of the open cloud in their hands. Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $100m in revenue and is the leading specialized storage cloud - managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals. About the Role We are seeking a Site Reliability Engineer II (SRE II) to help ensure the stability, scalability, and reliability of our services and infrastructure. This role focuses on building automation, maintaining observability, and supporting incident response to keep customer-facing systems performing at their best. The SRE will collaborate with engineering, product, and operations teams to embed reliability practices into day-to-day development and operations while contributing to tools and processes that improve efficiency and reduce manual effort. Key Responsibilities Service Reliability & Operations Support the availability and durability of critical services across production environments. Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk. Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements. Follow established ITIL/OSS processes (incident, change, problem, and capacity management). Automation & Tooling Develop automation for common operational tasks, reducing manual intervention and toil. Contribute to monitoring, logging,

Free ATS check

Applying for this Site Reliability Engineer II role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 30 detected · ranked by frequency

Incident Response ×5

Linux systems administration ×3

Scripting ×3

Containerization ×3

Microservices ×3

Root cause analysis ×3

Monitoring ×3

Alerting ×3

Site Reliability ×2

Automation ×2

Observability ×2

Kubernetes ×2

Docker ×2

Prometheus ×2

Grafana ×2

Catchpoint ×2

ELK ×2

Terraform ×2

Ansible ×2

Jenkins ×2

Linux

Python

Bash

AWS

GCP

Azure

Service Reliability

Systems Engineering

Operations

Role Details

Experience 2–4 yrs

Level Mid

Work Mode Remote

Education Bachelor's degree in Computer Science, Engineering, or relat

Category production-systems

AI-Extracted Insights

Domain Areas

object-storageopen-cloudcloud-storagedistributed-systems

How to Apply on Greenhouse

Create a Greenhouse profile before applying — it saves time across multiple applications.
Upload your resume as a PDF; the parser handles it better than Word.
Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Backblaze?

Real rants from real employees. Read before you apply.

Read Company Rants →