Elastic

Technology

SiteReliabilityEngineer(HostedInfra)

$150–220k ~AI est. United States
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Site Reliability Engineer (Hosted Infra) at Elastic. Skills: Site Reliability Engineering, Cloud Infrastructure, Automation. Engineer software to automate systems. Build internal tools and services”

What You'll Achieve.

Eliminate toil; Improve reliability; Drive incident prevention; Meet growing demand

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Systems thinking; Debugging

Eligibility Requirements

SRE on-call rotation

What They're Looking For.

Must Have

Experience building software with Golang, Comfortable reviewing code, Production experience operating large-scale cloud compute, Deep experience with Linux systems, Proficiency working with containerized workloads, Customer-first, systems-thinking approach, Comfortable working across time zones, Contribute clear and maintainable documentation, Communicate project status regularly

Nice to Have

Production experience with Terraform, Production experience with Puppet, Production experience with Ansible, Production experience with Argo CD, Production experience with Argo Workflows, Production experience with CUE, Production experience with Docker, Production experience with Kubernetes, Production experience with Ubuntu, Production experience with Ubuntu Live Patch, Experience being on-call, Experience using observability tools, Hands-on experience engineering solutions with Elastic Stack

What You'll Do.

Engineer software to automate systems

Build internal tools and services

Optimize reliability of hosts

Optimize lifecycle of hosts

Strengthen observability posture

Craft alerting systems

Craft monitoring systems

Scale global infrastructure

Evolve infrastructure management processes

Contribute to code reviews

Be mentored by teammates

Participate in SRE on-call rotation

Participate in postmortems

Champion reliability improvements

How You'll Work.

Team & Collaboration

Cross-functional teams; Teammates

Communication Scope

Communicate project status; Flag blockers

Process & Methodology

Planning

Full Job Description

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is the role We are Cloud Infrastructure SREs that integrate, scale, and evolve multi-cloud infrastructure across 4 Cloud Service Providers, 70+ globally distributed regions, and tens of thousands of hosts to power Elastic Cloud. We tackle hard problems at scale through automation, Infrastructure as Code (IaC), configuration management, and purpose-built software that eliminates toil and improves reliability. We're also a team that grows people as well as systems. If that challenge genuinely excites you, we'd love to hear from you. What you will be doing Engineering software to automate large-scale systems — building internal tools and services, not just running scripts. Optimizing the reliability and lifecycle of hosts across multiple cloud providers. Strengthening our observability posture — crafting alerting and monitoring systems that drive incident prevention over incident response. Scaling global infrastructure and evolving the infrastructure management processes to meet growing demand. Contributing to code reviews, sharing your work, planning what we need to do next, and both mentoring and being mentored by teammates. Being part of a balanced SRE on-call rotation: responding to incidents, improving runbooks, participating in postmortems, and championing reliability improvements. What you bring Experience building software with Golang.

Free ATS check

Applying for this Site Reliability Engineer (Hosted Infra) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Elastic?

Real rants from real employees. Read before you apply.

Read Company Rants →