MLabs

Software Development

SRE(Terminal)

$175–230k ~AI est. New York, United States; London, United Kingdom FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“SRE (Terminal) at MLabs. Skills: Site Reliability Engineering, Cloud infrastructure, Incident response, Automation. Design cloud infrastructure. Scale cloud infrastructure”

Industry & Context.

Software Development
Problems you'll solve

Pragmatic problem solving

Eligibility Requirements

On-call rotation

What They're Looking For.

Must Have

Deep expertise in infrastructure-as-code, Network topology expertise, High-availability architecture expertise, System internals expertise, Experience building foundational infrastructure, Experience running high-availability environments, Advanced proficiency with AWS, Advanced proficiency with GCP, Advanced proficiency with Kubernetes

Nice to Have

Infrastructure security hardening experience, IAM architecture experience, Compliance mapping experience, High-throughput data backbone experience, Low-latency data backbone experience, Event streaming systems experience, Web3/crypto infrastructure patterns understanding, Comfort operating within Web3/crypto infrastructure

What You'll Do.

Design cloud infrastructure

Scale cloud infrastructure

Maintain cloud infrastructure

Lead incident response

Participate in on-call rotations

Write automation code

Exercise judgment on system risks

Raise engineering bar

Implement rigorous standards

Implement modern tooling

Provide technical mentorship

How You'll Work.

Team & Collaboration

Cross-functional teams

Full Job Description

**Location: **New York, United States • London, United Kingdom (Office) **On-Site | Full-time** **Compensation: Competitive Compensation** Our client is a high-growth software development organization and a key contributor to one of the largest and fastest-growing decentralized crypto social networks globally. The platform has achieved massive scale, generating significant revenue and global attention since its inception. To support this rapid expansion and ensure the continuous uptime of its high-stakes, high-throughput environment, our client is seeking a battle-tested **Site Reliability Engineering (SRE) Expert**. This individual will be handed ambiguous, critical infrastructure challenges and will be trusted to navigate them end-to-end—scoping solutions, making sound architectural trade-offs, and executing with precision. **Key Responsibilities** * **Own Foundation & Architecture:** Design, scale, and maintain highly available, multi-region, or active-active cloud infrastructure patterns. * **Incident Response & Reliability:** Lead critical incident response efforts, participate in real on-call rotations, and drive comprehensive, blameless post-mortems to continuously harden the system. * **Automation & Tooling:** Write clean, production-grade automation code (Python, Go, or similar) for infrastructure tooling, operators, and seamless systems integration. * **Risk & Security Management:** Exercise sharp judgment regarding system risks, balancing rapid deployment velocity with robust infrastructure safety and stability. * **Operational Excellence:** Raise the engineering and operational bar across the organization through the implementation of rigorous standards, modern tooling, and technical mentorship. **Requirements** * **Core SRE & Infrastructure Focus:** Deep expertise in infrastructure-as-code (Terraform/OpenTofu), network topology, high-availability architecture, and system internals. * **Proven Track Record:** Experience building foundational infrastructu

Free ATS check

Applying for this SRE (Terminal) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about MLabs?

Real rants from real employees. Read before you apply.

Read Company Rants →