MLabs
Software Development
SRE(Terminal)
Neural analysis suggests this role is
optimal for Senior candidates.
“SRE (Terminal) at MLabs. Skills: Site Reliability Engineering, Cloud infrastructure, Incident response, Automation. Design cloud infrastructure. Scale cloud infrastructure”
Industry & Context.
Pragmatic problem solving
On-call rotation
What They're Looking For.
Must Have
Deep expertise in infrastructure-as-code, Network topology expertise, High-availability architecture expertise, System internals expertise, Experience building foundational infrastructure, Experience running high-availability environments, Advanced proficiency with AWS, Advanced proficiency with GCP, Advanced proficiency with Kubernetes
Nice to Have
Infrastructure security hardening experience, IAM architecture experience, Compliance mapping experience, High-throughput data backbone experience, Low-latency data backbone experience, Event streaming systems experience, Web3/crypto infrastructure patterns understanding, Comfort operating within Web3/crypto infrastructure
What You'll Do.
Design cloud infrastructure
Scale cloud infrastructure
Maintain cloud infrastructure
Lead incident response
Participate in on-call rotations
Write automation code
Exercise judgment on system risks
Raise engineering bar
Implement rigorous standards
Implement modern tooling
Provide technical mentorship
How You'll Work.
Team & Collaboration
Cross-functional teams
Full Job Description
**Location: **New York, United States • London, United Kingdom (Office) **On-Site | Full-time** **Compensation: Competitive Compensation** Our client is a high-growth software development organization and a key contributor to one of the largest and fastest-growing decentralized crypto social networks globally. The platform has achieved massive scale, generating significant revenue and global attention since its inception. To support this rapid expansion and ensure the continuous uptime of its high-stakes, high-throughput environment, our client is seeking a battle-tested **Site Reliability Engineering (SRE) Expert**. This individual will be handed ambiguous, critical infrastructure challenges and will be trusted to navigate them end-to-end—scoping solutions, making sound architectural trade-offs, and executing with precision. **Key Responsibilities** * **Own Foundation & Architecture:** Design, scale, and maintain highly available, multi-region, or active-active cloud infrastructure patterns. * **Incident Response & Reliability:** Lead critical incident response efforts, participate in real on-call rotations, and drive comprehensive, blameless post-mortems to continuously harden the system. * **Automation & Tooling:** Write clean, production-grade automation code (Python, Go, or similar) for infrastructure tooling, operators, and seamless systems integration. * **Risk & Security Management:** Exercise sharp judgment regarding system risks, balancing rapid deployment velocity with robust infrastructure safety and stability. * **Operational Excellence:** Raise the engineering and operational bar across the organization through the implementation of rigorous standards, modern tooling, and technical mentorship. **Requirements** * **Core SRE & Infrastructure Focus:** Deep expertise in infrastructure-as-code (Terraform/OpenTofu), network topology, high-availability architecture, and system internals. * **Proven Track Record:** Experience building foundational infrastructu
Applying for this SRE (Terminal) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about MLabs?
Real rants from real employees. Read before you apply.