MLabs

DevOps/InfrastructureEngineer

$100–130k United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“DevOps / Infrastructure Engineer at MLabs. Skills: DevOps, Infrastructure Engineering, Cloud Architecture, Network Security. Architect, provision, and scale user agent fleet. Ensure isolated, secure, and predictable containerized process”

What You'll Achieve.

Ensure isolated, secure, predictable containerized process; Guarantee graceful state retention; Preserve live in-flight transaction states

Industry & Context.

Problems you'll solve

Root cause analysis; Troubleshooting; Incident resolution

Eligibility Requirements

On-call rotations

What They're Looking For.

Must Have

Proven professional experience deploying, monitoring, and scaling complex architectures in production utilizing Railway, or equivalent containerized platform-as-a-service frameworks, In-depth technical mastery of Amazon Web Services (AWS), Practical expertise spanning Virtual Private Clouds (VPC), Identity & Access Management (IAM), Secrets Manager, and elastic scaling frameworks (ECS / AWS Lambda), Demonstrated experience implementing Tailscale within a high-security production environment, Distinct competence configuring Tailscale Access Control Lists (ACLs), complex subnet routing, and ephemeral node lifecycles, Mastery of Docker containerization, Mastery of comprehensive CI/CD deployment pipelines, Mastery of modern Infrastructure-as-Code (IaC) paradigms, Technical familiarity with blockchain mechanics, smart contract interactions, or web3 infrastructure paradigms, Proven professional history managing environments where system stability impacts critical financial outcomes, Total comfort managing on-call duties and live incident response

Nice to Have

Direct experience deploying, managing, and monitoring Large Language Model (LLM) or autonomous AI agent fleets at multi-tenant scale, Prior exposure to quantitative trading systems, high-frequency execution runtimes, or deep integrations with platforms such as Hyperliquid

What You'll Do.

and scale user agent fleet

and predictable containerized process

Track costs and manage lifecycle hooks

and harden private overlay networks

Link user agents securely with MCP servers and

Design and construct zero-touch deployment pipeline

Enable automated provisioning of containers

and maintain monitoring

Guarantee graceful state retention

Preserve live in-flight transaction states

Oversee system health

Participate in incident response

Participate in on-call rotations

Full Job Description

**Location: Remote - EST timezone** **Remote | Full-time** **Compensation: $100K - $130K** We are hiring on behalf of our client who is seeking an exceptional, production-proven Infrastructure & DevOps Engineer to take absolute ownership of the deployment, secure networking, architectural lifecycle, and overall reliability of this distributed agent fleet from day one. The client is engineering a sophisticated infrastructure designed to launch a highly distributed fleet of managed, single-tenant personal artificial intelligence (AI) trading agents. Operating non-stop, these isolated processes execute high-frequency, complex financial workflows natively on blockchain infrastructure, dedicated exclusively to individual user portfolios. **Key Responsibilities** * **Fleet Orchestration & Scaling:** Architect, provision, and scale the core user agent fleet across a hybrid Railway and AWS ecosystem, ensuring each user retains an isolated, secure, and predictable containerized process with optimized cost tracking and precise lifecycle hooks. * **Secure Network Engineering:** Establish, manage, and continuously harden private overlay networks using Tailscale in production, linking disparate user agents securely with core Model Context Protocol (MCP) servers and the underlying live trading runtimes. * **Automated User Provisioning:** Design and construct an end-to-end, zero-touch deployment pipeline utilizing advanced infrastructure-as-code and CI/CD best practices, enabling seamless, single-click automated provisioning of containers, secrets management, and environmental configurations for new users. * **Operational Resilience & SRE:** Define, build, and maintain comprehensive monitoring, telemetry, alerting, and automated incident response frameworks to guarantee graceful state retention, preserving live in-flight transaction states across sudden host restarts, scheduled key rotations, or regional cloud outages. * **Incident Management:** Oversee system health and participa

Free ATS check

Applying for this DevOps / Infrastructure Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about MLabs?

Real rants from real employees. Read before you apply.

Read Company Rants →