Crusoe

AI Infrastructure

StaffNetworkEngineer,Operations

$195–235k San Francisco, California, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Staff candidates.

The Brief

“Staff Network Engineer, Operations at Crusoe. Skills: Network operations, Incident response, Automation. Own uptime across global network infrastructure. Lead end-to-end response for network events”

What You'll Achieve.

Keep hyperscale AI infrastructure running; Affect availability of AI workloads

Industry & Context.

AI Infrastructure
Problems you'll solve

Root cause analysis; Troubleshooting

Eligibility Requirements

On-call responsibility, Escalation point during critical events

What They're Looking For.

Must Have

8+ years production network engineering, Hands-on observability/monitoring tools experience, Experience operating RDMA/RoCE lossless fabrics, Expert BGP, EVPN-VXLAN, IS-IS, OSPF, MPLS, QoS, TCP/IP knowledge, Proficiency Arista (EOS) and Juniper (Junos) platforms, Python proficiency for automation, Comfort operating large device fleets, On-call responsibility, Bachelor's degree in CS/EE or equivalent experience

Nice to Have

NVIDIA/Mellanox networking platforms experience, Kentik or Arbor familiarity, Experience defining SLIs/SLOs, Experience operating 10K+ device fleets, Background contributing to post-incident learning programs

What You'll Do.

Own uptime across global network infrastructure

Lead end-to-end response for network events

Contribute to end-to-end response

Mitigate network events

Communicate with stakeholders

Drive RCAs for production incidents

Identify systemic issues

Author remediation plans

Improve network monitoring stack

Maintain escalation playbooks

Automate remediation workflows

Define network reliability metrics

Track service level objectives

Provide technical guidance to engineers

Contribute to operational excellence culture

Contribute to continuous learning culture

How You'll Work.

Team & Collaboration

Partner with Architecture teams; Partner with SRE teams

Full Job Description

Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that — with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved — people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About This Role: Crusoe Cloud is seeking a Staff Network Operations Engineer to help own production reliability across our global network infrastructure, including edge, backbone, data center fabric, and GPU cluster interconnects. This is a hands-on production ownership role focused on incident response, root cause analysis, and operational excellence initiatives that keep our hyperscale AI infrastructure running at scale. Your work will directly affect the availability of AI workloads running across thousands of GPUs worldwide. The ideal candidate is a seasoned network engineer with deep operational experience in large-scale environments who thrives in high-pressure situations and takes pride in keeping systems healthy. You'll contribute to defining SLIs and SLOs, improving observability tooling, building automat

Free ATS check

Applying for this Staff Network Engineer, Operations role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Crusoe?

Real rants from real employees. Read before you apply.

Read Company Rants →