Crusoe
Technology
SeniorProductionEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Production Engineer at Crusoe. Skills: Distributed systems, AI infrastructure, LLM workloads. Design managed AI services. Operate managed AI services”
Industry & Context.
Problem-solving
What They're Looking For.
Must Have
Software engineering background, Production-grade systems experience, Distributed systems design experience, Large language models experience, SLI/SLO definition experience, Monitoring systems building experience, Performance/reliability improvement experience, Fault-tolerant systems design experience, Automated testing strategies experience, Proficiency in Python, Proficiency in Go, Proficiency in Java, Proficiency in C++, Kubernetes familiarity
Nice to Have
Experience scaling inference workloads, Experience scaling training workloads
What You'll Do.
Design managed AI services
Operate managed AI services
Build automation tooling
Build reliability tooling
Support inference services
Optimize training clusters
Optimize inference clusters
Automate observability
Build performance tuning
Investigate reliability issues
Resolve reliability issues
Contribute to architecture
How You'll Work.
Team & Collaboration
AI teams; Platform teams; Infrastructure teams
Full Job Description
Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster. We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that — with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI. We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved — people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services. If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe. About the Role: At Crusoe, our Production Engineering team ensures the reliability and scalability of Crusoe’s AI-optimized cloud platform. We’re looking for a Senior Production Engineer with a strong background in distributed systems and hands-on experience with large language models to help us build and operate managed AI services at scale. This role is central to delivering highly available, performant, and cost-efficient AI infrastructure that powers compute-intensive, latency-sensitive workloads for our customers. What You’ll Work On: - Design and operate reliable managed AI services with a focus on serving and scaling LLM workloads - Build automation and reliability tooling to support distributed AI pipelines and inference services - Define, measure, and improve SLIs/SLOs across AI workloads to ensure perfor
Applying for this Senior Production Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Crusoe?
Real rants from real employees. Read before you apply.