Mirantis
Technology
AIInfrastructure&PlatformOperationsEngineer
Neural analysis suggests this role is
optimal for mid candidates.
“AI Infrastructure & Platform Operations Engineer at Mirantis. Skills: AI infrastructure, Kubernetes, Platform operations, NVIDIA GPUs. Monitor AI infrastructure platforms. Operate AI infrastructure platforms”
Industry & Context.
Troubleshooting; Root cause analysis; Performance analysis; Availability analysis; Reliability analysis
Shift-based operational environment
What They're Looking For.
Must Have
3+ years experience, Linux administration, Networking concepts, Kubernetes in production, Support production infrastructure, Analytical skills, Problem-solving skills, Structured operational processes, Incident management processes, Excellent communication skills, Excellent collaboration skills, Shift-based operational environment
Nice to Have
NVIDIA GPU infrastructure, Accelerated computing platforms, InfiniBand networking, NVIDIA UFM, Kubernetes platform operations, AI infrastructure environments, HPC environments, Site Reliability Engineering, Platform Engineering, Observability platforms, Infrastructure automation technologies, Infrastructure-as-Code practices, Large-scale distributed systems, Production platforms
What You'll Do.
Monitor AI infrastructure platforms
Operate AI infrastructure platforms
Support AI infrastructure platforms
Investigate infrastructure incidents
Resolve infrastructure incidents
Investigate networking incidents
Resolve networking incidents
Investigate hardware incidents
Resolve hardware incidents
Investigate platform incidents
Resolve platform incidents
Support NVIDIA GPU infrastructure
Support platform services
Monitor Kubernetes environments
Troubleshoot Kubernetes environments
Investigate performance issues
Investigate availability issues
Investigate reliability issues
Collaborate with engineering teams
Collaborate with hardware vendors
Collaborate with datacenter personnel
Collaborate with service delivery teams
Participate in incident response
Participate in root cause analysis
Participate in operational improvement
Contribute to monitoring improvements
Contribute to observability improvements
Contribute to automation improvements
Contribute to operational process improvements
Maintain operational documentation
Maintain knowledge articles
How You'll Work.
Team & Collaboration
Engineering teams; Hardware vendors; Datacenter personnel; Service delivery teams
Full Job Description
About Mirantis Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy. We are building a European AI Infrastructure & Platform Operations team responsible for operating large-scale AI infrastructure environments powered by NVIDIA GPUs, high-performance networking, Kubernetes, and next-generation platform technologies. The team is responsible for ensuring the availability, performance, and operational stability of critical AI infrastructure platforms deployed across multiple datacenters. Working at the intersection of infrastructure, networking, and platform operations, you will help support the environments that power modern AI workloads. This is an opportunity to work with some of the latest technologies in AI infrastructure while contributing to the evolution of AI-powered operational services through platforms such as k0rdent AI. Responsibilities: * Monitor, operate, and support production AI infrastructure platforms. * Investigate and resolve infrastructure, networking, hardware, and platform-related incidents. * Support NVIDIA GPU infrastructure and associated platform services. * Monitor and troubleshoot Kubernetes-based environments. * Investigate performance, availability, and
Applying for this AI Infrastructure & Platform Operations Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on SmartRecruiters
- SmartRecruiters often includes a video screening step — check camera and mic permissions.
- Link your GitHub or portfolio directly in the profile section for technical roles.
- Applications may be reviewed by AI scoring before reaching a recruiter — use keywords from the job description.
ANONYMOUS · UNFILTERED
What do employees actually say about Mirantis?
Real rants from real employees. Read before you apply.