Mirantis
Technology
L2DatacenterSupportEngineer
Neural analysis suggests this role is
optimal for entry candidates.
“L2 Datacenter Support Engineer at Mirantis. Skills: InfiniBand fabrics, GPU clusters, Kubernetes IaaS, Bare metal lifecycle, Infrastructure support. Troubleshoot InfiniBand fabrics. Maintain InfiniBand fabrics”
What You'll Achieve.
High reliability InfiniBand; High performance InfiniBand; High reliability GPU infrastructure; High performance GPU infrastructure; Scalable bare metal provisioning; Efficient bare metal provisioning
Industry & Context.
Troubleshooting; Root cause analysis
What They're Looking For.
Must Have
3–6+ years infrastructure operations, 3–6+ years datacenter engineering, 3–6+ years cloud platforms, Linux systems expertise, Bare metal provisioning systems experience, Bare metal lifecycle management experience, InfiniBand networking experience, InfiniBand troubleshooting experience, InfiniBand performance experience, InfiniBand fabric management experience, IPAM/DCIM tools experience, NetBox experience, Ethernet network configuration experience, Ethernet validation experience, Datacenter networking understanding, Datacenter storage understanding, Datacenter hardware architecture understanding, Kubernetes production environments knowledge, Hardware troubleshooting skills, Distributed systems troubleshooting skills
Nice to Have
NVIDIA GPU platforms experience, Accelerated computing infrastructure experience, Automation tools familiarity, OpenStack exposure, Observability stacks familiarity
What You'll Do.
Troubleshoot InfiniBand fabrics
Maintain InfiniBand fabrics
Tune InfiniBand performance
Validate InfiniBand topology
Act as escalation point
Maintain infrastructure modeling
Maintain source-of-truth data
Manage InfiniBand fabric
Troubleshoot high-performance interconnects
Optimize high-performance interconnects
Diagnose issues across GPU servers
Diagnose issues across networking
Diagnose issues across storage
Diagnose issues across Kubernetes
Perform hardware diagnostics
Perform system-level diagnostics
Support Kubernetes platform stability
Support networking issues
Support scheduling issues
Contribute to automation
Automate provisioning workflows
Automate operational workflows
Lead incident response
Perform root cause analysis
Implement post-incident improvements
Collaborate with vendors
Collaborate with engineering teams
Support infrastructure upgrades
Manage capacity expansion
How You'll Work.
Team & Collaboration
Engineering teams; Internal teams; Vendor collaboration
Full Job Description
Mirantis helps organizations ship code faster on public and private clouds. The company provides a public cloud experience on any infrastructure from the data center to the edge. With Lens and the Mirantis Cloud Native Platform, Mirantis empowers a new breed of Kubernetes developers by removing infrastructure and operations complexity and providing one cohesive cloud experience for complete app and devops portability, a single pane of glass, and automated full-stack lifecycle management with continuous updates. Mirantis serves many of the world’s leading enterprises, including Adobe, DocuSign, Liberty Mutual, PayPal, Reliance Jio, Societe Generale, Splunk, and Volkswagen. Learn more at [www.mirantis.com](http://www.mirantis.com/). We are looking for an experienced L2 Engineer to operate and support high-performance AI infrastructure platforms, including NVIDIA GPU clusters, InfiniBand fabrics, and Kubernetes-based IaaS environments. This role focuses on deep infrastructure expertise, ensuring performance, scalability, and reliability of the platform layer that powers AI workloads — without being responsible for the workloads themselves. You will play a key role in bare metal lifecycle management, advanced InfiniBand troubleshooting, and platform stability, working closely with engineering teams to operate cutting-edge infrastructure at scale. Key responsibilities: * Troubleshoot and maintain InfiniBand fabrics, including performance tuning, link issues, and topology validation. * Act as the escalation point for L1 for complex infrastructure and hardware issues. * Own and maintain accurate infrastructure modeling, IPAM, and source-of-truth data in NetBox. * Own InfiniBand fabric management and advanced troubleshooting, utilizing Verity for configuration, monitoring, and optimization of high-performance interconnects. * Diagnose and resolve issues across GPU servers, networking, storage, and Kubernetes platforms. * Perform deep hardware and system-level diagnostics (G
Applying for this L2 Datacenter Support Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on SmartRecruiters
- SmartRecruiters often includes a video screening step — check camera and mic permissions.
- Link your GitHub or portfolio directly in the profile section for technical roles.
- Applications may be reviewed by AI scoring before reaching a recruiter — use keywords from the job description.
ANONYMOUS · UNFILTERED
What do employees actually say about Mirantis?
Real rants from real employees. Read before you apply.