Vultr
Cloud Infrastructure
RMASystemsEngineer
Neural analysis suggests this role is
optimal for Mid candidates.
“RMA Systems Engineer at Vultr. Skills: hardware and system level failures analysis, GPU/CPU hardware failure analysis, advanced troubleshooting, vendor management, RMA process improvement. Analyze GPU/CPU hardware failures by evaluating system logs, performance data, firmware behavior, and workload interactions to determine root cause. Perform advanced troubleshooting across hardware, firmware, and operating system layers to validate faults and identify appropriate remediation paths”
What You'll Achieve.
ensure system reliability; continuous improvement of Vultr’s RMA processes
Industry & Context.
analyze, diagnose, and resolve complex hardware and system level failures; systems-level troubleshooting expertise; ability to analyze non-routine technical issues; determine root cause; make informed decisions on repair, replacement, or escalation paths; Ability to analyze technical issues, apply judgment, and determine appropriate resolution paths
What They're Looking For.
Must Have
3–5 years of experience managing RMA processes or hardware logistics in a technical environment, Experience using Jira to manage, track, and document technical issues, RMA cases, and vendor escalations, including maintaining accurate and audit-ready case records, Experience diagnosing and troubleshooting complex hardware and system-level issues in a data center or infrastructure environment, Experience in RMA workflows, hardware lifecycle management, or infrastructure support, Hands-on experience with NVIDIA and/or AMD GPU technologies, Familiarity with analyzing system logs, firmware behavior, and performance metrics to determine root cause, Experience working with hardware vendors and managing technical escalations or support cases, understanding of server hardware, system architecture, and data center operations, Ability to analyze technical issues, apply judgment, and determine appropriate resolution paths, written and verbal communication skills, particularly in documenting technical findings and collaborating with cross-functional teams, Experience working with microcloud or distributed infrastructure environments, including understanding of system architecture and hardware integration, Experience supporting or analyzing systems within data center environments, including hardware lifecycle, performance, and reliability considerations
Nice to Have
Experience with tools such as JIRA, Confluence, and vendor support portals
What You'll Do.
Analyze GPU/CPU hardware failures by evaluating system logs
and workload interactions to determine root cause
Perform advanced troubleshooting across hardware
and operating system layers to validate faults and identify appropriate remediation paths
Serve as a technical liaison with hardware vendors
providing detailed diagnostic data
interpreting vendor responses
and influencing resolution strategies
Determine and coordinate on-site hardware remediation activities based on technical analysis
including guiding vendors and internal teams on appropriate repair or replacement actions
Evaluate software and firmware versions to identify compatibility issues or contributing factors to system failures
Validate and certify system stability and readiness prior to returning hardware to production environments
Identify recurring failure patterns and recommend improvements to RMA workflows
and hardware lifecycle management
Create and maintain detailed technical documentation
including failure analysis
troubleshooting methodologies
and resolution outcomes
Manage and contribute to technical case documentation within vendor portals
and alignment with diagnostic findings
How You'll Work.
Team & Collaboration
partners closely with Infrastructure, Engineering, and Vendor teams; collaborating with cross-functional teams
Communication Scope
written and verbal communication skills; documenting technical findings; collaborating with cross-functional teams
Full Job Description
WHO WE ARE Vultr is on a mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible for enterprises and AI innovators around the world. With 32 global cloud data center locations, Vultr is trusted by hundreds of thousands of active customers across 185 countries for its flexible, scalable, global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage solutions. In December 2024 Vultr announced an equity financing at a $3.5 billion valuation. Founded by David Aninowsky and self-funded for over a decade, Vultr has grown to become the world’s largest privately-held cloud infrastructure company. VULTR CARES - Medical Insurance stipend paid annually - 9 Company-Paid Holidays - Generous Leave Policy + 1 month paid sabbatical every 5 years + Anniversary Bonus each year - First year remote office setup + reimbursement per quarter each subsequent year for new equipment - Professional Development Reimbursement - Internet reimbursement - Fitness membership reimbursement - Company paid Wellable subscription JOIN VULTR Vultr is seeking a highly skilled and experienced RMA Systems Engineer to analyze, diagnose, and resolve complex hardware and system level failures across our cloud infrastructure platform. This role is responsible for evaluating failure patterns across GPU, CPU, and server environments, determining appropriate remediation strategies, and improving how Vultr manages hardware lifecycle and vendor interactions. The ideal candidate brings strong systems-level troubleshooting expertise and the ability to analyze non-routine technical issues, determine root cause, and make informed decisions on repair, replacement, or escalation paths. This is a highly visible role that partners closely with Infrastructure, Engineering, and Vendor teams to ensure system reliability and continuous improvement of Vultr’s RMA processes. It will also require the ability to adapt to the latest technology, help drive resolutions for never before
Applying for this RMA Systems Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Vultr?
Real rants from real employees. Read before you apply.