Vultr

Cloud Infrastructure

RMASystemsEngineer

Chennai, India FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“RMA Systems Engineer at Vultr. Skills: hardware and system level failures analysis, GPU/CPU hardware failure analysis, advanced troubleshooting, vendor management, RMA process improvement. Analyze GPU/CPU hardware failures by evaluating system logs, performance data, firmware behavior, and workload interactions to determine root cause. Perform advanced troubleshooting across hardware, firmware, and operating system layers to validate faults and identify appropriate remediation paths”

What You'll Achieve.

ensure system reliability; continuous improvement of Vultr’s RMA processes

Industry & Context.

Cloud Infrastructure

Problems you'll solve

analyze, diagnose, and resolve complex hardware and system level failures; systems-level troubleshooting expertise; ability to analyze non-routine technical issues; determine root cause; make informed decisions on repair, replacement, or escalation paths; Ability to analyze technical issues, apply judgment, and determine appropriate resolution paths

What They're Looking For.

Must Have

3–5 years of experience managing RMA processes or hardware logistics in a technical environment, Experience using Jira to manage, track, and document technical issues, RMA cases, and vendor escalations, including maintaining accurate and audit-ready case records, Experience diagnosing and troubleshooting complex hardware and system-level issues in a data center or infrastructure environment, Experience in RMA workflows, hardware lifecycle management, or infrastructure support, Hands-on experience with NVIDIA and/or AMD GPU technologies, Familiarity with analyzing system logs, firmware behavior, and performance metrics to determine root cause, Experience working with hardware vendors and managing technical escalations or support cases, understanding of server hardware, system architecture, and data center operations, Ability to analyze technical issues, apply judgment, and determine appropriate resolution paths, written and verbal communication skills, particularly in documenting technical findings and collaborating with cross-functional teams, Experience working with microcloud or distributed infrastructure environments, including understanding of system architecture and hardware integration, Experience supporting or analyzing systems within data center environments, including hardware lifecycle, performance, and reliability considerations

Nice to Have

Experience with tools such as JIRA, Confluence, and vendor support portals

What You'll Do.

Analyze GPU/CPU hardware failures by evaluating system logs

and workload interactions to determine root cause

Perform advanced troubleshooting across hardware

and operating system layers to validate faults and identify appropriate remediation paths

Serve as a technical liaison with hardware vendors

providing detailed diagnostic data

interpreting vendor responses

and influencing resolution strategies

Determine and coordinate on-site hardware remediation activities based on technical analysis

including guiding vendors and internal teams on appropriate repair or replacement actions

Evaluate software and firmware versions to identify compatibility issues or contributing factors to system failures

Validate and certify system stability and readiness prior to returning hardware to production environments

Identify recurring failure patterns and recommend improvements to RMA workflows

and hardware lifecycle management

Create and maintain detailed technical documentation

including failure analysis

troubleshooting methodologies

and resolution outcomes

Manage and contribute to technical case documentation within vendor portals

and alignment with diagnostic findings

How You'll Work.

Team & Collaboration

partners closely with Infrastructure, Engineering, and Vendor teams; collaborating with cross-functional teams

Communication Scope

written and verbal communication skills; documenting technical findings; collaborating with cross-functional teams

Full Job Description

WHO WE ARE Vultr is on a mission to make high-performance cloud infrastructure easy to use, affordable, and locally accessible for enterprises and AI innovators around the world. With 32 global cloud data center locations, Vultr is trusted by hundreds of thousands of active customers across 185 countries for its flexible, scalable, global Cloud Compute, Cloud GPU, Bare Metal, and Cloud Storage solutions. In December 2024 Vultr announced an equity financing at a $3.5 billion valuation. Founded by David Aninowsky and self-funded for over a decade, Vultr has grown to become the world’s largest privately-held cloud infrastructure company. VULTR CARES - Medical Insurance stipend paid annually - 9 Company-Paid Holidays - Generous Leave Policy + 1 month paid sabbatical every 5 years + Anniversary Bonus each year - First year remote office setup + reimbursement per quarter each subsequent year for new equipment - Professional Development Reimbursement - Internet reimbursement - Fitness membership reimbursement - Company paid Wellable subscription JOIN VULTR Vultr is seeking a highly skilled and experienced RMA Systems Engineer to analyze, diagnose, and resolve complex hardware and system level failures across our cloud infrastructure platform. This role is responsible for evaluating failure patterns across GPU, CPU, and server environments, determining appropriate remediation strategies, and improving how Vultr manages hardware lifecycle and vendor interactions. The ideal candidate brings strong systems-level troubleshooting expertise and the ability to analyze non-routine technical issues, determine root cause, and make informed decisions on repair, replacement, or escalation paths. This is a highly visible role that partners closely with Infrastructure, Engineering, and Vendor teams to ensure system reliability and continuous improvement of Vultr’s RMA processes. It will also require the ability to adapt to the latest technology, help drive resolutions for never before

Free ATS check

Applying for this RMA Systems Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 49 detected · ranked by frequency

analyze, diagnose, and resolve complex hardware and system level failures ×3

evaluating failure patterns ×3

determining appropriate remediation strategies ×3

systems-level troubleshooting ×3

analyze non-routine technical issues ×3

determine root cause ×3

make informed decisions on repair, replacement, or escalation paths ×3

analyze GPU/CPU hardware failures ×3

evaluating system logs ×3

performance data ×3

firmware behavior ×3

workload interactions ×3

Perform advanced troubleshooting across hardware, firmware, and operating system layers ×3

validate faults ×3

identify appropriate remediation paths ×3

interpreting vendor responses ×3

influencing resolution strategies ×3

Determine and coordinate on-site hardware remediation activities ×3

guiding vendors and internal teams on appropriate repair or replacement actions ×3

Evaluate software and firmware versions ×3

identify compatibility issues or contributing factors to system failures ×3

Validate and certify system stability and readiness ×3

Identify recurring failure patterns ×3

recommend improvements to RMA workflows, vendor processes, and hardware lifecycle management ×3

Manage and contribute to technical case documentation within vendor portals ×3

ensuring accuracy, completeness, and alignment with diagnostic findings ×3

hardware and system level failures analysis ×2

GPU/CPU hardware failure analysis ×2

advanced troubleshooting ×2

vendor management ×2

RMA process improvement ×2

GPU

BEHAVIOURAL

adapt to the latest technologydrive resolutions for never before seen issuescollaborating with cross-functional teams

Role Details

Experience 3–5 yrs

Level Mid

Type FULL TIME

Education Bachelor's degree in Business, Supply Chain, Engineering, or

Category infrastructure-operations

AI-Extracted Insights

Domain Areas

cloud-infrastructure-platformgpucpuand-server-environmentsmicrocloud-or-distributed-infrastructure-environmentsdata-center-environments

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Vultr?

Real rants from real employees. Read before you apply.

Read Company Rants →