Amazon Data Services, Inc.

Technology

HardwareReliabilityEngineer,InfrastructureReliability&Quality

$80–160k Herndon, Virginia, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Hardware Reliability Engineer, Infrastructure Reliability & Quality at Amazon Data Services, Inc.. Skills: Reliability Engineering, Risk assessment, Root cause analysis. Drive reliability risk identification. Drive reliability assessment”

What You'll Achieve.

Improve datacenter availability; Improve datacenter security

Industry & Context.

Technology

Problems you'll solve

Root cause analysis; Troubleshooting; Problem solving

Eligibility Requirements

Travel within US, International travel

What They're Looking For.

Must Have

2+ years of root cause analysis, Bachelor's or Master's degree, 4+ years of Reliability Engineering work experience, 2+ years experience with accelerated life testing, 2+ years experience with stress analysis, 2+ years experience with finite element analysis

Nice to Have

Experience in data center engineering, 7+ years of work experience in reliability risk identification, 7+ years of work experience in reliability risk assessment, Experience with proactive reliability approaches, Experience with effective reliability approaches, Proven experience in working with external design partners, Proven experience in working with manufacturing supply chain partners, Excellent verbal communication skills, Excellent written communication skills

What You'll Do.

Drive reliability risk identification

Drive reliability assessment

Drive reliability mitigation

Perform root cause analysis

Drive continuous improvements

Improve datacenter availability

Improve datacenter security

Drive product specification

Drive risk identification plan

Develop analytical approaches

Develop empirical approaches

Implement analytical approaches

Implement empirical approaches

Carry out lifecycle environmental analysis

Carry out operational stress analysis

Identify overstress weaknesses

Identify fatigue-related weaknesses

Evaluate product design risks

Evaluate product reliability risks

Assess manufacture process issues

Drive critical component identification

Establish critical to quality metrics

Establish critical to reliability metrics

Develop datacenter system level reliability model

Perform reliability quantification

Perform risk analysis

Monitor product performance

Drive root cause analysis

Drive corrective actions

Drive preventive actions

Drive vendor auditing

Drive quarterly review process

How You'll Work.

Team & Collaboration

Internal partners; Outside partners; Suppliers; Cross-functional teams

Communication Scope

Verbal communication; Written communication

Process & Methodology

Program management

Full Job Description

AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own As an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure & Security equipment (Example: Air Handling Units, LV Generator, Liquid Cooling, Power Gen, Chillers etc.). You will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability & security for AWS customers. You will work closely with both internal and outside partners including suppliers to drive key aspects of product specification, risk identification plan and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well as deployment stages. The i

Free ATS check

Applying for this Hardware Reliability Engineer, Infrastructure Reliability & Quality role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 47 detected · ranked by frequency

Reliability block diagram ×4

Statistical modeling ×4

Data analytics ×4

Risk assessment ×3

Analytical approaches ×3

Empirical approaches ×3

Thermal stress analysis ×3

Electrical stress analysis ×3

Chemical stress analysis ×3

Mechanical stress analysis ×3

Overstress analysis ×3

Fatigue analysis ×3

Test data analysis ×3

Field data analysis ×3

Critical component identification ×3

Reliability metrics ×3

Reliability Engineering ×2

Root cause analysis ×2

Physics-of-Failure

Statistical techniques

Statistical models

System reliability engineering

Risk identification

Product quality

Reliability risk

Product specification

Lifecycle environmental analysis

Operational stress analysis

Electronics manufacture process

Component identification

Vendor selection

Vendor qualification

BEHAVIOURAL

Ownership mindedIndependentAction orientedResults orientedOpen collaborative environment

Role Details

Work Mode Onsite

Type FULL TIME

Salary Band 75k-100k

AI-Extracted Insights

Domain Areas

datacenter-infrastructuresecurity-equipmentair-handling-unitslv-generatorliquid-coolingpower-generationchillersphysics-of-failure

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon Data Services, Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →