Amazon Data Services, Inc.

Technology

Sr.HardwareReliabilityEngineer,InfrastructureReliability&Quality

$136–185k Herndon, Virginia, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality at Amazon Data Services, Inc.. Skills: Hardware reliability, Failure analysis, Reliability testing. Design and implement hardware reliability testing plans. Perform failure analysis on hardware components”

What You'll Achieve.

Improve hardware reliability; Reduce hardware failures; Ensure data center availability

Industry & Context.

Technology

Problems you'll solve

Root cause analysis; Troubleshooting

What They're Looking For.

Must Have

Bachelor's degree or equivalent practical experience, 5+ years of experience in hardware reliability engineering, Experience with failure analysis, Experience with reliability testing methodologies, Experience with statistical analysis

Nice to Have

Master's degree or PhD in a relevant field, Experience with semiconductor devices, Experience with data centers, Experience with cloud infrastructure, Experience with Python or other scripting languages

What You'll Do.

Design and implement hardware reliability testing plans

Perform failure analysis on hardware components

Develop and maintain reliability models

Identify and mitigate reliability risks

Collaborate with design and manufacturing teams

Analyze field data to identify trends

Develop and implement corrective actions

Document findings and recommendations

How You'll Work.

Team & Collaboration

Design teams; Manufacturing teams; Cross-functional teams

Communication Scope

Technical documentation; Present findings

Full Job Description

As an Infrastructure Reliability Engineer specializing in Power Generation, you will be proactively driving the reliability risk identification, assessment, and mitigation for datacenter LV & MV generator systems. You will be responsible for root cause analysis of critical generator failures and drive continuous improvements to enhance datacenter availability for AWS customers. You will work closely with both internal teams and external partners including generator OEMs, fuel system suppliers, and service providers to drive key aspects of product specification, risk identification, and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience applying Physics-of-Failure (PoF) based approaches to develop and implement both analytical and empirical methods for generator quality and reliability risk identification across design, manufacture, and deployment stages. The candidate should be able to drive AWS application-specific requirements for lifecycle environmental and operational stress analysis of generator systems. The candidate should be capable of evaluating not only generator design quality and reliability risks, but also have the skills and experience in assessing manufacturing process related quality issues for generator components and assemblies. Knowledge of statistical techniques and models is required to analyze generator test data and field performance data to identify trends and drive proactive risk mitigation. At the component level, the candidate will lead critical component identification for generator systems and define the associated vendor selection and qualification requirements. The candidate will be expected to use knowledge of production process capability and system-level performance requirements to establish critical-to-quality and critical-to-reliability metrics for generator components and subsystems. At the system level, the candid

Free ATS check

Applying for this Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon Data Services, Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →