Amazon Data Services, Inc.

Data Centers

Sr.HardwareReliabilityEngineer,InfrastructureReliability&Quality

$136–185k Herndon, Virginia, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality at Amazon Data Services, Inc.. Skills: Hardware Reliability Engineering, Generator systems, Datacenter availability. Drive reliability risk identification. Assess reliability risk”

What You'll Achieve.

Enhance datacenter availability; Drive continuous improvement

Industry & Context.

Data Centers
Problems you'll solve

Problem analysis; Problem solving; Root cause analysis

Eligibility Requirements

Travel within US, Travel internationally

What They're Looking For.

Must Have

8+ years Reliability Engineering experience, 3+ years accelerated life testing, 3+ years stress analysis, 3+ years finite element analysis, Bachelor's or Master's degree, Experience in mission critical facilities

Nice to Have

10+ years reliability risk identification, Experience with power generation equipment, Experience with diesel generators, Experience with gas generators, Experience with rotating machinery, Experience with proactive reliability approaches, Experience with external supply chain partners

What You'll Do.

Drive reliability risk identification

Assess reliability risk

Mitigate reliability risk

Perform root cause analysis

Drive continuous improvements

Enhance datacenter availability

Work with internal teams

Work with external partners

Drive product specification

Develop analytical methods

Develop empirical methods

Implement analytical methods

Implement empirical methods

Drive AWS application requirements

Evaluate generator design quality

Evaluate generator reliability risks

Assess manufacturing process quality

Analyze generator test data

Analyze field performance data

Drive proactive risk mitigation

Lead critical component identification

Define vendor selection requirements

Define vendor qualification requirements

Establish critical-to-quality metrics

Establish critical-to-reliability metrics

Develop datacenter system reliability models

Monitor generator fleet performance

Lead root cause analysis

Drive corrective actions

Drive preventive actions

Conduct business reviews

Drive continuous improvement

Drive DFR methodology

Design-in reliability

Drive reliability qualification

Oversee factory testing

Guide root cause analysis

Support root cause analysis

Ensure highest standards

Analyze internal reliability data

Develop end of life strategy

How You'll Work.

Team & Collaboration

Internal teams; External partners; Generator OEMs; Fuel system suppliers; Service providers; Cross-functional teams

Communication Scope

Verbal communication; Written communication

Process & Methodology

Program management

Full Job Description

As an Infrastructure Reliability Engineer specializing in Power Generation, you will be proactively driving the reliability risk identification, assessment, and mitigation for datacenter LV & MV generator systems. You will be responsible for root cause analysis of critical generator failures and drive continuous improvements to enhance datacenter availability for AWS customers. You will work closely with both internal teams and external partners including generator OEMs, fuel system suppliers, and service providers to drive key aspects of product specification, risk identification, and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience applying Physics-of-Failure (PoF) based approaches to develop and implement both analytical and empirical methods for generator quality and reliability risk identification across design, manufacture, and deployment stages. The candidate should be able to drive AWS application-specific requirements for lifecycle environmental and operational stress analysis of generator systems. The candidate should be capable of evaluating not only generator design quality and reliability risks, but also have the skills and experience in assessing manufacturing process related quality issues for generator components and assemblies. Knowledge of statistical techniques and models is required to analyze generator test data and field performance data to identify trends and drive proactive risk mitigation. At the component level, the candidate will lead critical component identification for generator systems and define the associated vendor selection and qualification requirements. The candidate will be expected to use knowledge of production process capability and system-level performance requirements to establish critical-to-quality and critical-to-reliability metrics for generator components and subsystems. At the system level, the candid

Free ATS check

Applying for this Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon Data Services, Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →