Amazon Data Services, Inc.
Data Centers
Sr.HardwareReliabilityEngineer,InfrastructureReliability&Quality
Neural analysis suggests this role is
optimal for Senior candidates.
“Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality at Amazon Data Services, Inc.. Skills: Hardware Reliability Engineering, Generator systems, Datacenter availability. Drive reliability risk identification. Assess reliability risk”
What You'll Achieve.
Enhance datacenter availability; Drive continuous improvement
Industry & Context.
Problem analysis; Problem solving; Root cause analysis
Travel within US, Travel internationally
What They're Looking For.
Must Have
8+ years Reliability Engineering experience, 3+ years accelerated life testing, 3+ years stress analysis, 3+ years finite element analysis, Bachelor's or Master's degree, Experience in mission critical facilities
Nice to Have
10+ years reliability risk identification, Experience with power generation equipment, Experience with diesel generators, Experience with gas generators, Experience with rotating machinery, Experience with proactive reliability approaches, Experience with external supply chain partners
What You'll Do.
Drive reliability risk identification
Assess reliability risk
Mitigate reliability risk
Perform root cause analysis
Drive continuous improvements
Enhance datacenter availability
Work with internal teams
Work with external partners
Drive product specification
Develop analytical methods
Develop empirical methods
Implement analytical methods
Implement empirical methods
Drive AWS application requirements
Evaluate generator design quality
Evaluate generator reliability risks
Assess manufacturing process quality
Analyze generator test data
Analyze field performance data
Drive proactive risk mitigation
Lead critical component identification
Define vendor selection requirements
Define vendor qualification requirements
Establish critical-to-quality metrics
Establish critical-to-reliability metrics
Develop datacenter system reliability models
Monitor generator fleet performance
Lead root cause analysis
Drive corrective actions
Drive preventive actions
Conduct business reviews
Drive continuous improvement
Drive DFR methodology
Design-in reliability
Drive reliability qualification
Oversee factory testing
Guide root cause analysis
Support root cause analysis
Ensure highest standards
Analyze internal reliability data
Develop end of life strategy
How You'll Work.
Team & Collaboration
Internal teams; External partners; Generator OEMs; Fuel system suppliers; Service providers; Cross-functional teams
Communication Scope
Verbal communication; Written communication
Process & Methodology
Program management
Full Job Description
As an Infrastructure Reliability Engineer specializing in Power Generation, you will be proactively driving the reliability risk identification, assessment, and mitigation for datacenter LV & MV generator systems. You will be responsible for root cause analysis of critical generator failures and drive continuous improvements to enhance datacenter availability for AWS customers. You will work closely with both internal teams and external partners including generator OEMs, fuel system suppliers, and service providers to drive key aspects of product specification, risk identification, and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience applying Physics-of-Failure (PoF) based approaches to develop and implement both analytical and empirical methods for generator quality and reliability risk identification across design, manufacture, and deployment stages. The candidate should be able to drive AWS application-specific requirements for lifecycle environmental and operational stress analysis of generator systems. The candidate should be capable of evaluating not only generator design quality and reliability risks, but also have the skills and experience in assessing manufacturing process related quality issues for generator components and assemblies. Knowledge of statistical techniques and models is required to analyze generator test data and field performance data to identify trends and drive proactive risk mitigation. At the component level, the candidate will lead critical component identification for generator systems and define the associated vendor selection and qualification requirements. The candidate will be expected to use knowledge of production process capability and system-level performance requirements to establish critical-to-quality and critical-to-reliability metrics for generator components and subsystems. At the system level, the candid
Applying for this Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Amazon Data Services, Inc.?
Real rants from real employees. Read before you apply.