Amazon Data Services, Inc.
Technology
Sr.HardwareReliabilityEngineer,InfrastructureReliability&Quality
Neural analysis suggests this role is
optimal for Senior candidates.
“Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality at Amazon Data Services, Inc.. Skills: Hardware reliability, Failure analysis, Reliability testing. Design and implement hardware reliability testing plans. Perform failure analysis on hardware components”
What You'll Achieve.
Improve hardware reliability; Reduce hardware failures; Ensure data center availability
Industry & Context.
Root cause analysis; Troubleshooting
What They're Looking For.
Must Have
Bachelor's degree or equivalent practical experience, 5+ years of experience in hardware reliability engineering, Experience with failure analysis, Experience with reliability testing methodologies, Experience with statistical analysis
Nice to Have
Master's degree or PhD in a relevant field, Experience with semiconductor devices, Experience with data centers, Experience with cloud infrastructure, Experience with Python or other scripting languages
What You'll Do.
Design and implement hardware reliability testing plans
Perform failure analysis on hardware components
Develop and maintain reliability models
Identify and mitigate reliability risks
Collaborate with design and manufacturing teams
Analyze field data to identify trends
Develop and implement corrective actions
Document findings and recommendations
How You'll Work.
Team & Collaboration
Design teams; Manufacturing teams; Cross-functional teams
Communication Scope
Technical documentation; Present findings
Full Job Description
As an Infrastructure Reliability Engineer specializing in Power Generation, you will be proactively driving the reliability risk identification, assessment, and mitigation for datacenter LV & MV generator systems. You will be responsible for root cause analysis of critical generator failures and drive continuous improvements to enhance datacenter availability for AWS customers. You will work closely with both internal teams and external partners including generator OEMs, fuel system suppliers, and service providers to drive key aspects of product specification, risk identification, and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience applying Physics-of-Failure (PoF) based approaches to develop and implement both analytical and empirical methods for generator quality and reliability risk identification across design, manufacture, and deployment stages. The candidate should be able to drive AWS application-specific requirements for lifecycle environmental and operational stress analysis of generator systems. The candidate should be capable of evaluating not only generator design quality and reliability risks, but also have the skills and experience in assessing manufacturing process related quality issues for generator components and assemblies. Knowledge of statistical techniques and models is required to analyze generator test data and field performance data to identify trends and drive proactive risk mitigation. At the component level, the candidate will lead critical component identification for generator systems and define the associated vendor selection and qualification requirements. The candidate will be expected to use knowledge of production process capability and system-level performance requirements to establish critical-to-quality and critical-to-reliability metrics for generator components and subsystems. At the system level, the candid
Applying for this Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Amazon Data Services, Inc.?
Real rants from real employees. Read before you apply.