Amazon Data Services, Inc.
Technology
Sr.HardwareReliabilityEngineer,InfrastructureReliability&Quality
Neural analysis suggests this role is
optimal for Senior candidates.
“Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality at Amazon Data Services, Inc.. Skills: Hardware Reliability Engineering, Infrastructure Reliability, Quality Assurance. Drive reliability risk identification. Assess reliability risk”
What You'll Achieve.
Improve datacenter availability
Industry & Context.
Problem analysis; Problem solving; Root cause analysis
Travel within US, Travel internationally
What They're Looking For.
Must Have
8+ years Reliability Engineering work experience, 3+ years accelerated life testing, 3+ years stress analysis, 3+ years finite element analysis, Bachelor's or Master's degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field, Experience in industrial or commercial engineering in mission critical facilities, Experience in data centers, Experience in power generation, Experience in oil and gas facilities
Nice to Have
10+ years reliability risk identification and assessment, Experience with proactive and effective reliability approaches, Proven experience working with external design and manufacturing supply chain partners
What You'll Do.
Drive reliability risk identification
Assess reliability risk
Mitigate reliability risk
Perform root cause analysis of failures
Drive continuous improvements
Improve datacenter availability
Work with internal partners
Work with outside partners
Drive product specification
Drive risk identification plan execution
Develop analytical approaches
Implement analytical approaches
Develop empirical approaches
Implement empirical approaches
Assess product quality risks
Assess reliability risks
Assess electronics manufacture process issues
Drive critical component identification
Establish critical to quality metrics
Establish reliability metrics
Develop datacenter system level reliability model
Perform reliability quantification
Perform risk analysis
Optimize datacenter configuration
Monitor product performance in field
Drive root cause analysis of critical failures
Drive corrective actions
Drive preventive actions
Drive effective vendor auditing
Drive quarterly review process
Drive DFR methodology
Design-in reliability in new product designs
Drive reliability qualification of equipment
Drive quality qualification of equipment
Oversee factory testing
Guide root cause analysis
Support root cause analysis
Validate RCA conclusions
Ensure highest standards in testing
Ensure highest standards in remediation
Make recommendations about maintenance
Make recommendations about equipment replacement
Provide feedback to sourcing teams
Provide feedback to procurement teams
Evaluate vendor performance
Analyze internal reliability data
Create metrics to drive reliability
Develop end of life strategy
How You'll Work.
Team & Collaboration
Internal partners; Outside partners; Supply chain partners; Cross-functional teams
Communication Scope
Verbal communication; Written communication
Process & Methodology
Program management
Full Job Description
As an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure equipment (Example: Air Handling Units, LV Generator, MV Transformers, LV SWGR, Breakers, UPS, Chillers etc.). You will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and outside partners including suppliers to drive key aspects of product specification, risk identification plan and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well as deployment stages. The individual should be able to drive AWS application-specific requirements in carrying out both lifecycle environmental and operational stress driven risk analysis, including thermal, electrical, chemical and mechanical stresses so to identify overstress and fatigue-related product weaknesses. Candidate should be capable of evaluating not only product design quality/reliability risks, but also have the skills and experiences in assessing electronics manufacture process related quality/reliability issues. Knowledge of statistical techniques and models is required to analyze test as well as field data. At the component level, the individual will drive critical component identification and the associated vendor selection and qualification requirements. The candidate will be expected to use knowledge of process capability for electronic component production as well as system-level performance requirements to establish critical to quality and reliability metrics. At t
Applying for this Sr. Hardware Reliability Engineer, Infrastructure Reliability & Quality role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Amazon Data Services, Inc.?
Real rants from real employees. Read before you apply.