Amazon Web Services, Inc.
Cloud Computing
InfrastructureReliabilityEngineer
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Infrastructure Reliability Engineer at Amazon Web Services, Inc.. Skills: Infrastructure Reliability, Risk assessment, Root cause analysis, Continuous improvement. Drive reliability risk identification. Assess reliability risk”
What You'll Achieve.
Improve datacenter availability
Industry & Context.
Problem analysis; Problem solving
Travel within US, International travel
What They're Looking For.
Must Have
4+ years industrial/commercial engineering in mission critical facilities, 4+ years commissioning experience, Bachelor's degree in Electrical Engineering or related field, Experience researching new designs, technologies, construction methods for data center equipment and facilities
Nice to Have
Professional Engineer License, Experience with building codes and regulations, Experience carrying design concepts through exploration, development, and into deployment or mass production, Experience reading, interpreting, and creating construction drawings, specifications, and submittal documents
What You'll Do.
Drive reliability risk identification
Assess reliability risk
Mitigate reliability risk for datacenter infrastructure equipment
Perform root cause analysis of critical equipment failures
Drive continuous improvements to improve datacenter availability
Work closely with internal and outside partners
Drive product specification
Drive risk identification plan and execution
Develop and implement analytical approaches for product quality/reliability
Develop and implement empirical approaches for product quality/reliability
Carry out AWS application-specific requirements
Carry out lifecycle environmental risk analysis
Carry out operational stress driven risk analysis
Identify overstress and fatigue-related product weaknesses
Evaluate product design quality/reliability risks
Assess electronics manufacture process related quality/reliability issues
Drive critical component identification
Establish critical to quality and reliability metrics
Establish reliability metrics
Develop datacenter system level reliability model
Develop related reliability quantification
Develop risk analysis for datacenter configuration optimization
Monitor product performance in the field
Drive corrective actions for critical failures
Drive preventive actions for critical failures
Drive effective vendor auditing
Drive quarterly review process
How You'll Work.
Team & Collaboration
Internal partners; Outside partners; Suppliers; Software engineers; Hardware engineers; Network engineers; Supply chain specialists; Security experts; Operations managers
Communication Scope
Vendor management
Process & Methodology
Program management
Full Job Description
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. As an Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure equipment (Example: LV Generator, MV Transformers, LV SWGR, Breakers, UPS, HV Transformers). You will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and outside partners including suppliers to drive key aspects of product specification, risk identification plan and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well as deployment stages. The individ
Applying for this Infrastructure Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Amazon Web Services, Inc.?
Real rants from real employees. Read before you apply.