Amazon Data Services, Inc.

Cloud Computing

Sr.InfrastructureReliabilityEngineer

$60–185k Herndon, Virginia, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Sr. Infrastructure Reliability Engineer at Amazon Data Services, Inc.. Skills: Infrastructure Reliability, Risk assessment, Root cause analysis. Drive reliability risk identification. Assess reliability risk”

What You'll Achieve.

Improve datacenter availability

Industry & Context.

Cloud Computing
Problems you'll solve

Problem analysis; Problem solving

Eligibility Requirements

Travel within US, International travel

What They're Looking For.

Must Have

Bachelor's degree in Engineering, 10+ years Reliability Engineering experience, 3+ years failure analysis experience, 3+ years accelerated life testing experience

Nice to Have

Master's or Ph.D. in related field, 10+ years reliability risk assessment experience, Experience with proactive reliability approaches, Experience working with supply chain partners, Familiarity with data center equipment reliability

What You'll Do.

Drive reliability risk identification

Assess reliability risk

Mitigate reliability risk

Perform root cause analysis

Drive continuous improvements

Improve datacenter availability

Work with internal partners

Work with outside partners

Drive product specification

Drive risk identification plan

Develop analytical approaches

Implement analytical approaches

Develop empirical approaches

Implement empirical approaches

Assess product quality risks

Assess reliability risks

Assess manufacture quality risks

Assess manufacture reliability risks

Evaluate product design risks

Assess electronics manufacture process issues

Drive critical component identification

Drive vendor selection requirements

Drive vendor qualification requirements

Establish critical to quality metrics

Establish reliability metrics

Develop datacenter system reliability model

Perform reliability quantification

Perform risk analysis

Monitor product performance

Drive corrective actions

Drive preventive actions

Drive vendor auditing

Drive quarterly review process

How You'll Work.

Team & Collaboration

Collaborate with AWS; Work with suppliers

Process & Methodology

Program management

Full Job Description

AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. As a Senior Infrastructure Reliability Engineer you will be proactively driving the reliability risk identification, assessment and mitigation for datacenter infrastructure equipment (Example: LV Generator, MV Transformers, LV SWGR, Breakers, UPS, HV Transformers, In-rack Power shelf etc.). You will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability for AWS customers. You will work closely with both internal and outside partners including suppliers to drive key aspects of product specification, risk identification plan and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well a

Free ATS check

Applying for this Sr. Infrastructure Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon Data Services, Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →