Amazon.com Services LLC

Technology

SeniorProductManger-Tech,InfrastructureReliability

$151–205k Austin, Texas, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Product Manger - Tech, Infrastructure Reliability at Amazon.com Services LLC. Skills: Product roadmap, AI-powered platform, Infrastructure reliability. Own and drive multi-year product roadmap. Define vision, strategy, and success metrics”

What You'll Achieve.

Improve mean time to resolve; Reduce lost labor hours; Improve first page resolution; Measure auto-detection rate; Measure false positive rate; Measure consolidation accuracy; Measure remediation success rate

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting; Data-driven decision making

What They're Looking For.

Must Have

Bachelor's degree, Experience owning/driving roadmap strategy and definition, Experience with feature delivery and tradeoffs of a product, Experience contributing to engineering discussions around technology decisions and strategy related to a product, Experience managing technical products or online services, Experience in representing and advocating for a variety of critical customers and stakeholders during executive-level prioritization and planning

Nice to Have

Experience in using analytical tools, Experience in building and driving adoption of new tools

What You'll Do.

Own and drive multi-year product roadmap

Write code and deliver proof-of-concepts

Prototype multi-agent reasoning pipelines

Explore anomaly detection approaches

Stress-test LLM prompt chains

Shape platform failure detection and reasoning

Engage with data scientists on model architecture

Define AI reasoning techniques application

Define multi-agent architecture

Define agent roles and communication protocols

Translate operational and technical requirements

Create prioritized backlog

Make tradeoffs between feature depth

Serve as voice of Incident Managers

Advocate for Operations Control Center stakeholders

Define and track business case

Secure continued investment

Establish performance measurement mechanisms

Iterate rapidly based on data

Drive cross-functional alignment

Ensure platform orchestration model adoption

Lead executive-level reviews

Communicate path from detection improvements to readiness

How You'll Work.

Team & Collaboration

Cross-functional alignment; Executive-level reviews

Communication Scope

Executive presentations; Stakeholder communication

Process & Methodology

Roadmap strategy, Feature delivery, Prioritization, Tradeoffs

Full Job Description

Join Amazon's Fulfillment Technologies & Robotics (FTR) team to spearhead the product vision for a platform that ensures Amazon's fulfillment network never stops — even as we move toward fully self-governing, zero-touch operations. You'll own the roadmap for an AI-powered infrastructure reliability platform that prevents, detects, and resolves incidents across thousands of fulfillment sites globally. This is a rare opportunity for a technically deep product leader who can write code, deliver proof-of-concepts, and engage as a peer with data scientists and engineers. You will shape how LLMs, multi-agent systems, and machine learning are applied to one of the most operationally critical platforms Amazon has ever built — and your hands-on technical contributions will directly accelerate the team's ability to move from idea to production. Key job responsibilities - Own and drive the multi-year product roadmap for the Infrastructure Reliability AI-Ops platform, spanning three strategic programs: zero-touch incident resolution, associate-directed work tooling, and predictive failure prevention. This means defining the vision, strategy, and success metrics for AI-powered progressive detection, incident consolidation, self-governing remediation orchestration, and cross-domain observability capabilities that serve thousands of fulfillment sites globally. - Go beyond traditional product management by writing code and delivering working proof-of-concepts that validate technical hypotheses before committing engineering resources. Whether prototyping a multi-agent reasoning pipeline, exploring a new anomaly detection approach, or stress-testing an LLM prompt chain against real incident data, you will use your technical skills to compress the distance between idea and validated direction. - Bring deep knowledge of machine learning fundamentals and apply that knowledge to shape how the platform detects, consolidates, and reasons about failures. You will engage meaningfully with da

Free ATS check

Applying for this Senior Product Manger - Tech, Infrastructure Reliability role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon.com Services LLC?

Real rants from real employees. Read before you apply.

Read Company Rants →