Amazon.com Services LLC

Technology

SoftwareDevelopmentManager,DataCenter-GenAI

$184–250k Bellevue, Washington, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Manager candidates.

The Brief

“Software Development Manager, Data Center - GenAI at Amazon.com Services LLC. Skills: Agentic AI, Generative AI, Distributed systems, Full-stack development. Lead and mentor SDEs. Build and operate Agentic AI Platform”

What You'll Achieve.

Deliver highest standards for quality; Deliver highest standards for reliability; Provide infinite capacity; Lowest possible cost

Industry & Context.

Technology
Problems you'll solve

Troubleshooting; Root cause analysis

What They're Looking For.

Must Have

3+ years engineering team management, 7+ years working directly within engineering teams, Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, Experience partnering with product or program management teams, 3+ years developing large-scale, multi-tiered distributed software systems

Nice to Have

Experience delivering products against plan in a fast-paced, multi-disciplined, distributed-responsibility and often ambiguous environment, Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers, Knowledge of ML, NLP, Information Retrieval and Analytics, Experience working with fast-moving, high-performance teams, Experience leading teams building AI/ML or generative AI systems in production

What You'll Do.

Build and operate Agentic AI Platform

Foster culture of ownership

Foster culture of innovation

Foster culture of operational excellence

Own end-to-end technical roadmap

Balance investments across agentic AI capabilities

Balance investments across platform infrastructure

Balance investments across frontend experiences

Balance investments across search/knowledge systems

Drive architecture of agentic AI systems

Deliver agentic AI systems

Implement prompt engineering

Develop agent frameworks

Implement tool-calling patterns

Implement multi-agent orchestration

Lead development of full-stack serverless solutions

Deliver scalable platform capabilities

Own design of search and knowledge systems

Implement vector embeddings

Implement hybrid retrieval

Implement document processing pipelines

Implement semantic chunking

Define evaluation frameworks for agentic AI

Implement guardrails for agentic AI

Implement safety mechanisms for agentic AI

Ensure reliable platform behavior

Ensure trustworthy platform behavior

Build platform primitives

Build reusable components

Enable teams to build AI-powered capabilities

Partner with data center operations

Partner with controls engineering

Partner with product management

Partner with peer engineering teams

Identify high-impact use cases

Translate use cases into platform features

Establish engineering excellence

Enforce engineering excellence

Design CI/CD pipeline

Implement progressive deployment

Implement synthetic monitoring

Implement observability

Conduct operational readiness reviews

Own performance management

Own career development

Build diverse pipeline of engineers

Communicate platform strategy

Communicate project status

Communicate business impact to senior leadership

Drive alignment on priorities

Drive alignment on resource allocation

How You'll Work.

Team & Collaboration

Cross-functional stakeholders; Peer engineering teams; Data center operations; Controls engineering; Product management

Communication Scope

Communicate strategy; Communicate roadmaps; Communicate business impact

Process & Methodology

Roadmap planning, Strategic roadmap

Full Job Description

AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of design engineers, quality/reliability engineers, supply chain specialists, field engineers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for quality and reliability while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. You'll join a team of Software Development Engineers building an agentic AI platform that serves a broad customer base of design engineers, quality/reliability engineers, supply chain specialists, field engineers, and other vital roles across AWS data center operations. You'll collaborate with people across AWS to help us deliver the highest standards for quality and reliability while providing seemingly infinite capacity at the lowest possible cost for our customers. And you'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. As the Software Development Manager for the Data Center Agentic AI Platform team, you will lead a team of Software Development Engineers building AWS data center's agentic GenAI platform that powers AI-assisted operations across the global data center infrastructure. You will own the technical vision and strategic roadmap for the platform, driving investments across agentic AI systems, full-stack engineering, search and knowledge systems

Free ATS check

Applying for this Software Development Manager, Data Center - GenAI role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon.com Services LLC?

Real rants from real employees. Read before you apply.

Read Company Rants →