Amazon Data Services, Inc.
Technology
SrHardwareDevelopmentEngineer,HighPerformanceAI&MLServers
Neural analysis suggests this role is
optimal for Senior candidates.
“Sr Hardware Development Engineer, High Performance AI & ML Servers at Amazon Data Services, Inc.”
Industry & Context.
Full Job Description
Do you want to shape the future of AI? Join the team building the foundation of the world’s most advanced cloud for AI training and inference — where multi-billion-parameter models come to life at scale. Here, you’ll design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI/ML and HPC workloads. If you’re passionate about pushing the limits of performance, efficiency, and scalability in the cloud, this is your opportunity to build the systems that define what’s next for AWS — and for the entire AI industry. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion. Key job responsibilities - Lead technical solutions for complex high performance server and/or accelerator server and rack system architectural challenges - Own end-to-end system reliability, proactively identifying and resolving deficiencies before customer impact - Design and implement solutions to address system-level issues at large scale - Decompose complex server system problems (testability, reliability, diagnostics) into deliverable tasks and features - Apply expertise across hardware, software, system design, x86 architecture, processes, and operations - Collaborate with hardware, software, manufacturing, supply chain and product management teams - Develop and implement diagnostic tools and monitoring solutions for production systems - Debug complex system failures in time sensitive settings A day in the life Your day to day responsibilities will include interfacing with our internal and external customers to understand project requirements and faci
Applying for this Sr Hardware Development Engineer, High Performance AI & ML Servers role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Amazon Data Services, Inc.?
Real rants from real employees. Read before you apply.