Amazon.com Services LLC
Technology
SoftwareDevEngineerII,StoresFoundationalAI-SFAI
Neural analysis suggests this role is
optimal for Mid candidates.
“Software Dev Engineer II, Stores Foundational AI -SFAI at Amazon.com Services LLC. Skills: Generative AI, Large language models, Training infrastructure, Reinforcement learning. Develop generative AI for shopping. Design training system”
Industry & Context.
Eliminate bottlenecks; Troubleshooting
What They're Looking For.
Must Have
3+ years software development experience, 2+ years system design experience, Experience programming one language, Knowledge of ML fundamentals, Knowledge of LLM fundamentals
Nice to Have
Knowledge of ML frameworks, Knowledge of system performance, Knowledge of memory management, Knowledge of parallel computing
What You'll Do.
Develop generative AI for shopping
Design training system
Implement training system
Collaborate to improve training efficiency
Collaborate to improve training reliability
Design data infrastructure
Implement data infrastructure
Handle data ingestion
Learn state-of-the-art technologies
Adopt state-of-the-art algorithms
Build RL post-training pipelines
Improve RL training stability
Optimize RL post-training efficiency
Eliminate bottlenecks
Build observability systems
Track training dynamics
Track experiment progress
Unblock research progress
Contribute to system design
Contribute to technical roadmap
How You'll Work.
Team & Collaboration
Applied scientists; Engineers; Research scientists; Cross-functionally
Process & Methodology
Roadmap planning
Full Job Description
We’re working to improve shopping on Amazon using the capabilities of large language models (LLM), and are searching for pioneers who are passionate about technology, innovation, and customer experience, and are ready to make a lasting impact on the industry. You'll be working with talented scientists and engineers to innovate on behalf of our customers. If you're fired up about being part of a dynamic, driven team, then this is your moment to join us on this exciting journey! Key job responsibilities Key job responsibilities In this role you will leverage both your engineering and machine learning background to help develop generative AI for shopping. On a day-to-day basis, you will: - Design and implementation of a stable and efficient training system for model training and reinforcement learning that scale to various of model sizes and architecture. - Collaborate with other talented applied scientists and engineers to improve training efficiency and reliability that accelerates innovation. - Design and implement scalable data infrastructure: that handle Amazon-scale data ingestion, processing, and delivery across different training and evaluation stages; - Quickly learn and adopt state-of-the-art technologies and algorithms in the field of Generative AI. A day in the life On any given day, you may work on: Design and build end-to-end RL post-training pipelines (rollout → reward → optimization) at cluster scale Improve RL training stability (PPO / GRPO / RLOO) by monitoring and tuning key metrics such as reward, KL divergence, and policy stability Optimize RL post-training efficiency (GPU utilization, batching, sequence packing, async rollouts) Partner with research scientists to translate new RL algorithms into scalable, production-ready systems Profile and eliminate bottlenecks across compute, networking, and storage Build observability systems for training dynamics, system health, and experiment tracking Collaborate cross-functionally to run experiments, iterat
Applying for this Software Dev Engineer II, Stores Foundational AI -SFAI role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Amazon.com Services LLC?
Real rants from real employees. Read before you apply.