Amazon Web Services, Inc.

Technology

MachineLearningEngineer

$129–224k Mountain View, California, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Machine Learning Engineer at Amazon Web Services, Inc.. Skills: Machine Learning, Large Language Models, Generative AI, AWS. Design distributed training pipelines. Implement distributed training pipelines”

What You'll Achieve.

Deploy customized generative AI applications; Enhance performance and efficiency; Deliver next-generation AI solutions

Industry & Context.

Technology

Problems you'll solve

Optimize models for performance; Optimize models for efficiency

What They're Looking For.

Must Have

Bachelor's degree in Computer Science, 2+ years professional software development, 2+ years system design or architecture, Hands-on deep learning experience, Hands-on machine learning experience, Hands-on generative AI technology

Nice to Have

Experience training ML systems, Experience deploying ML systems, 2+ years full SDLC experience, Experience with ML library or framework

What You'll Do.

Design distributed training pipelines

Implement distributed training pipelines

Adapt LLMs for new languages

Adapt LLMs for domains

Adapt LLMs for vision applications

Optimize AI models for deployment

Develop custom kernels

Interact with enterprise customers

Interact with foundational model providers

Co-develop tailored generative AI solutions

How You'll Work.

Team & Collaboration

Multidisciplinary team; Customer collaboration; Foundational model providers

Full Job Description

The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. Our multidisciplinary team of strategists, scientists, engineers, and architects collaborates with customers across industries to fine-tune and deploy customized generative AI applications at scale. Additionally, we work closely with foundational model providers to optimize AI models for Amazon Silicon, enhancing performance and efficiency. As an SDE on our team, you will drive the development of custom Large Language Models (LLMs) across languages, domains, and modalities. You will be responsible for fine-tuning state-of-the-art LLMs for diverse use cases while optimizing models for high-performance deployment on AWS’s custom AI accelerators. This role offers an opportunity to innovate at the forefront of AI, tackling end-to-end LLM training pipelines at massive scale and delivering next-generation AI solutions for top AWS clients. Key job responsibilities - Large-Scale Training Pipelines: Design and implement distributed training pipelines for LLMs using tools such as Fully Sharded Data Parallel (FSDP) and DeepSpeed, ensuring scalability and efficiency - LLM Customization & Fine-Tuning: Adapt LLMs for new languages, domains, and vision applications through continued pre-training, fine-tuning, and Reinforcement Learning with Human Feedback (RLHF) - Model Optimization on AWS Silicon: Optimize AI models for deployment on AWS Inferentia and Trainium, leveraging the AWS Neuron SDK and developing custom kernels for enhanced performance - Customer Collaboration: Interact with enterprise customers and foundational model providers to understand their business and technical challenges, co-developing tailored generative AI solutions About the team ABOUT AWS: Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage c

Free ATS check

Applying for this Machine Learning Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon Web Services, Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →