Annapurna Labs (U. S. ) Inc.

Software Development, Cloud Computing

SeniorSoftwareEngineer-AI/ML,AWSNeuronInference

$100–227k Seattle, Washington, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Software Engineer - AI/ML, AWS Neuron Inference at Annapurna Labs (U. S. ) Inc.. Skills: AI/ML, LLM Inference, AWS Neuron. Develop core building blocks of LLM Inference. Optimize core building blocks of LLM Inference”

Industry & Context.

Software Development, Cloud Computing

Problems you'll solve

Performance optimization; Accuracy optimization

What They're Looking For.

Must Have

5+ years full software development life cycle, Bachelor's degree in computer science, 5+ years programming Java, C++, or C#, Object-oriented design experience, Fundamentals of Machine learning models, Model architecture, training, inference lifecycles, Work experience on model performance optimizations

Nice to Have

Master's degree in computer science, Hands-on experience with PyTorch or Jax, Developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware

What You'll Do.

Develop core building blocks of LLM Inference

Optimize core building blocks of LLM Inference

Adapt latest research in LLM optimization

Extract best performance from models

How You'll Work.

Team & Collaboration

Work across teams; Work across organizations; Work side by side with chip architects; Work side by side with compiler engineers; Work side by side with runtime engineers

Full Job Description

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role is for a senior software engineer in the Machine Learning Inference Applications team. This role is responsible for development and performance optimization of core building blocks of LLM Inference - Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts, etc. The team works side by side with chip architects, compiler engineers and runtime engineers to deliver performance and accuracy on Neuron devices across a range of models. Key job responsibilities Responsibilities of this role include adapting latest research in LLM optimization to Neuron chips to extract best performance from both open source as well as internally developed models. Working across teams and organizations is key. About the team Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Basic Qualifications: - 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience - Bachelor's degree in computer science or equivalent - 5+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience - Fundamentals of Machine learning models, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model performance. Preferred Qualifications: - Master's degree in computer science or equivalent - Hands-on experience with PyTorch or Ja

Free ATS check

Applying for this Senior Software Engineer - AI/ML, AWS Neuron Inference role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Annapurna Labs (U. S. ) Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →