Annapurna Labs (U. S. ) Inc.

Technology

NeuronRuntimeSoftwareDevelopmentEngineer,NeuronRuntime

$144–194k Seattle, Washington, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Neuron Runtime Software Development Engineer, Neuron Runtime at Annapurna Labs (U. S. ) Inc.. Skills: Runtime libraries, Driver development, AI accelerators. Develop runtime libraries. Maintain runtime libraries”

What You'll Achieve.

Optimize AI workloads; Improve performance

Industry & Context.

Technology

Problems you'll solve

Performance bottlenecks; Troubleshooting

Eligibility Requirements

On-call

What They're Looking For.

Must Have

3+ years software development experience, 2+ years system design or architecture experience, Experience programming at least one language

Nice to Have

3+ years full SDLC experience, Bachelor's degree in computer science

What You'll Do.

Develop runtime libraries

Maintain runtime libraries

Design Neuron Runtime

Develop Neuron Runtime

Deploy Neuron Runtime

Optimize ML frameworks

Manage development life cycle

Generate key information

Support multiple frameworks

Architect distributed systems

Build distributed systems

Operate distributed systems

Own services end-to-end

Define product directions

Deliver products to customers

How You'll Work.

Team & Collaboration

Cross-functional teams; Executive leadership; Senior management; Technical leaders

Communication Scope

Technical communication

Process & Methodology

Full software development life cycle

Full Job Description

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and the servers that use them. As the Software Development Engineer for the Neuron Runtime Team, you will be responsible for working alongside a team of engineers to develop and maintain high-performance runtime libraries and drivers for machine learning applications and AI accelerators. You will work on design, development, and deployment of Neuron Runtime and other Neuron components. The profiler plays a crucial role to internal and external customers in optimizing AI workloads across hardware platforms such as Trainium and Inferentia devices, by providing deep insights into performance bottlenecks and system behavior. Improving performance of ML Kernels and ML Frameworks. In this role, you will manage the full development life cycle of the Neuron Runtime, ensuring scalability, reliability, and usability. You will collaborate with cross-functional teams to ensure that the our C++ compiler generates key information so customers can understand and optimize the performance of our custom hardware. Additionally, you will drive innovations that allow the profiler to support multiple frameworks, such as PyTorch, JAX, and XLA. A successful candidate will have experience in architecting, building, and operating distributed systems with a focus on high availability and fault tolerance, Hands-on experience with AWS services (e.g., EC2, ECS, CloudWatch, S3, Lambda) in production environments and track record in Owning services end-to-end including deployment, monitoring, alarming, on-call, and post-incident review. A day in the life You will work with the executive leadership and other senior management and technical leaders to define product directions and deliver them to customers. We build massive-scale distributed training and inference solutions. This organization builds the full stack of software, servers and chips to accelerate at the highest scale. A

Free ATS check

Applying for this Neuron Runtime Software Development Engineer, Neuron Runtime role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Annapurna Labs (U. S. ) Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →