Annapurna Labs Ltd.

Technology

SeniorSoftwareDevelopmentEngineer(AWSML),MachineLearningIsrael(MLIL)FLOWsub-team(FleetLifecycle&OperationalWorkflows)

$450–650k ~AI est. Tel Aviv-Yafo, Tel Aviv, Israel FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Software Development Engineer (AWS ML), Machine Learning Israel (MLIL) — FLOW sub-team (Fleet Lifecycle & Operational Workflows) at Annapurna Labs Ltd.. Skills: ML accelerator servers, Hardware validation, Fleet-scale operations. Lead architecture and implementation of hardware validation and. Drive technical direction for PCIe validation”

Industry & Context.

Technology
Problems you'll solve

Debugging; Root-cause analysis

What They're Looking For.

Must Have

Bachelor's degree or above in Computer Science, Computer Engineering, Electrical Engineering, or related fields, At least 8 years of professional software development experience

Nice to Have

Experience with hardware bring-up, ASIC/FPGA validation, or manufacturing test development, Proficiency in scripting languages (Python, Lua) for test automation and data analysis, Track record of cross-team influence and delivering results through others, Experience building data pipelines, ETL systems, or fleet-scale monitoring/dashboarding, Demonstrated project-management experience leading multiple R&D initiatives in parallel, Experience with PCIe, or high-speed interconnect validation and debugging

What You'll Do.

Lead architecture and implementation of hardware validation and

Drive technical direction for PCIe validation

Drive technical direction for power/thermal diagnostics

Drive technical direction for stress-testing frameworks

Own subsystems end-to-end

Work with Hardware teams

Work with Manufacturing teams

Create coordinated software packages

Debug and root-cause complex hardware/software interaction failures

Drive root-cause to closure

Maintain data pipelines

Build monitoring systems

Maintain monitoring systems

Define best practices

Raise the bar for the team

Lead multiple development initiatives

Balance technical quality

How You'll Work.

Team & Collaboration

Cross-team influence; Cross-team collaboration sessions; Coordinate software delivery timelines

Process & Methodology

Project management, R&D initiatives

Full Job Description

Annapurna Labs designs silicon and software that accelerates innovation. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world. The MLIL FLOW team is looking for a Senior Software Development Engineer to lead the design and delivery of systems software for our next-generation ML accelerator servers. We build production software to validate, initialize, monitor, and qualify these servers — from first silicon through fleet-scale deployment. We work on the physical systems that execute ML workloads: Server bring-up, hardware diagnostics, interconnect validation, power/thermal monitoring, and fleet-scale operations are our bread and butter. Key job responsibilities • Lead the architecture and implementation of hardware validation and diagnostic software for new ML acceleration platforms. • Drive technical direction for PCIe validation, power/thermal diagnostics, and stress-testing frameworks that run across manufacturing, vetting, and production environments. • Own subsystems end-to-end: from design through implementation, testing, deployment, and operational excellence at fleet scale. • Work with Hardware, Manufacturing, EC2 teams to create coordinated software packages that enable both qualification and rapid deployment. • Debug and root-cause complex hardware/software interaction failures on first silicon and production fleet returns; drive root-cause to closure. • Build and maintain data pipelines, dashboards, and monitoring systems for fleet health and performance benchmarking. • Mentor engineers, define best practices, drive design reviews, and raise the bar for the team. • Lead multiple development initiatives in parallel, balancing schedule, risk, and technical quality across a fast-moving hardware program. A day in the life You'll start your day reviewing telemetry data from overnight fleet validation runs, identifying patterns

Free ATS check

Applying for this Senior Software Development Engineer (AWS ML), Machine Learning Israel (MLIL) — FLOW sub-team (Fleet Lifecycle & Operational Workflows) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Annapurna Labs Ltd.?

Real rants from real employees. Read before you apply.

Read Company Rants →