NVIDIA

Ethernet Switching

SoftwareManager,InfraToolsAITeam

Raanana, Israel FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Software Manager, Infra Tools AI Team at NVIDIA. Skills: Infrastructure Engineering, Tools Engineering, AI/LLM, SONiC Network OS, Leadership. own and drive the engineering infrastructure that powers the full product development lifecycle — from development environments and CI pipelines through regression, code coverage, and test efficiency. apply cutting-edge AI and LLM capabilities to transform how we analyze failures, generate test coverage, and accelerate product quality”

Industry & Context.

Ethernet Switching
Problems you'll solve

analytical and problem-solving skills; root cause clustering; anomaly detection

What They're Looking For.

Must Have

B. Sc. degree or equivalent experience in Engineering/Computer Science/related field, 8+ overall years of software engineering experience, at least 3 years of experience in a leadership role, managing software development teams, Proven ability to lead technical teams: hiring, mentoring, technical roadmapping, and cross-team influence, Experienced with developing software testing tools and tests infrastructure, Python programming experience building production-quality automation frameworks and tooling, Demonstrated experience designing and operating CI/CD systems at scale (Jenkins, GitLab CI, GitHub Actions, or equivalent), Hands-on experience with LLMs or AI-assisted developer tooling — building, integrating, or productizing AI capabilities in an engineering workflow, analytical and problem-solving skills with a bias toward measurable outcomes and data-driven decisions

Nice to Have

Deep Linux expertise: system internals, networking stack, process management, and scripting, Prior experience building LLM-powered test analysis pipelines or AI-enhanced DevOps tooling in a real production environment, Knowledge of networking protocols and hardware: Ethernet switching, L2/L3 protocols, QoS, VLANs, high-performance data center networking, Experience with code coverage instrumentation in large-scale C/Python codebases and using coverage data for test prioritization, Track record of measurably improving regression runtime, test reliability, or CI throughput in a complex embedded or systems software environment

What You'll Do.

own and drive the engineering infrastructure that powers the full product development lifecycle — from development environments and CI pipelines through regression

apply cutting-edge AI and LLM capabilities to transform how we analyze failures

generate test coverage

and accelerate product quality

Lead and mentor a team of infrastructure and tooling set technical direction

and grow team capabilities

and maintain scalable infrastructure for development

and test environments supporting SONiC OS

Architect and deliver LLM-based tools for intelligent regression analysis — failure classification

root cause clustering

and test flakiness prediction

Lead efforts to reduce regression runtime through parallelization

and dependency-aware scheduling

Develop deep technical knowledge of SONiC Network OS internals

including its subsystem architecture

SAI/ASIC abstraction layer

How You'll Work.

Team & Collaboration

cross-team influence

Process & Methodology

define priorities

Full Job Description

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. We are now seeking a highly motivated Infrastructure, Tools & AI Engineering Manager to join our Ethernet Switching group, working on SONiC Network OS. In this role, you will own and drive the engineering infrastructure that powers the full product development lifecycle — from development environments and CI pipelines through regression, code coverage, and test efficiency. You will apply cutting-edge AI and LLM capabilities to transform how we analyze failures, generate test coverage, and accelerate product quality. **What you’ll be doing:** * Lead and mentor a team of infrastructure and tooling engineers; set technical direction, define priorities, and grow team capabilities * Design, build, and maintain scalable infrastructure for development, integration, and test environments supporting SONiC OS. * Architect and deliver LLM-based tools for intelligent regression analysis — failure classification, root cause clustering, anomaly detection, and test flakiness prediction * Lead efforts to reduce regression runtime through parallelization, smart test selection, and dependency-aware scheduling * Develop deep technical knowledge of SONiC Network OS internals, including its subsystem architecture, SAI/ASIC abstraction layer, and management plane **What we need to see:** * B.Sc. degree or equivalent experience in Engineering/Computer Science/related field * 8+ overall years of software engineering experience, with at least 3 years of experience in a leadership role, managing software development teams * Proven ability to lead technical teams: hiring, mentoring, technical roadmapping, and cross-team influence * Experienced with developing software testing tools and tests infrastructure * Strong Python programming skills; experience building production-quality automation framewo

Free ATS check

Applying for this Software Manager, Infra Tools AI Team role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →