Nvidia
Infrastructure
SeniorSoftwareEngineer,DataCenterWorkloads
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Software Engineer, Data Center Workloads at Nvidia. Skills: software-driven characterization workflows, AI workloads, rack-scale systems, power optimization, performance optimization, drive behavior analysis, system-level analysis, automation framework development, data collection, telemetry, validation, correlation, analysis, cross-stack debugging, system bring-up, Python, C/C++, Linux. Develop and run software tools, automation, and workloads to characterize power, performance, and driv”
Industry & Context.
debugging and problem-solving skills; ability to work across multiple engineering disciplines; identify bottlenecks, anomalies, and optimization opportunities
What They're Looking For.
Must Have
B. Sc. or M. Sc. in Computer Science, Electrical Engineering, or a related field, 5+ years of software engineering experience, preferably in system software, infrastructure, validation, or performance-focused environments, programming skills in Python and at least one system-level language such as C/C++, Experience developing automation and test infrastructure for complex hardware/software systems, Hands-on experience running, debugging, or optimizing AI, HPC, or large-scale system workloads, Good understanding of system-level architecture, including interactions across hardware, firmware, drivers, operating systems, and application layers, Experience working in Linux environments and with scripting, telemetry, logging, and data analysis tools, debugging and problem-solving skills, with the ability to work across multiple engineering disciplines, Good communication skills and the ability to drive technical work in a fast-paced, cross-functional environment
Nice to Have
Experience with NVIDIA platforms, GPU systems, or rack-scale AI infrastructure, Background in power, thermal, performance, or storage/drive characterization, Experience with workload automation, cluster orchestration, or lab infrastructure, Familiarity with AI benchmarks, training/inference workloads, and system stress methodologies, Experience in post-silicon validation, production testing, or system bring-up
What You'll Do.
Develop and run software tools
and workloads to characterize power
and drive behavior across NVIDIA rack-scale systems
Execute AI and system-level workloads to stress and evaluate behavior across the stack
Build automated frameworks for data collection
and analysis of characterization results
Investigate system behavior under different workloads and operating conditions to identify bottlenecks
and optimization opportunities
and readiness activities for new rack-scale platforms and AI infrastructure
How You'll Work.
Team & Collaboration
Work closely with hardware, firmware, driver, system software, performance, and validation teams to define characterization methodologies and debug cross-stack issues
Communication Scope
Good communication skills; ability to drive technical work
Full Job Description
At NVIDIA, we are pioneers in innovation, transforming computer graphics, PC gaming, and accelerated computing for over 25 years. Our team is driven by powerful technology and outstanding people who expand the limits of what’s achievable. Now, we are unlocking the potential of AI to usher in the next era of computing. As part of our engineering organization, you will play a key hands-on role in developing and executing software-driven characterization workflows on NVIDIA rack-scale systems. This role is focused on running AI workloads across the full stack to analyze, characterize, and optimize power, performance, and drive behavior at system level. This is an opportunity to work at the intersection of software, infrastructure, silicon, and large-scale AI platforms, with direct impact on next-generation NVIDIA systems. **What you’ll be doing:** * Develop and run software tools, automation, and workloads to characterize power, performance, and drive behavior across NVIDIA rack-scale systems. * Execute AI and system-level workloads to stress and evaluate behavior across the stack, including GPUs, CPUs, networking, storage, firmware, drivers, and system software. * Build automated frameworks for data collection, telemetry, validation, correlation, and analysis of characterization results. * Investigate system behavior under different workloads and operating conditions to identify bottlenecks, anomalies, and optimization opportunities. * Work closely with hardware, firmware, driver, system software, performance, and validation teams to define characterization methodologies and debug cross-stack issues. * Support bring-up, validation, and readiness activities for new rack-scale platforms and AI infrastructure. * Create clear documentation, test flows, and repeatable processes to improve coverage, efficiency, and reproducibility. **What we need to see:** * B.Sc. or M.Sc. in Computer Science, Electrical Engineering, or a related field. * 5+ years of software engineering ex
Applying for this Senior Software Engineer, Data Center Workloads role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Nvidia?
Real rants from real employees. Read before you apply.