NVIDIA

CapacityOperationsandAnalyticsManager

$168–270k Santa Clara, California, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Capacity Operations and Analytics Manager at NVIDIA. Skills: GPU capacity management, Data analytics, Resource planning, Cloud operations. Manage and optimize GPU capacity. Optimize compute resources across cloud providers”

What You'll Achieve.

meet growing demands; ensure efficient utilization; support NVIDIA Infrastructure governance programs; support strategic capacity decisions; improve resource usage and automation; inform strategic capacity decisions; align cloud capacity management with company goals; match Customer satisfaction

Industry & Context.

Problems you'll solve

Identify performance bottlenecks; resolve them; extract useful signals and insights

What They're Looking For.

Must Have

10+ years of overall experience in cloud computing, managing or sourcing GPU capacity with cloud service providers, technical proficiency in cloud architecture, development and deployment, managing large data sets, Deep understanding of cloud service models (IaaS, PaaS, SaaS), cloud infrastructure technologies, Experience with Cloud Service Providers such as AWS, Azure, GCP, and OCI, Demonstrated experience in employing AI tools and techniques to extract useful signals and insights from data, understanding and practical application of statistical modeling and machine learning methodologies, Proficiency with data analytics, visualization, and monitoring tools such as Kibana, Grafana, Splunk, Prometheus, Tableau, Plotly, Knowledge of analytics, statistical modeling, and machine learning methodologies, Ability to operate effectively amidst uncertainty and rapidly changing business conditions, agile mindset, commitment to ongoing improvement

Nice to Have

A proven track record of large-scale computing operations and planning is a plus

What You'll Do.

Manage and optimize GPU capacity

Optimize compute resources across cloud providers

Build and maintain data models

Develop reporting systems and dashboards

Analyze technical and business needs for GPU capacity

Identify performance bottlenecks

Resolve performance bottlenecks

Drive infrastructure resource efficiency initiatives

Develop and enhance tooling for cloud infrastructure

Optimize resource usage and performance

Leverage AI techniques for insights

Align cloud capacity management with company goals

Develop Infrastructure and Service Level KPIs

Lead multi-year budget-based compute resource planning

How You'll Work.

Team & Collaboration

collaborate with relevant infrastructure teams; Partner and cross-collaborate with Finance, Product, Service Owners, and Infrastructure Engineering teams

Full Job Description

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and pioneering computing platforms. Because of our work, scientists, researchers, and engineers can advance their ideas. At its core, our visual computing technology not only enables an outstanding computing experience but it is also energy efficient! We pioneered a supercharged form of computing loved by the most fast-paced computer users in the world - scientists, designers, artists, and gamers. It’s not just technology, though! It is our people, some of the brightest in the world, and our company makes NVIDIA one of the most fun, innovative, and dynamic places to work! At the center of NVIDIA are our core values, like innovation, excellence, determination, and team, that guide us to be the best we can be. We are looking for a Capacity Operations Manager to help lead efforts with large-scale computing operations and planning. **What you will be doing:** * Manage and optimize GPU capacity and other compute resources across various cloud service providers to meet growing demands and ensure efficient utilization. * Build, develop, and maintain data models, reporting systems, data automation systems, dashboards, and performance metrics that support NVIDIA Infrastructure governance programs and strategic capacity decisions. * Analyze the technical and business needs for GPU capacity and other compute resources from various internal and external teams. * Identify performance bottlenecks in day-to-day usage of compute resources and collaborate with relevant infrastructure teams to resolve them. * Drive infrastructure resource efficiency initiatives in partnership with engineering, finance, and product teams. * Develop and enhance tooling for our cloud infrastructure and analytics platform to optimize resource usage and performance for NVIDIA and its customers. This includes crafting and developing tools for automating workflows and potentially leveraging AI techniques to extract useful s

Free ATS check

Applying for this Capacity Operations and Analytics Manager role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →