TensorWave

Operations

TechnicalProgramManager,DataCenterOperations

Las Vegas, Nevada, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Technical Program Manager, Data Center Operations at TensorWave. Skills: technical program management, data center operations, cross-functional coordination, operational discipline, risk mitigation, continuous improvement. Own end-to-end program management for data center operations across multiple sites, covering hardware lifecycle, capacity planning, change management, incident response, and operational readiness. Serve as the primary coordination point across facilities, networking, hardware,”

What You'll Achieve.

deliver seamless, secure, reliable, and resilient AI compute at scale; keep our next-generation AI infrastructure running at peak performance; ensure our AMD-powered AI clusters deliver on customer commitments at scale, with zero tolerance for preventable downtime; driving accountability to operational SLAs, program schedules, and customer commitments

Industry & Context.

Operations
Problems you'll solve

Identify, escalate, and mitigate risks to site reliability, capacity availability, and customer-facing uptime before they become incidents

What They're Looking For.

Must Have

5+ years of technical program management experience, at least 3 years directly managing data center operations, infrastructure programs, or critical facilities at scale, Demonstrated experience coordinating across facilities, network, and hardware engineering teams in a live production environment, Familiarity with data center operational systems and infrastructure: power distribution, cooling, structured cabling, and physical layer dependencies, Proven track record managing complex, multi-site operational programs on compressed timelines in a high-growth environment, technical communication skills: able to translate operational status and risk to both field teams and executive stakeholders, Experience with Jira or equivalent program management tooling for milestone tracking, incident management, and cross-team coordination

Nice to Have

Experience with high-density GPU or AI compute deployments and their operational demands, Background managing colocation or multi-tenant infrastructure environments, Familiarity with network infrastructure in data center environments: top-of-rack switching, structured fiber, spine-leaf topology, Experience with observability tooling (Grafana, Prometheus, or equivalent) for operational visibility, Prior experience at a hyperscaler, cloud provider, or high-growth AI infrastructure company, PMP or equivalent project management certification

What You'll Do.

Own end-to-end program management for data center operations across multiple sites

covering hardware lifecycle

and operational readiness

Serve as the primary coordination point across facilities

and software teams: driving accountability to operational SLAs

and customer commitments

Define and track program milestones

critical path dependencies

and resource requirements across concurrent multi-site operational programs

Translate operational status and risk into clear reporting for engineering

and executive leadership audiences

and mitigate risks to site reliability

capacity availability

and customer-facing uptime before they become incidents

Coordinate hardware deployment

and maintenance sequencing across sites in alignment with capacity and customer commitments

and infrastructure engineering teams to drive operational readiness for high-density GPU compute clusters

Own post-incident retrospectives and corrective action tracking

driving lessons learned into durable process improvements

Maintain program documentation including operational runbooks

and change management records

How You'll Work.

Team & Collaboration

Serve as the primary coordination point across facilities, networking, hardware, and software teams; Partner with network, power, facilities, and infrastructure engineering teams

Communication Scope

technical communication skills: able to translate operational status and risk to both field teams and executive stakeholders

Process & Methodology

program management, milestone tracking, incident management, cross-team coordination, critical path dependencies management, resource requirements planning, risk management, change management

Full Job Description

About TensorWave Our mission is simple: deliver seamless, secure, reliable, and resilient AI compute at scale. We've built a versatile cloud platform that eliminates infrastructure barriers, empowering builders to focus on innovation instead of fighting their stack. Because breakthrough AI should move at the speed of ideas, not infrastructure.   About the Role TensorWave is seeking an experienced Technical Program Manager with a strong data center operations background to lead and scale the operational programs that keep our next-generation AI infrastructure running at peak performance. In this role, you’ll own the full lifecycle of data center operations programs: from hardware deployment and capacity management through incident response, change management, and continuous reliability improvement. You’ll be the operational spine connecting facilities engineers, network, hardware, DevOps, and SRE teams, and executive leadership to ensure our AMD-powered AI clusters deliver on customer commitments at scale, with zero tolerance for preventable downtime. This is a high-visibility, high-impact role for someone who brings operational discipline to complex, fast-moving environments and maintains clear communication and structured execution under pressure.   What You’ll Do - Own end-to-end program management for data center operations across multiple sites, covering hardware lifecycle, capacity planning, change management, incident response, and operational readiness - Serve as the primary coordination point across facilities, networking, hardware, and software teams: driving accountability to operational SLAs, program schedules, and customer commitments - Define and track program milestones, critical path dependencies, and resource requirements across concurrent multi-site operational programs - Translate operational status and risk into clear reporting for engineering, product, and executive leadership audiences - Identify, escalate, and mitigate risks to site reliability

Free ATS check

Applying for this Technical Program Manager, Data Center Operations role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about TensorWave?

Real rants from real employees. Read before you apply.

Read Company Rants →