Nscale

Technology

SeniorInfrastructureSupportEngineer

$100–150k United States
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Infrastructure Support Engineer at Nscale. Skills: Infrastructure Support, Networking, Automation, Troubleshooting. Contribute to SLOs. Contribute to post-incident reviews”

Industry & Context.

Technology
Problems you'll solve

Troubleshooting; Root cause analysis; Diagnose failures

What They're Looking For.

Must Have

Networking fundamentals, Solid grasp of L2/L3, Solid grasp of routing, Solid grasp of BGP, Solid grasp of VLANs, Solid grasp of VXLAN, Solid grasp of firewalls, Solid grasp of load balancing, Scripting or software skills in Bash, Scripting or software skills in Python, Scripting or software skills in JavaScript, Experience with Infrastructure Automation tools, Familiarity with virtualisation technologies, Investigating issues, Performing deep dive investigation, Perform root cause analysis

Nice to Have

Openstack operations experience preferred, Automated Network Configuration experience, Experience automating network deployment configurations, Making safe, repeatable changes, GPU HPC concepts, RDMA/InfiniBand experience, Performance tuning for distributed workloads, Support scheduling for large multi-GPU jobs, Containers via Pyxis/Enroot, MPI experience, Diagnose queue failures, Diagnose topology failures, Diagnose job failures, GitOps tooling experience, Cluster/app automation pipelines experience, Build CICD pipelines, Maintain CICD pipelines, Re-architecting old scripts to use Github Actions

What You'll Do.

Contribute to post-incident reviews

Reduce human intervention

Use scripts for tooling

Use small tools for automation

Use software for operational tooling

Use software for integrations

Administer cloud infrastructure

Troubleshoot cloud infrastructure

Use virtualisation technologies

Perform deep dive investigation

Perform root cause analysis

Automate network deployment configurations

Support scheduling for large multi-GPU jobs

Support containers via Pyxis/Enroot

Diagnose queue failures

Diagnose topology failures

Diagnose job failures

Maintain CICD pipelines

Re-architect old scripts

How You'll Work.

Team & Collaboration

Cross-functional teams; Agile teams

Process & Methodology

Agile

Full Job Description

About Nscale Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility. At Nscale, our Support and Operations team plays a critical role in maintaining service availability, driving service reliability and rapid response to customer tickets We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future. About The Role We’re looking for an Engineer that has good people, leadership contribute to SLOs and post‑incident reviews. Strong Networking fundamentals. Solid grasp of L2/L3, routing, BGP, VLANs, VXLAN, firewalls, load balancing. Understanding of high‑performance fabrics (RDMA/NVLink basics) for cluster‑to‑cluster traffic. SRE‑style operations. Write and maintain runbooks, automate diagnostics, and reduce human intervention using scripts or small tools. Automation and Git. Scripting or software skills in Bash, Python, or JavaScript (or equivalent) for operational tooling and integrations, and experience with Infrastructure Automation tools (Ansible, Puppet, Terraform, Chef) Cloud Infrastructure Administration and Troubleshooting. Strong familiarity with using virtualisation technologies, and investigating issues that arise, performing deep dive investigation to perform root cause analysis. Openstack operations experience preferred. Nice to Have: Automated Network Configuratio

Free ATS check

Applying for this Senior Infrastructure Support Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

  • Create a Greenhouse profile before applying — it saves time across multiple applications.
  • Upload your resume as a PDF; the parser handles it better than Word.
  • Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
  • Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Nscale?

Real rants from real employees. Read before you apply.

Read Company Rants →