Nscale
Technology
SeniorInfrastructureSupportEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Infrastructure Support Engineer at Nscale. Skills: Infrastructure Support, Networking, Automation, Troubleshooting. Contribute to SLOs. Contribute to post-incident reviews”
Industry & Context.
Troubleshooting; Root cause analysis; Diagnose failures
What They're Looking For.
Must Have
Networking fundamentals, Solid grasp of L2/L3, Solid grasp of routing, Solid grasp of BGP, Solid grasp of VLANs, Solid grasp of VXLAN, Solid grasp of firewalls, Solid grasp of load balancing, Scripting or software skills in Bash, Scripting or software skills in Python, Scripting or software skills in JavaScript, Experience with Infrastructure Automation tools, Familiarity with virtualisation technologies, Investigating issues, Performing deep dive investigation, Perform root cause analysis
Nice to Have
Openstack operations experience preferred, Automated Network Configuration experience, Experience automating network deployment configurations, Making safe, repeatable changes, GPU HPC concepts, RDMA/InfiniBand experience, Performance tuning for distributed workloads, Support scheduling for large multi-GPU jobs, Containers via Pyxis/Enroot, MPI experience, Diagnose queue failures, Diagnose topology failures, Diagnose job failures, GitOps tooling experience, Cluster/app automation pipelines experience, Build CICD pipelines, Maintain CICD pipelines, Re-architecting old scripts to use Github Actions
What You'll Do.
Contribute to post-incident reviews
Reduce human intervention
Use scripts for tooling
Use small tools for automation
Use software for operational tooling
Use software for integrations
Administer cloud infrastructure
Troubleshoot cloud infrastructure
Use virtualisation technologies
Perform deep dive investigation
Perform root cause analysis
Automate network deployment configurations
Support scheduling for large multi-GPU jobs
Support containers via Pyxis/Enroot
Diagnose queue failures
Diagnose topology failures
Diagnose job failures
Maintain CICD pipelines
Re-architect old scripts
How You'll Work.
Team & Collaboration
Cross-functional teams; Agile teams
Process & Methodology
Agile
Full Job Description
About Nscale Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility. At Nscale, our Support and Operations team plays a critical role in maintaining service availability, driving service reliability and rapid response to customer tickets We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future. About The Role We’re looking for an Engineer that has good people, leadership contribute to SLOs and post‑incident reviews. Strong Networking fundamentals. Solid grasp of L2/L3, routing, BGP, VLANs, VXLAN, firewalls, load balancing. Understanding of high‑performance fabrics (RDMA/NVLink basics) for cluster‑to‑cluster traffic. SRE‑style operations. Write and maintain runbooks, automate diagnostics, and reduce human intervention using scripts or small tools. Automation and Git. Scripting or software skills in Bash, Python, or JavaScript (or equivalent) for operational tooling and integrations, and experience with Infrastructure Automation tools (Ansible, Puppet, Terraform, Chef) Cloud Infrastructure Administration and Troubleshooting. Strong familiarity with using virtualisation technologies, and investigating issues that arise, performing deep dive investigation to perform root cause analysis. Openstack operations experience preferred. Nice to Have: Automated Network Configuratio
Applying for this Senior Infrastructure Support Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Nscale?
Real rants from real employees. Read before you apply.