Nscale
Technology
SeniorInfrastructureSupportEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Infrastructure Support Engineer at Nscale. Skills: Infrastructure Support, SRE, Automation, Cloud. Contribute to SLOs. Contribute to post-incident reviews”
What You'll Achieve.
Maintaining service availability; Driving service reliability; Rapid response to customer tickets; Achieve superior results; Reducing complexity; Cost management; Rapid innovation; Environmental responsibility
Industry & Context.
Root cause analysis; Troubleshooting; Diagnose failures
What They're Looking For.
Must Have
Networking fundamentals, L2/L3 routing, BGP, VLANs, VXLAN, Firewalls, Load balancing, Scripting or software skills in Bash, Python, or JavaScript, Infrastructure Automation tools experience, Virtualisation technologies familiarity, Investigating issues, Deep dive investigation, Root cause analysis
Nice to Have
Openstack operations experience, Automated Network Configuration, Automating network deployment configurations, GPU HPC concepts, RDMA/InfiniBand experience, Performance tuning for distributed workloads, Support scheduling for large multi-GPU jobs, Containers via Pyxis/Enroot, MPI, Diagnose queue failures, Diagnose topology failures, Diagnose job failures, GitOps tooling, Cluster automation pipelines, App automation pipelines, Build CICD pipelines, Re-architecting old scripts
What You'll Do.
Contribute to post-incident reviews
Reduce human intervention
Perform deep dive investigation
Perform root cause analysis
Automate network deployment configurations
Make repeatable changes
Diagnose queue failures
Diagnose topology failures
Diagnose job failures
Maintain CICD pipelines
Re-architect old scripts
How You'll Work.
Team & Collaboration
Collaborative environment; Cross-functional teams
Process & Methodology
CICD pipelines
Full Job Description
About Nscale Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility. At Nscale, our Support and Operations team plays a critical role in maintaining service availability, driving service reliability and rapid response to customer tickets We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future. About the Role (Job Purpose) We’re looking for an Engineer that has good people, leadership contribute to SLOs and post‑incident reviews. Strong Networking fundamentals. Solid grasp of L2/L3, routing, BGP, VLANs, VXLAN, firewalls, load balancing. Understanding of high‑performance fabrics (RDMA/NVLink basics) for cluster‑to‑cluster traffic. SRE‑style operations. Write and maintain runbooks, automate diagnostics, and reduce human intervention using scripts or small tools. Automation and Git. Scripting or software skills in Bash, Python, or JavaScript (or equivalent) for operational tooling and integrations, and experience with Infrastructure Automation tools (Ansible, Puppet, Terraform, Chef) Cloud Infrastructure Administration and Troubleshooting. Strong familiarity with using virtualisation technologies, and investigating issues that arise, performing deep dive investigation to perform root cause analysis. Openstack operations experience preferred. Nice to Have: Automated Networ
Applying for this Senior Infrastructure Support Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Nscale?
Real rants from real employees. Read before you apply.