Nscale
Technology
InfrastructureSupportEngineer
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Infrastructure Support Engineer at Nscale. Skills: Infrastructure Support, Cloud, AI Infrastructure, GPU Cloud. Join support duty rotation. Handle day-to-day tickets”
What You'll Achieve.
Achieve superior results; Reduce complexity; Manage costs; Drive rapid innovation; Environmental responsibility; Maintain service availability; Drive service reliability; Rapid response to tickets; Get value from services; Optimize processes
Industry & Context.
Problem solving; Troubleshooting; Root cause analysis
On-call rotation, Out-of-hours work, Availability to travel, Supplier training courses
What They're Looking For.
Must Have
Comfortable with the CLI, Comfortable with services via systemd, Comfortable with filesystems, Comfortable with permissions, Comfortable with basic networking tools, Able to troubleshoot common issues, Know when to escalate, Solid grasp of IP addressing, Solid grasp of subnets, Solid grasp of VLANs, Solid grasp of routing at high level, Solid grasp of DNS, Solid grasp of firewalls, Understand core Kubernetes concepts, Perform basic Kubernetes troubleshooting, Follow Kubernetes runbooks, Familiar with basic GPU diagnostics, Able to use dashboards, Able to use alerts, Identify symptoms, Gather evidence, Follow runbooks, Comfortable reading simple Bash snippets, Comfortable writing simple Bash snippets, Comfortable reading simple Python snippets, Comfortable writing simple Python snippets, Use Git for version control, Familiarity with cloud troubleshooting flows, Familiarity with hypervisor troubleshooting flows
Nice to Have
Advanced networking topics a plus, BGP a plus, VXLAN a plus, Cluster-level administration experience a nice to have, Experience with Ansible beneficial, Experience with Terraform beneficial, OpenStack experience a plus, Hands-on exposure to Kubernetes administration, Hands-on exposure to Kubernetes operators, Hands-on exposure to Kubernetes storage add-ons, Hands-on exposure to Kubernetes networking add-ons, Deeper GPU/HPC concepts, RDMA/InfiniBand awareness, Performant distributed workload basics awareness, Job schedulers awareness, Used NCCL for performance troubleshooting, Infrastructure as Code tools, Config management tools, GitOps participation, CI/CD participation, Modernising scripts using GitHub Actions, Experience with Teleport, Experience with Vault
What You'll Do.
Join support duty rotation
Handle day-to-day tickets
Escalate early and appropriately
Collaborate with Engineering
Keep parties informed
Follow established runbooks
Resolve common issues
Contribute incremental fixes
Keep tickets up to date
Document customer communications
Learn platform fundamentals
Help customers get value
Participate in monitoring
Participate in troubleshooting
Participate in triage
Enable efficient handover
Deliver assigned tasks
Seek help when needed
Document validated steps
Contribute to training materials
Take part in incident reviews
Track preventative follow-ups
Identify areas for automation
Collaborate with cross-functional teams
Be escalation point for onsite staff
Participate in on-call work
Participate in out-of-hours work
Assist with deployments
Assist with troubleshooting
Assist with operational tasks
Attend supplier training courses
How You'll Work.
Team & Collaboration
Collaborate with Engineering; Collaborate with cross-functional teams; Escalation point for onsite staff
Communication Scope
Customer communications
Process & Methodology
Project work
Full Job Description
About Nscale Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility. At Nscale, our Support and Operations team plays a critical role in maintaining service availability, driving service reliability and rapid response to customer tickets We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future. About The Role We’re looking for an Engineer that has good people, leadership & technical skills. A technical expert responsible for ensuring the efficiency, reliability, and scalability of data centre infrastructure. You're comfortable problem solving & making decisions on complex topics with high levels of ambiguity in a results driven environment. You’re comfortable influencing without authority and exceptional at building relationships with senior stakeholders across the business to get things done. You have the understanding and skillset to grasp technical concepts and problems quickly You have strong analytical skills You’re a doer who is extremely organised and diligent You’re a self starter, curious, and quick to learn, knowing what questions to ask to get up to speed quickly What You'll Be Doing Join the Support duty rotation and handle day‑to‑day tickets and alerts, escalating early and appropriately. Collaborate with Engineering with guidance when incidents or changes
Applying for this Infrastructure Support Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Nscale?
Real rants from real employees. Read before you apply.