Nexthink

SaaS

SeniorSiteReliabilityEngineer

€60–85k ~AI est. Madrid, Spain FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for mid candidates.

The Brief

“Senior Site Reliability Engineer at Nexthink. Skills: Site Reliability Engineering, Cloud-native systems, Kubernetes, SaaS platform. Implement cloud-native systems. Manage cloud-native systems”

What You'll Achieve.

Enhance ability to deploy systems; Enhance ability to monitor systems; Enhance ability to scale systems; Reduce Mean Time to Detect; Reduce Mean Time to Recovery; Minimize need for escalation

Industry & Context.

SaaS
Problems you'll solve

System-level troubleshooting; Troubleshooting outages; Diagnose complex issues; Resolve complex issues

Eligibility Requirements

Shared on-call rotation

What They're Looking For.

Must Have

Bachelor’s degree in Computer Science, 5+ years SRE or Platform Engineer experience, Software development best practices knowledge, Public cloud services experience (AWS, GCP, Azure), SaaS product support experience, Programming or scripting skills (Python, Go, Bash), Infrastructure-as-code experience (Terraform), Proficiency with Kubernetes, Container-based deployment experience (Docker), Experience with CI/CD pipelines & tools, Experience managing monitoring solutions (Datadog), Comfortable with rotating on-call schedule, Experience managing critical incidents, Experience leading post-incident reviews, Operating and managing production systems, System-level troubleshooting skills

Nice to Have

Kubernetes ecosystem experience, Helm experience, FluxCD experience, Crossplane experience

What You'll Do.

Implement cloud-native systems

Manage cloud-native systems

Operate Kubernetes clusters

Enhance Kubernetes clusters

Operate deployment pipelines

Enhance deployment pipelines

Operate service meshes

Enhance service meshes

Design SaaS platform infrastructure

Build SaaS platform infrastructure

Maintain SaaS platform infrastructure

Maintain error budgets

Address availability issues

Address performance issues

Develop infrastructure-as-code

Provision infrastructure

Build internal platform tools

Support operational efficiency

Monitor infrastructure

Participate in on-call rotation

Drive timely resolution

Drive timely communication

Act as Incident Commander

Coordinate cross-team responses

Drive incident response processes

Refine incident response processes

Reduce Mean Time to Detect

Reduce Mean Time to Recovery

Diagnose complex issues

Resolve complex issues

Embed fault tolerance

Embed reliability principles

Automate health checks

Support automated testing

Support canary deployments

Support rollback strategies

Ensure reliable releases

Contribute to security best practices

Contribute to compliance automation

Contribute to cost optimization

How You'll Work.

Team & Collaboration

Work with Product Engineering teams; Work with Technical Platform Engineering; Work with Security teams; Work with Architecture teams; Cross-team responses

Full Job Description

Nexthink is the leader in digital employee experience management software. The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting employees anywhere, with any application or network, before employees notice the issue. As the first solution to allow IT to progress from reactive problem solving to proactive optimization, Nexthink enables its more than 1,300 customers to provide better digital experiences to more than 18 million employees. Dual headquartered in Lausanne, Switzerland and Boston, Massachusetts, Nexthink has 9 offices worldwide. #LI-Hybrid At Nexthink, we empower our customers with industry-leading solutions to enable continuous improvement of employee experience. We deliver unmatched visibility across all environments, so IT teams can consistently see, diagnose, and fix digital workplace issues. As a SaaS provider, our commitment is to deliver a seamless, resilient, and scalable platform around the clock. We are looking for an experienced, proactive and innovative professional that is keen to join as a Senior Site Reliability Engineer! The mission of Nexthink's SRE team is to strengthen our infrastructure and enhance our ability to deploy, monitor, and scale systems effectively and reliably. They work closely with over 50 Product Engineering teams that develop our products and services, as well as with the Technical Platform Engineering, Security and Architecture teams to understand the reliability requirements, design and implement solutions, and promote them for adoption and usage. Join our vibrant team of diverse and experienced engineers where cutting-edge technology meets innovation. Be a part of Nexthink's Digital Employee Experience technological revolution, ensuring our global customers enjoy a seamless user experience. Apply now and become a key player in our dynamic SRE organisation. As a Senior Site Reliability Engineer, you will: * Implement and manage cloud-native systems (

Free ATS check

Applying for this Senior Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on SmartRecruiters

  • SmartRecruiters often includes a video screening step — check camera and mic permissions.
  • Link your GitHub or portfolio directly in the profile section for technical roles.
  • Applications may be reviewed by AI scoring before reaching a recruiter — use keywords from the job description.

ANONYMOUS · UNFILTERED

What do employees actually say about Nexthink?

Real rants from real employees. Read before you apply.

Read Company Rants →