Diligent Corporation
SaaS
SeniorSiteReliabilityEngineer
“Senior Site Reliability Engineer at Diligent Corporation. Skills: Site Reliability Engineering, Kubernetes, Observability, Resilience, Security, Automation, GitOps. Operate and continuously improve Kubernetes production platforms. Contribute to zero-downtime upgrades and multi-AZ resilience”
What You'll Achieve.
Keep our Kubernetes platforms observable, resilient and boring-to-upgrade; Keep the bar high for the DAX 30 and other DACH customers we serve; Make an impact; Drive greater impact and accountability
Industry & Context.
Takes the initiative when something can be better — observability, resilience, a tricky upgrade, or the way the team thinks about security; Solve problems related to platform security posture; Debug workload without SRE hand-holding
Participate in our Standby and Daily Business rotation, Comfortable being on-call, Hybrid work model: expected to work onsite at least 50% of the time if within commuting distance
What They're Looking For.
Must Have
Several years hands-on SRE, DevOps or Platform Engineering, including meaningful time running production Kubernetes at scale, Kubernetes expertise with deep hands-on experience in at least one area — cluster lifecycle and upgrades, workload identity and RBAC, admission control, network policies, or custom resources and operators — and working familiarity with the rest, Solid grasp of Kubernetes and container security — secrets management, network segmentation and runtime protection — and an interest in growing into our security champion alongside our Application Security Engineer, Proven depth in the ELK stack (or a very similar log platform) — pipelines, indexing, dashboards, alerting — with an interest in growing into the team’s observability expert, Working knowledge of Prometheus and Grafana, Comfortable with GitOps and CI/CD as a daily way of working (we run Flux and GitLab equivalents like Argo CD, GitHub Actions or Jenkins are fine), and hands-on experience with Helm and Kustomize for managing manifests, Solid coding in Go, Python or Bash, with a love for automating away repetitive work, Comfortable being on-call and leading incidents calmly under pressure, Professional fluency in German and excellent at home working in a diverse team
Nice to Have
Experience in regulated industries (financial services, legal, healthcare, defence) or under compliance frameworks such as ISO 27001 or C5, Track record of designing or contributing to custom Kubernetes Operators, Service-mesh experience (Istio, Linkerd, Cilium), A demonstrated interest in working shoulder-to-shoulder with AppSec engineers to raise platform security posture, Experience operating Couchbase (Couchbase Operator, server groups, XDCR) or another stateful data platform on Kubernetes, Experience migrating ingress controllers or other cluster-wide components with zero customer downtime, Experience with anomaly detection on platform telemetry
What You'll Do.
Operate and continuously improve Kubernetes production platforms
Contribute to zero-downtime upgrades and multi-AZ resilience
Grow into the team’s expert on the ELK-based log platform
Maintain and evolve Prometheus alerting rules and Grafana dashboards
Partner on Kubernetes and container security
Chip away at operational toil — deployments
Ship reliably through our GitOps workflow
Participate in Standby and Daily Business rotation
Lead incident response
Run blameless post-mortems
Drive resulting action items to completion
How You'll Work.
Team & Collaboration
Partner closely with our Application Security Engineer; Work shoulder-to-shoulder with AppSec engineers; Collaborate with service owners; Work in a diverse team
Communication Scope
Professional fluency in German; Excellent communication within a diverse team
Process & Methodology
Drive resulting action items to completion
Applying for this Senior Site Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Diligent Corporation?
Real rants from real employees. Read before you apply.