Civica

Tech / AI / Software

SeniorSiteReliabilityEngineer

Remote Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Senior candidates.

The Brief

“Senior Site Reliability Engineer at Civica. Skills: Site Reliability Engineering, DevOps, infrastructure, cloud platform, automation, reliability, performance, security, AWS, Azure, VMware, IaC, container orchestration, monitoring, logging, tracing. owning the reliability, performance and security of the cloud platform. work together with principal engineers, developers, and product teams to make roadmaps reality”

Industry & Context.

Tech / AI / Software
Problems you'll solve

proactively identify risks before it impacts our users; translating complex technical issues into clear, actionable plans

Eligibility Requirements

Own the on-call rota

What They're Looking For.

Must Have

Demonstrable experience in a production SRE, DevOps or infrastructure role, ideally within a SaaS or large-scale web environment, Expert in at least one public cloud (AWS, Azure, or GCP) and comfortable designing hybrid migrations from on-prem to cloud, coding/scripting and troubleshooting skills (on either of Go,. NET, Java, Python, etc.) and a passion for building reusable tested libraries and tooling, Proven track record with IaC tools (Terraform, CloudFormation, or similar) and container orchestration (Kubernetes, ECS, AKS, OpenShift), Proven track record with virtual machine orchestration / provisioning and resiliency strategies (Kubevirt, packer, ansible), Deep understanding of monitoring, logging, and tracing frameworks (Prometheus/Grafana, ELK/Opensearch, Jaeger, etc.), Excellent communicator who thrives in cross-functional teams, with passion for translating complex technical issues into clear, actionable plans

Nice to Have

Kubernetes, ECS, AKS, OpenShift, Kubevirt, packer, ansible, Prometheus/Grafana, ELK/Opensearch, Jaeger, etc.

What You'll Do.

owning the reliability, performance and security of the cloud platform, work together with principal engineers, developers, and product teams to make roadmaps reality, driving automation and best practices through the process, Designing and implementing for scale & resilience, Architect, implement and continuously improve our existing Data Center and Cloud environments on AWS, Azure, and VMware, ensuring they meet our SLAs and adapt dynamically to demand working alongside the Platform teams providing PaaS/IaaS, Driving automation, Build and evolve infrastructure as code (Terraform, etc.

) and CI/CD pipelines (GitHub Actions, etc.

) to ship new features safely and at speed, Defining and measuring reliability, Partner with teams to set up meaningful SLIs/SLOs, implement real-time observability (Datadog, Prometheus, Grafana,.

) and proactively identify risks before it impacts our users, Leading incident response, Own the on-call rota, coach teams through blameless post-mortems, and embed a culture of continuous improvement so outages become learning opportunities, Mentoring & evangelism, Share your deep expertise by pairing with engineers, running brown-bag sessions on reliability best practices, and helping raise the bar across our global engineering organisation, Securing our stack, Collaborate with our Security team and include security controls into CI/CD, runtime environments and disaster-recovery so, our customers and citizens are always protected.

How You'll Work.

Team & Collaboration

work together with principal engineers, developers, and product teams; working alongside the Platform teams; Partner with teams; thrives in cross-functional teams; Collaborate with our Security team; Join employee-led communities

Communication Scope

Excellent communicator; translating complex technical issues into clear, actionable plans

Full Job Description

We’re Civica and we make software that helps deliver critical services for citizens all around the world. From local to state government, to education, to health and care, over 5,000 public bodies across the globe use our software to help provide critical services to over 100 million citizens. Our aspiration is to be a GovTech champion everywhere we work around the globe, supporting the needs of citizens and those that serve them every day. Building on 21 years of continuous growth and success, we're at a pivotal point on our journey to realise that aspiration. **Why you will love this opportunity as Senior SRE at Civica:** As our Senior Site Reliability Engineer, you will be at the heart of Civica’s SaaS transformation; owning the reliability, performance and security of the cloud platform that powers our education, health & care, and government products. You will work together with principal engineers, developers, and product teams to make roadmaps reality, driving automation and best practices through the process. **What you will be doing:** * Designing and implementing for scale & resilience: Architect, implement and continuously improve our existing Data Center and Cloud environments on AWS, Azure, and VMware, ensuring they meet our SLAs and adapt dynamically to demand working alongside the Platform teams providing PaaS/IaaS. * Driving automation: Build and evolve infrastructure as code (Terraform, etc.) and CI/CD pipelines (GitHub Actions, etc.) to ship new features safely and at speed. * Defining and measuring reliability: Partner with teams to set up meaningful SLIs/SLOs, implement real-time observability (Datadog, Prometheus, Grafana, ...) and proactively identify risks before it impacts our users. * Leading incident response: Own the on-call rota, coach teams through blameless post-mortems, and embed a culture of continuous improvement so outages become learning opportunities. * Mentoring & evangelism: Share your deep expertise by pairing with engineers, r

Free ATS check

Applying for this Senior Site Reliability Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Civica?

Real rants from real employees. Read before you apply.

Read Company Rants →