Arbor Education
SiteReliabilityTechnicalLead
Neural analysis suggests this role is
optimal for Lead candidates.
“Site Reliability Technical Lead at Arbor Education. Skills: Site Reliability Engineering, AWS, System Architecture, Leadership. Define and guide system architecture. Ensure systems are observable and meet SLOs”
What You'll Achieve.
Ensure products are robust, scalable, and secure; Meet agreed Service Level Objectives (SLOs); Reduce operational toil; Improve system efficiency
Industry & Context.
Root Cause Analysis (RCA)
No visa sponsorship
What They're Looking For.
Must Have
Extensive professional experience in SRE, DevOps, or Platform Engineering on complex, scalable systems, Extensive expertise with AWS and distributed cloud architectures, Proven experience operating platforms serving a high volume of requests (~1000 req/sec), Advanced proficiency with Terraform and configuration management tools, skills in Python, Go, or a similar language for automation and tooling, Deep experience with monitoring and observability platforms (e. g. , DataDog, Prometheus, or equivalent), plus incident/problem management, Expert understanding of distributed systems, microservices, and resilience patterns, Hands-on experience with containerization and orchestration technologies (Docker, Kubernetes, ECS), Practical experience with building and maintaining CI/CD pipelines for automated deployments, Demonstrated ability in mentoring and supporting the growth of fellow engineers
Nice to Have
Experience with chaos engineering and reliability testing, Knowledge of security best practices and compliance frameworks, Background in agile and lean methodologies (Scrum/Kanban), Contributions to open-source projects or the SRE community
What You'll Do.
Define and guide system architecture
Ensure systems are observable and meet SLOs
Drive continuous improvement in platform reliability
Lead Root Cause Analysis (RCA)
Optimize incident response process
Drive automation initiatives
Uphold coding standards
Promote automated testing
Ensure production readiness standards
Lead technical estimation
Contribute to release planning
Mentor and coach engineers
Foster alignment and galvanize team
How You'll Work.
Team & Collaboration
Work closely with Product Managers; Collaborate with Engineering Managers; Align technical direction with product strategy; Communicate complex technical concepts clearly
Communication Scope
Communicate complex technical concepts clearly to both technical and non-technical stakeholders
Process & Methodology
Technical estimation, Feasibility assessments, Structured release planning, Post-release reviews
Full Job Description
**Location:** Remote **Salary:** £80,000 - £90,000 ### About us At Arbor, we’re on a mission to transform the way schools work for the better. We believe in a future of work in schools where being challenged doesn’t mean being burnt out and overworked. Where data guides progress without overwhelming staff. And where everyone working in a school is reminded why they got into education every day. Our MIS and school management tools are already making a difference in over 7,000 schools and trusts. Giving time and power back to staff, turning data into clear, actionable insights, and supporting happier working days. At the heart of our brand is a recognition that the challenges schools face today aren’t just about efficiency, outputs and productivity - but about creating happier working lives for the people who drive education everyday: the staff. We want to make schools more joyful places to work, as well as learn. ### About the role We are looking for an experienced and collaborative Site Reliability Technical Lead to join our Site Reliability team and take ownership of system and solution design to ensure our products are robust, scalable, and secure. The remit and focus of the role is to blend deep technical expertise with leadership, requiring you to mentor and coach engineers, embed a culture of quality and reliability, and guide the team in making sound technical decisions. It’s a broad and exciting role, so we’re looking for someone up for a challenge - if you’re highly technical and a good communicator, this is the role for you. **Core responsibilities** * **Architectural Leadership:** Define and guide system architecture, balancing trade-offs between speed, scalability, maintainability, and security to meet business goals. * **Reliability and Performance:** Champion accountability from design through to production by ensuring systems are observable and meet agreed Service Level Objectives (SLOs). Drive continuous improvement in platform reliability, performanc
Applying for this Site Reliability Technical Lead role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Arbor Education?
Real rants from real employees. Read before you apply.