Arbor Education

SiteReliabilityTechnicalLead

£80–90k Port Hueneme, California, United States; United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Lead candidates.

The Brief

“Site Reliability Technical Lead at Arbor Education. Skills: Site Reliability Engineering, AWS, System Architecture, Leadership. Define and guide system architecture. Ensure systems are observable and meet SLOs”

What You'll Achieve.

Ensure products are robust, scalable, and secure; Meet agreed Service Level Objectives (SLOs); Reduce operational toil; Improve system efficiency

Industry & Context.

Problems you'll solve

Root Cause Analysis (RCA)

Eligibility Requirements

No visa sponsorship

What They're Looking For.

Must Have

Extensive professional experience in SRE, DevOps, or Platform Engineering on complex, scalable systems, Extensive expertise with AWS and distributed cloud architectures, Proven experience operating platforms serving a high volume of requests (~1000 req/sec), Advanced proficiency with Terraform and configuration management tools, skills in Python, Go, or a similar language for automation and tooling, Deep experience with monitoring and observability platforms (e. g. , DataDog, Prometheus, or equivalent), plus incident/problem management, Expert understanding of distributed systems, microservices, and resilience patterns, Hands-on experience with containerization and orchestration technologies (Docker, Kubernetes, ECS), Practical experience with building and maintaining CI/CD pipelines for automated deployments, Demonstrated ability in mentoring and supporting the growth of fellow engineers

Nice to Have

Experience with chaos engineering and reliability testing, Knowledge of security best practices and compliance frameworks, Background in agile and lean methodologies (Scrum/Kanban), Contributions to open-source projects or the SRE community

What You'll Do.

Define and guide system architecture

Ensure systems are observable and meet SLOs

Drive continuous improvement in platform reliability

Lead Root Cause Analysis (RCA)

Optimize incident response process

Drive automation initiatives

Uphold coding standards

Promote automated testing

Ensure production readiness standards

Lead technical estimation

Contribute to release planning

Mentor and coach engineers

Foster alignment and galvanize team

How You'll Work.

Team & Collaboration

Work closely with Product Managers; Collaborate with Engineering Managers; Align technical direction with product strategy; Communicate complex technical concepts clearly

Communication Scope

Communicate complex technical concepts clearly to both technical and non-technical stakeholders

Process & Methodology

Technical estimation, Feasibility assessments, Structured release planning, Post-release reviews

Full Job Description

**Location:** Remote **Salary:** £80,000 - £90,000 ### About us At Arbor, we’re on a mission to transform the way schools work for the better. We believe in a future of work in schools where being challenged doesn’t mean being burnt out and overworked. Where data guides progress without overwhelming staff. And where everyone working in a school is reminded why they got into education every day. Our MIS and school management tools are already making a difference in over 7,000 schools and trusts. Giving time and power back to staff, turning data into clear, actionable insights, and supporting happier working days. At the heart of our brand is a recognition that the challenges schools face today aren’t just about efficiency, outputs and productivity - but about creating happier working lives for the people who drive education everyday: the staff. We want to make schools more joyful places to work, as well as learn. ### About the role We are looking for an experienced and collaborative Site Reliability Technical Lead to join our Site Reliability team and take ownership of system and solution design to ensure our products are robust, scalable, and secure. The remit and focus of the role is to blend deep technical expertise with leadership, requiring you to mentor and coach engineers, embed a culture of quality and reliability, and guide the team in making sound technical decisions. It’s a broad and exciting role, so we’re looking for someone up for a challenge - if you’re highly technical and a good communicator, this is the role for you. **Core responsibilities** * **Architectural Leadership:** Define and guide system architecture, balancing trade-offs between speed, scalability, maintainability, and security to meet business goals. * **Reliability and Performance:** Champion accountability from design through to production by ensuring systems are observable and meet agreed Service Level Objectives (SLOs). Drive continuous improvement in platform reliability, performanc

Free ATS check

Applying for this Site Reliability Technical Lead role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Arbor Education?

Real rants from real employees. Read before you apply.

Read Company Rants →