Wikimedia Foundation
Technology
SeniorSiteReliabilityEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Site Reliability Engineer at Wikimedia Foundation. Skills: Site Reliability Engineering, Cloud Infrastructure, CI/CD, Observability. Define Service Level Objectives. Track Service Level Objectives”
What You'll Achieve.
Ensure reliability targets met; Enable proactive detection; Enable faster troubleshooting; Drive continuous improvement; Maintain performance; Maintain availability
Industry & Context.
Troubleshooting; Data-driven decisions
On-call
What They're Looking For.
Must Have
5+ years experience, Proficiency in at least one programming language, Experience with incident response, Experience with on-call practices, Experience with continuous improvement, Experience with operational excellence, Experience operating highly available, large-scale distributed systems
Nice to Have
Familiarity with Wikimedia or open source projects
What You'll Do.
Define Service Level Objectives
Track Service Level Objectives
Improve Service Level Objectives
Improve error budgets
Build observability systems
Enhance observability systems
Drive reliability engineering practices
Perform capacity planning
Improve developer experience
Enable self-service infrastructure
Streamline deployment workflows
Embed reliability best practices
Design CI/CD workflows
Implement CI/CD workflows
Optimize CI/CD workflows
Design GitOps workflows
Implement GitOps workflows
Optimize GitOps workflows
Enable automated deployments
Enable reliable deployments
Support progressive delivery
Implement secure-by-default infrastructure
Enforce security best practices
Optimize infrastructure cost
Optimize infrastructure efficiency
Establish operational metrics
Track operational metrics
Reduce operational toil
Identify repetitive work
Implement automation-first solutions
Contribute to platform capabilities
Evolve platform capabilities
Standardize infrastructure
Improve scalability across teams
Collaborate with global team
Communicate asynchronously
How You'll Work.
Team & Collaboration
Cross department collaboration; Enterprise SRE team; Foundation SRE teams; Security teams; Release teams; Software engineers
Communication Scope
Documentation skills
Process & Methodology
Agile
Full Job Description
Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to join our team, reporting to the Sr. Engineering Manager. As the Site Reliability Engineer, you will play a key role in designing, developing, and maintaining reliable, scalable, and highly available infrastructure for our API services. You will contribute heavily to the high impact challenges behind innovating, building, and maintaining Wikipedia’s data feeds for high volume reusers. In this role, you will foster cross department collaboration with the wikimedia foundation SRE teams. You will own reliability targets (SLOs) for critical APIs, balancing performance, cost, and availability through data-driven decisions. You will be involved in designing and running the infrastructure and services that interact with the base of Wikimedia Foundation’s projects, including, but not limited to: Kubernetes clusters, application servers, code collaboration infrastructure, and other developer-facing services. You will participate in incident response and be on-call. This role requires frequent work with other members of the enterprise and Foundation SRE team to maintain and improve our systems, as well as interacting with people not in SRE, like Security, Release and Software Engineers, together striving to move our projects and technologies forward. Wikimedia Enterprise is a new, revenue-generating product that provides fast, comprehensive, reliable, and secure data ingestion for organizations that wish to repurpose Wikimedia/Wikipedia content in third party environments. Wikimedia Enterprise aims to improve the user experience for Wikimedia/Wikipedia readers beyond our own websites; increase the reach and discoverability of Wikimedia/Wikipedia content; and improve awareness and ease of attribution and verifiability of Wikimedia/Wikipedia content by the organizations that reuse our content the most. You can learn more about the project in WIRED and Insider. We are a distributed and diverse t
Applying for this Senior Site Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Wikimedia Foundation?
Real rants from real employees. Read before you apply.