Company
Technology
SeniorSiteReliabilityEngineer(DevTools)
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Site Reliability Engineer (DevTools). Skills: Site Reliability Engineering, Software Engineering, Developer Infrastructure, Internal Tooling. Design developer infrastructure. Operate developer infrastructure”
What You'll Achieve.
Enhance developer experience; Enhance developer productivity; Optimize system performance; Reduce operational friction; Improve development workflows; Improve reliability
Industry & Context.
Troubleshooting; Analytical thinking; Problem solving
What They're Looking For.
Must Have
Site Reliability Engineering experience, Software Engineering experience, Java programming skills, Kotlin programming skills, Go programming skills, Python programming skills, Ruby programming skills, Unix/Linux operating systems knowledge, System internals knowledge, Infrastructure troubleshooting knowledge, JVM-based applications knowledge, Performance optimization knowledge, Operational best practices knowledge, Highly available systems design, Scalable systems design, Highly available systems operation, Scalable systems operation, Highly available systems improvement, Scalable systems improvement, Analytical thinking, Troubleshooting capabilities, Platform Engineering experience, Developer platforms experience, Internal tooling environments experience
Nice to Have
Spring Framework experience, Java-based monolithic applications experience, Large-scale enterprise systems experience, GitLab proficiency, TeamCity proficiency
What You'll Do.
Design developer infrastructure
Operate developer infrastructure
Improve developer infrastructure
Design internal tooling platforms
Operate internal tooling platforms
Improve internal tooling platforms
Build reliable systems
Maintain reliable systems
Build fault-tolerant systems
Maintain fault-tolerant systems
Build self-healing systems
Maintain self-healing systems
Ensure high availability
Analyze user feedback
Enhance developer experience
Enhance developer productivity
Optimize system performance
Reduce operational friction
Improve development workflows
Monitor platform health
Troubleshoot incidents
Implement preventive measures
Collaborate with engineering teams
Define operational metrics
Validate improvements
Resolve technical issues
Ensure platform stability
Explore emerging technologies
Integrate emerging technologies
Integrate AI-assisted workflows
Integrate developer productivity solutions
How You'll Work.
Team & Collaboration
Cross-functional engineering teams
Full Job Description
## Accountabilities Design, operate, and continuously improve large-scale developer infrastructure and internal tooling platforms. Build and maintain reliable, fault-tolerant, and self-healing systems that ensure high availability and performance. Analyze user feedback, identify pain points, and implement solutions that enhance developer experience and productivity. Optimize system performance, reduce operational friction, and improve the efficiency of development workflows. Develop, customize, and extend both open-source and commercial tools to better meet organizational needs. Contribute to software development initiatives across multiple programming languages and technology stacks. Monitor platform health, troubleshoot incidents, and implement preventive measures to improve reliability. Collaborate with engineering teams to define meaningful operational metrics and validate improvements through measurable outcomes. Support users by resolving technical issues, providing guidance, and ensuring platform stability. Explore and integrate emerging technologies, including AI-assisted workflows and developer productivity solutions. Requirements Proven experience combining Site Reliability Engineering and Software Engineering responsibilities in production environments. Strong programming skills and hands-on development experience with languages such as Java, Kotlin, Go, Python, Ruby, or similar. Solid understanding of Unix/Linux operating systems, system internals, and infrastructure troubleshooting. Strong knowledge of JVM-based applications, performance optimization, and operational best practices. Experience designing, operating, and improving highly available and scalable systems. Passion for enhancing user experience through engineering excellence and continuous improvement. Ability to adapt quickly, solve complex technical problems, and perform effectively in fast-changing environments. Strong analytical thinking, troubleshooting capabilities, and attention to deta
Applying for this Senior Site Reliability Engineer (DevTools) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about this company?
Real rants from real employees. Read before you apply.