Veeam Software
Technology
SeniorProductionEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Production Engineer at Veeam Software. Skills: Production Engineering, Site Reliability, Distributed Systems, Cloud Platforms. Own reliability of production services. Own performance of production services”
What You'll Achieve.
Accelerate safe AI at scale; Ensure data resilience; Ensure data security; Ensure data is understood; Ensure data is secured; Ensure data is resilient; Enable acceleration of safe AI; Drive reliability improvements; Drive observability improvements; Drive improvements across support teams; Drive improvements across development teams; Ensure services are production-ready; Ensure services are performant; Ensure services are fault-tolerant; Rapidly incorporate user feedback; Ensure measurable reliability improvements
Industry & Context.
Advanced troubleshooting; Problem-solving; Root cause analysis; Incident analysis
On-call response
What They're Looking For.
Must Have
5-8+ years software engineering, 5-8+ years site reliability, 5-8+ years production engineering, 5-8+ years senior technical support, Log analysis experience, Advanced troubleshooting experience, Programming experience (JS, Go, Typescript, Java, C#), Deploying systems on public cloud, Troubleshooting systems on public cloud, Familiarity with observability tooling, Solid understanding of distributed systems, Solid understanding of networking, Solid understanding of automation, Solid understanding of CI/CD
Nice to Have
Prior on-call experience, Prior incident response experience, Leading significant incidents experience, Leading problem-management efforts experience, Automation background, Performance testing background, Service scalability background, Familiarity with compliance best practices, Familiarity with security best practices, Experience incorporating compliance into production, Experience incorporating security into production
What You'll Do.
Own reliability of production services
Own performance of production services
Own operability of production services
Own reliability of production workflows
Own performance of production workflows
Own operability of production workflows
Own complex production issues
Own escalated production issues
Drive long-term fixes
Address systemic risks
Convert risks into improvements
Lead production efficiency initiatives
Define knowledge base integrity
Develop knowledge base integrity
Maintain knowledge base integrity
Build production monitoring systems
Maintain production monitoring systems
Minimize alerting noise
Ensure actionable alerting
Ensure well-documented runbooks
Guide operational decisions
Guide product decisions
Turn manual processes into automation
Champion automation patterns
Champion automation tooling adoption
Own post-mortem review process
Drive post-mortem actions
Ensure high-quality follow-up
Ensure measurable reliability improvements
Collaborate with support organization
Feed back tooling enhancements
Feed back improvement recommendations
Collaborate with developers
Ensure safe deployments
Ensure efficient incident mitigation
Ensure services are operable
Minimize manual intervention
Share learnings through documentation
Share learnings through feedback
Mentor other engineers
Coach other engineers
Raise operational bar
How You'll Work.
Team & Collaboration
Collaborating with support; Collaborating with developers; Collaborating with product managers; Collaborating with security professionals; Collaborating with engineering; Collaborating with development teams; Collaborating with support teams; Collaborating with product teams; Collaborating with cloud engineering; Collaborating with security teams
Communication Scope
Communicate with product managers; Communicate with security professionals; Share learnings
Process & Methodology
Incident lifecycle management, Problem-management efforts
Full Job Description
Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands. About the Role As a Senior Production Engineer, you will play a leading role in designing and operating reliable, scalable systems for Veeam's Data Cloud platform. You will own high‑impact production efficiency, automation, and documentation initiatives, drive reliability and observability improvements, and own or participate in the full incident lifecycle — from on‑call response, through mitigation, to leading post‑incident reviews and driving improvements across support and development teams. You will work as part of a team of skilled engineers, collaborating with support and development as a senior bridge and driving force for change. You will communicate with product managers and security professionals to ensure our services are production‑ready, performant, and fault‑tolerant, and that we rapidly incorporate user feedback into improvements. What You Will Do Production Own the reliability, performance, and operability of complex, business‑critical production services and workflows. Own complex and escalated production issues from support, and drive long‑term fixes in collaboration with engineering, including code, configuration, and architecture changes. Proactively identify and address systemic risks that are identified during the problem‑solving process, and convert them into long‑term engineering improvements. Lead produc
Applying for this Senior Production Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Veeam Software?
Real rants from real employees. Read before you apply.