Sanofi
IncidentManagementReliabilityEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Incident Management Reliability Engineer at Sanofi. Skills: Incident Management, Reliability Engineering, Automation. Manage Major Incidents. Act as command centre lead”
What You'll Achieve.
Minimize disruptions; Rapid recovery from major incidents; Continuously improve system performance; Continuously improve system availability; Reduction in number of recurring incidents; Reduction in impact of recurring incidents; Adherence to SLA/SLO targets; Completion rate of post-incident actions; Stakeholder satisfaction; Transparency during incidents
Industry & Context.
Root cause elimination
What They're Looking For.
Must Have
8+ years' experience
Nice to Have
ITIL v4 or Service Operations certification, SRE Foundation / Practitioner certification, Cloud certifications (AWS, Azure, or GCP), Incident Command System (ICS) or equivalent leadership training in crisis response
What You'll Do.
Manage Major Incidents
Act as command centre lead
Ensure incident documentation
Drive post-incident-reviews
Maintain communication processes
Enhance service reliability
Implement automated recovery
Analyze incident trends
Develop reliability plans
Participate in capacity planning
Participate in change reviews
Participate in failure analysis
Develop SLOs/SLIs/SLAs
Identify recurring issues
Lead root cause elimination
Automate operational tasks
Enhance service recovery
Contribute to Major Incident Process evolution
How You'll Work.
Team & Collaboration
Collaborate with service owners; Collaborate with platform teams; Coordinate across technical teams; Coordinate across business teams; Partner with problem management
Communication Scope
Verbal communication; Written communication; Stakeholder communication
Process & Methodology
Incident Command
Full Job Description
**Our Team:** Service Quality cultivates a culture of service excellence where quality is more than a benchmark – it's a shared purpose. Through synergistic collaboration, advanced monitoring, and empathetic customer advocacy, we strive to elevate every interaction and transform challenges into opportunities for growth. **Main responsibilities:** The Incident Management Reliability Engineer is responsible for ensuring the stability, resilience, and reliability of critical IT services. This role combines strong incident management expertise with reliability engineering principles to minimize disruptions, drive rapid recovery from major incidents, and continuously improve system performance and availability. * **Incident Management** * Lead the end-to-end management of Major Incidents (P1/P2), ensuring timely resolution and effective stakeholder communication. * Act as command centre lead during critical outages, coordinating across technical and business teams. * Ensure accurate and detailed incident documentation, including root cause, timeline and resolution steps. * Drive post-incident-reviews and ensure action items are implemented to prevent recurrence. * Maintain consistent communication and escalation processes aligned with ITSM best practices (e.g. ITIL) * **Reliability Engineering** * Collaborate with service owners and platform teams to enhance service reliability, observability, and fault tolerance. * Implement proactive monitoring, alerting, and automated recovery mechanisms. * Analyse incident trends and develop reliability improvement plans. * Participate in capacity planning, change reviews, and failure mode analysis to anticipate and mitigate risks. * Develop and track SLOs/SLIs/SLAs to measure service health and performance. * **Continuous Improvemen** t * Partner with problem management to identify recurring issues and lead root cause elimination initiatives. * Automate operational tasks and enhance service recovery using scripts, runbooks, and AIOp
Applying for this Incident Management Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Sanofi?
Real rants from real employees. Read before you apply.