Cambria
Manufacturing
SeniorManager,SystemsandSiteReliabilityEngineering
Neural analysis suggests this role is
optimal for Manager candidates.
“Senior Manager, Systems and Site Reliability Engineering at Cambria. Skills: Systems Engineering, Site Reliability Engineering, Infrastructure Management, Containerization. Manage day-to-day deliverables. Assist in service delivery”
What You'll Achieve.
Ensure services are available; Ensure services perform optimally; Meet or exceed SLO targets
Industry & Context.
Problem-solving; Analytical skills; Troubleshooting
On-call rotation, Travel between local locations
What They're Looking For.
Must Have
10+ years Systems Administrator/Engineering, Bachelor's degree in Computer Science or equivalent experience, Experience with container orchestration, Experience with virtualization platforms, Solid foundation in Linux administration, Experience writing infrastructure as code, Experience with Agile delivery methodologies, Experience with ALM toolsets, Experience with collaboration software, Managerial experience
Nice to Have
Kubernetes experience a plus, Nutanix EKS a plus, AWS EKS a plus, Ansible a plus, Cohesity a plus, Pure Storage a plus, Windows Server a plus, Linux a plus, Active Directory a plus, Okta a plus, AWS a plus, Terraform a plus, Git a plus, Red Hat Satellite a plus, Red Hat Identity Management a plus
What You'll Do.
Manage day-to-day deliverables
Assist in service delivery
Participate in on-call rotation
Hold daily standup meetings
Plan and prioritize work
Partner with senior leadership
Ensure technical work alignment
Participate in quarterly planning
Lead staff performance and development
Define skill competency models
Develop gap closure plans
Act as primary contact for customers
Provide technical updates
Lead critical issue resolution
Manage formal escalations
Participate in root cause analysis
Ensure permanent resolution
Communicate findings to stakeholders
Build scalable infrastructure
Operate resilient infrastructure
Maintain highly available infrastructure
Develop automation strategy
Automate SRE requests
Analyze metrics for monitors
Create actionable alerts
Ensure critical service availability
Lead container platform adoption
Implement container platform lifecycle
Manage CI/CD pipelines
Implement compute systems
Lifecycle compute systems
Implement storage systems
Lifecycle storage systems
Plan virtualization systems
Implement virtualization systems
Lifecycle virtualization systems
Plan server operating systems
Implement server operating systems
Lifecycle server operating systems
Ensure SLO targets are met
Participate in on-call rotation
How You'll Work.
Team & Collaboration
Internal business partners; Product teams; Senior leadership; Internal customers; Clients; Users
Communication Scope
Technical updates; Articulate complex ideas
Process & Methodology
Agile, Scrum, Kanban, Roadmap planning
Full Job Description
_**Job Description:**_ The Senior Manager, Systems and Site Reliability Engineering serves as the operational engine for Cambria’s infrastructure, acting as the primary bridge between IT strategy and technical execution. This role is responsible for the performance and development of Systems Engineers and Site Reliability Engineers. While driving the modernization to a container-first model , this leader ensures that services are available and perform optimally to scale manufacturing operations. As the #2 leader in IT Operations, you will translate strategic roadmaps into daily deliverables, manage high-pressure incident response, and serve as the technical face of the team to internal business partners. **Essential Duties & Responsibilities:** **Operational Leadership & Execution** * Day-to-Day Delivery: Manage the day-to-day deliverables of the team, ensuring tasks align with the workstreams translated from IT leadership strategy. * Hands-on Contributions: When necessary, assist in service delivery and participate in on-call rotation to ensure critical services are available * Agile Management: Hold daily standup cadence meetings with the team and plan/prioritize work across the team to maintain high velocity. * Strategic Alignment: Partner with senior leadership to ensure technical work is aligned with business priorities and participate in quarterly project planning. * People Management: Lead the performance and development of systems and SRE staff, defining skill competency models and gap closure plans. **Customer Engagement & Crisis Command** * Front-Line Communication: Act as the primary point of contact for internal customers, clients, and users to understand needs and provide transparent technical updates. * Incident Command: Take the lead on organizing the team to resolve critical issues, including formal escalation management and standing up "war rooms" for P1/P2 items. * Root Cause Advocacy: Participate in root cause analysis after incidents to ensure pe
Applying for this Senior Manager, Systems and Site Reliability Engineering role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Cambria?
Real rants from real employees. Read before you apply.