Cambria

Manufacturing

SeniorManager,SystemsandSiteReliabilityEngineering

$118–156k Le Sueur, Minnesota, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Manager candidates.

The Brief

“Senior Manager, Systems and Site Reliability Engineering at Cambria. Skills: Systems Engineering, Site Reliability Engineering, Infrastructure Management, Containerization. Manage day-to-day deliverables. Assist in service delivery”

What You'll Achieve.

Ensure services are available; Ensure services perform optimally; Meet or exceed SLO targets

Industry & Context.

Manufacturing
Problems you'll solve

Problem-solving; Analytical skills; Troubleshooting

Eligibility Requirements

On-call rotation, Travel between local locations

What They're Looking For.

Must Have

10+ years Systems Administrator/Engineering, Bachelor's degree in Computer Science or equivalent experience, Experience with container orchestration, Experience with virtualization platforms, Solid foundation in Linux administration, Experience writing infrastructure as code, Experience with Agile delivery methodologies, Experience with ALM toolsets, Experience with collaboration software, Managerial experience

Nice to Have

Kubernetes experience a plus, Nutanix EKS a plus, AWS EKS a plus, Ansible a plus, Cohesity a plus, Pure Storage a plus, Windows Server a plus, Linux a plus, Active Directory a plus, Okta a plus, AWS a plus, Terraform a plus, Git a plus, Red Hat Satellite a plus, Red Hat Identity Management a plus

What You'll Do.

Manage day-to-day deliverables

Assist in service delivery

Participate in on-call rotation

Hold daily standup meetings

Plan and prioritize work

Partner with senior leadership

Ensure technical work alignment

Participate in quarterly planning

Lead staff performance and development

Define skill competency models

Develop gap closure plans

Act as primary contact for customers

Provide technical updates

Lead critical issue resolution

Manage formal escalations

Participate in root cause analysis

Ensure permanent resolution

Communicate findings to stakeholders

Build scalable infrastructure

Operate resilient infrastructure

Maintain highly available infrastructure

Develop automation strategy

Automate SRE requests

Analyze metrics for monitors

Create actionable alerts

Ensure critical service availability

Lead container platform adoption

Implement container platform lifecycle

Manage CI/CD pipelines

Implement compute systems

Lifecycle compute systems

Implement storage systems

Lifecycle storage systems

Plan virtualization systems

Implement virtualization systems

Lifecycle virtualization systems

Plan server operating systems

Implement server operating systems

Lifecycle server operating systems

Ensure SLO targets are met

Participate in on-call rotation

How You'll Work.

Team & Collaboration

Internal business partners; Product teams; Senior leadership; Internal customers; Clients; Users

Communication Scope

Technical updates; Articulate complex ideas

Process & Methodology

Agile, Scrum, Kanban, Roadmap planning

Full Job Description

_**Job Description:**_ The Senior Manager, Systems and Site Reliability Engineering serves as the operational engine for Cambria’s infrastructure, acting as the primary bridge between IT strategy and technical execution. This role is responsible for the performance and development of Systems Engineers and Site Reliability Engineers. While driving the modernization to a container-first model , this leader ensures that services are available and perform optimally to scale manufacturing operations. As the #2 leader in IT Operations, you will translate strategic roadmaps into daily deliverables, manage high-pressure incident response, and serve as the technical face of the team to internal business partners. **Essential Duties & Responsibilities:** **Operational Leadership & Execution** * Day-to-Day Delivery: Manage the day-to-day deliverables of the team, ensuring tasks align with the workstreams translated from IT leadership strategy. * Hands-on Contributions: When necessary, assist in service delivery and participate in on-call rotation to ensure critical services are available * Agile Management: Hold daily standup cadence meetings with the team and plan/prioritize work across the team to maintain high velocity. * Strategic Alignment: Partner with senior leadership to ensure technical work is aligned with business priorities and participate in quarterly project planning. * People Management: Lead the performance and development of systems and SRE staff, defining skill competency models and gap closure plans. **Customer Engagement & Crisis Command** * Front-Line Communication: Act as the primary point of contact for internal customers, clients, and users to understand needs and provide transparent technical updates. * Incident Command: Take the lead on organizing the team to resolve critical issues, including formal escalation management and standing up "war rooms" for P1/P2 items. * Root Cause Advocacy: Participate in root cause analysis after incidents to ensure pe

Free ATS check

Applying for this Senior Manager, Systems and Site Reliability Engineering role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about Cambria?

Real rants from real employees. Read before you apply.

Read Company Rants →