Docker

Technology

StaffSoftwareEngineer,Infrastructure

CA$238–382k Canada FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Staff candidates.

The Brief

“Staff Software Engineer, Infrastructure at Docker. Skills: Infrastructure, Platform engineering, Cloud services, DevOps. Turn ambiguous infrastructure problems into proposals. Drive proposals through RFCs and architecture reviews”

What You'll Achieve.

Spinning up new region in hours; Platform teams trust platform; Teams stop thinking about platform; Consume teams feel faster provisioning; Consume teams feel faster shipping; Consume teams do more without us; Services run reliably

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting

Eligibility Requirements

On-call rotation

What They're Looking For.

Must Have

8+ years software engineering experience, Bachelor's degree or equivalent experience, Software engineering in Go, Track record designing/shipping/operating cloud services, Deep expertise in Kubernetes, networking, cloud platforms, reliability engineering, or developer platforms, Solid Linux, networking, and production-ops fundamentals, Experience setting technical direction, Experience leading cross-team alignment, Clear written and verbal communication

Nice to Have

EKS and ingress/CNI/service-mesh observability with OpenTelemetry/Prometheus/Grafana, CI/CD and progressive delivery (GitHub Actions, Argo CD, canaries), Experience leading migrations or adoption programs

What You'll Do.

Turn ambiguous infrastructure problems into proposals

Drive proposals through RFCs and architecture reviews

Design self-service capabilities

Design platform APIs for onboarding

Design platform APIs for provisioning

Design platform APIs for deployment

Design platform APIs for observability defaults

Design platform APIs for day-2 operations

Set delivery standards using Terraform

Set delivery standards using GitOps

Set delivery standards using Argo CD

Set delivery standards using progressive rollout

Set delivery standards using good testing

Build continuous-deployment flow

Evolve multi-tenant EKS foundations

Improve reliability of EKS foundations

Improve security of EKS foundations

Improve scale of EKS foundations

Improve cost of EKS foundations

Manage Envoy Gateway ingress

Manage traffic routing

Manage multi-region connectivity

Manage cross-account connectivity

Improve incident follow-up

Shape AI agentic workflows

Develop alert enrichment

Develop incident context-gathering

Develop runbook-assisted diagnosis

Develop remediation recommendations

Develop onboarding assistants

Develop readiness assistants

Improve health of on-call

Conduct blameless postmortems

How You'll Work.

Team & Collaboration

Cross-team alignment; Partner teams

Communication Scope

Written communication; Verbal communication

Process & Methodology

RFCs, Architecture reviews

Full Job Description

Docker has been one of the most loved brands in developer tooling, trusted by more than 20 million monthly users and over 20 billion container image pulls. From solo founders to the world's largest companies, developers rely on Docker to build, share, and run their applications across our suite of products including Docker Desktop, Docker Hub, and Docker Scout. We are a globally distributed, remote-first team building the tools that define how software gets built and delivered. As AI agents redefine software development, Docker is at the center of that shift, providing the sandboxed environments, verified images, and secure infrastructure that make autonomous workflows trustworthy by default. Docker is shipping a wave of new products this year, with R&D initiatives likely to lead to more, and we're investing heavily in the platform underneath all of it. That platform supports hundreds of engineers across many development teams and carries high-scale production traffic and data transfer every day. It has grown faster than its foundations, and this year is about closing that gap. Today, much of that work still leans on a handful of experts unblocking the same provisioning and operational workflows by hand. The top priority for this role is moving that work from expert-driven support to paved roads: self-service systems with clear ownership, safe defaults, strong guardrails, and adoption we can measure. The goal is a platform teams trust enough to stop thinking about it, one that just works, so they can focus on their own products instead of ours. The concrete version sits on this year's roadmap: spinning up a new global region or application environment should take hours, not days. Right now it takes days. Getting there means building the foundations underneath it. We need a real multi-region, cross-account network architecture and a testing and continuous-deployment flow teams can trust, then a self-service layer on top. We're the container company building our own i

Free ATS check

Applying for this Staff Software Engineer, Infrastructure role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Docker?

Real rants from real employees. Read before you apply.

Read Company Rants →