Docker
Technology
StaffSoftwareEngineer,Infrastructure
Neural analysis suggests this role is
optimal for Staff candidates.
“Staff Software Engineer, Infrastructure at Docker. Skills: Infrastructure, Platform engineering, Cloud services, DevOps. Turn ambiguous infrastructure problems into proposals. Drive proposals through RFCs and architecture reviews”
What You'll Achieve.
Spinning up new region in hours; Platform teams trust platform; Teams stop thinking about platform; Consume teams feel faster provisioning; Consume teams feel faster shipping; Consume teams do more without us; Services run reliably
Industry & Context.
Root cause analysis; Troubleshooting
On-call rotation
What They're Looking For.
Must Have
8+ years software engineering experience, Bachelor's degree or equivalent experience, Software engineering in Go, Track record designing/shipping/operating cloud services, Deep expertise in Kubernetes, networking, cloud platforms, reliability engineering, or developer platforms, Solid Linux, networking, and production-ops fundamentals, Experience setting technical direction, Experience leading cross-team alignment, Clear written and verbal communication
Nice to Have
EKS and ingress/CNI/service-mesh observability with OpenTelemetry/Prometheus/Grafana, CI/CD and progressive delivery (GitHub Actions, Argo CD, canaries), Experience leading migrations or adoption programs
What You'll Do.
Turn ambiguous infrastructure problems into proposals
Drive proposals through RFCs and architecture reviews
Design self-service capabilities
Design platform APIs for onboarding
Design platform APIs for provisioning
Design platform APIs for deployment
Design platform APIs for observability defaults
Design platform APIs for day-2 operations
Set delivery standards using Terraform
Set delivery standards using GitOps
Set delivery standards using Argo CD
Set delivery standards using progressive rollout
Set delivery standards using good testing
Build continuous-deployment flow
Evolve multi-tenant EKS foundations
Improve reliability of EKS foundations
Improve security of EKS foundations
Improve scale of EKS foundations
Improve cost of EKS foundations
Manage Envoy Gateway ingress
Manage traffic routing
Manage multi-region connectivity
Manage cross-account connectivity
Improve incident follow-up
Shape AI agentic workflows
Develop alert enrichment
Develop incident context-gathering
Develop runbook-assisted diagnosis
Develop remediation recommendations
Develop onboarding assistants
Develop readiness assistants
Improve health of on-call
Conduct blameless postmortems
How You'll Work.
Team & Collaboration
Cross-team alignment; Partner teams
Communication Scope
Written communication; Verbal communication
Process & Methodology
RFCs, Architecture reviews
Full Job Description
Docker has been one of the most loved brands in developer tooling, trusted by more than 20 million monthly users and over 20 billion container image pulls. From solo founders to the world's largest companies, developers rely on Docker to build, share, and run their applications across our suite of products including Docker Desktop, Docker Hub, and Docker Scout. We are a globally distributed, remote-first team building the tools that define how software gets built and delivered. As AI agents redefine software development, Docker is at the center of that shift, providing the sandboxed environments, verified images, and secure infrastructure that make autonomous workflows trustworthy by default. Docker is shipping a wave of new products this year, with R&D initiatives likely to lead to more, and we're investing heavily in the platform underneath all of it. That platform supports hundreds of engineers across many development teams and carries high-scale production traffic and data transfer every day. It has grown faster than its foundations, and this year is about closing that gap. Today, much of that work still leans on a handful of experts unblocking the same provisioning and operational workflows by hand. The top priority for this role is moving that work from expert-driven support to paved roads: self-service systems with clear ownership, safe defaults, strong guardrails, and adoption we can measure. The goal is a platform teams trust enough to stop thinking about it, one that just works, so they can focus on their own products instead of ours. The concrete version sits on this year's roadmap: spinning up a new global region or application environment should take hours, not days. Right now it takes days. Getting there means building the foundations underneath it. We need a real multi-region, cross-account network architecture and a testing and continuous-deployment flow teams can trust, then a self-service layer on top. We're the container company building our own i
Applying for this Staff Software Engineer, Infrastructure role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Docker?
Real rants from real employees. Read before you apply.