Amazon Development Center U.S., Inc.

Technology

SoftwareDevelopmentEngineerII-AmazonMSK,ManagedStreamingKafka(MSK),MSKHoover

$144–194k Seattle, Washington, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Software Development Engineer II - Amazon MSK, Managed Streaming Kafka (MSK), MSK Hoover at Amazon Development Center U.S., Inc.. Skills: Distributed systems, Large-scale automation, Infrastructure engineering. Design automation to patch hosts. Maintain hundreds of thousands of hosts”

What You'll Achieve.

Keep fleet healthy, secure, always on; Ensure customer availability guarantees; Make maintenance invisible to customers; Reduce manual effort

Industry & Context.

Technology
Problems you'll solve

Root cause analysis; Troubleshooting

Eligibility Requirements

On-call rotation

What They're Looking For.

Must Have

3+ years software development experience, 2+ years system design or architecture experience, Experience programming one language

Nice to Have

3+ years full SDLC experience, Bachelor's degree in computer science

What You'll Do.

Design automation to patch hosts

Maintain hundreds of thousands of hosts

Keep fleet maintenance invisible

Build systems to detect unhealthy hosts

Remediate unhealthy hosts

Develop rollout mechanisms

Develop rollback mechanisms

Test changes before customers

Reverse changes if needed

Debug production issues

Reduce manual effort to operate fleet

Write design documents

Collaborate with engineers

Raise engineering bar

Conduct design reviews

Improve automation for new cases

Push routine work into systems

Feed manual steps into automation

Recover unhealthy hosts automatically

How You'll Work.

Team & Collaboration

Collaborate with engineers; Partner with engineering; Partner with product; Partner with operations

Communication Scope

Write design documents

Full Job Description

Come keep one of the world's largest Apache Kafka fleets healthy, secure, and always on. On Amazon MSK, you will build the automation that maintains hundreds of thousands of streaming hosts, so customers never have to think about the infrastructure underneath their applications. This is infrastructure engineering at a scale where every change has to be safe by design. MSK is stateful: every host holds customer data that must stay replicated and in sync, so routine maintenance is never as simple as rebooting or replacing a host. Patching, repairing, and replacing nodes across a fleet this size means coordinating each action against the availability guarantees customers depend on, so that maintenance stays invisible to them. You will design and build the automation that does this, turning hard operational problems into systems that run themselves. If you enjoy distributed systems, large-scale automation, and work whose reliability countless streaming applications quietly depend on, this is a rare place to do it. Key job responsibilities - Design, build, and operate automation that patches and maintains hundreds of thousands of stateful hosts, keeping fleet maintenance invisible to customers. - Build systems that automatically detect unhealthy hosts and remediate them, balancing fast recovery against avoiding needless disruption. - Develop rollout and rollback mechanisms that keep the blast radius of any change small at fleet scale, and that let changes be tested before they reach customers and reversed if something goes wrong. - Own your services end to end: take part in on-call, debug production issues, and continually reduce the manual effort needed to operate the fleet. - Write design documents, collaborate with engineers across MSK, and raise the engineering bar through design and code reviews. A day in the life No two days look the same, but most blend building with operating. You might spend the morning designing a safer way to roll out a fleet-wide change, pair

Free ATS check

Applying for this Software Development Engineer II - Amazon MSK, Managed Streaming Kafka (MSK), MSK Hoover role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

ANONYMOUS · UNFILTERED

What do employees actually say about Amazon Development Center U.S., Inc.?

Real rants from real employees. Read before you apply.

Read Company Rants →