Jump Trading
Financial Services
LeadSiteReliabilityEngineer
Neural analysis suggests this role is
optimal for Lead candidates.
“Lead Site Reliability Engineer at Jump Trading. Skills: Site Reliability Engineering, Production Infrastructure, Automation. Manage engineers. Mentor engineers”
What You'll Achieve.
Increase resilience; Reduce downtime; Reduce operational toil
Industry & Context.
Root cause analysis
What They're Looking For.
Must Have
Proven leadership experience, Managed people across distributed teams, Solving reliability challenges, Strategic thinking skills, Tackling complex problems, Programming skills in Python, Go, or equivalent
Nice to Have
Experience in large-scale production environments
What You'll Do.
Architect monitoring systems
Implement monitoring systems
Architect alerting systems
Implement alerting systems
Architect automation frameworks
Implement automation frameworks
Oversee incident management
Improve incident management
Oversee change management
Improve change management
Improve post-incident review
Identify sources of toil
Eliminate sources of toil
Partner with engineering teams
Partner with networking teams
Partner with trading teams
Investigate performance issues
Optimize for throughput
Influence strategic direction
Influence tooling roadmap
Influence infrastructure scaling
Influence vendor partnerships
How You'll Work.
Team & Collaboration
Distributed teams; Engineering teams; Networking teams; Trading teams; Multiple regions
Process & Methodology
Roadmap planning
Full Job Description
Jump Trading Group is committed to world class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting edge research to global financial markets. Our culture is unique. Constant innovation requires fearlessness, creativity, intellectual honesty, and a relentless competitive streak. We believe in winning together and unlocking unique individual talent by incenting collaboration and mutual respect. At Jump, research outcomes drive more than superior risk adjusted returns. We design, develop, and deploy technologies that change our world, fund start-ups across industries, and partner with leading global research organizations and universities to solve problems. CORE (Central Ops and Reliability Engineering) is the Production Infrastructure team responsible for operating and improving Jump’s production trading environment. The team combines deep operational ownership with software and reliability engineering practices to support production systems, drive incident and change management, improve observability and deployment workflows, and reduce operational toil across a fast-moving global trading platform. What You’ll Do: As Lead Site Reliability Engineer in CORE, you will both manage and mentor engineers across teams and contribute directly to key projects, balancing leadership responsibilities with hands-on work. Design & Build: Architect and implement high-performance monitoring and alerting systems, real-time packet/flow analysis tooling, and automation frameworks for managing Jump’s global production footprint. Lead Operational Maturity: Oversee and improve incident management, change management, and post-incident review processes to increase resilience and reduce downtime. Drive Efficiency: Identify and eliminate sources of operational toil through automation and tooling. Collaborate Globally: Partner with engineering, networking, and trading teams in multiple regions
Applying for this Lead Site Reliability Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
ANONYMOUS · UNFILTERED
What do employees actually say about Jump Trading?
Real rants from real employees. Read before you apply.