Aleph Alpha
AI
SeniorAISoftwareEngineer–ModelTraining(f/m/d)
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior AI Software Engineer – Model Training (f/m/d) at Aleph Alpha. Skills: Model Training, Software Engineering, Distributed Systems. Co-own the training pipeline end-to-end. Design infrastructure and components”
What You'll Achieve.
Iterate fast on experiments; Stay successful long term; Iterate faster together; Speed us up long-term; Experiment and improve our models
Industry & Context.
Take ownership of problems
Willingness to relocate to Germany, Regular travel to Heidelberg
What They're Looking For.
Must Have
A track record of taking initiative to deliver high-impact work, Experience contributing in high-performing teams, Degree in computer science, engineering, or a related field, Willingness to relocate to Germany, Ability to write software that other engineers want to read and build on, Desire to take ownership of problems and collaborate with other teams to solve them, Deep interest in how state-of-the-art foundation models work, communication skills, with the ability to convey technical solutions to diverse audiences
Nice to Have
Experience working with distributed systems, Experience working with Kubernetes, Experience bringing AI research innovations into production, Experience in areas such as large-scale data processing or distributed computation for foundation model training or inference, Experience with performance engineering: profiling, benchmarking, and optimizing code for throughput, latency, or memory
What You'll Do.
Co-own the training pipeline end-to-end
Design infrastructure and components
Build high-quality tooling
Invest in tooling and infrastructure
Shape the direction of the team
How You'll Work.
Team & Collaboration
Collaborate across disciplines; Engineers and researchers work closely; Contribute in high-performing teams; Collaborate with other teams
Communication Scope
Convey technical solutions to diverse audiences
Full Job Description
OUR MISSION Aleph Alpha is one of the few companies in Europe doing serious foundation model pre- and post-training. We're building models that have general-purpose capabilities, and specifically excel at addressing the needs of our customers. We're looking for exceptional Software Engineers to join our model training team. Most of the team is based in Heidelberg . TEAM CULTURE At Aleph Alpha, we foster a culture built on ownership, autonomy, and empowerment. Teams and individual contributors are trusted to take responsibility for their work and drive meaningful impact. We maintain a flat organizational structure with efficient, supportive management that enables quick decision-making, open communication, and a strong sense of shared purpose. We believe a strong engineering culture is the key to model training success. We like Extreme Programming and favor trunk-based development. We often mob-program, which keeps us aligned and means we always learn from each other. ABOUT THE ROLE As a Software Engineer in Model Training, you'll work across our full stack. Some weeks you might be optimizing how training loads are scheduled on our cluster and making the pipeline more robust and performant so we can iterate faster. Other weeks, you'll be enabling large-scale code execution for reinforcement learning. And at other times, you might dig deep into our evaluation codebase to lift inference throughput on evals. No two days are the same. Things move fast, and your ability to focus and prioritize is what lets you unblock the team day-to-day while designing the high-quality tooling and infrastructure that speeds us up long-term. We're still building out our training pipeline and infrastructure. Some pieces exist, some don't, and you'll have real influence on what gets built and how. Your work directly shapes how quickly we can experiment and improve our models. YOUR RESPONSIBILITIES - Co-own the training pipeline end-to-end. Design, build, and maintain the infrastructure and
Applying for this Senior AI Software Engineer – Model Training (f/m/d) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Aleph Alpha?
Real rants from real employees. Read before you apply.