Abridge

Healthcare

EngineeringManager,ModelInference

$220–270k San Francisco, California, United States FULL TIME
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Manager candidates.

The Brief

“Engineering Manager, Model Inference at Abridge. Skills: Model Inference, Inference Systems, LLM Serving, Infrastructure Scaling, Team Leadership. Lead and grow a high-performing team of AI inference engineers. Build and scale infrastructure for Abridge’s products and APIs”

What You'll Achieve.

Ensure the systems underpinning every clinician interaction are operating at peak efficiency and reliability

Industry & Context.

Healthcare
Problems you'll solve

Benchmark and eliminate bottlenecks; Performance analysis

What They're Looking For.

Must Have

5+ years of engineering experience, 1+ years in a technical leadership or management role, Deep, hands-on experience with ML systems and inference frameworks, Understanding of LLM architecture, Experience with inference optimizations, Familiarity with GPU characteristics, roofline models, and performance analysis, Experience deploying reliable, distributed, real-time systems at scale, Experience with parallelism strategies, Skilled at hiring and mentorship, Technical communication and cross-functional collaboration skills, Comfortable giving constructive feedback on technical designs and code reviews, Has thrived in a fast-growing startup and knows how to operate with urgency and focus

Nice to Have

Background in training infrastructure and RL workloads, Skilled in building secure, compliant systems on major cloud platforms (GCP preferred, AWS experience welcome), Experience with Kubernetes and container orchestration at scale, Published work or contributions to inference optimization research

What You'll Do.

Lead and grow a high-performing team of AI inference engineers

Build and scale infrastructure for Abridge’s products and APIs

Own the technical direction of inference systems

Make key decisions around batching

Architect and scale inference infrastructure for reliability

Lead incident response

Benchmark and eliminate bottlenecks throughout the inference stack

Partner with ML Research teams on model optimization

Develop APIs for AI inference

and develop engineers

Establish team processes

engineering standards

and operational excellence

Plan and execute projects that directly impact clinicians and patients

How You'll Work.

Team & Collaboration

Partner closely with ML Research and the broader AI Platform; Work closely with the GenAI Platform, Data, and Product teams; Cross-functional collaboration skills

Communication Scope

Technical communication

Process & Methodology

Plan and execute projects

Full Job Description

ABOUT ABRIDGE Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients. Our enterprise-grade technology transforms patient-clinician conversations into structured clinical notes in real-time, with deep EMR integrations. Powered by Linked Evidence and our purpose-built, auditable AI, we are the only company that maps AI-generated summaries to ground truth, helping providers quickly trust and verify the output. As pioneers in generative AI for healthcare, we are setting the industry standards for the responsible deployment of AI across health systems. We are a growing team of practicing MDs, AI scientists, PhDs, creatives, technologists, and engineers working together to empower people and make care make more sense. We have offices located in the Mission District in San Francisco, the SoHo neighborhood of New York, and East Liberty in Pittsburgh. THE ROLE Our generative AI-powered products are transforming the practice of medicine—and the inference systems that power them need to be fast, reliable, and world-class. We’re looking for an Engineering Manager to lead and grow our Model Inference team. The Inference team owns the end-to-end technical direction of how our models are served: from architecting low-latency, high-throughput infrastructure to pushing the frontier of LLM serving techniques. You’ll lead a high-performing team of AI inference engineers, partner closely with ML Research and the broader AI Platform, and ensure the systems underpinning every clinician interaction are operating at peak efficiency and reliability. WHAT YOU’LL DO - Lead and grow a high-performing team of AI inference engineers focused on building and scaling infrastructure for Abridge’s products and APIs - Own the technical direction of our inference systems—making key

Free ATS check

Applying for this Engineering Manager, Model Inference role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Ashby

  • Ashby is a fast modern ATS — most applications take under 3 minutes.
  • The resume parser is strong; verify parsed experience dates and job titles.
  • Custom screening questions are often scored algorithmically — answer completely.
  • Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Abridge?

Real rants from real employees. Read before you apply.

Read Company Rants →