Capital One

banking

DistinguishedEngineer

$245–279k San Jose, California, United States FULL TIME Remote Friendly
Market Sentiment
HIGH DEMAND

Neural analysis suggests this role is
optimal for Distinguished candidates.

The Brief

“Distinguished Engineer at Capital One. Skills: Foundation Model (FM) Hosting, LLM inference, Distributed systems, Kubernetes, AI infrastructure, Scalability, Performance optimization, GPU utilization, Model serving. Define the future of banking in the cloud. Devise practical and reusable solutions to complex problems”

What You'll Achieve.

Optimize business outcomes; Drive towards technology solutions; Ensure high throughput, ultra-low latency, and optimal GPU utilization across massive, multi-tenant workloads; Seamlessly transition foundational models from the lab to highly optimized production environments

Industry & Context.

banking
Problems you'll solve

Devise practical and reusable solutions to complex problems; Decompose complex problems into practical and operational solutions

What They're Looking For.

Must Have

Bachelor's Degree, At least 7 years of experience in Software engineering

Nice to Have

Bachelor's or Master's Degree in Computer Science or a related field, 10+ years of experience coding in commonly used languages like Java, Python, Go, JavaScript or TypeScript and Swift., 9+ years of experience in the full lifecycle of system development, from conception through architecture, implementation, testing, deployment and production support, 3+ years of experience with public or private cloud technologies, 8+ years of experience with Networking (BGP, Wi-Fi, SD-WAN, Cloud Networking and Data Center Networking), Contributions, active maintainer status, or core authorship in open-source AI infrastructure or serving projects (_vLLM, TensorRT-LLM, Hugging Face TGI, Ray, or Triton Inference Server_)., Experience in distributed inference communication primitives., Experience optimizing NCCL, heavily utilizing NVLink/NVSwitch, and tuning network fabrics such as InfiniBand/RDMA for complex Tensor Parallelism (TP) and Pipeline Parallelism (PP) architectures., Published research or papers at top-tier Machine Learning and Systems conferences such as _MLSys, OSDI, SOSP, NeurIPS, or ICML_, or hold patents related to distributed systems, model compression, or AI inference scaling., Experience designing routing and scheduling mechanisms for split-architecture serving or multi-LoRA serving architectures to support thousands of dynamic, personalized model adapters simultaneously.

What You'll Do.

Define the future of banking in the cloud

Devise practical and reusable solutions to complex problems

Drive innovation at multiple levels

Optimize business outcomes

Drive towards technology solutions

Articulate and evangelize a bold technical vision for your domain

Decompose complex problems into practical and operational solutions

Ensure the quality of technical design and implementation

Serve as an authoritative expert on non-functional system characteristics

scalability and operability

Continue learning and injecting advanced technical knowledge into our community

Handle several projects simultaneously

balancing your time to maximize impact

Act as a role model and mentor within the tech community

Design and drive the long-term technical roadmap for our Foundation Model Hosting platform

Lead performance engineering across both the platform and model layers

Pioneer the implementation of advanced techniques such as speculative decoding

kv-cache optimization (PagedAttention)

and custom quantization strategies (FP8

Act as the primary engineering counterpart to our AI Research & Science teams

Co-design model architectures for deployability

Mentor senior engineers

Establish rigorous engineering standards for AI deployment

Foster a culture of uncompromising technical excellence

How You'll Work.

Team & Collaboration

Work alongside our talented team of developers, machine learning experts, product managers and people leaders; Influence, collaborate and provide the most innovative solutions across organizational boundaries; Act as the primary engineering counterpart to our AI Research & Science teams

Communication Scope

Creating clear and concise communications; Code samples; Blog posts

Process & Methodology

Handle several projects simultaneously, balancing your time to maximize impact

Full Job Description

Distinguished Engineer As a Distinguished Engineer at Capital One, you will be a part of a community of technical experts working to define the future of banking in the cloud. You will work alongside our talented team of developers, machine learning experts, product managers and people leaders. Our Distinguished Engineers are leading experts in their domains, helping devise practical and reusable solutions to complex problems. You will drive innovation at multiple levels, helping optimize business outcomes while driving towards strong technology solutions. At Capital One, we believe diversity of thought strengthens our ability to influence, collaborate and provide the most innovative solutions across organizational boundaries. You will promote a culture of engineering excellence, and strike the right balance between lending expertise and providing an inclusive environment where the ideas of others can be heard and championed. You will lead the way in creating next-generation talent for Capital One Tech, mentoring internal talent and actively recruiting to keep building our community. Distinguished Engineers are expected to lead through technical contribution. You will operate as a trusted advisor for our key technologies, platforms and capability domains, creating clear and concise communications, code samples, blog posts and other material to share knowledge both inside and outside the organization. You will specialize in a particular subject area, but your input and impact will be sought and expected throughout the organization. We are looking for a visionary technologist to anchor our Foundation Model (FM) Hosting team. As generative AI becomes the core engine of our business, the frontier of our success lies in how efficiently, reliably, and rapidly we can serve massive large language models at scale. In this Distinguished-level role, you won't just be using existing tools; you will be pushing the absolute limits of LLM inference physics. You will own the techni

Free ATS check

Applying for this Distinguished Engineer role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about Capital One?

Real rants from real employees. Read before you apply.

Read Company Rants →