Capital One
banking
DistinguishedEngineer
Neural analysis suggests this role is
optimal for Distinguished candidates.
“Distinguished Engineer at Capital One. Skills: Foundation Model (FM) Hosting, LLM inference, Distributed systems, Kubernetes, AI infrastructure, Scalability, Performance optimization, GPU utilization, Model serving. Define the future of banking in the cloud. Devise practical and reusable solutions to complex problems”
What You'll Achieve.
Optimize business outcomes; Drive towards technology solutions; Ensure high throughput, ultra-low latency, and optimal GPU utilization across massive, multi-tenant workloads; Seamlessly transition foundational models from the lab to highly optimized production environments
Industry & Context.
Devise practical and reusable solutions to complex problems; Decompose complex problems into practical and operational solutions
What They're Looking For.
Must Have
Bachelor's Degree, At least 7 years of experience in Software engineering
Nice to Have
Bachelor's or Master's Degree in Computer Science or a related field, 10+ years of experience coding in commonly used languages like Java, Python, Go, JavaScript or TypeScript and Swift., 9+ years of experience in the full lifecycle of system development, from conception through architecture, implementation, testing, deployment and production support, 3+ years of experience with public or private cloud technologies, 8+ years of experience with Networking (BGP, Wi-Fi, SD-WAN, Cloud Networking and Data Center Networking), Contributions, active maintainer status, or core authorship in open-source AI infrastructure or serving projects (_vLLM, TensorRT-LLM, Hugging Face TGI, Ray, or Triton Inference Server_)., Experience in distributed inference communication primitives., Experience optimizing NCCL, heavily utilizing NVLink/NVSwitch, and tuning network fabrics such as InfiniBand/RDMA for complex Tensor Parallelism (TP) and Pipeline Parallelism (PP) architectures., Published research or papers at top-tier Machine Learning and Systems conferences such as _MLSys, OSDI, SOSP, NeurIPS, or ICML_, or hold patents related to distributed systems, model compression, or AI inference scaling., Experience designing routing and scheduling mechanisms for split-architecture serving or multi-LoRA serving architectures to support thousands of dynamic, personalized model adapters simultaneously.
What You'll Do.
Define the future of banking in the cloud
Devise practical and reusable solutions to complex problems
Drive innovation at multiple levels
Optimize business outcomes
Drive towards technology solutions
Articulate and evangelize a bold technical vision for your domain
Decompose complex problems into practical and operational solutions
Ensure the quality of technical design and implementation
Serve as an authoritative expert on non-functional system characteristics
scalability and operability
Continue learning and injecting advanced technical knowledge into our community
Handle several projects simultaneously
balancing your time to maximize impact
Act as a role model and mentor within the tech community
Design and drive the long-term technical roadmap for our Foundation Model Hosting platform
Lead performance engineering across both the platform and model layers
Pioneer the implementation of advanced techniques such as speculative decoding
kv-cache optimization (PagedAttention)
and custom quantization strategies (FP8
Act as the primary engineering counterpart to our AI Research & Science teams
Co-design model architectures for deployability
Mentor senior engineers
Establish rigorous engineering standards for AI deployment
Foster a culture of uncompromising technical excellence
How You'll Work.
Team & Collaboration
Work alongside our talented team of developers, machine learning experts, product managers and people leaders; Influence, collaborate and provide the most innovative solutions across organizational boundaries; Act as the primary engineering counterpart to our AI Research & Science teams
Communication Scope
Creating clear and concise communications; Code samples; Blog posts
Process & Methodology
Handle several projects simultaneously, balancing your time to maximize impact
Full Job Description
Distinguished Engineer As a Distinguished Engineer at Capital One, you will be a part of a community of technical experts working to define the future of banking in the cloud. You will work alongside our talented team of developers, machine learning experts, product managers and people leaders. Our Distinguished Engineers are leading experts in their domains, helping devise practical and reusable solutions to complex problems. You will drive innovation at multiple levels, helping optimize business outcomes while driving towards strong technology solutions. At Capital One, we believe diversity of thought strengthens our ability to influence, collaborate and provide the most innovative solutions across organizational boundaries. You will promote a culture of engineering excellence, and strike the right balance between lending expertise and providing an inclusive environment where the ideas of others can be heard and championed. You will lead the way in creating next-generation talent for Capital One Tech, mentoring internal talent and actively recruiting to keep building our community. Distinguished Engineers are expected to lead through technical contribution. You will operate as a trusted advisor for our key technologies, platforms and capability domains, creating clear and concise communications, code samples, blog posts and other material to share knowledge both inside and outside the organization. You will specialize in a particular subject area, but your input and impact will be sought and expected throughout the organization. We are looking for a visionary technologist to anchor our Foundation Model (FM) Hosting team. As generative AI becomes the core engine of our business, the frontier of our success lies in how efficiently, reliably, and rapidly we can serve massive large language models at scale. In this Distinguished-level role, you won't just be using existing tools; you will be pushing the absolute limits of LLM inference physics. You will own the techni
Applying for this Distinguished Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Workday
- Workday has a multi-step form — save your progress after every section.
- "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
- Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
- Job requisition numbers are useful when following up with HR by email.
ANONYMOUS · UNFILTERED
What do employees actually say about Capital One?
Real rants from real employees. Read before you apply.