Company
Technology
DatabaseEngineering-LeadEngineer
Neural analysis suggests this role is
optimal for Lead candidates.
“Database Engineering - Lead Engineer. Skills: Database Engineering, AI/ML Infrastructure, DevOps, Cloud Engineering. Architect AI/ML infrastructure. Build AI/ML infrastructure”
Industry & Context.
Troubleshooting; Performance tuning
On-call support, 24x7 environment, Global coverage
What They're Looking For.
Must Have
8+ years platform engineering, 3+ years AI/ML infrastructure, Experience with model serving frameworks, Deep knowledge vector databases, Kubernetes (EKS) experience, Experience cloud-native databases, Understanding LLM application patterns, Experience building infrastructure as code, Expertise monitoring observability, Solid understanding database security, Ability operate on-call production environment
Nice to Have
SageMaker experience, Bedrock experience, VLLM experience, TGI experience, OpenSearch k-NN indexing, Embedding optimization, GPU workloads experience, Autoscaling experience, Distributed system operations experience, PostgreSQL experience, Redis experience, Document stores experience, LangChain experience, LlamaIndex experience, CloudFormation experience, Incident response experience
What You'll Do.
Architect AI/ML infrastructure
Build AI/ML infrastructure
Operate AI/ML infrastructure
Own database systems lifecycle
Manage database performance
Manage database scaling
Manage database disaster recovery
Design infrastructure as code
Implement infrastructure as code
Develop CI/CD pipelines
Maintain CI/CD pipelines
Implement monitoring systems
Implement logging systems
Implement alerting systems
Optimize vector search
Optimize embedding systems
Support Kubernetes ML workloads
Ensure database security
Configure IAM policies
Configure access controls
Participate on-call support
How You'll Work.
Team & Collaboration
Production systems support
Full Job Description
## Accountabilities Architect, build, and operate production-grade AI/ML and database infrastructure supporting large-scale AI applications. Own the full lifecycle of database systems including OpenSearch, DocumentDB, Aurora PostgreSQL, and Redis across performance, scaling, and disaster recovery. Design and implement infrastructure as code using Terraform, Crossplane, and CloudFormation for cloud-native environments. Develop and maintain CI/CD pipelines for ML systems, including automated testing and model validation workflows. Implement monitoring, logging, and alerting systems using CloudWatch, Grafana, and related observability tools. Optimize vector search and embedding systems for retrieval-augmented generation (RAG) use cases. Support Kubernetes-based ML workloads including GPU scaling, service mesh, and performance tuning. Ensure database security through encryption, IAM policies, TLS configurations, and fine-grained access controls. Participate in rotating on-call support for production systems operating in a 24x7 environment. Requirements: 8+ years of experience in platform engineering, infrastructure, or database engineering roles. At least 3+ years of hands-on experience in AI/ML infrastructure or production ML systems. Strong experience with model serving frameworks such as SageMaker, Bedrock, vLLM, or TGI. Deep knowledge of vector databases and search systems, including OpenSearch k-NN indexing and embedding optimization. Strong Kubernetes (EKS) experience, including GPU workloads, autoscaling, and distributed system operations. Experience designing and operating cloud-native databases such as PostgreSQL, Redis, and document stores at scale. Strong understanding of LLM application patterns including retrieval systems, memory management, and agent frameworks (LangChain, LlamaIndex). Experience building infrastructure as code using Terraform, CloudFormation, or similar tools. Strong expertise in monitoring, observability, and incident response in product
Applying for this Database Engineering - Lead Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Lever
- Lever uses a streamlined one-page form — apply in under 5 minutes.
- LinkedIn import works well; review parsed data before submitting.
- The cover letter field is optional but visible to reviewers — use it to differentiate.
- Referral codes from employees can significantly boost visibility of your application.
ANONYMOUS · UNFILTERED
What do employees actually say about this company?
Real rants from real employees. Read before you apply.