Cohere
AI
SeniorSearchApplicationsPerformanceEngineer
Neural analysis suggests this role is
optimal for Senior candidates.
“Senior Search Applications Performance Engineer at Cohere. Skills: Search Applications, Performance Engineering, AI Search. Implement performance monitoring. optimization strategies”
What You'll Achieve.
ensure users receive fast, reliable, and intelligent search experiences; optimize and scale Search Applications and infrastructure
Industry & Context.
continuous optimization; infrastructure enhancements
What They're Looking For.
Must Have
Python, backend search technologies, OpenSearch, ElasticSearch, Weaviate, FastAPI, data or evaluation pipelines, performance benchmarking, profiling applications, CPU, GPU, autoscaled compute nodes, communicate technical performance metrics, 4+ years of experience, production environments
Nice to Have
Kubernetes, Helm, infrastructure deployment, GPU-based model inference optimization, ONNX, Triton, vLLM, search and discovery domain
What You'll Do.
Implement performance monitoring
optimization strategies
Develop benchmarking frameworks
optimize search models
scaling the search services
Develop new tool surfaces
How You'll Work.
Team & Collaboration
Collaborate with modeling teams; Partner with product teams; communicate technical performance metrics to cross-functional teams
Communication Scope
communicate technical performance metrics effectively
Full Job Description
Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products. Join us on our mission and shape the future! WHY THIS ROLE? Join the Compass team to optimize and scale our Search Applications and infrastructure, ensuring users receive fast, reliable, and intelligent search experiences. You'll work at the intersection of search technology and performance engineering, revolutionizing how users interact with AI-powered search through continuous optimization, benchmarking, and infrastructure enhancements. Your work will directly enhance document understanding capabilities and support the development of new tool surfaces designed for agentic users. IN THIS ROLE, YOU WILL… - Implement performance monitoring and optimization strategies for Compass search services and the integration with North - Develop and maintain benchmarking frameworks to evaluate search model performance and infrastructure efficiency - Collaborate with modeling teams to optimize search models for faster response times and reduced resource consumption - Work on scaling the search services while maintaining high availability and low latency - Partner with product teams to translate performance requirements into technical implementations - Develop and
Applying for this Senior Search Applications Performance Engineer role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Cohere?
Real rants from real employees. Read before you apply.