Cohere
AI
SoftwareEngineer,DataInfrastructure
Neural analysis suggests this role is
optimal for Mid candidates.
“Software Engineer, Data Infrastructure at Cohere. Skills: Data Infrastructure, Storage Infrastructure, Distributed Data Processing. Build and maintain the high-performance data layer our Modeling teams rely on for training and evaluation jobs. Work directly on petabyte-scale storage infrastructure, and the networking and performance challenges that come with it”
Industry & Context.
Networking and performance challenges
What They're Looking For.
Must Have
4+ years of experience working on data storage infrastructure, Command of Python, Kubernetes experience, especially on the storage side (Persistent Volumes, CSI drivers, etc.), The ability to transform unstructured data into performant datasets across diverse storage backends including S3, GCS, and POSIX, Experience with distributed data processing frameworks such as Apache Beam, Spark, or Flink
Nice to Have
Familiarity with modern analytics tooling such as BigQuery, Airflow, or dbt, Genuine excitement about AI. You follow the research, have opinions, and enjoy being in the weeds, Comfort operating at the edge of what's known, with a desire to build something genuinely new rather than optimize what already exists
What You'll Do.
Build and maintain the high-performance data layer our Modeling teams rely on for training and evaluation jobs
Work directly on petabyte-scale storage infrastructure
and the networking and performance challenges that come with it
How You'll Work.
Team & Collaboration
Collaborate daily with researchers and engineers who are some of the best in the world at what they do
Full Job Description
Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products. Join us on our mission and shape the future! Why this role? We're building the data infrastructure behind some of the most demanding AI training workloads in the world, and we want sharp, curious people to help us do it. In this role, you'll build and maintain the high-performance data layer our Modeling teams rely on for training and evaluation jobs. As a Software Engineer, Data Infrastructure, you will: - Work directly on petabyte-scale storage infrastructure, and the networking and performance challenges that come with it. - Collaborate daily with researchers and engineers who are some of the best in the world at what they do. You may be a good fit if you have: - 4+ years of experience working on data storage infrastructure - Strong command of Python - Kubernetes experience, especially on the storage side (Persistent Volumes, CSI drivers, etc.) - The ability to transform unstructured data into performant datasets across diverse storage backends including S3, GCS, and POSIX - Experience with distributed data processing frameworks such as Apache Beam, Spark, or Flink - [Nice-to-have] Familiarity with modern analytics tooling such as BigQuery, Airflow
Applying for this Software Engineer, Data Infrastructure role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Cohere?
Real rants from real employees. Read before you apply.