Together AI

Technology

SystemsResearchEngineerIntern-GPUProgramming(Fall2026)

$0–0k San Francisco, California, United States INTERNSHIP Remote Friendly

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid+ candidates.

The Brief

“Systems Research Engineer Intern - GPU Programming (Fall 2026) at Together AI. Skills: GPU Programming, Parallel Computing, ML/AI. Develop GPU-accelerated kernels. Optimize GPU-accelerated algorithms”

What You'll Achieve.

Enhance performance; Enhance efficiency

Industry & Context.

Technology

Problems you'll solve

Problem-solving skills; Analytical skills

What They're Looking For.

Must Have

GPU programming experience, Parallel computing experience, CUDA knowledge, Triton knowledge, ML/AI applications knowledge, AI models knowledge, Performance profiling tools knowledge, Optimization tools knowledge

Nice to Have

Contribute to open source projects

What You'll Do.

Develop GPU-accelerated kernels

Optimize GPU-accelerated algorithms

Co-design GPU kernels

Co-design model architecture

Co-design GPU architectures

Co-design programming models

Achieve better performance

Integrate GPU-accelerated solutions

How You'll Work.

Team & Collaboration

Cross-functional teams; Modeling team; Algorithm team; Hardware teams; Software teams

Full Job Description

About The Role As a Systems Research Engineer Intern specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation. Responsibilities Optimize and fine-tune GPU code to achieve better performance and scalability Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems Stay up-to-date with the latest advancements in GPU programming techniques and technologies Requirements Strong background in GPU programming and parallel computing, such as CUDA and/or Triton. Knowledge of ML/AI applications and models Knowledge of performance profiling and optimization tools for GPU programming Excellent problem-solving and analytical skills Internship Program Details Our fall internship program spans over 12 to 16 weeks where you’ll have the opportunity to work with industry-leading engineers building a cloud from the ground up and possibly contribute to influential open source projects. Our internship dates are September 14th to December 18th. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contri

Free ATS check

Applying for this Systems Research Engineer Intern - GPU Programming (Fall 2026) role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

Create a Greenhouse profile before applying — it saves time across multiple applications.
Upload your resume as a PDF; the parser handles it better than Word.
Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Together AI?

Real rants from real employees. Read before you apply.

Read Company Rants →