Together AI
Technology
SystemsResearchEngineerIntern-GPUProgramming(Fall2026)
Neural analysis suggests this role is
optimal for Mid+ candidates.
“Systems Research Engineer Intern - GPU Programming (Fall 2026) at Together AI. Skills: GPU Programming, Parallel Computing, ML/AI. Develop GPU-accelerated kernels. Optimize GPU-accelerated algorithms”
What You'll Achieve.
Enhance performance; Enhance efficiency
Industry & Context.
Problem-solving skills; Analytical skills
What They're Looking For.
Must Have
GPU programming experience, Parallel computing experience, CUDA knowledge, Triton knowledge, ML/AI applications knowledge, AI models knowledge, Performance profiling tools knowledge, Optimization tools knowledge
Nice to Have
Contribute to open source projects
What You'll Do.
Develop GPU-accelerated kernels
Optimize GPU-accelerated algorithms
Co-design GPU kernels
Co-design model architecture
Co-design GPU architectures
Co-design programming models
Achieve better performance
Integrate GPU-accelerated solutions
How You'll Work.
Team & Collaboration
Cross-functional teams; Modeling team; Algorithm team; Hardware teams; Software teams
Full Job Description
About The Role As a Systems Research Engineer Intern specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation. Responsibilities Optimize and fine-tune GPU code to achieve better performance and scalability Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems Stay up-to-date with the latest advancements in GPU programming techniques and technologies Requirements Strong background in GPU programming and parallel computing, such as CUDA and/or Triton. Knowledge of ML/AI applications and models Knowledge of performance profiling and optimization tools for GPU programming Excellent problem-solving and analytical skills Internship Program Details Our fall internship program spans over 12 to 16 weeks where you’ll have the opportunity to work with industry-leading engineers building a cloud from the ground up and possibly contribute to influential open source projects. Our internship dates are September 14th to December 18th. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contri
Applying for this Systems Research Engineer Intern - GPU Programming (Fall 2026) role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Greenhouse
- Create a Greenhouse profile before applying — it saves time across multiple applications.
- Upload your resume as a PDF; the parser handles it better than Word.
- Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
- Enable email notifications to track application status in real time.
ANONYMOUS · UNFILTERED
What do employees actually say about Together AI?
Real rants from real employees. Read before you apply.