Samsung Semiconductor

Semiconductor

StaffCompilerEngineer-PyTorch+KernelDSL

$350–550k ~AI est. San Jose, California, United States

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Staff candidates.

The Brief

“Staff Compiler Engineer - PyTorch + Kernel DSL at Samsung Semiconductor. Skills: Compiler engineering, PyTorch, Kernel DSL. Adapt torch.compile to backend. Lower Inductor's IR to hardware”

Industry & Context.

Semiconductor

Problems you'll solve

Root cause analysis

What They're Looking For.

Must Have

Bachelor's with 10+ years experience, Master's with 8+ years experience, PhD's with 5+ years experience, 3-5+ years industry experience in Triton, 3-5+ years industry experience in Helion, 3-5+ years industry experience in MLIR, 3-5+ years industry experience in XLA, 3-5+ years industry experience in TVM, 3-5+ years industry experience in Inductor, 3-5+ years industry experience in IREE, 3-5+ years industry experience in CUTLASS, Experience designing kernel DSL from scratch, Experience making language-level changes to DSL, Experience with MLIR writing dialects, Experience with MLIR writing passes, Experience with MLIR backend integration, Experience building PyTorch backends, Experience with kernel autotuning, Experience with performance modeling, Experience with cost-based compilation, Background in HPC, Background in distributed systems, Background in NUMA-aware programming

Nice to Have

Open-source contributions to PyTorch, Open-source contributions to Triton, Open-source contributions to Helion, Open-source contributions to LLVM/MLIR, Open-source contributions to similar projects

What You'll Do.

Adapt torch.compile to backend

Lower Inductor's IR to hardware

Define kernel specialization

Define compiler yield points

Build kernel DSLs for hardware

Extend kernel DSLs for hardware

Lower tile-based programming model

Lower higher-level expression

Lower DSL to memory hierarchy

Lower DSL to collective primitives

Design placement passes

Design scheduling passes

Decide tensor placement

Decide tensor migration

Overlap compute with data movement

Implement parallelism-aware lowering

Make tensor parallelism first-class

Make pipeline parallelism first-class

Make expert parallelism first-class

Make sequence parallelism first-class

Perform memory planning

Contribute upstream to PyTorch

Contribute upstream to Triton

Contribute upstream to Helion

Engage with upstream review

How You'll Work.

Team & Collaboration

Hardware engineers; Software engineers

Full Job Description

Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World’s Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you’ll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what’s possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We’re dedicated to empowering people to be their true selves. Together, we’re building a better tomorrow for our employees, customers, partners, and communities. The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team is committed to designing and developing scalable platforms that can effectively handle the computational and memory requirements of these workloads while minimizing energy consumption and maximizing performance. To achieve this goal, we collaborate closely with both hardware and software engineers to identify and address the unique challenges posed by AI/ML workloads and to explore new computing abstractions that can provide a better balance between the hardware and software components of our systems. Additionally, we continuously conduct research and development in emerging technologies and trends across memory, computing, interconnect, and AI/ML, ensuring that our platforms are always equipped to handle the most demanding workloads of the future. By working together as a dedicated and passionate team, we aim to revolutionize the way AI/ML applications are deployed and executed, ultimately contributing to the advancement of AGI in an affordable and sustainable manner. Join us in our passion to shape the fu

Free ATS check

Applying for this Staff Compiler Engineer - PyTorch + Kernel DSL role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Greenhouse

Create a Greenhouse profile before applying — it saves time across multiple applications.
Upload your resume as a PDF; the parser handles it better than Word.
Answer all knockout questions carefully — wrong answers auto-reject before a human sees you.
Enable email notifications to track application status in real time.

ANONYMOUS · UNFILTERED

What do employees actually say about Samsung Semiconductor?

Real rants from real employees. Read before you apply.

Read Company Rants →