FuriosaAI

SolutionsArchitect

Santa Clara, California, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Solutions Architect at FuriosaAI. Skills: AI/LLM model deployments, RNGD chips/servers, inference frameworks, serving stack evolution, modern inference stacks, agent and orchestration frameworks, DNN frameworks. Own end-to-end technical enablement for US customers deploying AI models on FuriosaAI's RNGD NPU using the Furiosa SDK. Develop POCs, benchmarking studies, and live debugging sessions directly in customer environments”

What You'll Achieve.

bring the full potential of our powerful RNGD chips/servers to our customers; empowering customers with FuriosaAI’s powerful solutions; translate deep technical capability into business value

Industry & Context.

Eligibility Requirements

able to travel to customer sites and to Seoul HQ periodically

What They're Looking For.

Must Have

2–5 years in a US customer-facing technical role: Solutions Architect, Sales Engineer, Forward Deployed Engineer, or equivalent at an AI infra, cloud, or semiconductor company, Actively current on the AI/LLM landscape — tracking model releases, inference frameworks, and serving stack evolution in real time, Hands-on experience with modern inference stacks: vLLM, SGLang, TensorRT-LLM, Triton Inference Server, or similar, Hands-on experience with agent and orchestration frameworks: LangChain, LlamaIndex, LangGraph, AutoGen, or MCP-based tooling, Proficiency in comfortable with DNN frameworks (PyTorch, TensorFlow), written and verbal communication — able to engage credibly with ML engineers at frontier labs and VP/C-suite executives, Authorized to work in the able to travel to customer sites and to Seoul HQ periodically

Nice to Have

Prior experience at a US AI chip company, cloud silicon team, or AI infrastructure startup, Familiarity with NPU/GPU accelerator ecosystems, PCIe integration, and data center hardware deployment, Experience with inference optimization: quantization, kernel tuning, batching strategies, memory bandwidth optimization, Proficiency in C, C++, or Rust, Experience working with distributed or cross-timezone engineering teams

What You'll Do.

Own end-to-end technical enablement for US customers deploying AI models on FuriosaAI's RNGD NPU using the Furiosa SDK

and live debugging sessions directly in customer environments

Act as the technical authority to the US BD/Sales team during pre-sales and enterprise

current expertise in FuriosaAI's hardware and software stack and demonstrate it at US technical forums

and customer workshops

Onboard and train customers on integration patterns

optimization workflows

and best practices post-purchase

Serve as a technical feedback loop from US customers back to Seoul HQ product and engineering teams

How You'll Work.

Team & Collaboration

Act as the technical authority to the US BD/Sales team; Serve as a technical feedback loop from US customers back to Seoul HQ product and engineering teams; Experience working with distributed or cross-timezone engineering teams

Communication Scope

written and verbal communication

Full Job Description

ABOUT THE JOB FuriosaAI is looking for a Solutions Architect to bring the full potential of our powerful RNGD chips/servers to our customers by acting as the primary technical authority in AI/LLM model deployments. From running POCs to benchmarking and debugging, you will translate RNGD’s powerful system to real-world deployments of customers’ models, empowering customers with FuriosaAI’s powerful solutions. If you are interested in providing the technical expertise in challenging the current status-quo of AI infrastructure in real-world environments, join us in our path to a sustainable future of AI. WHAT YOU’LL DO - Own end-to-end technical enablement for US customers deploying AI models on FuriosaAI's RNGD NPU using the Furiosa SDK - Develop POCs, benchmarking studies, and live debugging sessions directly in customer environments - Act as the technical authority to the US BD/Sales team during pre-sales and enterprise evaluations; translate deep technical capability into business value for engineering and C-suite audiences - Develop deep, current expertise in FuriosaAI's hardware and software stack and demonstrate it at US technical forums, AI conferences, and customer workshops - Onboard and train customers on integration patterns, optimization workflows, and best practices post-purchase - Serve as a technical feedback loop from US customers back to Seoul HQ product and engineering teams QUALIFICATIONS - 2–5 years in a US customer-facing technical role: Solutions Architect, Sales Engineer, Forward Deployed Engineer, or equivalent at an AI infra, cloud, or semiconductor company - Actively current on the AI/LLM landscape — tracking model releases, inference frameworks, and serving stack evolution in real time - Hands-on experience with modern inference stacks: vLLM, SGLang, TensorRT-LLM, Triton Inference Server, or similar - Hands-on experience with agent and orchestration frameworks: LangChain, LlamaIndex, LangGraph, AutoGen, or MCP-based tooling - Proficiency in

Free ATS check

Applying for this Solutions Architect role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 38 detected · ranked by frequency

RNGD chips/servers ×3

running POCs ×3

benchmarking ×3

debugging ×3

integration patterns ×3

optimization workflows ×3

inference optimization ×3

quantization ×3

kernel tuning ×3

batching strategies ×3

memory bandwidth optimization ×3

AI/LLM model deployments ×2

inference frameworks ×2

serving stack evolution ×2

modern inference stacks ×2

agent and orchestration frameworks ×2

DNN frameworks ×2

vLLM ×2

SGLang ×2

TensorRT-LLM ×2

Triton Inference Server ×2

LangChain ×2

LlamaIndex ×2

LangGraph ×2

AutoGen ×2

PyTorch ×2

TensorFlow ×2

AI/LLM

AI infra

cloud

semiconductor

NPU

BEHAVIOURAL

written and verbal communication

Role Details

Experience 2–5 yrs

Level Mid

Type FULL TIME

Category tech-sales(sa)

AI-Extracted Insights

Domain Areas

ai-llm-landscapeai-infrastructureai-chip-companycloud-silicon-teamai-infrastructure-startupnpu-gpu-accelerator-ecosystems

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about FuriosaAI?

Real rants from real employees. Read before you apply.

Read Company Rants →