Sunday

Robotics

SoftwareEngineer,VoiceInteraction

Redwood City, California, United States FULL TIME

Market Sentiment

HIGH DEMAND

Neural analysis suggests this role is
optimal for Mid candidates.

The Brief

“Software Engineer, Voice Interaction at Sunday. Skills: Voice Pipeline, Speech-to-Text, Text-to-Speech, C++. Develop voice pipeline. Configure microphone array”

What You'll Achieve.

transform raw audio signals into actionable instructions; enable the robot to receive, interpret, and act intelligently on voice commands; deliver natural, responsive spoken interactions in real time; ensure the system operates within intended use; making informed tradeoffs between accuracy, latency, and resource consumption; runs on our robot, Memo, under real-world conditions

Industry & Context.

Robotics

Problems you'll solve

debugging problems that cut across both

What They're Looking For.

Must Have

2+ years experience developing voice-driven systems, speech-to-text, text-to-speech, real-time audio processing, end-to-end pipeline shipped to users or deployed on hardware, classical decision-making approaches, state machines, behavior trees, planning, modern ML-driven reasoning, LLMs, VLMs, compute-constrained platforms, software meets hardware, robotics, edge devices, consumer electronics, debugging problems that cut across both, C++, asynchronous programming, streaming buffering patterns, integration with cloud API services

Nice to Have

founding or early experience, define a release roadmap where no blueprint exists, shipping responsive AI systems, video games, embodied AI, ML-driven control for embodied AI, end-to-end learning, reinforcement learning, VLAs, interfacing with multimodal models, Publications in multimodal models, audio interpretation, robotics

What You'll Do.

Develop voice pipeline

Configure microphone array

Integrate voice subsystem

Evaluate STT/TTS engines

Build reliable software

Deliver voice interaction experience

How You'll Work.

Team & Collaboration

contribute to the behavior stack; work with product

Full Job Description

Join Us in Building the Future of Home Robotics At Sunday, we're developing personal robots to reclaim the hours lost to repetitive tasks. We're focused on an ambitious goal to make generalized robots broadly accessible, enabling households to take back quality time. We have spent the last 18 months building a talented team, securing capital, and validating our technology. We are now seeking passionate individuals to join us in the next phase of our growth. If you are ready to apply your skills to the forefront of robotics innovation, we’d love to hear from you. What To Expect As a Software Engineer, Voice Interaction, you will own the full voice pipeline that connects our users with Memo's core robotic and AI systems. Integrating machine learning models across local and cloud compute, you will transform raw audio signals into actionable instructions in a domestic environment. As part of the broader team, you will also contribute to the behavior stack that drives Memo's high-level decision making and task execution. What You’ll Do - Develop and maintain the full voice pipeline from microphone array input through wake word detection, speech-to-text, natural language understanding, and text-to-speech output - Configure and integrate microphone array for domestic use, tuning onboard audio processing (beamforming, noise suppression, echo cancellation) and supplementing with additional processing where needed - Integrate the voice subsystem with high level robot behaviors, enabling the robot to receive, interpret, and act intelligently on voice commands - Design and optimize TTS output to deliver natural, responsive spoken interactions in real time on embedded hardware - Define and enforce guardrails around voice input and output, including content filtering, prompt boundary enforcement, output length limits, and auditing to ensure the system operates within intended use - Evaluate and integrate STT/TTS engines and models, making informed tradeoffs between accuracy, late

Free ATS check

Applying for this Software Engineer, Voice Interaction role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

Should you apply? AI reads your resume vs this job — match score, gaps to address, ATS keywords.

SKILL SIGNAL 20 detected · ranked by frequency

Speech-to-Text ×5

Text-to-Speech ×5

Voice Pipeline ×3

Natural Language Understanding ×3

Beamforming ×3

Noise Suppression ×3

Echo Cancellation ×3

Asynchronous Programming ×3

Streaming Buffering ×3

LLMs

VLMs

audio processing

robot behavior integration

TTS output optimization

voice input/output guardrails

STT/TTS engine evaluation

software development

real-world conditions

voice interaction experience

Role Details

Experience 2–5 yrs

Level Mid

Type FULL TIME

Category software

AI-Extracted Insights

Domain Areas

home-roboticsai-systemsdomestic-environmentembedded-hardwarecompute-constrained-platformsroboticsedge-devicesconsumer-electronics

How to Apply on Ashby

Ashby is a fast modern ATS — most applications take under 3 minutes.
The resume parser is strong; verify parsed experience dates and job titles.
Custom screening questions are often scored algorithmically — answer completely.
Location field affects geo-based screening; use your actual metro area.

ANONYMOUS · UNFILTERED

What do employees actually say about Sunday?

Real rants from real employees. Read before you apply.

Read Company Rants →