Sunday
Robotics
SoftwareEngineer,VoiceInteraction
Neural analysis suggests this role is
optimal for Mid candidates.
“Software Engineer, Voice Interaction at Sunday. Skills: Voice Pipeline, Speech-to-Text, Text-to-Speech, C++. Develop voice pipeline. Configure microphone array”
What You'll Achieve.
transform raw audio signals into actionable instructions; enable the robot to receive, interpret, and act intelligently on voice commands; deliver natural, responsive spoken interactions in real time; ensure the system operates within intended use; making informed tradeoffs between accuracy, latency, and resource consumption; runs on our robot, Memo, under real-world conditions
Industry & Context.
debugging problems that cut across both
What They're Looking For.
Must Have
2+ years experience developing voice-driven systems, speech-to-text, text-to-speech, real-time audio processing, end-to-end pipeline shipped to users or deployed on hardware, classical decision-making approaches, state machines, behavior trees, planning, modern ML-driven reasoning, LLMs, VLMs, compute-constrained platforms, software meets hardware, robotics, edge devices, consumer electronics, debugging problems that cut across both, C++, asynchronous programming, streaming buffering patterns, integration with cloud API services
Nice to Have
founding or early experience, define a release roadmap where no blueprint exists, shipping responsive AI systems, video games, embodied AI, ML-driven control for embodied AI, end-to-end learning, reinforcement learning, VLAs, interfacing with multimodal models, Publications in multimodal models, audio interpretation, robotics
What You'll Do.
Develop voice pipeline
Configure microphone array
Integrate voice subsystem
Evaluate STT/TTS engines
Build reliable software
Deliver voice interaction experience
How You'll Work.
Team & Collaboration
contribute to the behavior stack; work with product
Full Job Description
Join Us in Building the Future of Home Robotics At Sunday, we're developing personal robots to reclaim the hours lost to repetitive tasks. We're focused on an ambitious goal to make generalized robots broadly accessible, enabling households to take back quality time. We have spent the last 18 months building a talented team, securing capital, and validating our technology. We are now seeking passionate individuals to join us in the next phase of our growth. If you are ready to apply your skills to the forefront of robotics innovation, we’d love to hear from you. What To Expect As a Software Engineer, Voice Interaction, you will own the full voice pipeline that connects our users with Memo's core robotic and AI systems. Integrating machine learning models across local and cloud compute, you will transform raw audio signals into actionable instructions in a domestic environment. As part of the broader team, you will also contribute to the behavior stack that drives Memo's high-level decision making and task execution. What You’ll Do - Develop and maintain the full voice pipeline from microphone array input through wake word detection, speech-to-text, natural language understanding, and text-to-speech output - Configure and integrate microphone array for domestic use, tuning onboard audio processing (beamforming, noise suppression, echo cancellation) and supplementing with additional processing where needed - Integrate the voice subsystem with high level robot behaviors, enabling the robot to receive, interpret, and act intelligently on voice commands - Design and optimize TTS output to deliver natural, responsive spoken interactions in real time on embedded hardware - Define and enforce guardrails around voice input and output, including content filtering, prompt boundary enforcement, output length limits, and auditing to ensure the system operates within intended use - Evaluate and integrate STT/TTS engines and models, making informed tradeoffs between accuracy, late
Applying for this Software Engineer, Voice Interaction role?
Most applicants get filtered before a human reads their resume. See if yours makes the cut.
How to Apply on Ashby
- Ashby is a fast modern ATS — most applications take under 3 minutes.
- The resume parser is strong; verify parsed experience dates and job titles.
- Custom screening questions are often scored algorithmically — answer completely.
- Location field affects geo-based screening; use your actual metro area.
ANONYMOUS · UNFILTERED
What do employees actually say about Sunday?
Real rants from real employees. Read before you apply.