NVIDIA

AI, high-performance computing, visualization, conversational AI

SeniorDeepLearningScientist,SpeechSynthesis

Ho Chi Minh City, Vietnam FULL TIME
The Brief

“Senior Deep Learning Scientist, Speech Synthesis at NVIDIA. Skills: Speech Synthesis model training, ML/DL techniques, Python programming, PyTorch. Train Speech Synthesis mel-spectrogram and vocoder models. Measure, benchmark, and analyze model performance, accuracy, and recommend improvements”

What You'll Achieve.

improve the experience of millions of customers

Industry & Context.

AI, high performance computing, visualization, conversational AI
Problems you'll solve

solving real‑world conversational AI problems; recommend improvements

What They're Looking For.

Must Have

Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, AI, Applied Math, Linguistics, or Computational Linguistics, 5+ years of experience in machine learning and AI model development, Python programming skills, with solid fundamentals in software design and optimization, knowledge of ML/DL techniques and tools, including CNNs, RNNs/LSTMs, and Transformers, Hands-on experience training speech synthesis models, including TTS, voice cloning, or speech-to-speech systems, Proficiency with PyTorch, Experience with Git, Gerrit, or GitLab

Nice to Have

Experience with multilingual or code-switched TTS, voice cloning, or cross-lingual voice cloning, Familiarity with text normalization, inverse text normalization, and multilingual G2P systems, Interest in linguistics, phonetics, phonology, and language technologies, C++ programming skills and familiarity with CUDA, cuDNN, or TensorRT, Experience deploying ML models on data center, cloud, or embedded systems

What You'll Do.

Train Speech Synthesis mel-spectrogram and vocoder models

and analyze model performance

and recommend improvements

Maintain the TTS model evaluation system and characterize quality metrics across platforms

Improve processes for speech data processing

and TTS training set preparation

Build knowledge of TTS datasets for training and evaluation

How You'll Work.

Team & Collaboration

Collaborate with cross-functional teams on new features, improvements, and issue triage; Participate in code reviews, design reviews, use case reviews, and test plan reviews

Free ATS check

Applying for this Senior Deep Learning Scientist, Speech Synthesis role?

Most applicants get filtered before a human reads their resume. See if yours makes the cut.

How to Apply on Workday

  • Workday has a multi-step form — save your progress after every section.
  • "Apply With LinkedIn" can fail or lose data; manual entry is more reliable.
  • Watch for the "Submit for Review" final step — hitting "Save" alone does not submit.
  • Job requisition numbers are useful when following up with HR by email.

ANONYMOUS · UNFILTERED

What do employees actually say about NVIDIA?

Real rants from real employees. Read before you apply.

Read Company Rants →