Explain to me this big difference between SAPI5 and Google speech synthesizer voices like I am five.

jochemstoel

Jochem Stoel

Posted on April 5, 2018

Explain to me this big difference between SAPI5 and Google speech synthesizer voices like I am five.

Traditional Windows SAPI5 speech synthesis voices are language specific. The voice is designed/recorded to be a specific language. This means that if you feed English text to a French voice it will read and pronounce it as if it were French, making it sound idiotic.

Google has a speech synthesis service as well, available in Chrome browser and as API and it behaves a lot differently. If you feed English text to a Dutch Google voice, it speaks/pronounces it properly but it gets a strong Dutch accent. This is not possible with traditional text to speech.

Please describe both processes and explain to me like I am 5 the fundamental difference between them that is responsible for this.

Thanks!

💖 💪 🙅 🚩
jochemstoel
Jochem Stoel

Posted on April 5, 2018

Join Our Newsletter. No Spam, Only the good stuff.

Sign up to receive the latest update from our blog.

Related