Polly can also deal with units, fractions, abbreviations, currencies, dates, times, and other speech components in sophisticated, language-specific fashion. Or, what about the “St.” Depending on the language and the context, this could mean (and should be pronounced) as either “street” or “saint.” Again, Polly knows what to do here. For example, consider the difference in pronunciation of the word “live” in the phrases “I live in Seattle” and “Live from New York.” Polly knows that this pair of homographs are spelled the same but are pronounced quite differently. Polly was designed to address many of the more challenging aspects of speech generation. Polly currently supports a total of 47 male & female voices spread across 24 languages, with additional languages and voices on the roadmap. In order to address these use cases (and others that you will dream up), we are introducing Polly, a cloud service that converts text to lifelike speech that you can use in your own tools and applications. Many of these applications are great fits for mobile environments where connectivity is very good but local processing power and storage are so-so at best. Entertainment, gaming, public announcement systems, e-learning, telephony, assistive apps & devices and personal assistants are just a few starting points. At that time, most people expected mechanically generated speech to sound precise, clipped, and devoid of human emotion.įast forward many years and we now see that there are many great applications and use cases for computer-generated speech, commonly known as Text-to-Speech or TTS. In just a few minutes, pleasant memories of HAL-9000, B9 ( Lost in Space), the original Star Trek Computer, and Rosie (from The Jetsons) all came to mind.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |