This allows developers to create speech-enabled products, and apps that talk to users.
For instance, if coders were to create interactive audio stories for Alexa, they could use the new voices to represent different characters, Amazon said. It will give skills more varied personalities, which should lead to a richer experience - or at least that's the idea.
A team of Chinese and U.S. researchers has discovered a new method of attacking smart assistants like Amazon Alexa and Google Home, which they have named "voice squatting".
"Using Amazon Polly, you can choose a different voice for any utterance by using the Structured Speech Markup Language (SSML) and specifying an Amazon Polly voice using the "voice name" tag", the post stated. Polly will let skill developers choose from eight different English voices and a wide variety of non-English voices. You can cache and replay Amazon Polly's speech output to prompt callers through interactive voice response (IVR) systems, such as Amazon Connect.
Below are two videos demonstrating a voice squatting attack on an Amazon Alexa and Google Home device.
Amazon Polly is a text-to-speech service that uses advanced deep learning technologies to synthesize speech that sounds like a natural human voice. Additionally, you can leverage Amazon Polly's API to deliver automated real-time information such as service status, account and billing inquiries, addresses, and contact information.