Supported Voices and Languages
Below is a list of supported Voice languages and codes from different providers.
Google
Google is one of the leading tech companies in the world and has developed a suite of cloud
services tailored for various applications. One of these services is text-to-speech conversion.
Google's text-to-speech service provides three types of voices: WaveNet
, Nueral2
, and Standard
.
Supported Voices
You can find a link to the supported voices here
Google Neural2 Voices
Neural2 voices are the latest generation of Google's text-to-speech voices.
They are powered by advanced neural network technology
and are designed to produce speech that is more natural and
closer to human-like pronunciation and intonation.
Neural2 voices are based on the same technology
used to create Custom Voices.
More information on Neural2
voices can be found here.
When using a Neural2 voice,
you must specify the voice to use the Neural2
in the voice parameter.
Neural2 Format:
"gcloud.<language-code>-Neural2-<variant>"
Neural2 Example:
"gcloud.en-US-Neural2-A"
Google WaveNet Voices
These are advanced voices powered by deep learning technology. They offer a more natural
and lifelike speech output. However, they come at a premium price compared to the standard voices.
More information on WaveNet
voices can be found here.
When using a WaveNet voice,
you must specify the voice to use the Wavenet
in the voice parameter.
WaveNet Format:
"gcloud.<language-code>-Wavenet-<variant>"
WaveNet Example:
"gcloud.en-GB-Wavenet-A"
Google Standard Voices
These are the traditional text-to-speech voices provided by Google. Suitable for a wide range
of applications, they are budget-friendly and serve as a reliable option for many developers and businesses.
Less natural sounding than WaveNet
and Neural2
voices.
More information on Standard
voices can be found here.
When using a Standard voice,
you must specify the voice to use the Standard
in the voice parameter
Standard Format:
"gcloud.<language-code>-Standard-<variant>"
Standard Example:
"gcloud.en-US-Standard-A"
Supported Languages
You can find a link to the supported languages here
SSML
SSML may be entered as a string wrapped in <speak>
tags.
You can find more information on Google Cloud SSML here
Amazon
Amazon, a global tech giant, offers a text-to-speech service through its cloud platform, AWS, known as Polly. Polly is designed to turn text into lifelike speech, allowing applications to speak out loud and developers to create applications that talk.
Supported Voices
You can find the list of supported Neural
voices here
You can find the list of supported Standard
voices here
Polly Neural Voices
Powered by advanced neural network technology, these voices are designed to produce speech that is more natural and closer to human-like pronunciation and intonation. Due to the sophistication of the technology, they come at a premium price.
When using a Neural voice, you must specify the voice to use the Neural
in the voice parameter.
Neural Format:
"polly.<language-code>-<voice-name>-Neural"
Neural Example:
"polly.en-GB-Emma-Neural"
Polly Standard Voices
These are the traditional text-to-speech voices provided by Polly. Suitable for a wide range
of applications, they are budget-friendly and serve as a reliable option for many developers and businesses.
Less natural sounding than Polly Neural
voices.
Standard Format:
"polly.<language-code>-<voice-name>"
Standard Example:
"polly.en-GB-Emma"
Supported Languages
You can find a link to the supported languages here
SSML
SSML may be entered as a string wrapped in <speak>
tags.
You can find more information on Amazon's Polly SSML here
ElevenLabs
Supported Voices
Currently, we support the voices from the Multilingual v2
model from ElevenLabs.
ElevenLabs Model | Supported Voice |
---|---|
Multilingual v2 | rachel , clyde , domi , dave , fin , antoni , thomas , charlie , emily , elli , callum , patrick , harry , liam , dorothy , josh , arnold , charlotte , matilda , matthew , james , joseph , jeremy , michael , ethan , gigi , freya , grace , daniel , serena , adam , nicole , jessie , ryan , sam , glinda , giovanni , mimi |
ElevenLabs Format:
"elevenlabs.<voice-name>"
ElevenLabs Example:
"elevenlabs.rachel"
Supported Languages
We currently support the Multilingual v2
model from ElevenLabs.
The Multilingual
model is capable of supporting multiple different languages.
Below is the list of supported languages in ElevenLabs.
ElevenLabs Model | Supported Languages |
---|---|
Multilingual v2 | US - [English - USA]GB - [English - UK]AU - [English - Australia]CA - [English - Canada]JP - [Japanese]CN - [Chinese]DE - [German]IN - [Hindi]FR - [French - France]CA - [French - Canada]KR - [Korean]BR - [Portuguese - Brazil]PT - [Portuguese - Portugal]IT - [Italian]ES - [Spanish - Spain]MX - [Spanish - Mexico]ID - [Indonesian]NL - [Dutch]TR - [Turkish]PH - [Filipino]PL - [Polish]SE - [Swedish]BG - [Bulgarian]RO - [Romanian]SA - [Arabic - Saudi Arabia]AE - [Arabic - UAE]CZ - [Czech]GR - [Greek]FI - [Finnish]HR - [Croatian]MY - [Malay]SK - [Slovak]DK - [Danish]IN - [Tamil]UA - [Ukrainian] |
SSML
ElevenLabs does not currently support the usage of SSML, therefore, it cannot be used when utilizing an ElevenLabs Voice model.