Amazon Polly Text to Speech Converter

Advanced deep learning technology to synthesize text to speech that sounds like a human voice

Launch our demos


Neural TTS voices
Neural ML Voices

*4 Different Languages & Dialects

*Up to 3000 Characters

Standard TTS Voices
Standard ML Voices

*29 Different Languages & Dialects

*Up to 3000 Characters

Large Text
Synthesize Big Texts Instantly

*For more than 3K characters

*Up to 100K Characters

Amazon Polly Features

Natural Human-like Voices

with Neural Text-to-Speech

Wide selection of voices & Languages

More than 30 different types

Synchronize Speech

for Enhanced Visual Experience

Supported Audio Stream formats

MP3 | Vorbis | PCM

Various Audio Sampling Rates

24.00kHz | 22.05kHz | 16.05kHz | 8.00 kHz

Adjustable Speaking Style

Speech Rate, Pitch, and Loudness

Newscaster Speaking Style

as TV or Radio newscaster (NTTS)

Payment Method

Pay as you go model

Backend By

Advanced Deep Learning Technology

Cost of Standard TTS

$4 for 1 million characters

Cost of Neural TTS

$16 for 1 million characters

Fully Customizable

Easy to Implement

Supported Neural Languages

British English
US English
US Spanish
Brazilian Portuguese

Supported Standard Languages

Arabic
Chinese, Mandarin
Danish
Dutch
English, Australian
English, British
English, Indian
English, US
English, Welsh
French
French, Canadian
German
Hindi
Icelandic
Italian
Japanese
Korean
Norwegian
Polish
Portuguese
Portuguese, Brazilian
Romanian
Russian
Spanish, European
Spanish, Mexican
Spanish, US
Turkish
Welsh