> For the complete documentation index, see [llms.txt](https://irosyadi.gitbook.io/irosyadi/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://irosyadi.gitbook.io/irosyadi/digitalmedia/text-to-speech.md).

# Text to Speech Apps

Related links:\
🔗 [Speech to Text Apps](/irosyadi/digitalmedia/speech-to-text.md)\
🔗 [Text to Speech Apps](/irosyadi/digitalmedia/text-to-speech.md)\
🔗 [Speech to Speech (Fake Voice Generator)](/irosyadi/digitalmedia/speech-to-speech.md)

## Text to Speech Apps

* [Modern Google-level STT Models Released](https://habr.com/en/post/519562/)
* [codeforequity-at/botium-speech-processing: Botium Speech Processing](https://github.com/codeforequity-at/botium-speech-processing) : open source
  * [demo](https://speech.botiumbox.com/api-docs/)
* [Create. Edit. Publish. - Narration Box](https://narrationbox.com/)
* [Text-to-Speech: Lifelike Speech Synthesis - Google Cloud](https://cloud.google.com/text-to-speech/)
* [Hyper-Realistic Artificial Voices](https://www.sonantic.io/)
* [mozilla/TTS: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)](https://github.com/mozilla/TTS)
* [Mozilla Common Voice](https://commonvoice.mozilla.org/en)
* [Amazon Polly](https://aws.amazon.com/polly/)
* [Descript - Create podcasts, videos, and transcripts](https://www.descript.com/)
* [Synthesize Voice AI and Natural Sounding Text-to-Speech—Replica](https://replicastudios.com/)
* [15.ai: Natural TTS with minimal data](https://15.ai/)
* [CookiePPP/cookietts: TTS from Cookie. Messy and experimental!](https://github.com/CookiePPP/cookietts)
* [coqui](https://github.com/coqui-ai) [Coqui](https://coqui.ai/) STT and TTS
* [Narration Box - Everything you need to engage your audience with voice and audio.](https://narrationbox.com/)
* [Text to Speech–Realistic AI Voice Generator - Microsoft Azure](https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/#overview=)
* [neonbjb/tortoise-tts: A multi-voice TTS system trained with an emphasis on quality](https://github.com/neonbjb/tortoise-tts)
* [15.ai: Natural TTS with minimal viable data](https://15.ai/)
* [Introducing Mimic 3 by Mycroft](https://mycroft.ai/blog/introducing-mimic-3/)
* [snakers4/silero-models: Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple](https://github.com/snakers4/silero-models)

## Research

* [Audio samples related to Tacotron, an end-to-end speech synthesis system by Google.](https://google.github.io/tacotron/)
* [Kyubyong/speaker\_adapted\_tts: Making a TTS model with 1 minute of speech samples within 10 minutes](https://github.com/Kyubyong/speaker_adapted_tts)
* [A highly efficient, real-time text to speech system deployed on CPUs](https://ai.facebook.com/blog/a-highly-efficient-real-time-text-to-speech-system-deployed-on-cpus/)
* [An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://r9y9.github.io/deepvoice3_pytorch/)
* [Audio samples from "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"](https://google.github.io/tacotron/publications/global_style_tokens/index.html)
* [Kyubyong/tacotron: A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model](https://github.com/Kyubyong/tacotron)
* [buriburisuri/speech-to-text-wavenet: Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow](https://github.com/buriburisuri/speech-to-text-wavenet)
* [NVIDIA/tacotron2: Tacotron 2 - PyTorch implementation with faster-than-realtime inference](https://github.com/NVIDIA/tacotron2)
* [CorentinJ/Real-Time-Voice-Cloning: Clone a voice in 5 seconds to generate arbitrary speech in real-time](https://github.com/CorentinJ/Real-Time-Voice-Cloning)
* [r9y9/tacotron\_pytorch: PyTorch implementation of Tacotron speech synthesis model.](https://github.com/r9y9/tacotron_pytorch)

## Text to Speech

* [TinyGem.org - bookmarking and content recommendations for people who love to read Hacker News.](https://tinygem.org/listen/)
* <https://www.locserendipity.com/TTS.html>
* [MycroftAI/mimic3: A fast local neural text to speech engine for Mycroft](https://github.com/MycroftAI/mimic3)
* [Matter - Chrome Web Store](https://chrome.google.com/webstore/detail/matter/knjbgabkeojmfdhindppcmhhfiembkeb)
* [Balabolka](http://www.cross-plus-a.com/balabolka.htm)

## Rank

* [Murf AI](https://murf.ai/) ⭐⭐⭐⭐⭐
* [Coqui](https://coqui.ai/) with speed 1.2 ⭐⭐⭐
* [Text To Speech](https://tts.cns.wtf/) with speed 1.5 ⭐⭐
* [Natural Readers](https://www.naturalreaders.com/online/) ⭐⭐⭐⭐⭐
* [FakeYou](https://fakeyou.com/) ⭐⭐⭐
* [Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis - NVIDIA ADLR](https://nv-adlr.github.io/Flowtron)

## Rank Indonesian

* [Murf AI](https://murf.ai/) ⭐⭐⭐
* [Micmonster](https://micmonster.com/text-to-speech/indonesian-indonesia/) ⭐⭐⭐⭐⭐
* [Play.ht](https://play.ht/text-to-speech-voices/indonesian/) ⭐⭐⭐
* [Ondoku](https://ondoku3.com/id/) ⭐⭐⭐
* [Narakeet](https://www.narakeet.com/)
* [Wideo](https://wideo.co/text-to-speech/)
* [Voicemaker](https://voicemaker.in/) ⭐⭐⭐⭐⭐
* [Free TTS](https://freetts.com/) ⭐⭐
* [Notevibes](https://notevibes.com/) ⭐⭐

### Text to Speech

* [ElevenLabs || Prime Voice AI](https://beta.elevenlabs.io/)
* [VALL-E](https://valle-demo.github.io/)
* [AI Voice Generator: Versatile Text to Speech Software | Murf AI](https://murf.ai/)
* [Play.ht dashboard](https://play.ht/app/audio-files)