# Text to Speech Apps

Related links:\
🔗 [Speech to Text Apps](/irosyadi/digitalmedia/speech-to-text.md)\
🔗 [Text to Speech Apps](/irosyadi/digitalmedia/text-to-speech.md)\
🔗 [Speech to Speech (Fake Voice Generator)](/irosyadi/digitalmedia/speech-to-speech.md)

## Text to Speech Apps

* [Modern Google-level STT Models Released](https://habr.com/en/post/519562/)
* [codeforequity-at/botium-speech-processing: Botium Speech Processing](https://github.com/codeforequity-at/botium-speech-processing) : open source
  * [demo](https://speech.botiumbox.com/api-docs/)
* [Create. Edit. Publish. - Narration Box](https://narrationbox.com/)
* [Text-to-Speech: Lifelike Speech Synthesis - Google Cloud](https://cloud.google.com/text-to-speech/)
* [Hyper-Realistic Artificial Voices](https://www.sonantic.io/)
* [mozilla/TTS: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)](https://github.com/mozilla/TTS)
* [Mozilla Common Voice](https://commonvoice.mozilla.org/en)
* [Amazon Polly](https://aws.amazon.com/polly/)
* [Descript - Create podcasts, videos, and transcripts](https://www.descript.com/)
* [Synthesize Voice AI and Natural Sounding Text-to-Speech—Replica](https://replicastudios.com/)
* [15.ai: Natural TTS with minimal data](https://15.ai/)
* [CookiePPP/cookietts: TTS from Cookie. Messy and experimental!](https://github.com/CookiePPP/cookietts)
* [coqui](https://github.com/coqui-ai) [Coqui](https://coqui.ai/) STT and TTS
* [Narration Box - Everything you need to engage your audience with voice and audio.](https://narrationbox.com/)
* [Text to Speech–Realistic AI Voice Generator - Microsoft Azure](https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/#overview=)
* [neonbjb/tortoise-tts: A multi-voice TTS system trained with an emphasis on quality](https://github.com/neonbjb/tortoise-tts)
* [15.ai: Natural TTS with minimal viable data](https://15.ai/)
* [Introducing Mimic 3 by Mycroft](https://mycroft.ai/blog/introducing-mimic-3/)
* [snakers4/silero-models: Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple](https://github.com/snakers4/silero-models)

## Research

* [Audio samples related to Tacotron, an end-to-end speech synthesis system by Google.](https://google.github.io/tacotron/)
* [Kyubyong/speaker\_adapted\_tts: Making a TTS model with 1 minute of speech samples within 10 minutes](https://github.com/Kyubyong/speaker_adapted_tts)
* [A highly efficient, real-time text to speech system deployed on CPUs](https://ai.facebook.com/blog/a-highly-efficient-real-time-text-to-speech-system-deployed-on-cpus/)
* [An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://r9y9.github.io/deepvoice3_pytorch/)
* [Audio samples from "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"](https://google.github.io/tacotron/publications/global_style_tokens/index.html)
* [Kyubyong/tacotron: A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model](https://github.com/Kyubyong/tacotron)
* [buriburisuri/speech-to-text-wavenet: Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow](https://github.com/buriburisuri/speech-to-text-wavenet)
* [NVIDIA/tacotron2: Tacotron 2 - PyTorch implementation with faster-than-realtime inference](https://github.com/NVIDIA/tacotron2)
* [CorentinJ/Real-Time-Voice-Cloning: Clone a voice in 5 seconds to generate arbitrary speech in real-time](https://github.com/CorentinJ/Real-Time-Voice-Cloning)
* [r9y9/tacotron\_pytorch: PyTorch implementation of Tacotron speech synthesis model.](https://github.com/r9y9/tacotron_pytorch)

## Text to Speech

* [TinyGem.org - bookmarking and content recommendations for people who love to read Hacker News.](https://tinygem.org/listen/)
* <https://www.locserendipity.com/TTS.html>
* [MycroftAI/mimic3: A fast local neural text to speech engine for Mycroft](https://github.com/MycroftAI/mimic3)
* [Matter - Chrome Web Store](https://chrome.google.com/webstore/detail/matter/knjbgabkeojmfdhindppcmhhfiembkeb)
* [Balabolka](http://www.cross-plus-a.com/balabolka.htm)

## Rank

* [Murf AI](https://murf.ai/) ⭐⭐⭐⭐⭐
* [Coqui](https://coqui.ai/) with speed 1.2 ⭐⭐⭐
* [Text To Speech](https://tts.cns.wtf/) with speed 1.5 ⭐⭐
* [Natural Readers](https://www.naturalreaders.com/online/) ⭐⭐⭐⭐⭐
* [FakeYou](https://fakeyou.com/) ⭐⭐⭐
* [Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis - NVIDIA ADLR](https://nv-adlr.github.io/Flowtron)

## Rank Indonesian

* [Murf AI](https://murf.ai/) ⭐⭐⭐
* [Micmonster](https://micmonster.com/text-to-speech/indonesian-indonesia/) ⭐⭐⭐⭐⭐
* [Play.ht](https://play.ht/text-to-speech-voices/indonesian/) ⭐⭐⭐
* [Ondoku](https://ondoku3.com/id/) ⭐⭐⭐
* [Narakeet](https://www.narakeet.com/)
* [Wideo](https://wideo.co/text-to-speech/)
* [Voicemaker](https://voicemaker.in/) ⭐⭐⭐⭐⭐
* [Free TTS](https://freetts.com/) ⭐⭐
* [Notevibes](https://notevibes.com/) ⭐⭐

### Text to Speech

* [ElevenLabs || Prime Voice AI](https://beta.elevenlabs.io/)
* [VALL-E](https://valle-demo.github.io/)
* [AI Voice Generator: Versatile Text to Speech Software | Murf AI](https://murf.ai/)
* [Play.ht dashboard](https://play.ht/app/audio-files)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://irosyadi.gitbook.io/irosyadi/digitalmedia/text-to-speech.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
