Speech to Text to Speech, sends text as OSC messages
Comprehensive Gradio WebUI for audio processing
Qwen3-TTS is an open-source series of TTS models
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
State-of-the-art TTS model under 25MB
Controllable & emotion-expressive zero-shot TTS
Use Microsoft Edge's online text-to-speech service from Python
1 min voice data can also be used to train a good TTS model
A single Gradio + React WebUI with extensions for ACE-Step
A sound cloning tool with a web interface, using your voice
A deep learning toolkit for Text-to-Speech, battle-tested in research
Foundational model for human-like, expressive TTS
Real-time voice interactive digital human
Towards Human-Sounding Speech
Free, high-quality text-to-speech API endpoint to replace OpenAI
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
Bailing is a voice dialogue robot similar to GPT-4o
Code for openai.fm, a demo for the OpenAI Speech API
A fast, local neural text to speech system
Virtual AI anchor that combines state-of-the-art technology
A high-quality rapid TTS voice cloning model
The open-source voice synthesis studio powered by Qwen3-TTS
Conversational voice AI agents