Comprehensive Gradio WebUI for audio processing
Qwen3-TTS is an open-source series of TTS models
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
State-of-the-art TTS model under 25MB
Controllable & emotion-expressive zero-shot TTS
Use Microsoft Edge's online text-to-speech service from Python
1 min voice data can also be used to train a good TTS model
A sound cloning tool with a web interface, using your voice
A deep learning toolkit for Text-to-Speech, battle-tested in research
Foundational model for human-like, expressive TTS
Real-time voice interactive digital human
Free, high-quality text-to-speech API endpoint to replace OpenAI
Towards Human-Sounding Speech
Industrial-level controllable zero-shot text-to-speech system
Generate audiobooks from e-books, voice cloning & 1107+ languages
Bailing is a voice dialogue robot similar to GPT-4o
Virtual AI anchor that combines state-of-the-art technology
A high-quality rapid TTS voice cloning model
SoTA open-source TTS
Conversational voice AI agents
Official PyTorch Implementation
Open-source framework for intelligent speech interaction
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference