LLM-based Reinforcement Learning audio edit model
VITS2 backbone with multilingual-bert
Mice speech to text with MX Cinnamon OS ISO
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
Chinese voice dialogue robot/smart speaker project
A walk along memory lane
Singing Voice Synthesis via Shallow Diffusion Mechanism
Clone a voice in 5 seconds to generate arbitrary speech in real-time
PAddle PARAllel text-to-speech toolKIT
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
An implementation of Tacotron 2 that supports multilingual experiments
Dia-1.6B generates lifelike English dialogue and vocal expressions