Multi-Tacotron-Voice-Cloning
The Multi-Tacotron Voice Cloning project is a multilingual phonemic implementation for Russian and English, built on a deep learning framework. The project, an extension of Real-Time-Voice-Cloning, facilitates the creation of numeric voice representations from brief audio samples. It includes pre-trained models and necessary datasets, providing efficient pathways for text-to-speech conversion. The diverse datasets and neural networks such as Tacotron 2 and WaveRNN enable seamless multilingual capabilities, suited for advanced TTS synthesis requirements.