#voice cloning

Logo of WhisperSpeech
WhisperSpeech
Explore an innovative open source text-to-speech system designed for flexibility and commercial use. Currently supporting English with plans for multilingual compatibility, recent updates enhance performance and introduce voice cloning. Test its capabilities on Google Colab with models leveraging Whisper, EnCodec, and Vocos.
Logo of elevenlabs-python
elevenlabs-python
Experience comprehensive text-to-speech capabilities with the Python library by ElevenLabs. This API is intended for developers and content creators, offering vibrant, realistic voices across numerous languages and accents efficiently. Featuring advanced models such as Eleven Multilingual v2 and Eleven Turbo v2.5, the library ensures consistent performance with a focus on diversity and speed. Installation and integration are straightforward, allowing users to generate audio, clone voices, and adjust settings to meet various project needs. This makes it suitable for anyone in search of professional-quality audio tools.
Logo of Multi-Tacotron-Voice-Cloning
Multi-Tacotron-Voice-Cloning
The Multi-Tacotron Voice Cloning project is a multilingual phonemic implementation for Russian and English, built on a deep learning framework. The project, an extension of Real-Time-Voice-Cloning, facilitates the creation of numeric voice representations from brief audio samples. It includes pre-trained models and necessary datasets, providing efficient pathways for text-to-speech conversion. The diverse datasets and neural networks such as Tacotron 2 and WaveRNN enable seamless multilingual capabilities, suited for advanced TTS synthesis requirements.
Logo of coqui-ai-TTS
coqui-ai-TTS
Discover the capabilities of a leading Text-to-Speech library supporting 16 languages and delivering efficient performance with latency below 200ms. The library includes models such as Tacotron, Glow-TTS, and VITS, with options for fine-tuning and multi-speaker TTS support. Utilize over 1100 Fairseq models for various linguistic needs and access numerous tools for training and refining speech models. Designed for a diverse range of applications, this library offers developers a flexible solution for generating high-quality speech.