Project Icon

voicesmith

Multispeaker Text-to-Speech Training without Coding Experience

Product DescriptionVoiceSmith provides an easy way for non-coders to train and run text-to-speech models for single and multiple speakers. Utilizing a refined DelightfulTTS and UnivNet structure, it optimizes model outputs on your datasets, with tools for automatic text normalization. The pretrained models are based on a vast repository of 5000 speakers, ensuring high adaptability. Compatible with Windows and Linux, and optimized for NVIDIA GPUs, VoiceSmith is a versatile tool. Developers can easily clone the repository and run the project while supporting its Apache-2.0 licensed evolution.
Project Details