PortaSpeech
PortaSpeech delivers a PyTorch-based generative text-to-speech system known for its compact model size and flexibility. It allows exploration of audio samples and employs pretrained models for single and batch inference. Featuring TTS controllability and supporting datasets like LJSpeech, it is designed with concise preprocessing and training guidance. It integrates vocoder options via HiFi-GAN and MelGAN for quality synthesis, making it a versatile choice for developers interested in speech synthesis. Moreover, it accommodates custom datasets and enhances alignment configurations, all while providing real-time functionality exemplified by TensorBoard.