bigvsan
This open-source project offers a PyTorch implementation designed to enhance neural vocoders with GAN-based methods through the Slicing Adversarial Network. Key sound quality metrics like M-STFT, PESQ, and MCD are improved. Built on BigVGAN, it supports efficient model training using the LibriTTS dataset. Comprehensive instructions and pretrained model checkpoints are available, making it a valuable resource for researchers and developers focused on advancing audio synthesis and speech processing.