melgan
This PyTorch-based implementation of MelGAN provides an efficient solution for lightweight and swift audio generation. It leverages the same mel-spectrogram function as NVIDIA's Tacotron2, ensuring seamless conversion into raw audio. Features highlight improved adaptability to new speakers versus WaveGlow and include a pretrained model on PyTorch Hub. Suitable for those seeking efficient audio synthesis in projects, it supports dataset preparation, model training with Tensorboard, and inference, tested on Python 3.6 using sets like LJSpeech-1.1.