Project Icon

FastSpeech2

FastSpeech 2 Text-to-Speech with PyTorch and MelGAN integration

Product DescriptionThis repository offers a PyTorch implementation of FastSpeech 2 using NVIDIA's Tacotron 2 preprocessing and MelGAN vocoder for enhanced audio synthesis. It leverages Espnet's framework for FastSpeech 2 replication with room for performance adjustments. Key features include phoneme-based synthesis and TorchScript export. Contributions are invited to extend its high-quality audio promise. Essential documentation and samples are provided for further exploration.
Project Details