Expressive-FastSpeech2
This open-source project leverages non-autoregressive TTS to enhance expressive voice synthesis, including emotional and conversational applications. It utilizes AIHub and IEMOCAP datasets to support multilingual processing, with a focus on English and Korean. Developers can easily adapt to other languages using the project's guidelines. Built on the FastSpeech2 framework, it facilitates multi-speaker synthesis with intricate emotional tones. The repository includes branches for emotional TTS using categorical and continuous descriptors, as well as conversational TTS incorporating dialogue history. This project serves as a vital tool for researchers and developers in the field of expressive voice synthesis.