en

#Expressive-FastSpeech2

Expressive-FastSpeech2

This open-source project leverages non-autoregressive TTS to enhance expressive voice synthesis, including emotional and conversational applications. It utilizes AIHub and IEMOCAP datasets to support multilingual processing, with a focus on English and Korean. Developers can easily adapt to other languages using the project's guidelines. Built on the FastSpeech2 framework, it facilitates multi-speaker synthesis with intricate emotional tones. The repository includes branches for emotional TTS using categorical and continuous descriptors, as well as conversational TTS incorporating dialogue history. This project serves as a vital tool for researchers and developers in the field of expressive voice synthesis.

Terms of Use Privacy Policy Advertising Services

Feedback Email: [email protected]