en

#AudioGPT

AudioGPT is an open-source initiative providing tools for analyzing and creating speech, music, and other audio forms. The project supports tasks such as text-to-speech, style transfer, and speech recognition through models like FastSpeech and whisper. For audio manipulation, it includes tasks like text-to-audio and image-to-audio using models such as Make-An-Audio. It also offers talking head synthesis with GeneFace. As some features are being refined, AudioGPT continuously broadens its functionality for varied audio projects.

Terms of Use Privacy Policy Advertising Services

Feedback Email: [email protected]