Project Icon

Bert-VITS2-ext

Explore TTS Technology with Synchronized Facial Expression Data Integration

Product DescriptionThe Bert-VITS2-ext project expands the capabilities of the original Bert-VITS2 by generating synchronized facial expressions with TTS outputs, aimed at improving emotion recognition. This project focuses on incorporating facial data into TTS training to enhance expression accuracy across frameworks such as CosyVoice and GPT-SoVITS. It allows for the collection of synchronized audio and facial data, enabling the training of models to convert audio into expressions and providing audio-to-photoreal interfaces. Additionally, the project delves into generating body animations using MotionGPT, which encounters some language translation challenges. This methodology facilitates more realistic digital interactions.
Project Details