#CVPR 2023

Logo of Collaborative-Diffusion
Collaborative-Diffusion
The project introduces advancements in multi-modal face generation and editing through pre-trained uni-modal diffusion models. It allows precise generation and editing of high-quality facial images via text and segmentation mask controls, focusing on identity preservation and dynamic diffusion. Notable updates involve FreeU integration and comprehensive training pipelines. This repository is a valuable resource for researchers and developers in facial image synthesis and modification.
Logo of SadTalker
SadTalker
This article presents a method for generating realistic talking head videos through the use of single portrait images combined with audio inputs. The technique involves using advanced 3D motion coefficient learning to produce lifelike facial animations. The project is integrated with platforms like Discord, allowing for the creation of high-quality videos from text inputs at no cost. It is compatible with multiple operating systems such as Windows, Linux, macOS, and Docker, broadening its accessibility. Recent updates have improved image quality with new full-body modes and enhancements. Being open-source, the project allows community involvement and feature expansions. With enriched WebUI demonstrations and upgraded performance, it serves as a crucial tool for developers involved in facial animation and multimedia projects.