#text-to-video

Logo of AI-text-to-video-model-from-scratch
AI-text-to-video-model-from-scratch
Discover the method for creating text-to-video models using GANs in Python. This guide covers key processes such as data coding, pre-processing, and GAN implementation for efficient video generation, suitable for those with limited computing resources.
Logo of VGen
VGen
The VGen project is an open-source video synthesis platform from the Tongyi Lab at Alibaba, featuring state-of-the-art models for generating videos. It facilitates the creation of high-quality videos from text and images, with the capability to integrate feedback from users. The repository includes various models like I2VGen-xl for image-to-video conversion and VideoComposer for videos with controlled motion. VGen offers comprehensive tools for visualization, training, and performance evaluation in video generation. Known for its flexibility and high performance across video tasks, recent updates include the release of InstructVideo and the ModelScopeT2V V1.5 model, which enhances video synthesis through improved customization and scalability.