en

#Cascaded Latent Models

LaVie is a text-to-video conversion framework utilizing cascaded latent diffusion models. Part of the Vchitect system, it integrates Base T2V, Video Interpolation, and Video Super-Resolution features for customizable video output. It includes pre-trained models like LaVie base and Stable Diffusion, available on OpenXLab and Hugging Face Spaces. The framework offers diverse sampling methods and guidance scales, supporting the creative video generation process. Developers can follow step-by-step installation and inference tutorials. LaVie is open for academic research and commercial activities, fostering a collaborative video creation technology community.

Terms of Use Privacy Policy Advertising Services

Feedback Email: [email protected]