make-a-video-pytorch
Discover the Pytorch implementation of Meta AI's text-to-video generator, incorporating pseudo-3D convolutions and temporal attention for enhanced temporal fusion. This technology builds on SOTA text-to-image models like DALL-E2, offering modifications for efficient computation and precise frame interpolation. Whether applied to images or videos, it supports flexible training for diverse uses. Developed with support from Stability.ai and contributions from leading AI researchers.