Vlogger
The project presents an innovative AI platform for generating detailed vlogs from user inputs, employing a Large Language Model in an oversight role. The approach divides vlog creation into distinct phases including scripting, acting, videography, and narration, utilizing tailored models to maintain narrative integrity and visual quality. Featuring the new ShowMaker model, it enhances the spatial-temporal alignment between script and visuals. Comprehensive evaluations demonstrate the platform's capability to produce coherent, extended vlogs, pushing forward zero-shot video generation benchmarks.