Lumina-T2X
Lumina-T2X utilizes flow-based diffusion transformers to effectively convert text into various modalities, including images, videos, and music. It supports high-quality outputs with resolutions up to 2K, and accommodates multilingual prompts and emojis. Recent enhancements improve visual quality, offering new demos that highlight its versatility in vision-language tasks, targeting developers and researchers engaged in generative AI.