#3D Generation

Logo of normal-depth-diffusion
normal-depth-diffusion
Normal-Depth Diffusion Model provides an adaptable method for creating detailed 3D models from text prompts. It includes extensive inference and training codes, pre-trained models for normal, multi-view, and albedo diffusion, and offers a multi-view dataset from Objaverse, facilitating detailed 3D generation. The project continuously evolves with new updates, offering accessible training datasets and integration with leading AI frameworks to enhance accuracy in text-to-3D rendering.
Logo of GeoDream
GeoDream
GeoDream enhances precision in 3D generation by integrating 3D and 2D priors, ensuring realistic, high-resolution outputs and consistent geometric structures. The Uni3D-score metric advances semantic coherence evaluation in 3D. Recent integrations with ThreeStudio and Stable-Zero123 extend its capabilities. Designed for research efficiency, GeoDream supports dual virtual environments, optimizing performance on NVIDIA GPUs with reduced VRAM need, and provides robust texture and mesh export options.
Logo of Awesome-Text-to-3D
Awesome-Text-to-3D
Discover techniques for converting text and images into 3D models using advanced 2D priors such as stable diffusion and CLIP. This repository showcases various methods including zero-shot text-guided generation and multi-view consistent image transformation, presenting developments since 2022. As research progresses, directly training on 3D data models offers promising results, with comprehensive listings of the latest advancements in text and image-guided 3D creation.
Logo of richdreamer
richdreamer
RichDreamer applies a Normal-Depth Diffusion Model to convert text into detailed 3D images, suitable for various applications. The model, accepted at CVPR2024, supports MultiView-ND and Albedo Diffusion for enhanced detail. It includes installation instructions and pre-trained weights, and works efficiently with various GPUs. Resources are available on ModelScope's 3D Object Generation platform.