InstantMesh - Generate 3D Meshes from Single Images with InstantMesh

Introduction to InstantMesh

InstantMesh is an innovative tool for generating 3D meshes from single images using sparse-view large reconstruction models. Developed as a part of the LRM/Instant3D architecture, this framework stands out due to its efficiency and ease of use in the 3D graphics domain. The project is readily accessible, with an official implementation that is available to the public.

Key Features

InstantMesh provides a variety of features catering to different needs in 3D mesh generation. It includes the Zero123++ fine-tuning code, which is already released for public use. The tool supports a Gradio demo that can be run on two GPUs, optimizing memory use. Additionally, users can run the demo with Docker, making deployment flexible and convenient.

For those interested in practical applications, the InstantMesh team has released both inference and training codes, alongside model weights. A Gradio demo is available on HuggingFace, providing an interactive experience for users.

A noteworthy aspect is the plan to include support for more multi-view diffusion models, indicating a commitment to continuous improvement and expansion of functionalities.

System Requirements

To effectively use InstantMesh, certain software dependencies need to be fulfilled. It is recommended to use Python version 3.10 or above, along with PyTorch 2.1.0 and CUDA 12.1. The installation process is simplified with provided scripts and commands that set up the necessary environment.

Usage Guide

Model Download

InstantMesh comes with four model variants for reconstructing sparse-view images. There's also a specialized Zero123++ UNet model for generating images with white backgrounds. You can choose to let the inference script automatically download these models, or manually download them to a designated directory.

Running a Local Demo

Running the InstantMesh demo locally is simple. By executing the provided script, users can start a Gradio demo. The demo is designed to run on two GPUs to economize memory usage, but it can be configured to run on a single GPU if needed. There is also provision for running the demo through Docker.

Command Line Execution

For those who prefer command-line operations, InstantMesh allows generating 3D meshes directly. Commands are provided to process images, and options are available for exporting meshes with or without texture maps, depending on the user's preference.

Training Capabilities

InstantMesh offers training code, enabling users to contribute to or modify the existing models. The scripts support training on both NeRF and mesh representations. Furthermore, there is code available for fine-tuning the Zero123++ model, which is frequently requested by the community.

Acknowledgements

The InstantMesh project acknowledges the significant contributions of several related projects in the field of 3D generative AI, including Zero123++, OpenLRM, FlexiCubes, and Instant3D. Several community members also receive thanks for enhancing the project's accessibility through demos and other implementations.

In summary, InstantMesh is a powerful, user-friendly tool for 3D mesh generation, providing a comprehensive suite of features for both developers and end-users in the 3D graphics community.