LaRa - Optimize Radiance Field Reconstruction for Extensive Baseline Applications

LaRa: Efficient Large-Baseline Radiance Fields

LaRa is an innovative project designed to excel in creating radiance fields with large baselines. It presents a significant advancement in the field of computer vision, particularly focusing on the efficient reconstruction of 3D scenes from various forms of input such as multi-view images, text, and single-view images.

New Features

Recently, the LaRa project introduced a crucial update—support for half precision training. This enhancement allows for faster convergence, achieving over 100% improvement in speed while also reaching about 1.5dB gains with fewer iterations. The updated model performs better, reducing geometric errors and providing more accurate outputs in lesser time.

Installation

Installing LaRa is a straightforward process. Users can clone the repository from GitHub and set up the environment using Conda, a popular package management system. Once installed, it can be activated easily to start using the toolkit.

Dataset

LaRa utilizes the processed GObjaverse dataset for training, which is a substantial dataset requiring approximately 1.4TB of storage space. A script is provided for automatic downloading, and users can opt to download the entire dataset or just a subset. For those preferring to preprocess their data, scripts are available for both the GObjaverse and Co3D datasets.

Training

Training in LaRa is carried out using a straightforward command, with additional configuration options available through a YAML file. This setup allows users to specify their GPU preferences and other parameters necessary for customizing the training process.

Evaluation

LaRa stands out with its ability to reconstruct radiance fields from diverse inputs:

Multi-view to 3D Reconstruction: Users can evaluate the reconstruction performance by following a simple command. Additional settings allow for output options like video and mesh during evaluation.
Text to 3D Conversion: LaRa includes a unique feature to generate 3D scenes from textual descriptions, although this feature is currently limited due to permissions issues.
Single View to 3D: With support for advanced generator types, LaRa can transform a single image view into a 3D structure. This feature is facilitated using specific pre-trained models and configurations.

Acknowledgements

The LaRa project is built upon various foundational projects like 2D Gaussian Splatting and has integrated tools from others, such as Splatter-Image and GRM. The collaborative nature of this project, leveraging tools and insights from several contributors, underscores the collective effort in pushing the boundaries of radiance field reconstruction.

Citation

Those who find LaRa useful in their research or projects are encouraged to cite it. The project, led by a team of researchers including Anpei Chen, Haofei Xu, Stefano Esposito, Siyu Tang, and Andreas Geiger, is set to be presented at the European Conference on Computer Vision (ECCV) in 2024.

Through its cutting-edge features and enhancements, LaRa offers a comprehensive toolkit for researchers and practitioners in the domain of radiance fields, paving the way for more efficient and effective 3D scene reconstruction.