Project Icon

lance

Reliable and Modern Data Format for Machine Learning Workflows

Product DescriptionLance is an advanced data format designed for machine learning workflows, offering significantly faster random access than Parquet. It supports efficient IO operations crucial for large-scale ML training and integrates well with tools like Pandas, DuckDB, and Polars. Lance features vector search capabilities, automated data versioning, and works seamlessly with Apache Arrow, making it suitable for a range of applications such as search engines and robotics. The project is actively developed and open to community contributions for improvements. Discover its streamlined and adaptable structure to accelerate ML development.
Project Details