deltacat
This Python-based data catalog uses Ray for efficient management of scalable, ACID-compliant databases. It features git-like workflows for managing exabyte-scale data lakes and applies Apache Arrow for tasks like change-data-capture and table repair, maintaining data consistency and integrity.