Project Icon

lhotse

Adaptive Python Library for Seamless Audio Data Handling with PyTorch

Product DescriptionLhotse, a Python library, enhances speech and audio data preparation by offering flexible and accessible solutions. It smoothly integrates with PyTorch and supports both novice and seasoned users with its command-line interface and standardized data preparation methods. Lhotse's features include dynamic audio cuts for real-time operations like mixing and truncation, optimizing storage and bandwidth usage. It allows for data augmentation and feature extraction in both pre-computed and real-time modes, supports feature-space cut mixing, and works with Kaldi and ESPnet frameworks, making it a valuable tool for researchers and developers in audio processing.
Project Details