Project Icon

stable-ts

Improving Reliable Timestamps for Whisper Transcriptions

Product DescriptionThe stable-ts library enhances the functionality of Whisper by providing reliable timestamps in audio transcription. Key integrations include voice isolation, noise reduction, and dynamic time warping to achieve precise word-level timestamps. It supports diverse model configurations and preprocessing methods, improving transcription accuracy by suppressing silence and refining outputs. The library requires FFmpeg and PyTorch for installation, offering cross-platform compatibility and customizable options via denoisers and voice detection methods. Additionally, it connects with any ASR system, enabling its application in various audio transcription scenarios.
Project Details