stt
An offline speech recognition tool that effectively converts audio or video into text using the fast-whisper model. It offers output in formats like JSON and SRT, making it a viable alternative to OpenAI and Baidu's APIs. Users can choose model sizes to match hardware capabilities and leverage CUDA acceleration with NVIDIA GPUs. This tool is easy to deploy on Windows, Linux, and Mac, with straightforward setup and detailed API documentation.