sherpa
Sherpa is an open-source speech-to-text inference framework focusing on end-to-end models using PyTorch, suitable for deploying pre-trained transducer and CTC-based models. It supports speech transcription via both C++ and Python APIs. For training and fine-tuning models, refer to the Icefall project. Consider Sherpa-ONNX and Sherpa-NCNN for similar projects without PyTorch, which also support iOS, Android, and embedded systems. Comprehensive documentation and a browser-based demo are available for further exploration.