willow
Willow Inference Server supports self-hosting for efficient language inference, including STT, TTS, and LLM, compatible with applications like WebRTC. Explore discussions and documentation on Github and heywillow.io for better integration and support.