CosyVoice
Explore CosyVoice, an AI model for voice processing with multilingual capabilities. Features include repetition-aware sampling for stability, streaming inference, and voice conversion across languages. It supports zero-shot, SFT, and instruct mode inferences. Pre-trained models enable advanced text-to-speech and voice manipulation for both experts and novices. Potential future additions include music generation and wider multilingual data support.