so-vits-svc
This project utilizes the SoftVC content encoder paired with the VITS model for singing voice conversion, maintaining original pitch and intonations without text conversion. Key features include a visible f0 editor and speaker mix timeline editor, ensuring uninterrupted sound with NSF HiFiGAN vocoder integration. Tailored for offline purposes, it is intended for converting fictional character voices without real-time application support. Its academic focus emphasizes the user's responsibility for dataset authorization. The 4.1-Stable update offers enhanced sound quality and dynamic fusion capabilities.