Project Icon

pflowtts_pytorch

Data-Efficient Zero-Shot Speech Synthesis Using P-Flow Model for Rapid Speaker Adaptation

Product DescriptionP-Flow utilizes a speech-prompted text encoder and flow matching generative decoder for efficient zero-shot TTS, achieving notable speaker adaptation and synthesis speed improvements compared to large-scale models. Trained on the LibriTTS dataset, P-Flow maintains high speaker similarity and pronunciation quality.
Project Details