Project Icon

PL-BERT

Improve Text-to-Speech Prosody with Phoneme-Level BERT

Product DescriptionThe PL-BERT project enhances text-to-speech by using a phoneme-level BERT to predict graphemes and phonemes, significantly improving speech naturalness over current models like StyleTTS. Pre-trained on a large English dataset, it is adaptable to other languages and easily integrates into various TTS models with comprehensive setup and training guides. This method efficiently generates prosodic patterns using phoneme-only inputs.
Project Details