dc_tts
The dc_tts project introduces a text-to-speech model that employs deep convolutional networks with guided attention, emphasizing efficient training and quality synthesis. The project examines diverse datasets such as LJ Speech and KSS, incorporating techniques like layer normalization and adaptive learning rates to improve performance. Training scripts are available for users to generate and evaluate synthetic speech, aiming for greater efficiency over Tacotron through exclusive use of convolutional layers.