资讯

We implement the following features in this framework: Data processing for non-autoregressive Text-to-Speech using Montreal Forced Aligner. Convenient and scalable framework for training and inference ...
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention This work aimed at more efficient text-to-speech generation by using fully convolutional layers ...