Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet

Mingyang Zhang 0003, Xin Wang 0037, Fuming Fang, Haizhou Li 0001, Junichi Yamagishi. Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet. In Gernot Kubin, Zdravko Kacic, editors, Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019. pages 1298-1302, ISCA, 2019. [doi]