End-to-End Text-to-Speech Synthesis with Unaligned Multiple Language Units Based on Attention

Masashi Aso, Shinnosuke Takamichi, Hiroshi Saruwatari. End-to-End Text-to-Speech Synthesis with Unaligned Multiple Language Units Based on Attention. In Helen Meng, Bo Xu 0011, Thomas Fang Zheng, editors, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020. pages 4009-4013, ISCA, 2020. [doi]

Abstract

Abstract is missing.