A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS

Haohan Guo, Feng-Long Xie, Frank K. Soong, Xixin Wu, Helen Meng. A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 1611-1615, ISCA, 2022. [doi]

Abstract

Abstract is missing.