Vocoder-free text-to-speech synthesis incorporating generative adversarial networks using low-/multi-frequency STFT amplitude spectra

Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari. Vocoder-free text-to-speech synthesis incorporating generative adversarial networks using low-/multi-frequency STFT amplitude spectra. Computer Speech & Language, 58:347-363, 2019. [doi]

Abstract

Abstract is missing.