nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech

Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao 0006. nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 4293-4297, IEEE, 2022. [doi]

Authors

Botao Zhao

This author has not been identified. Look up 'Botao Zhao' in Google

Xulong Zhang

This author has not been identified. Look up 'Xulong Zhang' in Google

Jianzong Wang

This author has not been identified. Look up 'Jianzong Wang' in Google

Ning Cheng

This author has not been identified. Look up 'Ning Cheng' in Google

Jing Xiao 0006

This author has not been identified. Look up 'Jing Xiao 0006' in Google