Multi-Rate Attention Architecture for Fast Streamable Text-to-Speech Spectrum Modeling

Qing He, Zhiping Xiu, Thilo Köhler, Jilong Wu. Multi-Rate Attention Architecture for Fast Streamable Text-to-Speech Spectrum Modeling. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. pages 5689-5693, IEEE, 2021. [doi]

Abstract

Abstract is missing.