Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis

Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu 0001, Shiyin Kang, Helen Meng. Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis. In IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023. pages 1-5, IEEE, 2023. [doi]

Authors

Shun Lei

This author has not been identified. Look up 'Shun Lei' in Google

Yixuan Zhou

This author has not been identified. Look up 'Yixuan Zhou' in Google

Liyang Chen

This author has not been identified. Look up 'Liyang Chen' in Google

Zhiyong Wu 0001

This author has not been identified. Look up 'Zhiyong Wu 0001' in Google

Shiyin Kang

This author has not been identified. Look up 'Shiyin Kang' in Google

Helen Meng

This author has not been identified. Look up 'Helen Meng' in Google