MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis

Shun Lei, Yixuan Zhou, Liyang Chen, Zhiyong Wu 0001, Xixin Wu, Shiyin Kang, Helen Meng. MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis. IEEE Transactions on Audio, Speech & Language Processing, 31:3290-3303, 2023. [doi]

Authors

Shun Lei

This author has not been identified. Look up 'Shun Lei' in Google

Yixuan Zhou

This author has not been identified. Look up 'Yixuan Zhou' in Google

Liyang Chen

This author has not been identified. Look up 'Liyang Chen' in Google

Zhiyong Wu 0001

This author has not been identified. Look up 'Zhiyong Wu 0001' in Google

Xixin Wu

This author has not been identified. Look up 'Xixin Wu' in Google

Shiyin Kang

This author has not been identified. Look up 'Shiyin Kang' in Google

Helen Meng

This author has not been identified. Look up 'Helen Meng' in Google