Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis

Xueyuan Chen, Shun Lei, Zhiyong Wu 0001, Dong Xu, Weifeng Zhao, Helen Meng. Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis. In Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, James Pustejovsky, Leo Wanner, Key-Sun Choi, Pum-Mo Ryu, Hsin-Hsi Chen, Lucia Donatelli, Heng Ji, Sadao Kurohashi, Patrizia Paggio, Nianwen Xue, Seokhwan Kim, YoungGyun Hahm, Zhong He, Tony Kyungil Lee, Enrico Santus, Francis Bond, Seung-Hoon Na, editors, Proceedings of the 29th International Conference on Computational Linguistics, COLING 2022, Gyeongju, Republic of Korea, October 12-17, 2022. pages 7193-7202, International Committee on Computational Linguistics, 2022. [doi]

Authors

Xueyuan Chen

This author has not been identified. Look up 'Xueyuan Chen' in Google

Shun Lei

This author has not been identified. Look up 'Shun Lei' in Google

Zhiyong Wu 0001

This author has not been identified. Look up 'Zhiyong Wu 0001' in Google

Dong Xu

This author has not been identified. Look up 'Dong Xu' in Google

Weifeng Zhao

This author has not been identified. Look up 'Weifeng Zhao' in Google

Helen Meng

This author has not been identified. Look up 'Helen Meng' in Google