CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Yi Meng, Xiang Li, Zhiyong Wu 0001, Tingtian Li, Zixun Sun, Xinyu Xiao, Chi Sun, Hui Zhan, Helen Meng. CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 5533-5537, ISCA, 2022. [doi]

This author has not been identified. Look up 'Yi Meng' in GoogleThis author has not been identified. Look up 'Xiang Li' in GoogleThis author has not been identified. Look up 'Zhiyong Wu 0001' in GoogleThis author has not been identified. Look up 'Tingtian Li' in GoogleThis author has not been identified. Look up 'Zixun Sun' in GoogleThis author has not been identified. Look up 'Xinyu Xiao' in GoogleThis author has not been identified. Look up 'Chi Sun' in GoogleThis author has not been identified. Look up 'Hui Zhan' in GoogleThis author has not been identified. Look up 'Helen Meng' in Google

runs on WebDSL