Prosody Modeling with 3D Visual Information for Expressive Video Dubbing

Zhihan Yang, Shansong Liu, Xu Li, Haozhe Wu, Zhiyong Wu 0001, Ying Shan, Jia Jia 0001. Prosody Modeling with 3D Visual Information for Expressive Video Dubbing. In Naomi Harte, Julie Carson-Berndsen, Gareth Jones, editors, 24th Annual Conference of the International Speech Communication Association, Interspeech 2023, Dublin, Ireland, August 20-24, 2023. pages 4863-4867, ISCA, 2023. [doi]

Abstract

Abstract is missing.