Inferring Emphasis for Real Voice Data: An Attentive Multimodal Neural Network Approach

Suping Zhou, Jia Jia 0001, Long Zhang, Yanfeng Wang, Wei Chen, Fanbo Meng, Fei Yu, Jialie Shen. Inferring Emphasis for Real Voice Data: An Attentive Multimodal Neural Network Approach. In Wen-Huang Cheng, Junmo Kim, Wei-Ta Chu, Peng Cui 0001, Jung-Woo Choi, Min-Chun Hu, Wesley De Neve, editors, MultiMedia Modeling - 26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part II. Volume 11962 of Lecture Notes in Computer Science, pages 52-62, Springer, 2020. [doi]

Abstract

Abstract is missing.