Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition

Jinxin Wang, Zhongwen Guo, Chao Yang, Xiaomei Li, Ziyuan Cui. Multi-Scale Hybrid Fusion Network for Mandarin Audio-Visual Speech Recognition. In IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023. pages 642-647, IEEE, 2023. [doi]

Abstract

Abstract is missing.