An End-to-End Mandarin Audio-Visual Speech Recognition Model with a Feature Enhancement Module

Jinxin Wang, Chao Yang, Zhongwen Guo, Xiaomei Li, Weigang Wang. An End-to-End Mandarin Audio-Visual Speech Recognition Model with a Feature Enhancement Module. In IEEE International Conference on Systems, Man, and Cybernetics, SMC 2023, Honolulu, Oahu, HI, USA, October 1-4, 2023. pages 572-577, IEEE, 2023. [doi]

Abstract

Abstract is missing.