Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition

Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng. Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI 2023, 19th-25th August 2023, Macao, SAR, China. pages 5076-5084, ijcai.org, 2023. [doi]

Authors

Yuchen Hu

This author has not been identified. Look up 'Yuchen Hu' in Google

Ruizhe Li

This author has not been identified. Look up 'Ruizhe Li' in Google

Chen Chen

This author has not been identified. Look up 'Chen Chen' in Google

Heqing Zou

This author has not been identified. Look up 'Heqing Zou' in Google

Qiushi Zhu

This author has not been identified. Look up 'Qiushi Zhu' in Google

Eng Siong Chng

This author has not been identified. Look up 'Eng Siong Chng' in Google