Alignvsr: audio-Visual Cross-Modal Alignment for Visual speech Recognition

Zehua Liu, Xiaolou Li, Chen Chen 0075, Li Guo, Lantian Li, Dong Wang 0013. Alignvsr: audio-Visual Cross-Modal Alignment for Visual speech Recognition. In Proceedings of the 2025 11th International Conference on Communication and Information Processing, ICCIP 2025, Lingshui, Hainan, China, November 12-15, 2025. pages 161-165, ACM, 2025. [doi]

Abstract

Abstract is missing.