Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition

Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu, Eng Siong Chng. Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023. pages 15213-15232, Association for Computational Linguistics, 2023. [doi]

Authors

Yuchen Hu

This author has not been identified. Look up 'Yuchen Hu' in Google

Ruizhe Li

This author has not been identified. Look up 'Ruizhe Li' in Google

Chen Chen

This author has not been identified. Look up 'Chen Chen' in Google

Chengwei Qin

This author has not been identified. Look up 'Chengwei Qin' in Google

Qiu-Shi Zhu

This author has not been identified. Look up 'Qiu-Shi Zhu' in Google

Eng Siong Chng

This author has not been identified. Look up 'Eng Siong Chng' in Google