Deep Audio-visual System for Closed-set Word-level Speech Recognition

Yougen Yuan, Wei Tang, Minhao Fan, Yue Cao, Peng Zhang 0005, Lei Xie 0001. Deep Audio-visual System for Closed-set Word-level Speech Recognition. In Wen Gao 0001, Helen Mei-Ling Meng, Matthew Turk, Susan R. Fussell, Björn W. Schuller, Yale Song, Kai Yu 0004, editors, International Conference on Multimodal Interaction, ICMI 2019, Suzhou, China, October 14-18, 2019. pages 540-545, ACM, 2019. [doi]

Abstract

Abstract is missing.