Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention

Xinmeng Xu, Yang Wang, Jie Jia, Binbin Chen, Dejun Li. Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 971-975, ISCA, 2022. [doi]

Abstract

Abstract is missing.