VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency

Ruohan Gao, Kristen Grauman. VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 15495-15505, Computer Vision Foundation / IEEE, 2021. [doi]

Abstract

Abstract is missing.