Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention

Bin Duan, Hao Tang, Wei Wang, Ziliang Zong, Guowei Yang, Yan Yan 0002. Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention. In IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Waikoloa, HI, USA, January 3-8, 2021. pages 4012-4021, IEEE, 2021. [doi]

Abstract

Abstract is missing.