Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention

Bin Duan, Hao Tang, Wei Wang, Ziliang Zong, Guowei Yang, Yan Yan 0002. Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention. In IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Waikoloa, HI, USA, January 3-8, 2021. pages 4012-4021, IEEE, 2021. [doi]

Authors

Bin Duan

This author has not been identified. Look up 'Bin Duan' in Google

Hao Tang

This author has not been identified. Look up 'Hao Tang' in Google

Wei Wang

This author has not been identified. Look up 'Wei Wang' in Google

Ziliang Zong

This author has not been identified. Look up 'Ziliang Zong' in Google

Guowei Yang

This author has not been identified. Look up 'Guowei Yang' in Google

Yan Yan 0002

This author has not been identified. Look up 'Yan Yan 0002' in Google