Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention

Bin Duan, Hao Tang, Wei Wang, Ziliang Zong, Guowei Yang, Yan Yan 0002. Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention. In IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Waikoloa, HI, USA, January 3-8, 2021. pages 4012-4021, IEEE, 2021. [doi]

@inproceedings{DuanTWZY021,
  title = {Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention},
  author = {Bin Duan and Hao Tang and Wei Wang and Ziliang Zong and Guowei Yang and Yan Yan 0002},
  year = {2021},
  doi = {10.1109/WACV48630.2021.00406},
  url = {https://doi.org/10.1109/WACV48630.2021.00406},
  researchr = {https://researchr.org/publication/DuanTWZY021},
  cites = {0},
  citedby = {0},
  pages = {4012-4021},
  booktitle = {IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Waikoloa, HI, USA, January 3-8, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-0477-8},
}