Localizing Visual Sounds the Hard Way

Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman. Localizing Visual Sounds the Hard Way. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 16867-16876, Computer Vision Foundation / IEEE, 2021. [doi]

@inproceedings{ChenXANVZ21,
  title = {Localizing Visual Sounds the Hard Way},
  author = {Honglie Chen and Weidi Xie and Triantafyllos Afouras and Arsha Nagrani and Andrea Vedaldi and Andrew Zisserman},
  year = {2021},
  url = {https://openaccess.thecvf.com/content/CVPR2021/html/Chen_Localizing_Visual_Sounds_the_Hard_Way_CVPR_2021_paper.html},
  researchr = {https://researchr.org/publication/ChenXANVZ21},
  cites = {0},
  citedby = {0},
  pages = {16867-16876},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021},
  publisher = {Computer Vision Foundation / IEEE},
}