Does Audio help in deep Audio-Visual Saliency prediction models?

Ritvik Agrawal, Shreyank Jyoti, Rohit Girmaji, Sarath Sivaprasad, Vineet Gandhi. Does Audio help in deep Audio-Visual Saliency prediction models?. In Raj Tumuluri, Nicu Sebe, Gopal Pingali, Dinesh Babu Jayagopi, Abhinav Dhall, Richa Singh 0001, Lisa Anthony, Albert Ali Salah, editors, International Conference on Multimodal Interaction, ICMI 2022, Bengaluru, India, November 7-11, 2022. pages 48-56, ACM, 2022. [doi]

@inproceedings{AgrawalJGSG22,
  title = {Does Audio help in deep Audio-Visual Saliency prediction models?},
  author = {Ritvik Agrawal and Shreyank Jyoti and Rohit Girmaji and Sarath Sivaprasad and Vineet Gandhi},
  year = {2022},
  doi = {10.1145/3536221.3556625},
  url = {https://doi.org/10.1145/3536221.3556625},
  researchr = {https://researchr.org/publication/AgrawalJGSG22},
  cites = {0},
  citedby = {0},
  pages = {48-56},
  booktitle = {International Conference on Multimodal Interaction, ICMI 2022, Bengaluru, India, November 7-11, 2022},
  editor = {Raj Tumuluri and Nicu Sebe and Gopal Pingali and Dinesh Babu Jayagopi and Abhinav Dhall and Richa Singh 0001 and Lisa Anthony and Albert Ali Salah},
  publisher = {ACM},
  isbn = {978-1-4503-9390-4},
}