DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei Zha. DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 27263-27273, IEEE, 2024. [doi]

Abstract

Abstract is missing.