Incorporating Audio-Guided Visual Attention into Sound Event Localization and Detection with Source Distance Estimation

Qing Wang 0008, Jun Du 0002, Hengyi Hong, Maocheng Hu, Mingqi Cai, Xin Fang. Incorporating Audio-Guided Visual Attention into Sound Event Localization and Detection with Source Distance Estimation. In IEEE International Conference on Multimedia and Expo, ICME 2025, Nantes, France, June 30 - July 4, 2025. pages 1-6, IEEE, 2025. [doi]

Abstract

Abstract is missing.