Masked co-attention model for audio-visual event localization

Hengwei Liu, Xiaodong Gu 0001. Masked co-attention model for audio-visual event localization. Appl. Intell., 54(2):1691-1705, 2024. [doi]

Abstract

Abstract is missing.