Multi-scale network with shared cross-attention for audio-visual correlation learning

Jiwei Zhang 0012, Yi Yu 0001, Suhua Tang, Wei Li 0012, Jianming Wu. Multi-scale network with shared cross-attention for audio-visual correlation learning. Neural Computing and Applications, 35(27):20173-20187, September 2023. [doi]

Abstract

Abstract is missing.