Multi-encoder attention-based architectures for sound recognition with partial visual assistance

Wim Boes, Hugo Van Hamme. Multi-encoder attention-based architectures for sound recognition with partial visual assistance. EURASIP J. Audio, Speech and Music Processing, 2022(1):25, 2022. [doi]

Abstract

Abstract is missing.