Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition

Hengshun Zhou, Jun Du, Yuanyuan Zhang, Qing Wang 0008, Qing-Feng Liu, Chin-Hui Lee. Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition. IEEE Transactions on Audio, Speech & Language Processing, 29:2617-2629, 2021. [doi]

Abstract

Abstract is missing.