Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition

Akihiko Takashima, Ryo Masumura, Atsushi Ando, Yoshihiro Yamazaki, Mihiro Uchida, Shota Orihashi. Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 4740-4744, ISCA, 2022. [doi]

Abstract

Abstract is missing.