Speech emotion recognition based on crossmodal transformer and attention weight correction

Ryusei Terui, Takeshi Yamada. Speech emotion recognition based on crossmodal transformer and attention weight correction. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024, Macau, December 3-6, 2024. pages 1-5, IEEE, 2024. [doi]

Abstract

Abstract is missing.