MS-Swinformer and DMTL: Multi-scale spatial fusion and dynamic multi-task learning for speech emotion recognition

Defu Lan, Hai Cheng. MS-Swinformer and DMTL: Multi-scale spatial fusion and dynamic multi-task learning for speech emotion recognition. Computer Speech & Language, 99:101908, 2026. [doi]

Abstract

Abstract is missing.