MaxMViT-MLP: Multiaxis and Multiscale Vision Transformers Fusion Network for Speech Emotion Recognition

Kah Liang Ong, Chin-Poo Lee, Heng Siong Lim, Kian-Ming Lim, Ali Alqahtani 0001. MaxMViT-MLP: Multiaxis and Multiscale Vision Transformers Fusion Network for Speech Emotion Recognition. IEEE Access, 12:18237-18250, 2024. [doi]

Abstract

Abstract is missing.