Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation

Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi. Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation. In 2018 IEEE International Conference on Multimedia and Expo, ICME 2018, San Diego, CA, USA, July 23-27, 2018. pages 1-6, IEEE Computer Society, 2018. [doi]

Abstract

Abstract is missing.