Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition

Chung Soo Ahn, Chamara Kasun, Sunil Sivadas, Jagath C. Rajapakse. Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 744-748, ISCA, 2022. [doi]

Authors

Chung Soo Ahn

This author has not been identified. Look up 'Chung Soo Ahn' in Google

Chamara Kasun

This author has not been identified. Look up 'Chamara Kasun' in Google

Sunil Sivadas

This author has not been identified. Look up 'Sunil Sivadas' in Google

Jagath C. Rajapakse

This author has not been identified. Look up 'Jagath C. Rajapakse' in Google