Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription

Saurabh Sahu, Vikramjit Mitra, Nadee Seneviratne, Carol Y. Espy-Wilson. Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription. In Gernot Kubin, Zdravko Kacic, editors, Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019. pages 3302-3306, ISCA, 2019. [doi]

Authors

Saurabh Sahu

This author has not been identified. Look up 'Saurabh Sahu' in Google

Vikramjit Mitra

This author has not been identified. Look up 'Vikramjit Mitra' in Google

Nadee Seneviratne

This author has not been identified. Look up 'Nadee Seneviratne' in Google

Carol Y. Espy-Wilson

This author has not been identified. Look up 'Carol Y. Espy-Wilson' in Google