Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition

researchr

You are not signed in
Sign in
Sign up

Chung Soo Ahn, Chamara Kasun, Sunil Sivadas, Jagath C. Rajapakse. Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 744-748, ISCA, 2022. [doi]

@inproceedings{AhnKSR22,
  title = {Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition},
  author = {Chung Soo Ahn and Chamara Kasun and Sunil Sivadas and Jagath C. Rajapakse},
  year = {2022},
  doi = {10.21437/Interspeech.2022-888},
  url = {https://doi.org/10.21437/Interspeech.2022-888},
  researchr = {https://researchr.org/publication/AhnKSR22},
  cites = {0},
  citedby = {0},
  pages = {744-748},
  booktitle = {Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022},
  editor = {Hanseok Ko and John H. L. Hansen},
  publisher = {ISCA},
}

External Links

Cite Key

Statistics

PDF

Researchr

Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition