A Novel end-to-end Speech Emotion Recognition Network with Stacked Transformer Layers

Xianfeng Wang, Min Wang, Wenbo Qi, Wanqi Su, Xiangqian Wang 0001, Huan Zhou. A Novel end-to-end Speech Emotion Recognition Network with Stacked Transformer Layers. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. pages 6289-6293, IEEE, 2021. [doi]

@inproceedings{WangWQS0Z21,
  title = {A Novel end-to-end Speech Emotion Recognition Network with Stacked Transformer Layers},
  author = {Xianfeng Wang and Min Wang and Wenbo Qi and Wanqi Su and Xiangqian Wang 0001 and Huan Zhou},
  year = {2021},
  doi = {10.1109/ICASSP39728.2021.9414314},
  url = {https://doi.org/10.1109/ICASSP39728.2021.9414314},
  researchr = {https://researchr.org/publication/WangWQS0Z21},
  cites = {0},
  citedby = {0},
  pages = {6289-6293},
  booktitle = {IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021},
  publisher = {IEEE},
  isbn = {978-1-7281-7605-5},
}