NISQA: A Deep CNN-Self-Attention Model for Multidimensional Speech Quality Prediction with Crowdsourced Datasets

Gabriel Mittag, Babak Naderi, Assmaa Chehadi, Sebastian Möller 0001. NISQA: A Deep CNN-Self-Attention Model for Multidimensional Speech Quality Prediction with Crowdsourced Datasets. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 2127-2131, ISCA, 2021. [doi]

@inproceedings{MittagNC021,
  title = {NISQA: A Deep CNN-Self-Attention Model for Multidimensional Speech Quality Prediction with Crowdsourced Datasets},
  author = {Gabriel Mittag and Babak Naderi and Assmaa Chehadi and Sebastian Möller 0001},
  year = {2021},
  doi = {10.21437/Interspeech.2021-299},
  url = {https://doi.org/10.21437/Interspeech.2021-299},
  researchr = {https://researchr.org/publication/MittagNC021},
  cites = {0},
  citedby = {0},
  pages = {2127-2131},
  booktitle = {Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021},
  editor = {Hynek Hermansky and Honza Cernocký and Lukás Burget and Lori Lamel and Odette Scharenborg and Petr Motlícek},
  publisher = {ISCA},
}