Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party

Yifei Wu, Chenda Li, Song Yang, Zhongqin Wu, Yanmin Qian. Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 3021-3025, ISCA, 2021. [doi]

@inproceedings{WuLYWQ21,
  title = {Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party},
  author = {Yifei Wu and Chenda Li and Song Yang and Zhongqin Wu and Yanmin Qian},
  year = {2021},
  doi = {10.21437/Interspeech.2021-2128},
  url = {https://doi.org/10.21437/Interspeech.2021-2128},
  researchr = {https://researchr.org/publication/WuLYWQ21},
  cites = {0},
  citedby = {0},
  pages = {3021-3025},
  booktitle = {Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021},
  editor = {Hynek Hermansky and Honza Cernocký and Lukás Burget and Lori Lamel and Odette Scharenborg and Petr Motlícek},
  publisher = {ISCA},
}