Multi-talker audio-visual speech recognition towards diverse scenarios

Yuxiao Lin, Tao Jin 0004, Xize Cheng, Zhou Zhao 0001, Fei Wu 0001. Multi-talker audio-visual speech recognition towards diverse scenarios. Journal of Zhejiang University - Science C, 26(11):2310-2323, November 2025. [doi]

@article{LinJCZW25,
  title = {Multi-talker audio-visual speech recognition towards diverse scenarios},
  author = {Yuxiao Lin and Tao Jin 0004 and Xize Cheng and Zhou Zhao 0001 and Fei Wu 0001},
  year = {2025},
  month = {November},
  doi = {10.1631/FITEE.2500411},
  url = {https://doi.org/10.1631/FITEE.2500411},
  researchr = {https://researchr.org/publication/LinJCZW25},
  cites = {0},
  citedby = {0},
  journal = {Journal of Zhejiang University - Science C},
  volume = {26},
  number = {11},
  pages = {2310-2323},
}