Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings

Chenyu Yang, Mengxi Chen, Yanfeng Wang, Yu Wang 0027. Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 4031-4041, ACM, 2023. [doi]

@inproceedings{YangCW023,
  title = {Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings},
  author = {Chenyu Yang and Mengxi Chen and Yanfeng Wang and Yu Wang 0027},
  year = {2023},
  doi = {10.1145/3581783.3612424},
  url = {https://doi.org/10.1145/3581783.3612424},
  researchr = {https://researchr.org/publication/YangCW023},
  cites = {0},
  citedby = {0},
  pages = {4031-4041},
  booktitle = {Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023},
  editor = {Abdulmotaleb El-Saddik and Tao Mei and Rita Cucchiara and Marco Bertini 0001 and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain},
  publisher = {ACM},
}