MSDWild: Multi-modal Speaker Diarization Dataset in the Wild

Tao Liu, Shuai Fan 0005, Xu Xiang, Hongbo Song, Shaoxiong Lin, Jiaqi Sun, Tianyuan Han, Siyuan Chen, Binwei Yao, Sen Liu, Yifei Wu, Yanmin Qian, Kai Yu 0004. MSDWild: Multi-modal Speaker Diarization Dataset in the Wild. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 1476-1480, ISCA, 2022. [doi]

Abstract

Abstract is missing.