Streaming Multi-Talker ASR with Token-Level Serialized Output Training

Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li 0001, Takuya Yoshioka. Streaming Multi-Talker ASR with Token-Level Serialized Output Training. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 3774-3778, ISCA, 2022. [doi]

Abstract

Abstract is missing.