WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition

Binbin Zhang, Hang Lv 0001, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie 0001, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng. WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 6182-6186, IEEE, 2022. [doi]

Abstract

Abstract is missing.