WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition

Binbin Zhang, Hang Lv 0001, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie 0001, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng. WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 6182-6186, IEEE, 2022. [doi]

Authors

Binbin Zhang

This author has not been identified. Look up 'Binbin Zhang' in Google

Hang Lv 0001

This author has not been identified. Look up 'Hang Lv 0001' in Google

Pengcheng Guo

This author has not been identified. Look up 'Pengcheng Guo' in Google

Qijie Shao

This author has not been identified. Look up 'Qijie Shao' in Google

Chao Yang

This author has not been identified. Look up 'Chao Yang' in Google

Lei Xie 0001

This author has not been identified. Look up 'Lei Xie 0001' in Google

Xin Xu

This author has not been identified. Look up 'Xin Xu' in Google

Hui Bu

This author has not been identified. Look up 'Hui Bu' in Google

Xiaoyu Chen

This author has not been identified. Look up 'Xiaoyu Chen' in Google

Chenchen Zeng

This author has not been identified. Look up 'Chenchen Zeng' in Google

Di Wu

This author has not been identified. Look up 'Di Wu' in Google

Zhendong Peng

This author has not been identified. Look up 'Zhendong Peng' in Google