WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition

Binbin Zhang, Hang Lv 0001, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie 0001, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng. WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022. pages 6182-6186, IEEE, 2022. [doi]

@inproceedings{ZhangLGSYXXBCZW22,
  title = {WENETSPEECH: A 10000+ Hours Multi-Domain Mandarin Corpus for Speech Recognition},
  author = {Binbin Zhang and Hang Lv 0001 and Pengcheng Guo and Qijie Shao and Chao Yang and Lei Xie 0001 and Xin Xu and Hui Bu and Xiaoyu Chen and Chenchen Zeng and Di Wu and Zhendong Peng},
  year = {2022},
  doi = {10.1109/ICASSP43922.2022.9746682},
  url = {https://doi.org/10.1109/ICASSP43922.2022.9746682},
  researchr = {https://researchr.org/publication/ZhangLGSYXXBCZW22},
  cites = {0},
  citedby = {0},
  pages = {6182-6186},
  booktitle = {IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-0540-9},
}