Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

Tiezheng Yu, Rita Frieske, Peng Xu 0008, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung. Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis, editors, Proceedings of the Thirteenth Language Resources and Evaluation Conference, LREC 2022, Marseille, France, 20-25 June 2022. pages 6487-6494, European Language Resources Association, 2022. [doi]

@inproceedings{YuF0CYLDBCMSF22,
  title = {Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset},
  author = {Tiezheng Yu and Rita Frieske and Peng Xu 0008 and Samuel Cahyawijaya and Cheuk Tung Shadow Yiu and Holy Lovenia and Wenliang Dai and Elham J. Barezi and Qifeng Chen and Xiaojuan Ma and Bertram E. Shi and Pascale Fung},
  year = {2022},
  url = {https://aclanthology.org/2022.lrec-1.696},
  researchr = {https://researchr.org/publication/YuF0CYLDBCMSF22},
  cites = {0},
  citedby = {0},
  pages = {6487-6494},
  booktitle = {Proceedings of the Thirteenth Language Resources and Evaluation Conference, LREC 2022, Marseille, France, 20-25 June 2022},
  editor = {Nicoletta Calzolari and Frédéric Béchet and Philippe Blache and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association},
}