MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible

Marcely Zanon Boito, William Havard, Mahault Garnerin, Éric Le Ferrand, Laurent Besacier. MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis, editors, Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020. pages 6486-6493, European Language Resources Association, 2020. [doi]

@inproceedings{BoitoHGFB20,
  title = {MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible},
  author = {Marcely Zanon Boito and William Havard and Mahault Garnerin and Éric Le Ferrand and Laurent Besacier},
  year = {2020},
  url = {https://www.aclweb.org/anthology/2020.lrec-1.799/},
  researchr = {https://researchr.org/publication/BoitoHGFB20},
  cites = {0},
  citedby = {0},
  pages = {6486-6493},
  booktitle = {Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020},
  editor = {Nicoletta Calzolari and Frédéric Béchet and Philippe Blache and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asunción Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association},
  isbn = {979-10-95546-34-4},
}