No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in Peru

Gina Bustamante, Arturo Oncevay, Roberto Zariquiey. No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in Peru. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis, editors, Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020. pages 2914-2923, European Language Resources Association, 2020. [doi]

@inproceedings{BustamanteOZ20,
  title = {No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in Peru},
  author = {Gina Bustamante and Arturo Oncevay and Roberto Zariquiey},
  year = {2020},
  url = {https://www.aclweb.org/anthology/2020.lrec-1.356/},
  researchr = {https://researchr.org/publication/BustamanteOZ20},
  cites = {0},
  citedby = {0},
  pages = {2914-2923},
  booktitle = {Proceedings of The 12th Language Resources and Evaluation Conference, LREC 2020, Marseille, France, May 11-16, 2020},
  editor = {Nicoletta Calzolari and Frédéric Béchet and Philippe Blache and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asunción Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association},
  isbn = {979-10-95546-34-4},
}