ReadOCR: A Novel Dataset and Readability Assessment of OCRed Texts

Thi-Tuyet-Hai Nguyen, Adam Jatowt, Mickaël Coustaty, Antoine Doucet. ReadOCR: A Novel Dataset and Readability Assessment of OCRed Texts. In Seiichi Uchida, Elisa Barney, Véronique Eglin, editors, Document Analysis Systems - 15th IAPR International Workshop, DAS 2022, La Rochelle, France, May 22-25, 2022, Proceedings. Volume 13237 of Lecture Notes in Computer Science, pages 479-491, Springer, 2022. [doi]

Abstract

Abstract is missing.