Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts

Allen Kim, Charuta Pethe, Naoya Inoue, Steven Skiena. Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih, editors, Findings of the Association for Computational Linguistics: EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 16-20 November, 2021. pages 4217-4226, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.