Generating Correction Candidates for OCR Errors using BERT Language Model and FastText SubWord Embeddings

Mahdi Hajiali, Jorge Ramón Fonseca Cacho, Kazem Taghva. Generating Correction Candidates for OCR Errors using BERT Language Model and FastText SubWord Embeddings. In Kohei Arai, editor, Intelligent Computing - Proceedings of the 2021 Computing Conference, Volume 1, SAI 2021, Virtual Event, 15-16 July, 2021. Volume 283 of Lecture Notes in Networks and Systems, pages 1045-1053, Springer, 2021. [doi]

Abstract

Abstract is missing.