How to Improve Optical Character Recognition of Historical Finnish Newspapers Using Open Source Tesseract OCR Engine - Final Notes on Development and Evaluation

Mika Koistinen, Kimmo Kettunen, Jukka Kervinen. How to Improve Optical Character Recognition of Historical Finnish Newspapers Using Open Source Tesseract OCR Engine - Final Notes on Development and Evaluation. In Zygmunt Vetulani, Patrick Paroubek, Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics - 8th Language and Technology Conference, LTC 2017, PoznaƄ, Poland, November 17-19, 2017, Revised Selected Papers. Volume 12598 of Lecture Notes in Computer Science, pages 17-30, Springer, 2017. [doi]

Abstract

Abstract is missing.