Figure and Figure Caption Extraction for Mixed Raster and Vector PDFs: Digitization of Astronomical Literature with OCR Features

Jill P. Naiman, Peter K. G. Williams, Alyssa Goodman. Figure and Figure Caption Extraction for Mixed Raster and Vector PDFs: Digitization of Astronomical Literature with OCR Features. In Gianmaria Silvello, Óscar Corcho, Paolo Manghi, Giorgio Maria Di Nunzio, Koraljka Golub, Nicola Ferro 0001, Antonella Poggi, editors, Linking Theory and Practice of Digital Libraries - 26th International Conference on Theory and Practice of Digital Libraries, TPDL 2022, Padua, Italy, September 20-23, 2022, Proceedings. Volume 13541 of Lecture Notes in Computer Science, pages 52-67, Springer, 2021. [doi]

Abstract

Abstract is missing.