SCI-3000: A Dataset for Figure, Table and Caption Extraction from Scientific PDFs

Filip Darmanovic, Allan Hanbury, Markus Zlabinger. SCI-3000: A Dataset for Figure, Table and Caption Extraction from Scientific PDFs. In Gernot A. Fink, Rajiv Jain, Koichi Kise, Richard Zanibbi, editors, Document Analysis and Recognition - ICDAR 2023 - 17th International Conference, San José, CA, USA, August 21-26, 2023, Proceedings, Part I. Volume 14187 of Lecture Notes in Computer Science, pages 234-251, Springer, 2023. [doi]

Abstract

Abstract is missing.