Abstract is missing.
- Hierarchical Structure Understanding in Complex Tables with VLLMs: a Benchmark and ExperimentsLuca Bindini, Simone Giovannini, Simone Marinai, Valeria Nardoni, Kimiya Noor Ali. 3-16 [doi]
- Archival Faces: Detection of Faces in Digitized Historical DocumentsMarek Vasko, Adam Herout, Michal Hradis. 17-34 [doi]
- AnonED: Complex Region Anonymisation in Electrical Diagrams Using Hybrid Density-Based Spatial ClusteringOlumayowa Onabanjo, Carlos Francisco Moreno-García, Gemma Martinez Huerta, Marina Díaz Piloñeta, Francisco Ortega Fernández. 35-49 [doi]
- AnnoPage Dataset: Dataset of Non-textual Elements in Documents with Fine-Grained CategorizationMartin Kiss, Michal Hradis, Martina Dvoráková, Václav Jirousek, Filip Kersch. 50-66 [doi]
- GAN-Based Content-Conditioned Generation of Handwritten Musical SymbolsGerard Asbert, Pau Torras, Lei Kang, Alicia Fornés, Josep Lladós 0001. 67-81 [doi]
- SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table ExtractionEthan Bradley, Muhammad Roman, Karen Rafferty, Barry Devereux. 85-100 [doi]
- BengaliDiff: Diffusion Model for Few-Shot Bengali Font GenerationMd Bilayet Hossain, Honghui Yuan, Shabnur Anonna Akhy, Keiji Yanai. 101-115 [doi]
- DAA-Net: Dynamic Adaptive Aggregation Network for Document Image RectificationXinyue Zhou, Nanfeng Jiang, Da-Han Wang, Wang Man. 116-133 [doi]
- Visual Text Generation in Khmer Language: Challenges and Trends with Diffusion ModelsSaly Keo, Vannkinh Nom, Souhail Bakkali, Muhammad Muzzamil Luqman, Mickaël Coustaty, Jean-Marc Ogier. 134-152 [doi]
- EroPT: Benchmarking Robustness of OCR Methods on Eroded Printed TextGyan Singh Budhiraja, Shrey Chandola, Anandita Jamwal, Manikandan Ravikiran, Rohit Saluja. 153-165 [doi]
- BiNet: A Deep Encoder-Decoder Network for Binarizing Degraded Ancient ManuscriptsMaruf A. Dhali, Jan Willem de Wit, Lambert Schomaker. 166-193 [doi]
- Modular OCR Using Web Scraping DataGuy Gisfan, Eli (Omid) David, Nathan S. Netanyahu. 194-210 [doi]
- Semi-supervised Writing Style Classification in Medieval Hebrew ManuscriptsReem Alaasam, Jihad El-Sana, Irina Rabaev, Daria Vasyutinsky Shapira. 211-226 [doi]
- Enhancing Khmer-English Machine Translation via Document Analysis TechniquesRina Buoy, Sovisal Chenda, Nguonly Taing, Marry Kong, Masakazu Iwamura, Koichi Kise. 229-245 [doi]
- PALM-LAY: A Multi-script Cross-Regional Dataset for Layout Analysis of Palm Leaf ManuscriptsNimol Thuon, Jun Du 0002, Panhapin Theang, Ratana Thuon. 246-262 [doi]
- Open Set Oracle Character Recognition via Adaptive Decision BoundaryShuangping Huang, Zonghao Liu, Beibei Liu, Wenjie Peng, Yongge Liu. 263-276 [doi]
- TMAWS: A Manchu Archives Word Spotting Method Supporting Both Image and String Query ModesJianjun He, Ligen Cheng, Zihang Zhang, Yu Zhou, Xinshu Cui, Ruirui Zheng. 277-295 [doi]
- The Research on End-to-End Tibetan Text Detection and Recognition in Natural ScenesXing Peiran, Rinchen Dongrub, Nyima Tashi, Dorje Tashi, Yuqing Cai, Zhuoya Liu. 296-310 [doi]
- Multi-type Tibetan Ancient Book Text Line Recognition Based on Adapter Fine-TuningFeixiang Cui, Dorje Tashi, Yong Tso, Nyima Tashi. 311-328 [doi]
- ClapperText: A Benchmark for Text Recognition in Low-Resource Archival DocumentsTingyu Lin 0002, Marco Peer, Florian Kleber, Robert Sablatnig. 329-346 [doi]
- Cross-Lingual Learning for Low-Resource Khmer Scene Text Detection and RecognitionVannkinh Nom, Saly Keo, Souhail Bakkali, Muhammad Muzzamil Luqman, Mickaël Coustaty, Jean-Marc Ogier. 347-365 [doi]
- Text Enhancement of Degraded Historical DocumentsReem Alaasam, Boraq Madi, Jihad El-Sana. 366-375 [doi]
- HePU: Hebrew Paleography Understanding Dataset For Semantic Script SegmentationNour Atamni, Boraq Madi, Islam Amar, Said Naamneh, Raid Saabni, Jihad El-Sana. 376-394 [doi]