Abstract is missing.
- Evaluation of Different Tagging Schemes for Named Entity Recognition in Handwritten DocumentsDavid Villanova-Aparisi, Carlos D. Martínez-Hinarejos, Verónica Romero 0001, Moisés Pastor-i-Gadea. 3-16 [doi]
- Analyzing the Impact of Tokenization on Multilingual Epidemic Surveillance in Low-Resource LanguagesStephen Mutuvi, Emanuela Boros, Antoine Doucet, Gaël Lejeune, Adam Jatowt, Moses Odeo. 17-32 [doi]
- DAMGCN: Entity Linking in Visually Rich Documents with Dependency-Aware Multimodal Graph Convolutional NetworkYi-Ming Chen, Xiangting Hou, Dongfang Lou, Zhilin Liao, Cheng-Lin Liu. 33-47 [doi]
- Analyzing Textual Information from Financial Statements for Default PredictionChinesh Doshi, Himani Shrotiya, Rohit Bhiogade, Himanshu S. Bhatt, Abhishek Jha. 48-65 [doi]
- RealCQA: Scientific Chart Question Answering as a Test-Bed for First-Order LogicSaleem Ahmed, Bhavin Jawade, Shubham Pandey, Srirangaraj Setlur, Venu Govindaraju. 66-83 [doi]
- An Iterative Graph Learning Convolution Network for Key Information Extraction Based on the Document Inductive BiasJiyao Deng, Yi Zhang, Xinpeng Zhang, Zhi Tang, Liangcai Gao. 84-97 [doi]
- QuOTeS: Query-Oriented Technical SummarizationJuan Antonio Ramirez-Orta, Eduardo Xamena, Ana Gabriela Maguitman, Axel J. Soto, Flavia P. Zanoto, Evangelos E. Milios. 98-114 [doi]
- A Benchmark of Nested Named Entity Recognition Approaches in Historical Structured DocumentsSolenn Tual, Nathalie Abadie, Joseph Chazalon, Bertrand Duménieu, Edwin Carlinet. 115-131 [doi]
- "Explain Thyself Bully": Sentiment Aided Cyberbullying Detection with ExplanationKrishanu Maity, Prince Jha, Raghav Jain, Sriparna Saha 0001, Pushpak Bhattacharyya. 132-148 [doi]
- LayoutGCN: A Lightweight Architecture for Visually Rich Document UnderstandingDengliang Shi, Siliang Liu, Jintao Du, Huijia Zhu. 149-165 [doi]
- Topic Shift Detection in Chinese Dialogues: Corpus and BenchmarkJiangyi Lin, Yaxin Fan, Feng Jiang, Xiaomin Chu, Peifeng Li. 166-183 [doi]
- Detecting Forged Receipts with Domain-Specific Ontology-Based Entities & RelationsBeatriz Martínez Tornés, Emanuela Boros, Antoine Doucet, Petra Gomez-Krämer, Jean-Marc Ogier. 184-199 [doi]
- CED: Catalog Extraction from DocumentsTong Zhu 0002, Guoliang Zhang, Zechang Li, Zijian Yu, Junfei Ren, Mengsong Wu, Zhefeng Wang, Baoxing Huai, Pingfu Chao, Wenliang Chen. 200-215 [doi]
- A Character-Level Document Key Information Extraction Method with Contrastive LearningXinpeng Zhang, Jiyao Deng, Liangcai Gao. 216-230 [doi]
- Multimodal Rumour Detection: Catching News that Never Transpired!Raghvendra Kumar 0003, Ritika Sinha, Sriparna Saha 0001, Adam Jatowt. 231-248 [doi]
- Semantic Triple-Assisted Learning for Question Answering Passage Re-rankingDinesh Nagumothu, Bahadorreza Ofoghi, Peter W. Eklund. 249-264 [doi]
- I-WAS: A Data Augmentation Method with GPT-2 for Simile DetectionYongzhu Chang, Rongsheng Zhang, Jiashu Pu. 265-279 [doi]
- Information Redundancy and Biases in Public Document Information Extraction BenchmarksSeif Laatiri, Pirashanth Ratnamogan, Joël Tang, Laurent Lam, William Vanhuffel, Fabien Caspani. 280-294 [doi]
- On Web-based Visual Corpus Construction for Visual Document UnderstandingDonghyun Kim, Teakgyu Hong, Moonbin Yim, Yoonsik Kim, Geewook Kim. 297-313 [doi]
- Ambigram Generation by a Diffusion ModelTakahiro Shirakawa, Seiichi Uchida. 314-330 [doi]
- Analyzing Font Style Usage and Contextual Factors in Real ImagesNaoya Yasukochi, Hideaki Hayashi, Daichi Haraguchi, Seiichi Uchida. 331-347 [doi]
- CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl DataMichal Turski, Tomasz Stanislawek, Karol Kaczmarek, Pawel Dyda, Filip Gralinski. 348-365 [doi]
- ESTER-Pt: An Evaluation Suite for TExt Recognition in PortugueseMoniele Kunrath Santos, Guilherme Torresan Bazzo, Lucas Lima de Oliveira, Viviane Pereira Moreira. 366-383 [doi]
- Augraphy: A Data Augmentation Library for Document ImagesAlexander Groleau, Kok Wei Chee, Stefan Larson, Samay Maini, Jonathan Boarman. 384-401 [doi]
- TextREC: A Dataset for Referring Expression Comprehension with Reading ComprehensionChenyang Gao, Biao Yang, Hao Wang, Mingkun Yang, Wenwen Yu, Yuliang Liu, Xiang Bai. 402-420 [doi]
- SIMARA: A Database for Key-Value Information Extraction from Full-Page Handwritten DocumentsSolène Tarride, Mélodie Boillet, Jean-François Moufflet, Christopher Kermorvant. 421-437 [doi]
- Diffusion Models for Document Image GenerationNoman Tanveer, Adnan Ul-Hasan, Faisal Shafait. 438-453 [doi]
- Receipt Dataset for Document Forgery DetectionBeatriz Martínez Tornés, Théo Taburet, Emanuela Boros, Kais Rouis, Antoine Doucet, Petra Gomez-Krämer, Nicolas Sidere, Vincent Poulain D'Andecy. 454-469 [doi]
- EnsExam: A Dataset for Handwritten Text Erasure on Examination PapersLiufeng Huang, Bangdong Chen, Chongyu Liu, Dezhi Peng, Weiying Zhou, Yaqiang Wu, Hui Li, Hao Ni, Lianwen Jin. 470-485 [doi]
- MIDV-Holo: A Dataset for ID Document Hologram Detection in a Video StreamL. I. Koliaskina, Ekaterina Emelianova, Daniil V. Tropin, V. V. Popov, Konstantin B. Bulatov, Dmitry P. Nikolaev, Vladimir V. Arlazarov. 486-503 [doi]