Abstract is missing.
- SFDLA: Source-Free Document Layout AnalysisSebastian Tewes, Yufan Chen 0001, Omar Moured, Jiaming Zhang 0001, Rainer Stiefelhagen. 3-22 [doi]
- IndicDLP: A Foundational Dataset for Multi-lingual and Multi-domain Document Layout ParsingOikantik Nath, Sahithi Kukkala, Mitesh M. Khapra, Ravi Kiran Sarvadevabhatla. 23-39 [doi]
- UniLayDet: Simple Multi-dataset Document Layout AnalysisPrasidh Srikumar, Ajoy Mondal, C. V. Jawahar. 40-57 [doi]
- HIP: Hierarchical Point Modeling and Pre-training for Visual Information ExtractionRujiao Long, Pengfei Wang, Zhibo Yang 0003, Wenqing Cheng. 58-75 [doi]
- The Return of Structural Handwritten Mathematical Expression RecognitionJakob Seitz, Tobias Lengfeld, Radu Timofte. 79-95 [doi]
- PACM: Position-Aware Cross-Modality Decoder for Handwritten Mathematical Expression RecognitionZeng Li, Jin Wei, Zhijie Shen, Can Ma, Yaqiang Wu, Yu Zhou 0015. 96-114 [doi]
- From Scribbles to Text: A Novel Transformer-Based Recognition Model for Child HandwritingSahana Rangasrinivasan, M. S. Sumi Suresh, Srirangaraj Setlur, Bharat Jayaraman, Venu Govindaraju. 115-131 [doi]
- DCC: Plug-and-Play Dynamic Category Compression for Enhanced Handwritten Text GenerationYiming Wang, Hongxi Wei, Heng Wang, Shiwen Sun. 132-146 [doi]
- Template-Guided Cascaded Diffusion for Stylized Handwritten Chinese Text-Line GenerationHonglie Wang, Minsi Ren, Yan-Ming Zhang, Fei Yin, Cheng-Lin Liu. 149-166 [doi]
- FSTDiff: One-Shot Font Generation via Cross-Font Style Transformation LearningShilin Li, Anna Zhu. 167-182 [doi]
- AI-Generated Lecture Slides for Improving Slide Element Detection and RetrievalSuyash Maniyar, Vishvesh Trivedi, Ajoy Mondal, Anand Mishra 0001, C. V. Jawahar. 183-199 [doi]
- InfoDesignLM: An LLM for Interactive and Controllable Infographic Designing Through TextXilin Zhang, Hao Wang, Jianbiao Dai, Pinpin Zhu. 200-218 [doi]
- Layout-Aware Text Editing for Efficient Transformation of Academic PDFs to MarkdownChangxu Duan. 221-241 [doi]
- HiDReader: Human-Inspired Document Reading Agent via Reinforcement LearningChangqing Wang, Hao Wang, Pinpin Zhu, Huiran Zhang. 242-258 [doi]
- SemiTabDETR: End-to-End Semi-supervised Table Detection with Transformer-Based Enhanced Query ApproachTahira Shehzadi, Didier Stricker, Muhammad Zeshan Afzal. 259-279 [doi]
- TexTAR: Textual Attribute Recognition in Multi-domain and Multi-lingual Document ImagesRohan Kumar, Jyothi Swaroopa Jinka, Ravi Kiran Sarvadevabhatla. 280-296 [doi]
- A Novel Multi-modal Dataset and Method for Handwritten Signature Recognition with Image-Audio FusionQixiang Li, Xirali Ablat, Xiaoya Lin, Mahpirat Muhammat, Kurban Ubul. 299-317 [doi]
- Personality Trait Prediction from Twitter Data Using Text and Image FeaturesKunal Biswas, Shivakumara Palaiahnakote, Umapada Pal 0001, Daniel P. Lopresti, Tong Lu. 318-336 [doi]
- ComicsPAP: Understanding Comic Strips by Picking the Correct PanelEmanuele Vivoli, Artemis Llabrés, Mohamed Ali Souibgui, Marco Bertini 0001, Ernest Valveny Llobet, Dimosthenis Karatzas. 337-350 [doi]
- T-LLaVA: An Effective Saliency-Aware Slicing Strategy for Text RecognitionMengze Wei, Chun Yang, Min Liang, Fang Zhou, Xiaobin Zhu 0001, Xu-Cheng Yin. 351-369 [doi]
- From Conversations to Insights: A Multimodal Approach to Discussion SummarizationPunit Kumar Singh, Nishant Kumar, Hrushik Mehta, Sriparna Saha 0001. 373-391 [doi]
- SemSyn-LCE: A Charge Prediction Method Based on Semantic Syntactic Fusion and Legal Constituent Elements MatchingWenjun Chen, Bianxia Du, Wenhui Xia, Qiao Hu, Yupeng Hu. 392-409 [doi]
- Expertise Finding: Domain Extraction from Documents Using Fuzzy ClusteringDipendra Sharma Kafle, Esma Talhi, Mickaël Coustaty, Antoine Doucet. 410-427 [doi]
- MATATA: Weakly Supervised End-to-End MAthematical Tool-Augmented Reasoning for Tabular ApplicationsVishnou Vinayagame, Gregory Senay, Luis Martí. 428-467 [doi]