Yaxian Wang, Bifan Wei, Jun Liu 0002, Lingling Zhang 0005, Shuting He, Jun Li, Qika Lin. GlFoMR: A Glance-then-Focus Multimodal Reasoning Framework for Diagram Question Answering. In Nicola Ferro 0001, Maria Maistro, Gabriella Pasi, Omar Alonso, Andrew Trotman, Suzan Verberne, editors, Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025, Padua, Italy, July 13-18, 2025. pages 1130-1140, ACM, 2025. [doi]
Abstract is missing.