Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents

Mengjun Cheng, Chengquan Zhang, Chang Liu 0047, Yuke Li, Bohan Li, Kun Yao, Xiawu Zheng, Rongrong Ji, Jie Chen. Textual Grounding for Open-Vocabulary Visual Information Extraction in Layout-Diversified Documents. In Ales Leonardis, Elisa Ricci 0001, Stefan Roth 0001, Olga Russakovsky, Torsten Sattler, Gül Varol, editors, Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLV. Volume 15103 of Lecture Notes in Computer Science, pages 474-491, Springer, 2024. [doi]

Abstract

Abstract is missing.