ViBERTgrid: A Jointly Trained Multi-modal 2D Document Representation for Key Information Extraction from Documents

Weihong Lin, Qifang Gao, Lei Sun 0003, Zhuoyao Zhong, Kai Hu, Qin Ren, Qiang Huo. ViBERTgrid: A Jointly Trained Multi-modal 2D Document Representation for Key Information Extraction from Documents. In Josep Lladós 0001, Daniel Lopresti, Seiichi Uchida, editors, 16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Switzerland, September 5-10, 2021, Proceedings, Part I. Volume 12821 of Lecture Notes in Computer Science, pages 548-563, Springer, 2021. [doi]

Abstract

Abstract is missing.