ViBERTgrid: A Jointly Trained Multi-modal 2D Document Representation for Key Information Extraction from Documents

Weihong Lin, Qifang Gao, Lei Sun 0003, Zhuoyao Zhong, Kai Hu, Qin Ren, Qiang Huo. ViBERTgrid: A Jointly Trained Multi-modal 2D Document Representation for Key Information Extraction from Documents. In Josep Lladós 0001, Daniel Lopresti, Seiichi Uchida, editors, 16th International Conference on Document Analysis and Recognition, ICDAR 2021, Lausanne, Switzerland, September 5-10, 2021, Proceedings, Part I. Volume 12821 of Lecture Notes in Computer Science, pages 548-563, Springer, 2021. [doi]

Authors

Weihong Lin

This author has not been identified. Look up 'Weihong Lin' in Google

Qifang Gao

This author has not been identified. Look up 'Qifang Gao' in Google

Lei Sun 0003

This author has not been identified. Look up 'Lei Sun 0003' in Google

Zhuoyao Zhong

This author has not been identified. Look up 'Zhuoyao Zhong' in Google

Kai Hu

This author has not been identified. Look up 'Kai Hu' in Google

Qin Ren

This author has not been identified. Look up 'Qin Ren' in Google

Qiang Huo

This author has not been identified. Look up 'Qiang Huo' in Google