XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding

Zhangxuan Gu, Changhua Meng, Ke Wang, Jun Lan, Weiqiang Wang, Ming Gu, Liqing Zhang 0001. XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 4573-4582, IEEE, 2022. [doi]

Abstract

Abstract is missing.