StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

Yuechen Yu, Yulin Li, Chengquan Zhang, Xiaoqiang Zhang, Zengyuan Guo, Xiameng Qin, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang 0001. StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Authors

Yuechen Yu

This author has not been identified. Look up 'Yuechen Yu' in Google

Yulin Li

This author has not been identified. Look up 'Yulin Li' in Google

Chengquan Zhang

This author has not been identified. Look up 'Chengquan Zhang' in Google

Xiaoqiang Zhang

This author has not been identified. Look up 'Xiaoqiang Zhang' in Google

Zengyuan Guo

This author has not been identified. Look up 'Zengyuan Guo' in Google

Xiameng Qin

This author has not been identified. Look up 'Xiameng Qin' in Google

Kun Yao

This author has not been identified. Look up 'Kun Yao' in Google

Junyu Han

This author has not been identified. Look up 'Junyu Han' in Google

Errui Ding

This author has not been identified. Look up 'Errui Ding' in Google

Jingdong Wang 0001

This author has not been identified. Look up 'Jingdong Wang 0001' in Google