DocumentNet: Bridging the Data Gap in Document Pre-training

Lijun Yu, Jin Miao, Xiaoyu Sun, Jiayi Chen, Alexander G. Hauptmann, Hanjun Dai, Wei Wei 0019. DocumentNet: Bridging the Data Gap in Document Pre-training. In Mingxuan Wang, Imed Zitouni, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023 - Industry Track, Singapore, December 6-10, 2023. pages 707-722, Association for Computational Linguistics, 2023. [doi]

Authors

Lijun Yu

This author has not been identified. Look up 'Lijun Yu' in Google

Jin Miao

This author has not been identified. Look up 'Jin Miao' in Google

Xiaoyu Sun

This author has not been identified. Look up 'Xiaoyu Sun' in Google

Jiayi Chen

This author has not been identified. Look up 'Jiayi Chen' in Google

Alexander G. Hauptmann

This author has not been identified. Look up 'Alexander G. Hauptmann' in Google

Hanjun Dai

This author has not been identified. Look up 'Hanjun Dai' in Google

Wei Wei 0019

This author has not been identified. Look up 'Wei Wei 0019' in Google