Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yu 0004. Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding. In Luz Angelica Caudillo-Mata, Silvio Lattanzi, Andrés Muñoz Medina, Leman Akoglu, Aristides Gionis, Sergei Vassilvitskii, editors, Proceedings of the 17th ACM International Conference on Web Search and Data Mining, WSDM 2024, Merida, Mexico, March 4-8, 2024. pages 864-872, ACM, 2024. [doi]

Abstract

Abstract is missing.