mmLayout: Multi-grained MultiModal Transformer for Document Understanding

Wenjin Wang, Zhengjie Huang, Bin Luo, Qianglong Chen, Qiming Peng, Yinxu Pan, Weichong Yin, Shikun Feng, Yu Sun, Dianhai Yu, Yin Zhang. mmLayout: Multi-grained MultiModal Transformer for Document Understanding. In João Magalhães, Alberto Del Bimbo, Shin'ichi Satoh 0001, Nicu Sebe, Xavier Alameda-Pineda, Qin Jin, Vincent Oria, Laura Toni, editors, MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10 - 14, 2022. pages 4877-4886, ACM, 2022. [doi]

Authors

Wenjin Wang

This author has not been identified. Look up 'Wenjin Wang' in Google

Zhengjie Huang

This author has not been identified. Look up 'Zhengjie Huang' in Google

Bin Luo

This author has not been identified. Look up 'Bin Luo' in Google

Qianglong Chen

This author has not been identified. Look up 'Qianglong Chen' in Google

Qiming Peng

This author has not been identified. Look up 'Qiming Peng' in Google

Yinxu Pan

This author has not been identified. Look up 'Yinxu Pan' in Google

Weichong Yin

This author has not been identified. Look up 'Weichong Yin' in Google

Shikun Feng

This author has not been identified. Look up 'Shikun Feng' in Google

Yu Sun

This author has not been identified. Look up 'Yu Sun' in Google

Dianhai Yu

This author has not been identified. Look up 'Dianhai Yu' in Google

Yin Zhang

This author has not been identified. Look up 'Yin Zhang' in Google