Unifying Vision, Text, and Layout for Universal Document Processing

Zineng Tang, Ziyi Yang, Guoxin Wang, Yuwei Fang, Yang Liu, Chenguang Zhu, Michael Zeng 0001, Cha Zhang, Mohit Bansal. Unifying Vision, Text, and Layout for Universal Document Processing. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 19254-19264, IEEE, 2023. [doi]

@inproceedings{TangYWFLZ0ZB23,
  title = {Unifying Vision, Text, and Layout for Universal Document Processing},
  author = {Zineng Tang and Ziyi Yang and Guoxin Wang and Yuwei Fang and Yang Liu and Chenguang Zhu and Michael Zeng 0001 and Cha Zhang and Mohit Bansal},
  year = {2023},
  doi = {10.1109/CVPR52729.2023.01845},
  url = {https://doi.org/10.1109/CVPR52729.2023.01845},
  researchr = {https://researchr.org/publication/TangYWFLZ0ZB23},
  cites = {0},
  citedby = {0},
  pages = {19254-19264},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-0129-8},
}