Light-Weight Multi-modality Feature Fusion Network for Visually-Rich Document Understanding

Jeff Yang, Huynh The Vu, Hai Luu Tuan. Light-Weight Multi-modality Feature Fusion Network for Visually-Rich Document Understanding. In Elisa H. Barney Smith, Marcus Liwicki, Liangrui Peng, editors, Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30 - September 4, 2024, Proceedings, Part I. Volume 14804 of Lecture Notes in Computer Science, pages 191-207, Springer, 2024. [doi]

@inproceedings{YangVT24,
  title = {Light-Weight Multi-modality Feature Fusion Network for Visually-Rich Document Understanding},
  author = {Jeff Yang and Huynh The Vu and Hai Luu Tuan},
  year = {2024},
  doi = {10.1007/978-3-031-70533-5_12},
  url = {https://doi.org/10.1007/978-3-031-70533-5_12},
  researchr = {https://researchr.org/publication/YangVT24},
  cites = {0},
  citedby = {0},
  pages = {191-207},
  booktitle = {Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30 - September 4, 2024, Proceedings, Part I},
  editor = {Elisa H. Barney Smith and Marcus Liwicki and Liangrui Peng},
  volume = {14804},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-031-70533-5},
}