BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering

Khiem Vinh Tran, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen. BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering. In International Conference on Multimedia Analysis and Pattern Recognition, MAPR 2023, Quy Nhon, Vietnam, October 5-6, 2023. pages 1-6, IEEE, 2023. [doi]

@inproceedings{TranNN23,
  title = {BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering},
  author = {Khiem Vinh Tran and Kiet Van Nguyen and Ngan Luu-Thuy Nguyen},
  year = {2023},
  doi = {10.1109/MAPR59823.2023.10288874},
  url = {https://doi.org/10.1109/MAPR59823.2023.10288874},
  researchr = {https://researchr.org/publication/TranNN23},
  cites = {0},
  citedby = {0},
  pages = {1-6},
  booktitle = {International Conference on Multimedia Analysis and Pattern Recognition, MAPR 2023, Quy Nhon, Vietnam, October 5-6, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-2741-0},
}