BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering

Khiem Vinh Tran, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen. BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering. In International Conference on Multimedia Analysis and Pattern Recognition, MAPR 2023, Quy Nhon, Vietnam, October 5-6, 2023. pages 1-6, IEEE, 2023. [doi]

Abstract

Abstract is missing.