Khiem Vinh Tran, Hao Phu Phan, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen. ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese. Multimedia Syst., 30(4):199, August 2024. [doi]
No references recorded for this publication.
No citations of this publication recorded.