TRANS-VQA: Fully Transformer-Based Image Question-Answering Model Using Question-guided Vision Attention

Dipali Koshti, Ashutosh Gupta, Mukesh Kalla, Arvind Sharma. TRANS-VQA: Fully Transformer-Based Image Question-Answering Model Using Question-guided Vision Attention. Inteligencia Artificial, Revista Iberoamericana de Inteligencia Artificial, 27(73):111-128, January 2024. [doi]

Abstract

Abstract is missing.