Question-guided attention and cross-modal alignment for knowledge-based visual question answering

Wei Li, Fuyun Deng, Zhixin Li. Question-guided attention and cross-modal alignment for knowledge-based visual question answering. Inf. Process. Manage., 63(4):104578, 2026. [doi]

Abstract

Abstract is missing.