Mingyang Ma, Turdi Tohti, Yi Liang, Zicheng Zuo, Askar Hamdulla. A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answering. Signal, Image and Video Processing, 18(4):3471-3482, June 2024. [doi]
Abstract is missing.