A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answering

Mingyang Ma, Turdi Tohti, Yi Liang, Zicheng Zuo, Askar Hamdulla. A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answering. Signal, Image and Video Processing, 18(4):3471-3482, June 2024. [doi]

Abstract

Abstract is missing.