VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering

Yanan Wang 0002, Michihiro Yasunaga, Hongyu Ren, Shinya Wada, Jure Leskovec. VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 21525-21535, IEEE, 2023. [doi]

Abstract

Abstract is missing.