Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question Answering

Qing Zhang, Haocheng Lv, Jie Liu, Zhiyun Chen, Jianyong Duan, Mingying Xv, Hao Wang. Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question Answering. In Qingshan Liu 0001, Hanzi Wang, Zhanyu Ma, Weishi Zheng 0001, Hongbin Zha, Xilin Chen 0001, Liang Wang, Rongrong Ji, editors, Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part I. Volume 14425 of Lecture Notes in Computer Science, pages 383-394, Springer, 2023. [doi]

Abstract

Abstract is missing.