Multi-modal spatial relational attention networks for visual question answering

Haibo Yao, Lipeng Wang, Chengtao Cai, Yuxin Sun, Zhi Zhang, Yongkang Luo. Multi-modal spatial relational attention networks for visual question answering. Image Vision Comput., 140:104840, December 2023. [doi]

Abstract

Abstract is missing.