Multimodal feature fusion by relational reasoning and attention for visual question answering

Weifeng Zhang, Jing Yu, Hua Hu, HaiYang Hu, Zengchang Qin. Multimodal feature fusion by relational reasoning and attention for visual question answering. Information Fusion, 55:116-126, 2020. [doi]

Abstract

Abstract is missing.