MM-Reasoner: A Multi-Modal Knowledge-Aware Framework for Knowledge-Based Visual Question Answering

Mahmoud Khademi, Ziyi Yang, Felipe Frujeri, Chenguang Zhu. MM-Reasoner: A Multi-Modal Knowledge-Aware Framework for Knowledge-Based Visual Question Answering. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 6571-6581, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.