VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering

Ekta Sood, Fabian Kögel, Florian Strohm, Prajit Dhar, Andreas Bulling. VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering. In Arianna Bisazza, Omri Abend, editors, Proceedings of the 25th Conference on Computational Natural Language Learning, CoNLL 2021, Online, November 10-11, 2021. pages 27-43, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.