SAViR-T: Spatially Attentive Visual Reasoning with Transformers

Pritish Sahu, Kalliopi Basioti, Vladimir Pavlovic 0001. SAViR-T: Spatially Attentive Visual Reasoning with Transformers. In Massih-Reza Amini, Stéphane Canu, Asja Fischer, Tias Guns, Petra Kralj Novak, Grigorios Tsoumakas, editors, Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2022, Grenoble, France, September 19-23, 2022, Proceedings, Part III. Volume 13715 of Lecture Notes in Computer Science, pages 460-476, Springer, 2022. [doi]

Abstract

Abstract is missing.