VinVL+L: Enriching Visual Representation with Location Context in VQA

Jirí Vyskocil, Lukás Picek. VinVL+L: Enriching Visual Representation with Location Context in VQA. In Robert Sablatnig, Florian Kleber, editors, Proceedings of the 26th Computer Vision Winter Workshop (CVWW 2023), Krems a.d. Donau, Austria, February 15-17, 2023. Volume 3349 of CEUR Workshop Proceedings, CEUR-WS.org, 2023. [doi]

Abstract

Abstract is missing.