Paparazzi: A Deep Dive into the Capabilities of Language and Vision Models for Grounding Viewpoint Descriptions

Henrik Voigt, Jan N. Hombeck, Monique Meuschke, Kai Lawonn, Sina Zarrieß. Paparazzi: A Deep Dive into the Capabilities of Language and Vision Models for Grounding Viewpoint Descriptions. In Andreas Vlachos 0001, Isabelle Augenstein, editors, Findings of the Association for Computational Linguistics: EACL 2023, Dubrovnik, Croatia, May 2-6, 2023. pages 798-813, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.