One does not fit all! On the Complementarity of Vision Encoders for Vision and Language Tasks

Gregor Geigle, Chen Liu, Jonas Pfeiffer, Iryna Gurevych. One does not fit all! On the Complementarity of Vision Encoders for Vision and Language Tasks. In Burcu Can, Maximilian Mozes, Samuel Cahyawijaya, Naomi Saphra, Nora Kassner, Shauli Ravfogel, Abhilasha Ravichander, Chen Zhao, Isabelle Augenstein, Anna Rogers, KyungHyun Cho, Edward Grefenstette, Lena Voita, editors, Proceedings of the 8th Workshop on Representation Learning for NLP, RepL4NLP@ACL 2023, Toronto, Canada, July 13, 2023. pages 97-117, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.