Sungkyung Kim, Adam Lee, Junyoung Park, Andrew Chung, Jusang Oh, Jay Yoon Lee. Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 15155-15165, Association for Computational Linguistics, 2024. [doi]
Abstract is missing.