Enhancing few-shot KB-VQA with panoramic image captions guided by Large Language Models

Pengpeng Qiang, Hongye Tan, Xiaoli Li 0001, Dian Wang, Ru Li, Xinyi Sun, Hu Zhang 0003, Jiye Liang. Enhancing few-shot KB-VQA with panoramic image captions guided by Large Language Models. Neurocomputing, 623:129373, 2025. [doi]

Abstract

Abstract is missing.