Guiding Audio-Visual Question Answering with Collective Question Reasoning

Baoqi Pei, Yifei Huang 0002, Guo Chen 0006, Jilan Xu, Yali Wang 0001, Limin Wang 0002, Tong Lu, Yu Qiao 0001, Fei Wu 0001. Guiding Audio-Visual Question Answering with Collective Question Reasoning. International Journal of Computer Vision, 133(10):6912-6929, October 2025. [doi]

Abstract

Abstract is missing.