Modality Re-Balance for Visual Question Answering: A Causal Framework

Xinpeng Lv, Wanrong Huang, Haotian Wang 0001, Ruochun Jin, Xueqiong Li, Zhipeng Lin, Shuman Li, Yongquan Feng, Yuhua Tang. Modality Re-Balance for Visual Question Answering: A Causal Framework. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. pages 5650-5654, IEEE, 2024. [doi]

Abstract

Abstract is missing.