Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning

Jingjing Jiang, Chao Ma, Xurui Song, Hanwang Zhang, Jun Luo 0001. Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 3034-3046, IEEE, 2025. [doi]

Abstract

Abstract is missing.