Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Zitian Wang, Yue Liao, Kang Rong, Fengyun Rao, Yibo Yang, Si Liu 0001. Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 2010-2021, IEEE, 2025. [doi]

Abstract

Abstract is missing.