CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

JinLan Fu, huangfushenzhen, Hao Fei 0001, Xiaoyu Shen, Bryan Hooi, Xipeng Qiu, See-Kiong Ng. CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

Abstract

Abstract is missing.