Semantic Alignment for Multimodal Large Language Models

Tao Wu, Mengze Li 0001, Jingyuan Chen, Wei Ji 0008, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu 0001. Semantic Alignment for Multimodal Large Language Models. In Jianfei Cai 0001, Mohan S. Kankanhalli, Balakrishnan Prabhakaran 0001, Susanne Boll, Ramanathan Subramanian, Liang Zheng 0001, Vivek K. Singh 0001, Pablo César, Lexing Xie, Dong Xu 0001, editors, Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024 - 1 November 2024. pages 3489-3498, ACM, 2024. [doi]

Abstract

Abstract is missing.