ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Yan Yang 0011, Dongxu Li, Haoning Wu, Bei Chen, Liu Liu 0009, Liyuan Pan, Junnan Li. ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar, editors, Findings of the Association for Computational Linguistics, ACL 2025, Vienna, Austria, July 27 - August 1, 2025. pages 10883-10892, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.