Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models

Jierun Chen, Fangyun Wei, Jinjing Zhao, Sizhe Song, Bohuai Wu, Zhuoxuan Peng, S.-H. Gary Chan, Hongyang Zhang. Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2025, Nashville, TN, USA, June 11-15, 2025. pages 513-524, Computer Vision Foundation / IEEE, 2025. [doi]

Abstract

Abstract is missing.