ViLEM: Visual-Language Error Modeling for Image-Text Retrieval

Yuxin Chen, Zongyang Ma, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Weiming Hu, Xiaohu Qie, Jianping Wu. ViLEM: Visual-Language Error Modeling for Image-Text Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 11018-11027, IEEE, 2023. [doi]

Abstract

Abstract is missing.