ViLEM: Visual-Language Error Modeling for Image-Text Retrieval

Yuxin Chen, Zongyang Ma, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Weiming Hu, Xiaohu Qie, Jianping Wu. ViLEM: Visual-Language Error Modeling for Image-Text Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 11018-11027, IEEE, 2023. [doi]

Authors

Yuxin Chen

This author has not been identified. Look up 'Yuxin Chen' in Google

Zongyang Ma

This author has not been identified. Look up 'Zongyang Ma' in Google

Ziqi Zhang

This author has not been identified. Look up 'Ziqi Zhang' in Google

Zhongang Qi

This author has not been identified. Look up 'Zhongang Qi' in Google

Chunfeng Yuan

This author has not been identified. Look up 'Chunfeng Yuan' in Google

Ying Shan

This author has not been identified. Look up 'Ying Shan' in Google

Bing Li

This author has not been identified. Look up 'Bing Li' in Google

Weiming Hu

This author has not been identified. Look up 'Weiming Hu' in Google

Xiaohu Qie

This author has not been identified. Look up 'Xiaohu Qie' in Google

Jianping Wu

This author has not been identified. Look up 'Jianping Wu' in Google