VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching

Junyu Bi, Daixuan Cheng, Ping Yao, Bochen Pang, Yuefeng Zhan, Chuanguang Yang, Yujing Wang, Hao Sun, Weiwei Deng, Qi Zhang. VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 2584-2593, IEEE, 2023. [doi]

Authors

Junyu Bi

This author has not been identified. Look up 'Junyu Bi' in Google

Daixuan Cheng

This author has not been identified. Look up 'Daixuan Cheng' in Google

Ping Yao

This author has not been identified. Look up 'Ping Yao' in Google

Bochen Pang

This author has not been identified. Look up 'Bochen Pang' in Google

Yuefeng Zhan

This author has not been identified. Look up 'Yuefeng Zhan' in Google

Chuanguang Yang

This author has not been identified. Look up 'Chuanguang Yang' in Google

Yujing Wang

This author has not been identified. Look up 'Yujing Wang' in Google

Hao Sun

This author has not been identified. Look up 'Hao Sun' in Google

Weiwei Deng

This author has not been identified. Look up 'Weiwei Deng' in Google

Qi Zhang

This author has not been identified. Look up 'Qi Zhang' in Google