COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

Haoyu Lu, Nanyi Fei, Yuqi Huo, Yizhao Gao, Zhiwu Lu 0001, Ji-Rong Wen. COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 15671-15680, IEEE, 2022. [doi]

Authors

Haoyu Lu

This author has not been identified. Look up 'Haoyu Lu' in Google

Nanyi Fei

This author has not been identified. Look up 'Nanyi Fei' in Google

Yuqi Huo

This author has not been identified. Look up 'Yuqi Huo' in Google

Yizhao Gao

This author has not been identified. Look up 'Yizhao Gao' in Google

Zhiwu Lu 0001

This author has not been identified. Look up 'Zhiwu Lu 0001' in Google

Ji-Rong Wen

This author has not been identified. Look up 'Ji-Rong Wen' in Google