Multimodal Contrastive Training for Visual Representation Learning

Xin Yuan, Zhe Lin 0001, Jason Kuen, Jianming Zhang 0001, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta. Multimodal Contrastive Training for Visual Representation Learning. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 6995-7004, Computer Vision Foundation / IEEE, 2021. [doi]

Abstract

Abstract is missing.