Multimodal Contrastive Training for Visual Representation Learning

Xin Yuan, Zhe Lin 0001, Jason Kuen, Jianming Zhang 0001, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta. Multimodal Contrastive Training for Visual Representation Learning. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 6995-7004, Computer Vision Foundation / IEEE, 2021. [doi]

@inproceedings{Yuan0K0WMKF21,
  title = {Multimodal Contrastive Training for Visual Representation Learning},
  author = {Xin Yuan and Zhe Lin 0001 and Jason Kuen and Jianming Zhang 0001 and Yilin Wang and Michael Maire and Ajinkya Kale and Baldo Faieta},
  year = {2021},
  url = {https://openaccess.thecvf.com/content/CVPR2021/html/Yuan_Multimodal_Contrastive_Training_for_Visual_Representation_Learning_CVPR_2021_paper.html},
  researchr = {https://researchr.org/publication/Yuan0K0WMKF21},
  cites = {0},
  citedby = {0},
  pages = {6995-7004},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021},
  publisher = {Computer Vision Foundation / IEEE},
}