Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality

Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross. Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 5228-5238, IEEE, 2022. [doi]

@inproceedings{ThrushJBSWKR22,
  title = {Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality},
  author = {Tristan Thrush and Ryan Jiang and Max Bartolo and Amanpreet Singh and Adina Williams and Douwe Kiela and Candace Ross},
  year = {2022},
  doi = {10.1109/CVPR52688.2022.00517},
  url = {https://doi.org/10.1109/CVPR52688.2022.00517},
  researchr = {https://researchr.org/publication/ThrushJBSWKR22},
  cites = {0},
  citedby = {0},
  pages = {5228-5238},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-6946-3},
}