Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality

Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross. Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 5228-5238, IEEE, 2022. [doi]

Authors

Tristan Thrush

This author has not been identified. Look up 'Tristan Thrush' in Google

Ryan Jiang

This author has not been identified. Look up 'Ryan Jiang' in Google

Max Bartolo

This author has not been identified. Look up 'Max Bartolo' in Google

Amanpreet Singh

This author has not been identified. Look up 'Amanpreet Singh' in Google

Adina Williams

This author has not been identified. Look up 'Adina Williams' in Google

Douwe Kiela

This author has not been identified. Look up 'Douwe Kiela' in Google

Candace Ross

This author has not been identified. Look up 'Candace Ross' in Google