@ CREPE: Can Vision-Language Foundation Models Reason Compositionally?

Zixian Ma, Jerry Hong, Mustafa Omer Gul, Mona Gandhi, Irena Gao, Ranjay Krishna. @ CREPE: Can Vision-Language Foundation Models Reason Compositionally?. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 10910-10921, IEEE, 2023. [doi]

Abstract

Abstract is missing.