Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding

Le Zhang, Rabiul Awal, Aishwarya Agrawal. Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 13774-13784, IEEE, 2024. [doi]

Authors

Le Zhang

This author has not been identified. Look up 'Le Zhang' in Google

Rabiul Awal

This author has not been identified. Look up 'Rabiul Awal' in Google

Aishwarya Agrawal

This author has not been identified. Look up 'Aishwarya Agrawal' in Google