Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding

Le Zhang, Rabiul Awal, Aishwarya Agrawal. Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 13774-13784, IEEE, 2024. [doi]

@inproceedings{ZhangAA24,
  title = {Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding},
  author = {Le Zhang and Rabiul Awal and Aishwarya Agrawal},
  year = {2024},
  doi = {10.1109/CVPR52733.2024.01307},
  url = {https://doi.org/10.1109/CVPR52733.2024.01307},
  researchr = {https://researchr.org/publication/ZhangAA24},
  cites = {0},
  citedby = {0},
  pages = {13774-13784},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024},
  publisher = {IEEE},
  isbn = {979-8-3503-5300-6},
}