ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations

Chanda Grover, Indra Deep Mastan, Debayan Gupta. ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations. In Soma Biswas, Shanmuganathan Raman, Amit K. Roy Chowdhury, editors, Proceedings of the Thirteenth Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 2022, Gandhinagar, India, December 8-10, 2022. ACM, 2022. [doi]

Abstract

Abstract is missing.