Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval

Arun Reddy, Alexander Martin 0006, Eugene Yang, Andrew Yates, Kate Sanders 0002, Kenton Murray, Reno Kriz, Celso M. de Melo, Benjamin Van Durme, Rama Chellappa. Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 19691-19701, Computer Vision Foundation / IEEE, 2025. [doi]

Abstract

Abstract is missing.