Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval

Dohwan Ko, Ji Soo Lee, Minhyuk Choi, Zihang Meng, Hyunwoo J. Kim. Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 22263-22273, IEEE, 2025. [doi]

Abstract

Abstract is missing.