VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval

Siteng Huang, Biao Gong, Yulin Pan, Jianwen Jiang, Yiliang Lv, Yuyuan Li, Donglin Wang. VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 6565-6574, IEEE, 2023. [doi]

Abstract

Abstract is missing.