RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval

Xing Wu 0002, Chaochen Gao, Zijia Lin, Zhongyuan Wang, Jizhong Han, Songlin Hu. RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval. In Yoav Goldberg, Zornitsa Kozareva, Yue Zhang, editors, Findings of the Association for Computational Linguistics: EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11, 2022. pages 3036-3047, Association for Computational Linguistics, 2022. [doi]

Abstract

Abstract is missing.