Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision

Xiaohan Wang, Linchao Zhu, Zhedong Zheng, Mingliang Xu, Yi Yang. Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision. IEEE Transactions on Multimedia, 25:6079-6089, 2023. [doi]

Abstract

Abstract is missing.