Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval

Yaya Shi, Haowei Liu, Haiyang Xu, Zongyang Ma, Qinghao Ye, Anwen Hu, Ming Yan, Ji Zhang 0011, Fei Huang 0004, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha. Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 4460-4470, ACM, 2023. [doi]

Abstract

Abstract is missing.