Cross-modal Pretraining and Matching for Video Understanding

Limin Wang. Cross-modal Pretraining and Matching for Video Understanding. In Bei Liu 0001, Jianlong Fu, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Yong Rui, editors, MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, Taipei, Taiwan, August 21, 2021. pages 1-2, ACM, 2021. [doi]

Abstract

Abstract is missing.