Align and Prompt: Video-and-Language Pre-training with Entity Prompts

Dongxu Li, Junnan Li 0001, Hongdong Li, Juan Carlos Niebles, Steven C. H. Hoi. Align and Prompt: Video-and-Language Pre-training with Entity Prompts. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 4943-4953, IEEE, 2022. [doi]

Abstract

Abstract is missing.