Learning Unified Video-Language Representations via Joint Modeling and Contrastive Learning for Natural Language Video Localization

Chenhao Cui, Xinnian Liang, Shuangzhi Wu, Zhoujun Li 0001. Learning Unified Video-Language Representations via Joint Modeling and Contrastive Learning for Natural Language Video Localization. In International Joint Conference on Neural Networks, IJCNN 2023, Gold Coast, Australia, June 18-23, 2023. pages 1-8, IEEE, 2023. [doi]

Abstract

Abstract is missing.