Video-Text Representation Learning via Differentiable Weak Temporal Alignment

Dohwan Ko, Joonmyung Choi, Juyeon Ko, Shinyeong Noh, Kyoung-woon On, Eun-Sol Kim, Hyunwoo J. Kim. Video-Text Representation Learning via Differentiable Weak Temporal Alignment. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 5006-5015, IEEE, 2022. [doi]

Abstract

Abstract is missing.