Video-Text Representation Learning via Differentiable Weak Temporal Alignment

Dohwan Ko, Joonmyung Choi, Juyeon Ko, Shinyeong Noh, Kyoung-woon On, Eun-Sol Kim, Hyunwoo J. Kim. Video-Text Representation Learning via Differentiable Weak Temporal Alignment. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 5006-5015, IEEE, 2022. [doi]

@inproceedings{KoCKNOKK22,
  title = {Video-Text Representation Learning via Differentiable Weak Temporal Alignment},
  author = {Dohwan Ko and Joonmyung Choi and Juyeon Ko and Shinyeong Noh and Kyoung-woon On and Eun-Sol Kim and Hyunwoo J. Kim},
  year = {2022},
  doi = {10.1109/CVPR52688.2022.00496},
  url = {https://doi.org/10.1109/CVPR52688.2022.00496},
  researchr = {https://researchr.org/publication/KoCKNOKK22},
  cites = {0},
  citedby = {0},
  pages = {5006-5015},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-6946-3},
}