Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision

Xiaohan Wang, Linchao Zhu, Zhedong Zheng, Mingliang Xu, Yi Yang. Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision. IEEE Transactions on Multimedia, 25:6079-6089, 2023. [doi]

@article{WangZZXY23,
  title = {Align and Tell: Boosting Text-Video Retrieval With Local Alignment and Fine-Grained Supervision},
  author = {Xiaohan Wang and Linchao Zhu and Zhedong Zheng and Mingliang Xu and Yi Yang},
  year = {2023},
  doi = {10.1109/TMM.2022.3204444},
  url = {https://doi.org/10.1109/TMM.2022.3204444},
  researchr = {https://researchr.org/publication/WangZZXY23},
  cites = {0},
  citedby = {0},
  journal = {IEEE Transactions on Multimedia},
  volume = {25},
  pages = {6079-6089},
}