The following publications are possibly variants of this publication:
- Event-Equalized Dense Video CaptioningKangyi Wu, Pengna Li, Jingwen Fu, Yizhe Li, Yang Wu, Yuhan Liu, Jinjun Wang, Sanping Zhou. cvpr 2025: 8417-8427 [doi]
- Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense CaptioningSijin Chen, Hongyuan Zhu, MingSheng Li, Xin Chen 0040, Peng Guo, Yinjie Lei, Gang Yu 0002, Taihao Li, Tao Chen 0003. pami, 46(11):7331-7347, November 2024. [doi]
- Event-centric multi-modal fusion method for dense video captioningZhi Chang, Dexin Zhao, Huilin Chen, Jingdan Li, Pengfei Liu. NN, 146:120-129, 2022. [doi]