Sketch, Ground, and Refine: Top-Down Dense Video Captioning

Chaorui Deng, Shizhe Chen, Da Chen, Yuan He, Qi Wu 0001. Sketch, Ground, and Refine: Top-Down Dense Video Captioning. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. pages 234-243, Computer Vision Foundation / IEEE, 2021. [doi]

Abstract

Abstract is missing.