Multi-modal Dense Video Captioning

Vladimir Iashin, Esa Rahtu. Multi-modal Dense Video Captioning. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR Workshops 2020, Seattle, WA, USA, June 14-19, 2020. pages 4117-4126, IEEE, 2020. [doi]

Abstract

Abstract is missing.