End-to-End Dense Video Captioning With Masked Transformer

Luowei Zhou, Yingbo Zhou, Jason J. Corso, Richard Socher, Caiming Xiong. End-to-End Dense Video Captioning With Masked Transformer. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018. pages 8739-8748, IEEE Computer Society, 2018. [doi]

Abstract

Abstract is missing.