M3: Multimodal Memory Modelling for Video Captioning

Junbo Wang, Wei Wang 0115, Yan Huang 0008, Liang Wang 0001, Tieniu Tan. M3: Multimodal Memory Modelling for Video Captioning. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018. pages 7512-7520, IEEE Computer Society, 2018. [doi]

Abstract

Abstract is missing.