Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos

Shaoxiang Chen, Wenhao Jiang, Wei Liu, Yu-Gang Jiang. Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos. In Andrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm, editors, Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part IV. Volume 12349 of Lecture Notes in Computer Science, pages 333-351, Springer, 2020. [doi]

Abstract

Abstract is missing.