Describing Unseen Videos via Multi-modal Cooperative Dialog Agents

Ye Zhu, Yu Wu 0011, Yi Yang, Yan Yan 0006. Describing Unseen Videos via Multi-modal Cooperative Dialog Agents. In Andrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm, editors, Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XXIII. Volume 12368 of Lecture Notes in Computer Science, pages 153-169, Springer, 2020. [doi]

Abstract

Abstract is missing.