Cross-Modal and Hierarchical Modeling of Video and Text

Bowen Zhang, Hexiang Hu, Fei Sha. Cross-Modal and Hierarchical Modeling of Video and Text. In Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, Yair Weiss, editors, Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part XIII. Volume 11217 of Lecture Notes in Computer Science, pages 385-401, Springer, 2018. [doi]

Abstract

Abstract is missing.