Connecting Vision and Language with Localized Narratives

Jordi Pont-Tuset, Jasper R. R. Uijlings, Soravit Changpinyo, Radu Soricut, Vittorio Ferrari. Connecting Vision and Language with Localized Narratives. In Andrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm, editors, Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part V. Volume 12350 of Lecture Notes in Computer Science, pages 647-664, Springer, 2020. [doi]

Abstract

Abstract is missing.