Improving Vision-and-Language Navigation with Image-Text Pairs from the Web

Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra. Improving Vision-and-Language Navigation with Image-Text Pairs from the Web. In Andrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm, editors, Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part VI. Volume 12351 of Lecture Notes in Computer Science, pages 259-274, Springer, 2020. [doi]

Abstract

Abstract is missing.