PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation

Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin. PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation. In Stevan Rudinac, Alan Hanjalic, Cynthia C. S. Liem, Marcel Worring, Björn Þór Jónsson 0001, Bei Liu, Yoko Yamakata, editors, MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part I. Volume 14554 of Lecture Notes in Computer Science, pages 187-200, Springer, 2024. [doi]

Abstract

Abstract is missing.