PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation

Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin. PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation. In Stevan Rudinac, Alan Hanjalic, Cynthia C. S. Liem, Marcel Worring, Björn Þór Jónsson 0001, Bei Liu, Yoko Yamakata, editors, MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part I. Volume 14554 of Lecture Notes in Computer Science, pages 187-200, Springer, 2024. [doi]

Authors

Ting Liu

This author has not been identified. Look up 'Ting Liu' in Google

Yue Hu

This author has not been identified. Look up 'Yue Hu' in Google

Wansen Wu

This author has not been identified. Look up 'Wansen Wu' in Google

Youkai Wang

This author has not been identified. Look up 'Youkai Wang' in Google

Kai Xu

This author has not been identified. Look up 'Kai Xu' in Google

Quanjun Yin

This author has not been identified. Look up 'Quanjun Yin' in Google