Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation

Yibo Cui, Liang Xie 0012, Yakun Zhang, Meishan Zhang, Ye Yan, Erwei Yin. Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 12009-12019, IEEE, 2023. [doi]

Abstract

Abstract is missing.