Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces

Vebjørn Haug Kåsene, Pierre Lison. Following Route Instructions using Large Vision-Language Models: A Comparison between Low-level and Panoramic Action Spaces. In Mourad Abbas, Tariq Yousef, Lukas Galke, editors, Proceedings of the 8th International Conference on Natural Language and Speech Processing, ICNLSP 2025, Southern Denmark University, Odense, Denmark, August 25-27, 2025. pages 449-463, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.