A Cross-Modal Object-Aware Transformer for Vision-and-Language Navigation

Han Ni, Jia Chen, Dayong Zhu, Dianxi Shi. A Cross-Modal Object-Aware Transformer for Vision-and-Language Navigation. In Marek Z. Reformat, Du Zhang, Nikolaos G. Bourbakis, editors, 34th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2022, Macao, China, October 31 - November 2, 2022. pages 976-981, IEEE, 2022. [doi]

Abstract

Abstract is missing.