Multimodal attention networks for low-level vision-and-language navigation

Federico Landi, Lorenzo Baraldi, Marcella Cornia, Massimiliano Corsini, Rita Cucchiara. Multimodal attention networks for low-level vision-and-language navigation. Computer Vision and Image Understanding, 210:103255, 2021. [doi]

Abstract

Abstract is missing.