History Aware Multimodal Transformer for Vision-and-Language Navigation

Shizhe Chen, Pierre-Louis Guhur, Cordelia Schmid, Ivan Laptev. History Aware Multimodal Transformer for Vision-and-Language Navigation. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 5834-5847, 2021. [doi]

Authors

Shizhe Chen

This author has not been identified. Look up 'Shizhe Chen' in Google

Pierre-Louis Guhur

This author has not been identified. Look up 'Pierre-Louis Guhur' in Google

Cordelia Schmid

This author has not been identified. Look up 'Cordelia Schmid' in Google

Ivan Laptev

This author has not been identified. Look up 'Ivan Laptev' in Google