Learning Vision-and-Language Navigation from YouTube Videos

Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan. Learning Vision-and-Language Navigation from YouTube Videos. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 8283-8292, IEEE, 2023. [doi]

Authors

Kunyang Lin

This author has not been identified. Look up 'Kunyang Lin' in Google

Peihao Chen

This author has not been identified. Look up 'Peihao Chen' in Google

Diwei Huang

This author has not been identified. Look up 'Diwei Huang' in Google

Thomas H. Li

This author has not been identified. Look up 'Thomas H. Li' in Google

Mingkui Tan

This author has not been identified. Look up 'Mingkui Tan' in Google

Chuang Gan

This author has not been identified. Look up 'Chuang Gan' in Google