VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes

Satwik Kottur, Ramakrishna Vedantam, José M. F. Moura, Devi Parikh. VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. pages 4985-4994, IEEE Computer Society, 2016. [doi]

Abstract

Abstract is missing.