VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes

Satwik Kottur, Ramakrishna Vedantam, José M. F. Moura, Devi Parikh. VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. pages 4985-4994, IEEE Computer Society, 2016. [doi]

@inproceedings{KotturVMP16,
  title = {VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes},
  author = {Satwik Kottur and Ramakrishna Vedantam and José M. F. Moura and Devi Parikh},
  year = {2016},
  doi = {10.1109/CVPR.2016.539},
  url = {http://doi.ieeecomputersociety.org/10.1109/CVPR.2016.539},
  researchr = {https://researchr.org/publication/KotturVMP16},
  cites = {0},
  citedby = {0},
  pages = {4985-4994},
  booktitle = {2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016},
  publisher = {IEEE Computer Society},
  isbn = {978-1-4673-8851-1},
}