Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models

Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik. Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015. pages 2641-2649, IEEE, 2015. [doi]

@inproceedings{PlummerWCCHL15,
  title = {Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models},
  author = {Bryan A. Plummer and Liwei Wang and Chris M. Cervantes and Juan C. Caicedo and Julia Hockenmaier and Svetlana Lazebnik},
  year = {2015},
  doi = {10.1109/ICCV.2015.303},
  url = {http://dx.doi.org/10.1109/ICCV.2015.303},
  researchr = {https://researchr.org/publication/PlummerWCCHL15},
  cites = {0},
  citedby = {0},
  pages = {2641-2649},
  booktitle = {2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015},
  publisher = {IEEE},
  isbn = {978-1-4673-8391-2},
}