More Grounded Image Captioning by Distilling Image-Text Matching Model

Yuanen Zhou, Meng Wang, Daqing Liu, Zhenzhen Hu, Hanwang Zhang. More Grounded Image Captioning by Distilling Image-Text Matching Model. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020. pages 4776-4785, IEEE, 2020. [doi]

@inproceedings{ZhouWLHZ20,
  title = {More Grounded Image Captioning by Distilling Image-Text Matching Model},
  author = {Yuanen Zhou and Meng Wang and Daqing Liu and Zhenzhen Hu and Hanwang Zhang},
  year = {2020},
  doi = {10.1109/CVPR42600.2020.00483},
  url = {https://doi.org/10.1109/CVPR42600.2020.00483},
  researchr = {https://researchr.org/publication/ZhouWLHZ20},
  cites = {0},
  citedby = {0},
  pages = {4776-4785},
  booktitle = {2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020},
  publisher = {IEEE},
  isbn = {978-1-7281-7168-5},
}