More Grounded Image Captioning by Distilling Image-Text Matching Model

Yuanen Zhou, Meng Wang, Daqing Liu, Zhenzhen Hu, Hanwang Zhang. More Grounded Image Captioning by Distilling Image-Text Matching Model. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020. pages 4776-4785, IEEE, 2020. [doi]

Abstract

Abstract is missing.