Scaling Up Vision-Language Pretraining for Image Captioning

Xiaowei Hu 0006, Zhe Gan, Jianfeng Wang, Zhengyuan Yang, Zicheng Liu 0001, Yumao Lu, Lijuan Wang. Scaling Up Vision-Language Pretraining for Image Captioning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 17959-17968, IEEE, 2022. [doi]

Abstract

Abstract is missing.