GIT: A Generative Image-to-text Transformer for Vision and Language

Jianfeng Wang, Zhengyuan Yang, Xiaowei Hu 0006, Linjie Li, Kevin Lin, Zhe Gan, Zicheng Liu 0001, Ce Liu 0001, Lijuan Wang. GIT: A Generative Image-to-text Transformer for Vision and Language. Trans. Mach. Learn. Res., 2022, 2022. [doi]

Abstract

Abstract is missing.