MAGVLT: Masked Generative Vision-and-Language Transformer

Sungwoong Kim, DaeJin Jo, Donghoon Lee, Jongmin Kim 0006. MAGVLT: Masked Generative Vision-and-Language Transformer. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 23338-23348, IEEE, 2023. [doi]

Abstract

Abstract is missing.