UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog

Cheng Chen, Zhenshan Tan, Qingrong Cheng, Xin Jiang 0002, Qun Liu 0001, Yudong Zhu, Xiaodong Gu 0001. UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 18082-18091, IEEE, 2022. [doi]

Abstract

Abstract is missing.