DeVLBert: Learning Deconfounded Visio-Linguistic Representations

Shengyu Zhang, Tan Jiang, Tan Wang, Kun Kuang, Zhou Zhao, Jianke Zhu, Jin Yu, Hongxia Yang, Fei Wu. DeVLBert: Learning Deconfounded Visio-Linguistic Representations. In Chang Wen Chen, Rita Cucchiara, Xian-Sheng Hua 0001, Guo-Jun Qi, Elisa Ricci 0001, Zhengyou Zhang, Roger Zimmermann, editors, MM '20: The 28th ACM International Conference on Multimedia, Virtual Event / Seattle, WA, USA, October 12-16, 2020. pages 4373-4382, ACM, 2020. [doi]

Abstract

Abstract is missing.