Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning

Xi Zhang, Feifei Zhang, Changsheng Xu. Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning. IEEE Transactions on Multimedia, 24:2986-2997, 2022. [doi]

Abstract

Abstract is missing.