Visual and Linguistic Double Transformer Fusion Model for Multimodal Tweet Classification

Jinyan Zhou, Xingang Wang, Ning Liu, Xiaoyu Liu, Jiandong Lv, Xiaomin Li, Hong Zhang, Rui Cao. Visual and Linguistic Double Transformer Fusion Model for Multimodal Tweet Classification. In International Joint Conference on Neural Networks, IJCNN 2023, Gold Coast, Australia, June 18-23, 2023. pages 1-8, IEEE, 2023. [doi]

Abstract

Abstract is missing.