Multi-Modality Cross Attention Network for Image and Sentence Matching

Xi Wei, Tianzhu Zhang, Yan Li, Yongdong Zhang, Feng Wu. Multi-Modality Cross Attention Network for Image and Sentence Matching. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020. pages 10938-10947, IEEE, 2020. [doi]

Abstract

Abstract is missing.