Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding

Hassan Akbari, Svebor Karaman, Surabhi Bhargava, Brian Chen, Carl Vondrick, Shih-Fu Chang. Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019. pages 12476-12486, Computer Vision Foundation / IEEE, 2019. [doi]

Abstract

Abstract is missing.