Predicting Visual Features From Text for Image and Video Caption Retrieval

Jianfeng Dong, Xirong Li, Cees G. M. Snoek. Predicting Visual Features From Text for Image and Video Caption Retrieval. IEEE Transactions on Multimedia, 20(12):3377-3388, 2018. [doi]

Abstract

Abstract is missing.