Predicting Visual Features From Text for Image and Video Caption Retrieval

Jianfeng Dong, Xirong Li, Cees G. M. Snoek. Predicting Visual Features From Text for Image and Video Caption Retrieval. IEEE Transactions on Multimedia, 20(12):3377-3388, 2018. [doi]

No reviews for this publication, yet.