Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses

Jing Shi 0005, Jia Xu, Boqing Gong, Chenliang Xu. Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019. pages 10444-10452, Computer Vision Foundation / IEEE, 2019. [doi]

Abstract

Abstract is missing.