Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding

Zezhong Lv, Bing Su 0001. Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding. In IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023. pages 1487-1492, IEEE, 2023. [doi]

Abstract

Abstract is missing.