Gist, Content, Target-Oriented: A 3-Level Human-Like Framework for Video Moment Retrieval

Di Wang 0011, Xiantao Lu, Quan Wang 0006, Yumin Tian, Bo Wan, Lihuo He. Gist, Content, Target-Oriented: A 3-Level Human-Like Framework for Video Moment Retrieval. IEEE Transactions on Multimedia, 26:11044-11056, 2024. [doi]

Abstract

Abstract is missing.