Matching and Localizing: A Simple yet Effective Framework for Human-Centric Spatio-Temporal Video Grounding

Chaolei Tan, Jian-Fang Hu, Wei-Shi Zheng. Matching and Localizing: A Simple yet Effective Framework for Human-Centric Spatio-Temporal Video Grounding. In Lu Fang, Daniel Povey, Guangtao Zhai, Tao Mei 0001, Ruiping Wang 0001, editors, Artificial Intelligence - Second CAAI International Conference, CICAI 2022, Beijing, China, August 27-28, 2022, Revised Selected Papers, Part I. Volume 13604 of Lecture Notes in Computer Science, pages 305-316, Springer, 2022. [doi]

Abstract

Abstract is missing.