Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding

Xin Gu, Yaojie Shen, Chenxi Luo, Tiejian Luo, Yan Huang 0002, Yuewei Lin, Heng Fan 0001, Libo Zhang 0001. Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

Abstract

Abstract is missing.