How Can Objects Help Video-Language Understanding?

Zitian Tang, Shijie Wang, Junho Cho, Jaewook Yoo, Chen Sun 0002. How Can Objects Help Video-Language Understanding?. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 21994-22003, IEEE, 2025. [doi]

Abstract

Abstract is missing.