A Survey on Video Temporal Grounding With Multimodal Large Language Model

Jianlong Wu, Wei Liu 0005, Ye Liu 0002, Meng Liu 0006, Liqiang Nie, Zhouchen Lin, Chang Wen Chen. A Survey on Video Temporal Grounding With Multimodal Large Language Model. IEEE Trans. Pattern Anal. Mach. Intell., 48(2):1521-1541, February 2026. [doi]

Abstract

Abstract is missing.