Spatial-temporal video grounding with cross-modal understanding and enhancement

Shu Luo, Jingyu Pan, Da Cao, Jiawei Wang 0025, Yuquan Le, Meng Liu 0006. Spatial-temporal video grounding with cross-modal understanding and enhancement. Expert Syst. Appl., 271:126650, 2025. [doi]

Abstract

Abstract is missing.