Rongsen Wu, Jie Xu 0023, Hao Zheng, Zhiyuan Xu, Zixuan Li, Shixue Cheng, Shumao Zhang. Spatio-temporal feature extraction with a global-local Transformer model for video scene graph generation. Digit. Commun. Networks, 12(2):364-374, 2026. [doi]
Abstract is missing.