Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering

Yun Liu, Xiaoming Zhang 0001, Feiran Huang, Bo Zhang, Zhoujun Li 0001. Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering. IEEE Transactions on Image Processing, 31:1684-1696, 2022. [doi]

Abstract

Abstract is missing.