Spatiotemporal-Textual Co-Attention Network for Video Question Answering

Zheng-Jun Zha, Jiawei Liu, Tianhao Yang, Yongdong Zhang. Spatiotemporal-Textual Co-Attention Network for Video Question Answering. TOMCCAP, 15(2s), 2019. [doi]

Abstract

Abstract is missing.