Video Question Generation via Semantic Rich Cross-Modal Self-Attention Networks Learning

Yu-Siang Wang, Hung-Ting Su, Chen-Hsi Chang, Zhe Yu Liu, Winston H. Hsu. Video Question Generation via Semantic Rich Cross-Modal Self-Attention Networks Learning. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020. pages 2423-2427, IEEE, 2020. [doi]

Abstract

Abstract is missing.