Stacked Multimodal Attention Network for Context-Aware Video Captioning

Yi Zheng, Yuejie Zhang, Rui Feng, Tao Zhang 0022, Weiguo Fan. Stacked Multimodal Attention Network for Context-Aware Video Captioning. IEEE Trans. Circuits Syst. Video Techn., 32(1):31-42, 2022. [doi]

Abstract

Abstract is missing.