Multimodal architecture for video captioning with memory networks and an attention mechanism

Wei Li, Dashan Guo, Xiangzhong Fang. Multimodal architecture for video captioning with memory networks and an attention mechanism. Pattern Recognition Letters, 105:23-29, 2018. [doi]

Abstract

Abstract is missing.