Multimodal architecture for video captioning with memory networks and an attention mechanism

Wei Li, Dashan Guo, Xiangzhong Fang. Multimodal architecture for video captioning with memory networks and an attention mechanism. Pattern Recognition Letters, 105:23-29, 2018. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: