The following publications are possibly variants of this publication:
- Multimodal Semantic Attention Network for Video CaptioningLiang Sun, Bing Li, Chunfeng Yuan, Zhengjun Zha, Weiming Hu. icmcs 2019: 1300-1305 [doi]
- Hierarchical & multimodal video captioning: Discovering and transferring multimodal knowledge for vision to languageAn-An Liu, Ning Xu, Yongkang Wong, Junnan Li, Yuting Su, Mohan S. Kankanhalli. cviu, 163:113-125, 2017. [doi]