The following publications are possibly variants of this publication:
- Visual Relational Reasoning for Image CaptionHaolei Pei, Qiaohong Chen, Ji Wang, Qi Sun, Yubo Jia. ijcnn 2020: 1-8 [doi]
- Relational Attention with Textual Enhanced Transformer for Image CaptioningLifei Song, Yiwen Shi, Xinyu Xiao, Chunxia Zhang 0001, Shiming Xiang. prcv 2021: 151-163 [doi]
- Dynamic Transformer for Image CaptioningTiantao Xian, Zhixin Li, Tianyu Chen, Huifang Ma. icmcs 2022: 1-6 [doi]
- Entangled Transformer for Image CaptioningGuang Li, Linchao Zhu, Ping Liu, Yi Yang. iccv 2019: 8927-8936 [doi]
- Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image CaptioningXinzhi Dong, Chengjiang Long, Wenju Xu, Chunxia Xiao. mm 2021: 2615-2624 [doi]
- Exploring Visual Relationships via Transformer-based Graphs for Enhanced Image CaptioningJingyu Li, Zhendong Mao, Hao Li, Weidong Chen, Yongdong Zhang 0001. tomccap, 20(5), May 2024. [doi]