The following publications are possibly variants of this publication:
- Dynamic Transformer for Image CaptioningTiantao Xian, Zhixin Li, Tianyu Chen, Huifang Ma. icmcs 2022: 1-6 [doi]
- Entangled Transformer for Image CaptioningGuang Li, Linchao Zhu, Ping Liu, Yi Yang. iccv 2019: 8927-8936 [doi]
- ICDT: Incremental Context Guided Deliberation Transformer for Image CaptioningXinyi Lai, Yufeng Lyu, Jiang Zhong, Chen Wang, Qizhu Dai, Gang Li. pricai 2022: 444-458 [doi]
- Geometry Attention Transformer with position-aware LSTMs for image captioningChi Wang, Yulin Shen, Luping Ji. eswa, 201:117174, 2022. [doi]
- Geometrically-Aware Dual Transformer Encoding Visual and Textual Features for Image CaptioningYu-Ling Chang, Hao-Shang Ma, Shiou-Chi Li, Jen-Wei Huang. pakdd 2024: 15-27 [doi]
- Interactive Change-Aware Transformer Network for Remote Sensing Image Change CaptioningChen Cai, Yi Wang, Kim-Hui Yap. remotesensing, 15(23):5611, December 2023. [doi]
- Image Captioning Through Image TransformerSen He, Wentong Liao, Hamed R. Tavakoli, Michael Yang, Bodo Rosenhahn, Nicolas Pugeault. ACCV 2021: 153-169 [doi]