Relational Attention with Textual Enhanced Transformer for Image Captioning

Lifei Song, Yiwen Shi, Xinyu Xiao, Chunxia Zhang 0001, Shiming Xiang. Relational Attention with Textual Enhanced Transformer for Image Captioning. In Huimin Ma, Liang Wang, Changshui Zhang, Fei Wu 0001, Tieniu Tan, Yaonan Wang, Jianhuang Lai, Yao Zhao 0001, editors, Pattern Recognition and Computer Vision - 4th Chinese Conference, PRCV 2021, Beijing, China, October 29 - November 1, 2021, Proceedings, Part III. Volume 13021 of Lecture Notes in Computer Science, pages 151-163, Springer, 2021. [doi]

Abstract

Abstract is missing.