Relational Attention with Textual Enhanced Transformer for Image Captioning

Lifei Song, Yiwen Shi, Xinyu Xiao, Chunxia Zhang 0001, Shiming Xiang. Relational Attention with Textual Enhanced Transformer for Image Captioning. In Huimin Ma, Liang Wang, Changshui Zhang, Fei Wu 0001, Tieniu Tan, Yaonan Wang, Jianhuang Lai, Yao Zhao 0001, editors, Pattern Recognition and Computer Vision - 4th Chinese Conference, PRCV 2021, Beijing, China, October 29 - November 1, 2021, Proceedings, Part III. Volume 13021 of Lecture Notes in Computer Science, pages 151-163, Springer, 2021. [doi]

@inproceedings{SongSXZX21,
  title = {Relational Attention with Textual Enhanced Transformer for Image Captioning},
  author = {Lifei Song and Yiwen Shi and Xinyu Xiao and Chunxia Zhang 0001 and Shiming Xiang},
  year = {2021},
  doi = {10.1007/978-3-030-88010-1_13},
  url = {https://doi.org/10.1007/978-3-030-88010-1_13},
  researchr = {https://researchr.org/publication/SongSXZX21},
  cites = {0},
  citedby = {0},
  pages = {151-163},
  booktitle = {Pattern Recognition and Computer Vision - 4th Chinese Conference, PRCV 2021, Beijing, China, October 29 - November 1, 2021, Proceedings, Part III},
  editor = {Huimin Ma and Liang Wang and Changshui Zhang and Fei Wu 0001 and Tieniu Tan and Yaonan Wang and Jianhuang Lai and Yao Zhao 0001},
  volume = {13021},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-030-88010-1},
}