ArCo: Attention-reinforced transformer with contrastive learning for image captioning

Zhongan Wang, Shuai Shi, Zirong Zhai, Yingna Wu, Rui Yang. ArCo: Attention-reinforced transformer with contrastive learning for image captioning. Image Vision Comput., 128:104570, 2022. [doi]

@article{WangSZWY22,
  title = {ArCo: Attention-reinforced transformer with contrastive learning for image captioning},
  author = {Zhongan Wang and Shuai Shi and Zirong Zhai and Yingna Wu and Rui Yang},
  year = {2022},
  doi = {10.1016/j.imavis.2022.104570},
  url = {https://doi.org/10.1016/j.imavis.2022.104570},
  researchr = {https://researchr.org/publication/WangSZWY22},
  cites = {0},
  citedby = {0},
  journal = {Image Vision Comput.},
  volume = {128},
  pages = {104570},
}