Image Captioning Based on An Improved Transformer with IoU Position Encoding

Yazhou Li, Yihui Shi, Yun Liu, Ruifan Li, Zhanyu Ma. Image Captioning Based on An Improved Transformer with IoU Position Encoding. In Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021, Tokyo, Japan, December 14-17, 2021. pages 2066-2071, IEEE, 2021. [doi]

@inproceedings{LiSLLM21,
  title = {Image Captioning Based on An Improved Transformer with IoU Position Encoding},
  author = {Yazhou Li and Yihui Shi and Yun Liu and Ruifan Li and Zhanyu Ma},
  year = {2021},
  url = {https://ieeexplore.ieee.org/document/9689357},
  researchr = {https://researchr.org/publication/LiSLLM21},
  cites = {0},
  citedby = {0},
  pages = {2066-2071},
  booktitle = {Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021, Tokyo, Japan, December 14-17, 2021},
  publisher = {IEEE},
  isbn = {978-988-14768-9-0},
}