Multilayer Vision and Language Augmented Transformer for Image Captioning

Qiang Su, Zhixin Li. Multilayer Vision and Language Augmented Transformer for Image Captioning. In Fenrong Liu, Arun Anand Sadanandan, Duc Nghia Pham, Petrus Mursanto, Dickson Lukose, editors, PRICAI 2023: Trends in Artificial Intelligence - 20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023, Jakarta, Indonesia, November 15-19, 2023, Proceedings, Part II. Volume 14326 of Lecture Notes in Computer Science, pages 210-222, Springer, 2023. [doi]

Abstract

Abstract is missing.