Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization

Liqiang Jing, Yiren Li, Junhao Xu, Yongcan Yu, Pei Shen, Xuemeng Song. Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization. Int. J. Autom. Comput., 20(2):289-298, April 2023. [doi]

Abstract

Abstract is missing.