COME: Clip-OCR and Master ObjEct for text image captioning

Gang Lv, Yining Sun, Fudong Nian, Maofei Zhu, Wenliang Tang, Zhenzhen Hu. COME: Clip-OCR and Master ObjEct for text image captioning. Image Vision Comput., 136:104751, August 2023. [doi]

Abstract

Abstract is missing.