COME: Clip-OCR and Master ObjEct for text image captioning

Gang Lv, Yining Sun, Fudong Nian, Maofei Zhu, Wenliang Tang, Zhenzhen Hu. COME: Clip-OCR and Master ObjEct for text image captioning. Image Vision Comput., 136:104751, August 2023. [doi]

Authors

Gang Lv

This author has not been identified. Look up 'Gang Lv' in Google

Yining Sun

This author has not been identified. Look up 'Yining Sun' in Google

Fudong Nian

This author has not been identified. Look up 'Fudong Nian' in Google

Maofei Zhu

This author has not been identified. Look up 'Maofei Zhu' in Google

Wenliang Tang

This author has not been identified. Look up 'Wenliang Tang' in Google

Zhenzhen Hu

This author has not been identified. Look up 'Zhenzhen Hu' in Google