Transferring General Multimodal Pretrained Models to Text Recognition

Junyang Lin, Xuancheng Ren, Yichang Zhang, Gao Liu, Peng Wang, an Yang, Chang Zhou. Transferring General Multimodal Pretrained Models to Text Recognition. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023. pages 588-597, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.