A Vision Transformer Based Scene Text Recognizer with Multi-grained Encoding and Decoding

Zhi Qiao, Zhilong Ji, Ye Yuan, Jinfeng Bai. A Vision Transformer Based Scene Text Recognizer with Multi-grained Encoding and Decoding. In Utkarsh Porwal, Alicia Fornés, Faisal Shafait, editors, Frontiers in Handwriting Recognition - 18th International Conference, ICFHR 2022, Hyderabad, India, December 4-7, 2022, Proceedings. Volume 13639 of Lecture Notes in Computer Science, pages 198-212, Springer, 2022. [doi]

Abstract

Abstract is missing.