Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network

Jiajun Wei, Hongjian Zhan, Yue Lu 0001, Xiao Tu, Bing Yin, Cong Liu, Umapada Pal 0001. Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 5885-5893, AAAI Press, 2024. [doi]

Authors

Jiajun Wei

This author has not been identified. Look up 'Jiajun Wei' in Google

Hongjian Zhan

This author has not been identified. Look up 'Hongjian Zhan' in Google

Yue Lu 0001

This author has not been identified. Look up 'Yue Lu 0001' in Google

Xiao Tu

This author has not been identified. Look up 'Xiao Tu' in Google

Bing Yin

This author has not been identified. Look up 'Bing Yin' in Google

Cong Liu

This author has not been identified. Look up 'Cong Liu' in Google

Umapada Pal 0001

This author has not been identified. Look up 'Umapada Pal 0001' in Google