LaTr: Layout-Aware Transformer for Scene-Text VQA

Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha. LaTr: Layout-Aware Transformer for Scene-Text VQA. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 16527-16537, IEEE, 2022. [doi]

Abstract

Abstract is missing.