Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling

XiaoPeng Lu, Zhen Fan 0003, Yansen Wang, Jean Oh, Carolyn P. Rosé. Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling. In IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021, Montreal, BC, Canada, October 11-17, 2021. pages 2631-2639, IEEE, 2021. [doi]

Abstract

Abstract is missing.