Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA

Chengyang Fang, Wenhui Jiang, Yuming Fang, Yuxin Peng 0001, Yang Liu 0293. Separate, Locate, and Align: Determine Context Relation of Scene Text From Multiple Perspectives in TextVQA. IEEE Trans. Circuits Syst. Video Techn., 35(11):11172-11185, 2025. [doi]

Abstract

Abstract is missing.