A multimodal attention fusion network with a dynamic vocabulary for TextVQA

Jiajia Wu, Jun Du, Fengren Wang, Chen Yang, Xinzhe Jiang, Jinshui Hu, Bing Yin, Jianshu Zhang, Lirong Dai 0001. A multimodal attention fusion network with a dynamic vocabulary for TextVQA. Pattern Recognition, 122:108214, 2022. [doi]

Abstract

Abstract is missing.