The following publications are possibly variants of this publication:
- A multimodal attention fusion network with a dynamic vocabulary for TextVQAJiajia Wu, Jun Du, Fengren Wang, Chen Yang, Xinzhe Jiang, Jinshui Hu, Bing Yin, Jianshu Zhang, Lirong Dai 0001. PR, 122:108214, 2022. [doi]
- Focus Your Attention: A Focal Attention for Multimodal LearningChunxiao Liu, Zhendong Mao, Tianzhu Zhang, An-An Liu, Bin Wang 0004, Yongdong Zhang 0001. tmm, 24:103-115, 2022. [doi]