The following publications are possibly variants of this publication:
- Self-Adaptive Neural Module Transformer for Visual Question AnsweringHuasong Zhong, Jingyuan Chen, Chen Shen, Hanwang Zhang, Jianqiang Huang, Xian-Sheng Hua 0001. tmm, 23:1264-1273, 2021. [doi]
- Dual Self-Guided Attention with Sparse Question Networks for Visual Question AnsweringXiang Shen, Dezhi Han, Chin-Chen Chang, Liang Zong. ieicetd, 105(4):785-796, 2022. [doi]
- Dual self-attention with co-attention networks for visual question answeringYun Liu, Xiaoming Zhang 0001, Qianyun Zhang, Chaozhuo Li, Feiran Huang, Xianghong Tang, Zhoujun Li. PR, 117:107956, 2021. [doi]
- A Transformer-based Medical Visual Question Answering ModelLei Liu, Xiangdong Su, Hui Guo, Daobin Zhu. icpr 2022: 1712-1718 [doi]