Multi-modal spatial relational attention networks for visual question answering

Haibo Yao, Lipeng Wang, Chengtao Cai, Yuxin Sun, Zhi Zhang, Yongkang Luo. Multi-modal spatial relational attention networks for visual question answering. Image Vision Comput., 140:104840, December 2023. [doi]

Authors

Haibo Yao

This author has not been identified. Look up 'Haibo Yao' in Google

Lipeng Wang

This author has not been identified. Look up 'Lipeng Wang' in Google

Chengtao Cai

This author has not been identified. Look up 'Chengtao Cai' in Google

Yuxin Sun

This author has not been identified. Look up 'Yuxin Sun' in Google

Zhi Zhang

This author has not been identified. Look up 'Zhi Zhang' in Google

Yongkang Luo

This author has not been identified. Look up 'Yongkang Luo' in Google