Positional Attention Guided Transformer-Like Architecture for Visual Question Answering

Aihua Mao, Zhi Yang, Ken Lin, Jun Xuan, Yong-Jin Liu. Positional Attention Guided Transformer-Like Architecture for Visual Question Answering. IEEE Transactions on Multimedia, 25:6997-7009, 2023. [doi]

Abstract

Abstract is missing.