Looking closer and smarter: Multi-scale progressive attention for visual text question answering

Kang Chen, Xiangqian Wu 0002. Looking closer and smarter: Multi-scale progressive attention for visual text question answering. Neurocomputing, 697:134131, 2026. [doi]

Abstract

Abstract is missing.