Improving visual question answering by combining scene-text information

Himanshu Sharma, Anand Singh Jalal. Improving visual question answering by combining scene-text information. Multimedia Tools Appl., 81(9):12177-12208, 2022. [doi]

Abstract

Abstract is missing.