Vision And Text Transformer For Predicting Answerability On Visual Question Answering

Tung Le, Huy-Tien Nguyen, Minh Le Nguyen. Vision And Text Transformer For Predicting Answerability On Visual Question Answering. In 2021 IEEE International Conference on Image Processing, ICIP 2021, Anchorage, AK, USA, September 19-22, 2021. pages 934-938, IEEE, 2021. [doi]

Abstract

Abstract is missing.