Vision And Text Transformer For Predicting Answerability On Visual Question Answering

Tung Le, Huy-Tien Nguyen, Minh Le Nguyen. Vision And Text Transformer For Predicting Answerability On Visual Question Answering. In 2021 IEEE International Conference on Image Processing, ICIP 2021, Anchorage, AK, USA, September 19-22, 2021. pages 934-938, IEEE, 2021. [doi]

@inproceedings{LeNN21-2,
  title = {Vision And Text Transformer For Predicting Answerability On Visual Question Answering},
  author = {Tung Le and Huy-Tien Nguyen and Minh Le Nguyen},
  year = {2021},
  doi = {10.1109/ICIP42928.2021.9506796},
  url = {https://doi.org/10.1109/ICIP42928.2021.9506796},
  researchr = {https://researchr.org/publication/LeNN21-2},
  cites = {0},
  citedby = {0},
  pages = {934-938},
  booktitle = {2021 IEEE International Conference on Image Processing, ICIP 2021, Anchorage, AK, USA, September 19-22, 2021},
  publisher = {IEEE},
  isbn = {978-1-6654-4115-5},
}