GViG: Generative Visual Grounding Using Prompt-Based Language Modeling for Visual Question Answering

Yi-Ting Li, Ying-Jia Lin, Chia-Jen Yeh, Chun-Yi Lin, Hung-Yu Kao. GViG: Generative Visual Grounding Using Prompt-Based Language Modeling for Visual Question Answering. In De-Nian Yang, Xing Xie 0001, Vincent S. Tseng, Jian Pei, Jen-Wei Huang, Jerry Chun-Wei Lin, editors, Advances in Knowledge Discovery and Data Mining - 28th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2024, Taipei, Taiwan, May 7-10, 2024, Proceedings, Part VI. Volume 14650 of Lecture Notes in Computer Science, pages 83-94, Springer, 2024. [doi]

Abstract

Abstract is missing.