Cong Phu Nguyen, Huy-Tien Nguyen, Tung Le. Fusing Visual and Textual Representations via Multi-layer Fusing Transformers for Vietnamese Visual Question Answering. In Ngoc Thanh Nguyen 0001, Bogdan Franczyk, André Ludwig, Manuel Núñez 0001, Jan Treur, Gottfried Vossen, Adrianna Kozierkiewicz, editors, Advances in Computational Collective Intelligence - 16th International Conference, ICCCI 2024, Leipzig, Germany, September 9-11, 2024, Proceedings, Part II. Volume 2166 of Communications in Computer and Information Science, pages 185-196, Springer, 2024. [doi]
Abstract is missing.