Distilled Dual-Encoder Model for Vision-Language Understanding

Zekun Wang, Wenhui Wang, Haichao Zhu, Ming Liu 0004, Bing Qin 0001, Furu Wei. Distilled Dual-Encoder Model for Vision-Language Understanding. In Yoav Goldberg, Zornitsa Kozareva, Yue Zhang, editors, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11. pages 8901-8913, Association for Computational Linguistics, 2022. [doi]

Abstract

Abstract is missing.