Unified Multimodal Model with Unlikelihood Training for Visual Dialog

Zihao Wang, Junli Wang, Changjun Jiang. Unified Multimodal Model with Unlikelihood Training for Visual Dialog. In João Magalhães, Alberto Del Bimbo, Shin'ichi Satoh 0001, Nicu Sebe, Xavier Alameda-Pineda, Qin Jin, Vincent Oria, Laura Toni, editors, MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10 - 14, 2022. pages 4625-4634, ACM, 2022. [doi]

Abstract

Abstract is missing.