Effective End-to-End Vision Language Pretraining With Semantic Visual Loss

Xiaofeng Yang, Fayao Liu, Guosheng Lin. Effective End-to-End Vision Language Pretraining With Semantic Visual Loss. IEEE Transactions on Multimedia, 25:8408-8417, 2023. [doi]

Abstract

Abstract is missing.