Modal interaction-enhanced prompt learning by transformer decoder for vision-language models

Mingyue Liu, Honggang Zhao, Longfei Ma, Mingyong Li. Modal interaction-enhanced prompt learning by transformer decoder for vision-language models. IJMIR, 12(2):19, December 2023. [doi]

Abstract

Abstract is missing.