Modal interaction-enhanced prompt learning by transformer decoder for vision-language models

Mingyue Liu, Honggang Zhao, Longfei Ma, Mingyong Li. Modal interaction-enhanced prompt learning by transformer decoder for vision-language models. IJMIR, 12(2):19, December 2023. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: