Modal Interaction-Enhanced Prompt Learning by Transformer Decoder for Vision-Language Models

Mingyue Liu, Honggang Zhao, Longfei Ma, Xiang Li, Yucheng Ji, Mingyong Li. Modal Interaction-Enhanced Prompt Learning by Transformer Decoder for Vision-Language Models. In Zhi Jin, Yuncheng Jiang, Robert Andrei Buchmann, Yaxin Bi, Ana-Maria Ghiran, Wenjun Ma, editors, Knowledge Science, Engineering and Management - 16th International Conference, KSEM 2023, Guangzhou, China, August 16-18, 2023, Proceedings, Part IV. Volume 14120 of Lecture Notes in Computer Science, pages 163-174, Springer, 2023. [doi]

Authors

Mingyue Liu

This author has not been identified. Look up 'Mingyue Liu' in Google

Honggang Zhao

This author has not been identified. Look up 'Honggang Zhao' in Google

Longfei Ma

This author has not been identified. Look up 'Longfei Ma' in Google

Xiang Li

This author has not been identified. Look up 'Xiang Li' in Google

Yucheng Ji

This author has not been identified. Look up 'Yucheng Ji' in Google

Mingyong Li

This author has not been identified. Look up 'Mingyong Li' in Google