RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models

Xiang Lin, Weixin Li, Shu Guo, Lihong Wang, Di Huang. RMAdapter: Reconstruction-based Multi-Modal Adapter for Vision-Language Models. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 23594-23602, AAAI Press, 2026. [doi]

Authors

Xiang Lin

This author has not been identified. Look up 'Xiang Lin' in Google

Weixin Li

This author has not been identified. Look up 'Weixin Li' in Google

Shu Guo

This author has not been identified. Look up 'Shu Guo' in Google

Lihong Wang

This author has not been identified. Look up 'Lihong Wang' in Google

Di Huang

This author has not been identified. Look up 'Di Huang' in Google