SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models

Zekun Wang, Jingchang Chen, Wangchunshu Zhou, Haichao Zhu, Jiafeng Liang, Liping Shan, Ming Liu 0004, Dongliang Xu, Qing Yang 0033, Bing Qin 0001. SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language Models. In Nicoletta Calzolari, Min-Yen Kan, VĂ©ronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC/COLING 2024, 20-25 May, 2024, Torino, Italy. pages 14937-14953, ELRA and ICCL, 2024. [doi]

Authors

Zekun Wang

This author has not been identified. Look up 'Zekun Wang' in Google

Jingchang Chen

This author has not been identified. Look up 'Jingchang Chen' in Google

Wangchunshu Zhou

This author has not been identified. Look up 'Wangchunshu Zhou' in Google

Haichao Zhu

This author has not been identified. Look up 'Haichao Zhu' in Google

Jiafeng Liang

This author has not been identified. Look up 'Jiafeng Liang' in Google

Liping Shan

This author has not been identified. Look up 'Liping Shan' in Google

Ming Liu 0004

This author has not been identified. Look up 'Ming Liu 0004' in Google

Dongliang Xu

This author has not been identified. Look up 'Dongliang Xu' in Google

Qing Yang 0033

This author has not been identified. Look up 'Qing Yang 0033' in Google

Bing Qin 0001

This author has not been identified. Look up 'Bing Qin 0001' in Google