Zhengyang Liang, MeiYu Liang, Wei Huang, Yawen Li 0001, Wu Liu, Yingxia Shao, Kangkang Lu 0002. Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Retrieval. In Cathal Gurrin, Klaus Schoeffmann, Min Zhang, Luca Rossetto, Stevan Rudinac, Duc-Tien Dang-Nguyen, Wen-Huang Cheng, Phoebe Chen, Jenny Benois-Pineau, editors, Proceedings of the 33rd ACM International Conference on Multimedia, MM 2025, Dublin, Ireland, October 27-31, 2025. pages 2851-2859, ACM, 2025. [doi]
Abstract is missing.