DIME-FM : DIstilling Multimodal and Efficient Foundation Models

Ximeng Sun, Pengchuan Zhang, Peizhao Zhang, Hardik Shah, Kate Saenko, Xide Xia. DIME-FM : DIstilling Multimodal and Efficient Foundation Models. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 15475-15487, IEEE, 2023. [doi]

Abstract

Abstract is missing.