VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks

Jinseong Jang, Chunfei Ma, Byeongwon Lee. VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 30073-30083, Computer Vision Foundation / IEEE, 2025. [doi]

Abstract

Abstract is missing.