Improving vision-language models through intra-modal contrastive learning-based hard sample mining

Chaojun Lin, Ying Shi, Gang Wang, Shijian Liu. Improving vision-language models through intra-modal contrastive learning-based hard sample mining. Neurocomputing, 652:131047, 2025. [doi]

Abstract

Abstract is missing.