Chaojun Lin, Ying Shi, Gang Wang, Shijian Liu. Improving vision-language models through intra-modal contrastive learning-based hard sample mining. Neurocomputing, 652:131047, 2025. [doi]
No references recorded for this publication.
No citations of this publication recorded.