SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification

Fang Peng, Xiaoshan Yang, Linhui Xiao, Yaowei Wang 0001, Changsheng Xu. SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification. IEEE Transactions on Multimedia, 26:3469-3480, 2024. [doi]

@article{PengYXWX24,
  title = {SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification},
  author = {Fang Peng and Xiaoshan Yang and Linhui Xiao and Yaowei Wang 0001 and Changsheng Xu},
  year = {2024},
  doi = {10.1109/TMM.2023.3311646},
  url = {https://doi.org/10.1109/TMM.2023.3311646},
  researchr = {https://researchr.org/publication/PengYXWX24},
  cites = {0},
  citedby = {0},
  journal = {IEEE Transactions on Multimedia},
  volume = {26},
  pages = {3469-3480},
}