Role-Playing in Vision-Language Models: A Comprehensive Evaluation of Image Description Performance

Zhaofeng Niu, Xiaoya Chang, Bowen Wang 0002, Xingfu Cheng, Guangshun Li, Liangzhi Li 0001. Role-Playing in Vision-Language Models: A Comprehensive Evaluation of Image Description Performance. In International Joint Conference on Neural Networks, IJCNN 2025, Rome, Italy, June 30 - July 5, 2025. pages 1-8, IEEE, 2025. [doi]

@inproceedings{NiuCWCLL25,
  title = {Role-Playing in Vision-Language Models: A Comprehensive Evaluation of Image Description Performance},
  author = {Zhaofeng Niu and Xiaoya Chang and Bowen Wang 0002 and Xingfu Cheng and Guangshun Li and Liangzhi Li 0001},
  year = {2025},
  doi = {10.1109/IJCNN64981.2025.11228995},
  url = {https://doi.org/10.1109/IJCNN64981.2025.11228995},
  researchr = {https://researchr.org/publication/NiuCWCLL25},
  cites = {0},
  citedby = {0},
  pages = {1-8},
  booktitle = {International Joint Conference on Neural Networks, IJCNN 2025, Rome, Italy, June 30 - July 5, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-1042-8},
}