WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image

Yuci Liang, Xinheng Lyu, Wenting Chen, Meidan Ding, Jipeng Zhang, Xiangjian He, Song Wu, Xiaohan Xing, Sen Yang 0006, Xiyue Wang, LinLin Shen. WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 22718-22727, IEEE, 2025. [doi]

Abstract

Abstract is missing.