WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image

Yuci Liang, Xinheng Lyu, Wenting Chen, Meidan Ding, Jipeng Zhang, Xiangjian He, Song Wu, Xiaohan Xing, Sen Yang 0006, Xiyue Wang, LinLin Shen. WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 22718-22727, IEEE, 2025. [doi]

Authors

Yuci Liang

This author has not been identified. Look up 'Yuci Liang' in Google

Xinheng Lyu

This author has not been identified. Look up 'Xinheng Lyu' in Google

Wenting Chen

This author has not been identified. Look up 'Wenting Chen' in Google

Meidan Ding

This author has not been identified. Look up 'Meidan Ding' in Google

Jipeng Zhang

This author has not been identified. Look up 'Jipeng Zhang' in Google

Xiangjian He

This author has not been identified. Look up 'Xiangjian He' in Google

Song Wu

This author has not been identified. Look up 'Song Wu' in Google

Xiaohan Xing

This author has not been identified. Look up 'Xiaohan Xing' in Google

Sen Yang 0006

This author has not been identified. Look up 'Sen Yang 0006' in Google

Xiyue Wang

This author has not been identified. Look up 'Xiyue Wang' in Google

LinLin Shen

This author has not been identified. Look up 'LinLin Shen' in Google