GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing

Xianzhi Ma, Jianhui Li, Changhua Pei, Hao Liu 0034. GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing. In Cathal Gurrin, Klaus Schoeffmann, Min Zhang, Luca Rossetto, Stevan Rudinac, Duc-Tien Dang-Nguyen, Wen-Huang Cheng, Phoebe Chen, Jenny Benois-Pineau, editors, Proceedings of the 33rd ACM International Conference on Multimedia, MM 2025, Dublin, Ireland, October 27-31, 2025. pages 5441-5450, ACM, 2025. [doi]

Authors

Xianzhi Ma

This author has not been identified. Look up 'Xianzhi Ma' in Google

Jianhui Li

This author has not been identified. Look up 'Jianhui Li' in Google

Changhua Pei

This author has not been identified. Look up 'Changhua Pei' in Google

Hao Liu 0034

This author has not been identified. Look up 'Hao Liu 0034' in Google