C&D-CLIP: Cascaded decoder and deep cross visual prompt tuning for zero-shot semantic segmentation

Linchuan Li, Yongquan Liang, Zhihui Wang 0003, Yongtang Bao, Shansong Yang. C&D-CLIP: Cascaded decoder and deep cross visual prompt tuning for zero-shot semantic segmentation. Pattern Recognition, 179:113528, 2026. [doi]

Abstract

Abstract is missing.