VLT: Vision-Language Transformer and Query Generation for Referring Segmentation

Henghui Ding, Chang Liu 0072, Suchen Wang, Xudong Jiang. VLT: Vision-Language Transformer and Query Generation for Referring Segmentation. IEEE Trans. Pattern Anal. Mach. Intell., 45(6):7900-7916, June 2023. [doi]

Abstract

Abstract is missing.