VLT: Vision-Language Transformer and Query Generation for Referring Segmentation

Henghui Ding, Chang Liu 0072, Suchen Wang, Xudong Jiang. VLT: Vision-Language Transformer and Query Generation for Referring Segmentation. IEEE Trans. Pattern Anal. Mach. Intell., 45(6):7900-7916, June 2023. [doi]

Authors

Henghui Ding

This author has not been identified. Look up 'Henghui Ding' in Google

Chang Liu 0072

This author has not been identified. Look up 'Chang Liu 0072' in Google

Suchen Wang

This author has not been identified. Look up 'Suchen Wang' in Google

Xudong Jiang

This author has not been identified. Look up 'Xudong Jiang' in Google