SLAN: Self-Locator Aided Network for Vision-Language Understanding

Jiang-Tian Zhai, Qi Zhang, Tong Wu, Xing-yu Chen, Jiang-Jiang Liu 0001, Ming-Ming Cheng. SLAN: Self-Locator Aided Network for Vision-Language Understanding. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 21892-21901, IEEE, 2023. [doi]

Abstract

Abstract is missing.