Language-guided visual attention network for visual grounding

Haibo Yao, Lipeng Wang, Chengtao Cai, Wei Wang, Zhi Zhang, Lichao Jiang. Language-guided visual attention network for visual grounding. In 30th International Conference on Mechatronics and Machine Vision in Practice, M2VIP 2024, Leeds, United Kingdom, October 3-5, 2024. pages 1-6, IEEE, 2024. [doi]

Abstract

Abstract is missing.