Dynamic Multi-modal Prompting for Efficient Visual Grounding

Wansen Wu, Ting Liu, Youkai Wang, Kai Xu, Quanjun Yin, Yue Hu. Dynamic Multi-modal Prompting for Efficient Visual Grounding. In Qingshan Liu 0001, Hanzi Wang, Zhanyu Ma, Weishi Zheng 0001, Hongbin Zha, Xilin Chen 0001, Liang Wang, Rongrong Ji, editors, Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part VII. Volume 14431 of Lecture Notes in Computer Science, pages 359-371, Springer, 2023. [doi]

Abstract

Abstract is missing.