Dynamic Multi-modal Prompting for Efficient Visual Grounding

Wansen Wu, Ting Liu, Youkai Wang, Kai Xu, Quanjun Yin, Yue Hu. Dynamic Multi-modal Prompting for Efficient Visual Grounding. In Qingshan Liu 0001, Hanzi Wang, Zhanyu Ma, Weishi Zheng 0001, Hongbin Zha, Xilin Chen 0001, Liang Wang, Rongrong Ji, editors, Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Xiamen, China, October 13-15, 2023, Proceedings, Part VII. Volume 14431 of Lecture Notes in Computer Science, pages 359-371, Springer, 2023. [doi]

Authors

Wansen Wu

This author has not been identified. Look up 'Wansen Wu' in Google

Ting Liu

This author has not been identified. Look up 'Ting Liu' in Google

Youkai Wang

This author has not been identified. Look up 'Youkai Wang' in Google

Kai Xu

This author has not been identified. Look up 'Kai Xu' in Google

Quanjun Yin

This author has not been identified. Look up 'Quanjun Yin' in Google

Yue Hu

This author has not been identified. Look up 'Yue Hu' in Google