Unifying 3D Vision-Language Understanding via Promptable Queries

Ziyu Zhu, Zhuofan Zhang, Xiaojian Ma, Xuesong Niu, Yixin Chen 0003, Baoxiong Jia, Zhidong Deng, Siyuan Huang 0001, Qing Li 0003. Unifying 3D Vision-Language Understanding via Promptable Queries. In Ales Leonardis, Elisa Ricci 0001, Stefan Roth 0001, Olga Russakovsky, Torsten Sattler, Gül Varol, editors, Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLIV. Volume 15102 of Lecture Notes in Computer Science, pages 188-206, Springer, 2024. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: