Jiawen Lin, Shiran Bian, Yihang Zhu, Wenbin Tan, Yachao Zhang 0001, Yuan Xie 0006, Yanyun Qu. SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding. In Cathal Gurrin, Klaus Schoeffmann, Min Zhang, Luca Rossetto, Stevan Rudinac, Duc-Tien Dang-Nguyen, Wen-Huang Cheng, Phoebe Chen, Jenny Benois-Pineau, editors, Proceedings of the 33rd ACM International Conference on Multimedia, MM 2025, Dublin, Ireland, October 27-31, 2025. pages 3094-3103, ACM, 2025. [doi]
Abstract is missing.