Dingyuan Zhang, Dingkang Liang, Zichang Tan, Xiaoqing Ye, Cheng Zhang 0020, Jingdong Wang 0001, Xiang Bai. Make Your ViT-Based Multi-view 3D Detectors Faster via Token Compression. In Ales Leonardis, Elisa Ricci 0001, Stefan Roth 0001, Olga Russakovsky, Torsten Sattler, Gül Varol, editors, Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLVII. Volume 15105 of Lecture Notes in Computer Science, pages 56-72, Springer, 2024. [doi]
Abstract is missing.