Abstract is missing.
- Local Occupancy-Enhanced Object Grasping with Multiple Triplanar ProjectionKangqi Ma, Hao Dong, Yadong Mu. 1-18 [doi]
- Region-Native Visual TokenizationMeng Wang 0001, Yuyao Huang, Henghui Ding, Xinlong Wang, Tiejun Huang 0001, Yao Zhao 0001, Yunchao Wei, Shuicheng Yan. 19-36 [doi]
- SparseCraft: Few-Shot Neural Reconstruction Through Stereopsis Guided Geometric LinearizationMae Younes, Amine Ouasfi, Adnane Boukhayma. 37-56 [doi]
- Sketch2Vox: Learning 3D Reconstruction from a Single Monocular SketchFei Wang. 57-73 [doi]
- DGE: Direct Gaussian 3D Editing by Consistent Multi-view EditingMinghao Chen, Iro Laina, Andrea Vedaldi. 74-92 [doi]
- The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven InitializationJiafeng Mao, Xueting Wang, Kiyoharu Aizawa. 93-109 [doi]
- Diffusion for Out-of-Distribution Detection on Road Scenes and BeyondSilvio Galesso, Philipp Schröppel, Hssan Driss, Thomas Brox. 110-126 [doi]
- Rethinking Directional Parameterization in Neural Implicit Surface ReconstructionZijie Jiang, Tianhan Xu, Hiroharu Kato. 127-142 [doi]
- A Comprehensive Study of Multimodal Large Language Models for Image Quality AssessmentTianhe Wu, Kede Ma, Jie Liang 0007, Yujiu Yang, Lei Zhang 0006. 143-160 [doi]
- Semi-supervised Teacher-Reference-Student Architecture for Action Quality AssessmentWulian Yun, Mengshi Qi, Fei Peng 0003, Huadong Ma. 161-178 [doi]
- Efficient Neural Video Representation with Temporally Coherent ModulationSeungjun Shin, Suji Kim, Dokwan Oh. 179-195 [doi]
- Ref-AVS: Refer and Segment Objects in Audio-Visual ScenesYaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang 0002, Di Hu 0001. 196-213 [doi]
- DreamScene: 3D Gaussian-Based Text-to-3D Scene Generation via Formation Pattern SamplingHaoran Li, Haolin Shi, Wenli Zhang, Wenjun Wu, Yong Liao, Lin Wang 0025, Lik Hang Lee, Peng Yuan Zhou. 214-230 [doi]
- Multi-modal Crowd Counting via a Broker ModalityHaoliang Meng, Xiaopeng Hong, Chenhao Wang, Miao Shang, Wangmeng Zuo. 231-250 [doi]
- FastPCI: Motion-Structure Guided Fast Point Cloud Frame InterpolationTianyu Zhang, Guocheng Qian, Jin Xie, Jian Yang 0003. 251-267 [doi]
- Made to Order: Discovering Monotonic Temporal Changes via Self-supervised Video OrderingCharig Yang, Weidi Xie, Andrew Zisserman. 268-286 [doi]
- PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud RegistrationRunzhao Yao, Shaoyi Du, Wenting Cui, Canhui Tang, Chengwu Yang. 287-303 [doi]
- Open-Vocabulary RGB-Thermal Semantic SegmentationGuoqiang Zhao, Junjie Huang, Xiaoyun Yan, Zhaojing Wang, Junwei Tang, Yangjun Ou, Xinrong Hu, Tao Peng 0006. 304-320 [doi]
- MeshVPR: Citywide Visual Place Recognition Using 3D MeshesGabriele Moreno Berton, Lorenz Junglas, Riccardo Zaccone, Thomas Pollok, Barbara Caputo, Carlo Masone. 321-339 [doi]
- Can Textual Semantics Mitigate Sounding Object Segmentation Preference?Yaoting Wang, Peiwen Sun, Yuanchao Li, Honggang Zhang 0002, Di Hu 0001. 340-356 [doi]
- Concise Plane Arrangements for Low-Poly Surface and Volume ModellingRaphael Sulzer, Florent Lafarge. 357-373 [doi]
- KeypointDETR: An End-to-End 3D Keypoint DetectorHairong Jin, Yuefan Shen, Jianwen Lou, Kun Zhou 0001, Youyi Zheng. 374-390 [doi]
- ViPer: Visual Personalization of Generative Models via Individual Preference LearningSogand Salehi, Mahdi Shafiei, Teresa Yeo, Roman Bachmann 0001, Amir Zamir. 391-406 [doi]
- MLPHand: Real Time Multi-view 3D Hand Reconstruction via MLP ModelingJian Yang 0003, Jiakun Li, Guoming Li, Huai-yu Wu, Zhen Shen, Zhaoxin Fan. 407-424 [doi]
- uCAP: An Unsupervised Prompting Method for Vision-Language ModelsA. Tuan Nguyen, Kai Sheng Tai, Bor-Chun Chen, Satya Narayan Shukla, Hanchao Yu, Philip Torr 0001, Tai-Peng Tian, Ser-Nam Lim. 425-439 [doi]
- LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language ModelDilxat Muhtar, Zhenshi Li, Feng Gu, Xueliang Zhang 0002, Pengfeng Xiao. 440-457 [doi]
- How Far Can a 1-Pixel Camera Go? Solving Vision Tasks Using Photoreceptors and Computationally Designed Visual MorphologyAndrei Atanov, Jiawei Fu, Rishubh Singh, Isabella Yu, Andrew Spielberg, Amir Zamir. 458-476 [doi]