Abstract is missing.
- NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and GenerationRuikai Cui, Weizhe Liu, Weixuan Sun, Senbo Wang, Taizhang Shang, Yang Li 0041, Xibin Song, Han Yan, Zhennan Wu, Shenzhou Chen, Hongdong Li, Pan Ji. 1-18 [doi]
- AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment LabelingSherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Misha Sra, Pradeep Sen. 19-36 [doi]
- SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion ModelsDongseok Shim, H. Jin Kim. 37-53 [doi]
- Quantized Prompt for Efficient Generalization of Vision-Language ModelsTianxiang Hao, Xiaohan Ding, Juexiao Feng, Yuhong Yang 0008, Hui Chen 0013, Guiguang Ding. 54-73 [doi]
- Online Temporal Action Localization with Memory-Augmented TransformerYoungkil Song, Dongkeun Kim, Minsu Cho, Suha Kwak. 74-91 [doi]
- Efficient Cascaded Multiscale Adaptive Network for Image RestorationYichen Zhou, Pan Zhou 0002, Teck Khim Ng. 92-110 [doi]
- MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion ModelMuyao Niu, Xiaodong Cun, Xintao Wang, Yong Zhang 0034, Ying Shan, Yinqiang Zheng. 111-128 [doi]
- Occlusion-Aware Seamless SegmentationYihong Cao, Jiaming Zhang 0001, Hao Shi, Kunyu Peng, Yuhongxuan Zhang, Hui Zhang 0023, Rainer Stiefelhagen, Kailun Yang 0001. 129-147 [doi]
- OpenKD: Opening Prompt Diversity for Zero- and Few-Shot Keypoint DetectionChangsheng Lu, Zheyuan Liu 0002, Piotr Koniusz. 148-165 [doi]
- Referring Atomic Video Action RecognitionKunyu Peng, Jia Fu 0001, Kailun Yang 0001, Di Wen 0006, Yufan Chen 0001, Ruiping Liu, Junwei Zheng, Jiaming Zhang 0001, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg. 166-185 [doi]
- Agent3D-Zero: An Agent for Zero-Shot 3D UnderstandingSha Zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli Ouyang, Tong He 0001, Yanyong Zhang. 186-202 [doi]
- Stream Query Denoising for Vectorized HD-Map ConstructionShuo Wang, Fan Jia, Weixin Mao, Yingfei Liu, Yucheng Zhao, Zehui Chen, Tiancai Wang, Chi Zhang 0026, Xiangyu Zhang 0005, Feng Zhao 0004. 203-220 [doi]
- SAGS: Structure-Aware 3D Gaussian SplattingEvangelos Ververas, Rolandos-Alexandros Potamias, Jifei Song, Jiankang deng, Stefanos Zafeiriou. 221-238 [doi]
- Spherical Linear Interpolation and Text-Anchoring for Zero-Shot Composed Image RetrievalYoung-Kyun Jang, Dat Huynh, Ashish Shah, Wen-Kai Chen, Ser-Nam Lim. 239-254 [doi]
- OneRestore: A Universal Restoration Framework for Composite DegradationYu Guo 0008, Yuan Gao 0015, Yuxu Lu, Huilin Zhu, Ryan Wen Liu, Shengfeng He. 255-272 [doi]
- Beat-It: Beat-Synchronized Multi-condition 3D Dance GenerationZikai Huang, Xuemiao Xu, Cheng Xu, Huaidong Zhang, Chenxi Zheng, Jing Qin 0001, Shengfeng He. 273-290 [doi]
- SKYMASK: Attack-Agnostic Robust Federated Learning with Fine-Grained Learnable MasksPeishen Yan, Hao Wang 0022, Tao Song 0003, Yang Hua, Ruhui Ma, Ningxin Hu, Mohammad Reza Haghighat, Haibing Guan. 291-308 [doi]
- RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational ConsistencyZiming Sun, Yuan Liang, Zejun Ma, Tianle Zhang, Linchao Bao, Guiqing Li, Shengfeng He. 309-325 [doi]
- Pixel-GS: Density Control with Pixel-Aware Gradient for 3D Gaussian SplattingZheng Zhang, Wenbo Hu 0002, Yixing Lao, Tong He 0001, Hengshuang Zhao. 326-342 [doi]
- WorldPose: A World Cup Dataset for Global 3D Human Pose EstimationTianjian Jiang, Johsan Billingham, Sebastian Müksch, Juan Jose Zarate, Nicolas Evans, Martin R. Oswald, Marc Pollefeys, Otmar Hilliges, Manuel Kaufmann, Jie Song 0006. 343-362 [doi]
- Language-Driven 6-DoF Grasp Detection Using Negative Prompt GuidanceToan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen 0003. 363-381 [doi]
- COIN-Matting: Confounder Intervention for Image MattingZhaohe Liao, Jiangtong Li, Jun Lan, Huijia Zhu, Weiqiang Wang, Li Niu 0002, Liqing Zhang 0001. 382-397 [doi]
- SHINE: Saliency-Aware Hierarchical Negative Ranking for Compositional Temporal GroundingZixu Cheng, Yujiang Pu, Shaogang Gong, Parisa KordJamshidi, Yu Kong 0001. 398-416 [doi]
- Audio-Driven Talking Face Generation with Stabilized Synchronization LossDogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Hazim Kemal Ekenel, Alexander Waibel. 417-435 [doi]
- Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional VideosMd Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang, Fu-Jen Chu, Kris Kitani, Gedas Bertasius, Xitong Yang. 436-452 [doi]