Abstract is missing.
- Revisit Human-Scene Interaction via Space OccupancyXinpeng Liu, Haowen Hou, Yanchao Yang 0001, Yong-Lu Li 0001, Cewu Lu. 1-19 [doi]
- Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute ControlYue Han, Junwei Zhu, Keke He, Xu Chen 0024, Yanhao Ge, Wei Li 0190, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu 0007. 20-36 [doi]
- WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy ModelHaisheng Fu, Jie Liang 0001, Zhenman Fang, Jingning Han, Feng Liang 0001, Guohe Zhang. 37-53 [doi]
- Grid-Attention: Enhancing Computational Efficiency of Large Vision Models Without Fine-TuningPengyu Li, Tianchu Guo, Biao Wang, Xian-Sheng Hua 0001. 54-70 [doi]
- Mitigating Background Shift in Class-Incremental Semantic SegmentationGilhan Park, WonJun Moon, Subeen Lee, Tae-young Kim, Jae-Pil Heo. 71-88 [doi]
- Relation DETR: Exploring Explicit Position Relation Prior for Object DetectionXiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei 0001, Badong Chen, Xuguang Lan. 89-105 [doi]
- BKDSNN: Enhancing the Performance of Learning-Based Spiking Neural Networks Training with Blurred Knowledge DistillationZekai Xu, Kang You, Qinghai Guo, Xiang Wang, Zhezhi He. 106-123 [doi]
- Agent Attention: On the Integration of Softmax and Linear AttentionDongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, Gao Huang 0001. 124-140 [doi]
- Learning by Aligning 2D Skeleton Sequences and Multi-modality FusionQuoc Huy Tran, Muhammad Ahmed 0003, Murad Popattia, M. Hassan Ahmed, Andrey Konin, M. Zeeshan Zia. 141-161 [doi]
- Resolving Scale Ambiguity in Multi-view 3D Reconstruction Using Dual-Pixel SensorsKohei Ashida, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita. 162-178 [doi]
- Object-Oriented Anchoring and Modal Alignment in Multimodal LearningShibin Mei, Bingbing Ni, Hang Wang, Chenglong Zhao, Fengfa Hu, Zhiming Pi, Bilian Ke. 179-196 [doi]
- Towards Stable 3D Object DetectionJiabao Wang, Qiang Meng, Guochao Liu, Liujiang Yan, Ke Wang, Ming-Ming Cheng, Qibin Hou. 197-213 [doi]
- FYI: Flip Your Images for Dataset DistillationByunggwan Son, Youngmin Oh, Donghyeon Baek, Bumsub Ham. 214-230 [doi]
- On-the-Fly Category Discovery for LiDAR Semantic SegmentationHyeonseong Kim, Sung Hoon Yoon, Minseok Kim, Kuk-Jin Yoon. 231-249 [doi]
- Dual-Camera Smooth Zoom on Mobile PhonesRenlong Wu, Zhilu Zhang, Yu Yang, Wangmeng Zuo. 250-269 [doi]
- ProtoComp: Diverse Point Cloud Completion with Controllable PrototypeXumin Yu, Yanbo Wang, Jie Zhou 0001, Jiwen Lu. 270-286 [doi]
- CONDA: Condensed Deep Association Learning for Co-salient Object DetectionLong Li, Nian Liu, Dingwen Zhang, Zhongyu Li, Salman Khan 0001, Rao Muhammad Anwer, Hisham Cholakkal, Junwei Han, Fahad Shahbaz Khan. 287-303 [doi]
- Cascade Prompt Learning for Vision-Language Model AdaptationGe Wu, Xin Zhang, Zheng Li 0028, Zhaowei Chen, Jiajun Liang, Jian Yang 0003, Xiang Li 0041. 304-321 [doi]
- PolyRoom: Room-Aware Transformer for Floorplan ReconstructionYuzhou Liu, Lingjie Zhu, Xiaodong Ma, Hanqiao Ye, Xiang Gao 0009, Xianwei Zheng, Shuhan Shen. 322-339 [doi]
- BenchLMM: Benchmarking Cross-Style Visual Capability of Large Multimodal ModelsRizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Yaohang Li, Xing Luo, Chenyu Yi, Alex C. Kot. 340-358 [doi]
- SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-ResolutionMingjun Zheng, Long Sun, Jiangxin Dong, Jinshan Pan. 359-375 [doi]
- HENet: Hybrid Encoding for End-to-End Multi-task 3D Perception from Multi-view CamerasZhongyu Xia, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang 0001. 376-392 [doi]
- Hierarchical Unsupervised Relation Distillation for Source Free Domain AdaptationBowei Xing, Xianghua Ying, Ruibin Wang, Ruohao Guo, Ji Shi, Wenzhen Yue. 393-409 [doi]
- Customized Generation Reimagined: Fidelity and Editability HarmonizedJian Jin, Yang Shen, Zhenyong Fu, Jian Yang 0003. 410-426 [doi]
- AUFormer: Vision Transformers Are Parameter-Efficient Facial Action Unit DetectorsKaishen Yuan, Zitong Yu, Xin Liu 0012, Weicheng Xie, Huanjing Yue, Jingyu Yang 0002. 427-445 [doi]
- Improving Video Segmentation via Dynamic Anchor QueriesYikang Zhou, Tao Zhang 0042, Shunping Ji, Shuicheng Yan, Xiangtai Li. 446-463 [doi]
- Controllable Contextualized Image Captioning: Directing the Visual Narrative Through User-Defined HighlightsShunqi Mao, Chaoyi Zhang, Hang Su, Hwanjun Song, Igor Shalyminov, Weidong Cai 0001. 464-481 [doi]