| 3924 | -- | 3939 | Yafeng Li, Yuehan Chen, Jiqing Zhang, Yudong Li, XianPing Fu. An Underwater Image Restoration Method With Polarization Imaging Optimization Model for Poor Visible Conditions |
| 3940 | -- | 3954 | Runhao Zeng, Yishen Zhuo, Jialiang Li, Yunjin Yang, Huisi Wu, Qi Chen 0014, Xiping Hu 0001, Victor C. M. Leung. Improving Video Moment Retrieval by Auxiliary Moment-Query Pairs With Hyper-Interaction |
| 3955 | -- | 3968 | Xun Jiang 0001, Liqing Zhu, Xing Xu 0001, Fumin Shen, Yang Yang 0002, Heng Tao Shen. Query as Supervision: Toward Low-Cost and Robust Video Moment and Highlight Retrieval |
| 3969 | -- | 3982 | Xinyi Zhang, Haoran Xu 0004, Chenyun Yu, Guang Tan. PCTrack: Accurate Object Tracking for Live Video Analytics on Resource-Constrained Edge Devices |
| 3983 | -- | 3999 | Lei Zhu 0005, Runbing Wu, Xinghui Zhu, Chengyuan Zhang 0001, Lin Wu 0001, Shichao Zhang 0001, Xuelong Li 0001. Bi-Direction Label-Guided Semantic Enhancement for Cross-Modal Hashing |
| 4000 | -- | 4010 | Yiheng Jiang, Haotian Zhang, Li Li 0040, Dong Liu 0002, Zhu Li 0001. Sparse Point Clouds Assisted Learned Image Compression |
| 4011 | -- | 4025 | Xinjian Wei, Yu Qiu, Xiaoxuan Xu, Jing Xu 0008, Jie Mei, Jun Zhang 0003. ECINFusion: A Novel Explicit Channel-Wise Interaction Network for Unified Multi-Modal Medical Image Fusion |
| 4026 | -- | 4038 | Jiayi Lyu, Xing Lan, Guohong Hu, Hanyu Jiang, Wei Gan, Jinbao Wang, Jian Xue. Multimodal Emotional Talking Face Generation Based on Action Units |
| 4039 | -- | 4054 | Junyu Fan, Jie Xu, Jingchun Zhou, Danling Meng, Yi Lin 0006. See Through Water: Heuristic Modeling Toward Color Correction for Underwater Image Enhancement |
| 4055 | -- | 4071 | Qi Zang, Shuang Wang 0001, Dong Zhao, Zhun Zhong, Biao Hou, Licheng Jiao. Joint Style and Layout Synthesizing: Toward Generalizable Remote Sensing Semantic Segmentation |
| 4072 | -- | 4086 | Lanhu Wu, Miao Zhang, Yongri Piao, Zhenyan Yao, Weibing Sun, Feng Tian, Huchuan Lu. CNN-Transformer Rectified Collaborative Learning for Medical Image Segmentation |
| 4087 | -- | 4099 | Wen Zhang, Zhenshan Tan, Li Zhang, Zhijiang Li. Color Decoupling for Multi-Illumination Color Constancy |
| 4100 | -- | 4115 | Xuan Tan, Xun Gong 0002, Yang Xiang. CLIP-Based Camera-Agnostic Feature Learning for Intra-Camera Supervised Person Re-Identification |
| 4116 | -- | 4129 | Anwei Luo, Rizhao Cai, Chenqi Kong, Yakun Ju, Xiangui Kang, Jiwu Huang, Alex C. Kot. Forgery-Aware Adaptive Learning With Vision Transformer for Generalized Face Forgery Detection |
| 4130 | -- | 4143 | Yanlong Yang, Jianan Liu, Tao Huang 0008, Qing-Long Han, Gang Ma, Bing Zhu 0004. RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection Systems |
| 4144 | -- | 4157 | Zhaofeng Shi, Heqian Qiu, Lanxiao Wang, Fanman Meng, Qingbo Wu 0001, Hongliang Li 0001. Cognition Transferring and Decoupling for Text-Supervised Egocentric Semantic Segmentation |
| 4158 | -- | 4171 | Qiongjie Cui, Zhenyu Lou, Zhenbo Song, Xiangbo Shu. Toward Physically Stable Motion Generation: A New Paradigm of Human Pose Representation |
| 4172 | -- | 4183 | Hangwei Chen, Feng Shao 0001, Xiongli Chai, Baoyang Mu, Qiuping Jiang. Art Comes From Life: Artistic Image Aesthetics Assessment via Attribute Knowledge Amalgamation |
| 4184 | -- | 4197 | Yuxiang Shao, Feifei Zhang, Changsheng Xu. Text-Video Knowledge Guided Prompting for Weakly Supervised Temporal Action Localization |
| 4198 | -- | 4211 | Dewen Qiao, Xiang Ao, Yu Liu 0021, Xuetao Chen, Fuyuan Song, Zheng Qin 0001, Wenqiang Jin. Tri-AFLLM: Resource-Efficient Adaptive Asynchronous Accelerated Federated LLMs |
| 4212 | -- | 4225 | Haihong Xiao, Wenxiong Kang, Hao Liu 0061, Yuqiong Li, Ying He 0001. Semantic Scene Completion via Semantic-Aware Guidance and Interactive Refinement Transformer |
| 4226 | -- | 4241 | Yuqi Jiang, Jing Li 0010, Yanran Dai, Haidong Qin, Xiaoshi Zhou, Yong Zhang, Hongwei Liu, Kefan Yan, Tao Yang 0006. RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array |
| 4242 | -- | 4255 | Shibai Yin, Yiwei Shi, Yibin Wang 0001, Yee-Hong Yang. When Aware Haze Density Meets Diffusion Model for Synthetic-to-Real Dehazing |
| 4256 | -- | 4270 | Fan Xu 0005, Chuibin Chen, Zhigao Shang, Kai-Kuang Ma, QiHui Wu, Zebin Lin, Jie Zhan, Yizhou Shi. Deep Multi-Modal Ship Detection and Classification Network |
| 4271 | -- | 4286 | Yanjie Liang, Qiangqiang Wu, Lin Cheng, Changqun Xia, Jia Li 0003. Progressive Semantic-Visual Alignment and Refinement for Vision-Language Tracking |
| 4287 | -- | 4299 | Dan Song 0006, Xuanpu Zhang, Jianhao Zeng, Pengxin Zhan, Qingguo Chen, Weihua Luo, An-An Liu. Better Fit: Accommodate Variations in Clothing Types for Virtual Try-On |
| 4300 | -- | 4313 | Yiyao Fan, Jun Lin 0003, Changming Sun, Tianhao Wang 0009, Yuehan Qi, Guanyu Zhang, Yang Liu 0333. An Image Terrain Map Model for Texture Filtering |
| 4314 | -- | 4328 | Yuanwei Liu, Nian Liu, Yi Wu, Hisham Cholakkal, Rao Muhammad Anwer, Xiwen Yao, Junwei Han. NTRENet++: Unleashing the Power of Non-Target Knowledge for Few-Shot Semantic Segmentation |
| 4329 | -- | 4340 | Rao Fu, Qian Li, Cheng Wen 0001, Ning An 0002, Fulin Tang. A Novel Framework for Learning Bézier Decomposition From 3D Point Clouds |
| 4341 | -- | 4356 | Sai Yang, Bin Hu 0023, Fan Liu 0003, Xiaoxin Wu 0004, Weiping Ding 0001, Jun Zhou 0001. IPT-ILR: Image Pyramid Transformer Coupled With Information Loss Regularization for All-in-One Image Restoration |
| 4357 | -- | 4369 | Han Zhu 0003, Zhenzhong Chen, Shan Liu 0001. Information Bottleneck Based Self-Distillation: Boosting Lightweight Network for Real-World Super-Resolution |
| 4370 | -- | 4383 | Chenyang Shi, Boyi Wei, Xiucheng Wang, Hanxiao Liu, Yibo Zhang, Wenzhuo Li, Ningfang Song, Jing Jin. Polarity-Focused Denoising for Event Cameras |
| 4384 | -- | 4396 | Hong Zhu, Pingping Zhang, Lei Xue, Guanglin Yuan. Multi-Modal Understanding and Generation for Object Tracking |
| 4397 | -- | 4408 | Guang-yong Chen, Chao-Wei Zheng, Guodong Fan, Jian-Nan Su, Min Gan, C. L. Philip Chen. Real-World Image Reflection Removal: An Ultra-High-Definition Dataset and an Efficient Baseline |
| 4409 | -- | 4422 | Xinmiao Ding, Jinming Lou, Wenyang Luo, Yufan Liu, Bing Li 0001, Weiming Hu. iESTA: Instance-Enhanced Spatial-Temporal Alignment for Video Copy Localization |
| 4423 | -- | 4436 | Qianyu Zhang, Bolun Zheng, Xingying Chen, Quan Chen, Zunjie Zhu, Canjin Wang, Zongpeng Li, Xu Jia 0012, Chengang Yan. Hierarchical Frequency-Based Upsampling and Refining for HEVC Compressed Video Enhancement |
| 4437 | -- | 4449 | Haochen Yu, Weixi Gong, Jiansheng Chen, Huimin Ma 0001. GET3DGS: Generate 3D Gaussians Based on Points Deformation Fields |
| 4450 | -- | 4463 | Shuyang Wang, Kang Liu 0014, Ju Huang, Xuelong Li 0001. FLDet: Faster and Lighter Aerial Object Detector |
| 4464 | -- | 4478 | Yang Yang 0080, Chao Wang 0003, Lei Gong, Min Wu 0008, Zhenghua Chen, Yingxue Gao, Teng Wang, Xuehai Zhou. Uncertainty-Aware Self-Knowledge Distillation |
| 4479 | -- | 4492 | Jie Wang, Xiangji Kong, Nana Yu, Zihao Zhang, Yahong Han. Explicitly Disentangling and Exclusively Fusing for Semi-Supervised Bi-Modal Salient Object Detection |
| 4493 | -- | 4505 | Yicong He, George K. Atia. Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers |
| 4506 | -- | 4520 | Hao Zhang, Sicheng Li, Yupeng Gui, Zhiyong Li 0016, Shusong Xu, YanHeng Lu, Dimin Niu, Hongzhong Zheng, Yen-Kuang Chen, Yuan Xie 0001, Yibo Fan. A Tightly Coupled AI-ISP Vision Processor |
| 4521 | -- | 4534 | Jie Wang, Nana Yu, Zihao Zhang, Yahong Han. Single-Group Generalized RGB and RGB-D Co-Salient Object Detection |
| 4535 | -- | 4548 | Jinzheng Guang, Shichao Wu, Zhengxi Hu, Qianyi Zhang, Peng Wu, Jingtai Liu. DCCLA: Dense Cross Connections With Linear Attention for LiDAR-Based 3D Pedestrian Detection |
| 4549 | -- | 4559 | Tong Zhao, Qiang Fang, Xin Xu 0001. Denser Teacher: Rethinking Dense Pseudo-Label for Semi-Supervised Oriented Object Detection |
| 4560 | -- | 4575 | Chao You, Licheng Jiao, Lingling Li 0002, Xu Liu 0006, Fang Liu 0001, Wenping Ma 0001, Shuyuan Yang 0001. Contour Knowledge-Aware Perception Learning for Semantic Segmentation |
| 4576 | -- | 4591 | Ting Luo 0001, Yuhang Zhou, Zhouyan He, Gangyi Jiang, Haiyong Xu, Shuren Qi, Yushu Zhang 0001. StegMamba: Distortion-Free Immune-Cover for Multi-Image Steganography With State Space Model |
| 4592 | -- | 4607 | Zeng You, Zhiquan Wen, Yaofo Chen, Xin Li 0034, Runhao Zeng, Yaowei Wang 0001, Mingkui Tan. Toward Long Video Understanding via Fine-Detailed Video Story Generation |
| 4608 | -- | 4618 | Yanfeng Zheng, Zhong Luo, Ying Cao 0001, Xiaosong Yang, Weiwei Xu, Zheng Lin 0005, Nan Yin, Pengjie Wang 0001. Unsupervised Salient Object Detection on Light Field With High-Quality Synthetic Labels |
| 4619 | -- | 4634 | Xiaoyan Yu, Shen Zhou, Huafeng Li, Liehuang Zhu. Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration |
| 4635 | -- | 4647 | Huafeng Chen, Pengxu Wei, Guangqian Guo, Shan Gao 0003. SAM-COD+: SAM-Guided Unified Framework for Weakly-Supervised Camouflaged Object Detection |
| 4648 | -- | 4660 | Yue Wu 0004, Jiayi Lei, Yongzhe Yuan, Xiaolong Fan, Maoguo Gong, Wenping Ma 0001, Qiguang Miao, Mingyang Zhang 0002. Equivariance-Based Markov Decision Process for Unsupervised Point Cloud Registration |
| 4661 | -- | 4674 | Qiang Qiao, Meixia Qu, Wenyu Wang, Bin Jiang 0011, Qiang Guo 0003. Effective Global Context Integration for Lightweight 3D Medical Image Segmentation |
| 4675 | -- | 4685 | Hu Ding, Yan Yan 0001, Yang Lu 0009, Jing-Hao Xue, Hanzi Wang. Uncertainty-Aware Label Refinement on Hypergraphs for Personalized Federated Facial Expression Recognition |
| 4686 | -- | 4697 | Zhimao Peng, Enguang Wang, Xialei Liu, Ming-Ming Cheng. Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection |
| 4698 | -- | 4712 | Lingyun Yu 0002, Tian Xie, Chuanbin Liu 0001, Guoqing Jin, Zhiguo Ding 0006, Hongtao Xie. Distilling Multi-Level Semantic Cues Across Multi-Modalities for Face Forgery Detection |
| 4713 | -- | 4726 | Shuhan Dong, Weiying Xie, Danian Yang, Yunsong Li, Jiaqing Zhang, Jiayuan Tian, Jie Lei 0001. SeaDATE: Remedy Dual-Attention Transformer With Semantic Alignment via Contrast Learning for Multimodal Object Detection |
| 4727 | -- | 4739 | Wenjie Li, Xiaolong Li 0001, Rongrong Ni, Yao Zhao 0001. Extracting High-Discriminative Features for Detecting Double JPEG Compression With the Same Quantization Matrix |
| 4740 | -- | 4752 | Xin Guo, Xi Wang, Xueyang Fu, Zheng-Jun Zha. Deep Unfolding Network for Image Desnowing With Snow Shape Prior |
| 4753 | -- | 4767 | Ye Zhang, Yifeng Wang 0001, Zijie Fang, Hao Bian, Linghan Cai, Ziyue Wang 0005, Yongbing Zhang 0002. DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions |
| 4768 | -- | 4783 | Shaohui Li, Shuoyu Ma, Wenrui Dai, Nuowen Kan, Fan Cheng 0002, Chenglin Li, Junni Zou, Hongkai Xiong. Task-Adapted Learnable Embedded Quantization for Scalable Human-Machine Image Compression |
| 4784 | -- | 4800 | Xiaojiao Guo, Xuhang Chen 0002, Shuqiang Wang, Chi-Man Pun. Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis |
| 4801 | -- | 4815 | XiangHai Wang 0001, Liyang Song, Yining Feng, Junheng Zhu. S3F2Net: Spatial-Spectral-Structural Feature Fusion Network for Hyperspectral Image and LiDAR Data Classification |
| 4816 | -- | 4830 | Tongbo Wang, Lin Zhu 0012, Hua Huang 0001. Enhancing Real-Time Object Detection With Optical Flow-Guided Streaming Perception |
| 4831 | -- | 4845 | Zhenghua Huang, Cheng Lin, Biyun Xu, Menghan Xia, Qian Li 0019, Yansheng Li 0001, Nong Sang. 2EA: Target-Aware Taylor Expansion Approximation Network for Infrared and Visible Image Fusion |
| 4846 | -- | 4856 | Li Yu 0004, Hongchao Zhong, Longkun Zou, Ke Chen 0004, Pan Gao 0001. Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation |
| 4857 | -- | 4869 | Ruiheng Zhang, Zhe Cao, Yan Huang 0023, Shuo Yang 0006, Lixin Xu, Min Xu 0001. Visible-Infrared Person Re-Identification With Real-World Label Noise |
| 4870 | -- | 4882 | Zelin Liu, Xinggang Wang, Cheng Wang, Wenyu Liu 0001, Xiang Bai. SparseTrack: Multi-Object Tracking by Performing Scene Decomposition Based on Pseudo-Depth |
| 4883 | -- | 4895 | Yuan Zhao, Jiayu Sun, Lihe Zhang, Huchuan Lu. FocusCLIP: Focusing on Anomaly Regions by Visual-Text Discrepancies |
| 4896 | -- | 4909 | Chunxiao Liu, Zelong Wang, Philip Birch, Xun Wang 0007. Efficient Retinex-Based Framework for Low-Light Image Enhancement Without Additional Networks |
| 4910 | -- | 4922 | Sun'ao Liu, Hongtao Xie, Jiannan Ge, Yongdong Zhang 0001. ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation |
| 4923 | -- | 4936 | Chaojun Dong, Chengxuan Wang, Yikui Zhai, Ye Li, Jianhong Zhou, Pasquale Coscia, Angelo Genovese, Vincenzo Piuri, Fabio Scotti. GMTNet: Dense Object Detection via Global Dynamically Matching Transformer Network |
| 4937 | -- | 4948 | An-An Liu, Quanhan Wu, Ning Xu 0003, Hongshuo Tian, Lanjun Wang. Enriched Image Captioning Based on Knowledge Divergence and Focus |
| 4949 | -- | 4962 | Jian Wang 0113, Fan Li 0003, Lijun He 0001. A Unified Framework for Adversarial Patch Attacks Against Visual 3D Object Detection in Autonomous Driving |
| 4963 | -- | 4975 | Yaoye Song, Peng Zhang 0005, Wei Huang 0013, Yufei Zha, Yanning Zhang 0001. Flexible Temperature Parallel Distillation for Dense Object Detection: Make Response-Based Knowledge Distillation Great Again |
| 4976 | -- | 4990 | Enki Cho, Jung-Uk Kim, Seong Tae Kim 0001. Spatial Mask-Based Adaptive Robust Training for Video Object Segmentation With Noisy Labels |
| 4991 | -- | 5005 | Shuo Li 0010, Fang Liu 0001, Licheng Jiao, Lingling Li 0002, Puhua Chen, Xu Liu 0006, Wenping Ma 0001. Prompt-Based Concept Learning for Few-Shot Class-Incremental Learning |
| 5006 | -- | 5021 | Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro. MSCoTDet: Language-Driven Multi-Modal Fusion for Improved Multispectral Pedestrian Detection |
| 5022 | -- | 5036 | Zhihao Li, Huaxiang Zhang 0001, Lei Zhu 0002, Jiande Sun 0001, Li Liu 0031. Heterogeneous Generative Tokens and Distance-Aware Recovery Network for Occluded Person Re-Identification |
| 5037 | -- | 5050 | Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yunhai Tong, Chen Change Loy. DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training |
| 5051 | -- | 5066 | Rongshan Chen, Hao Sheng 0001, Da Yang 0001, Zhenglong Cui, Ruixuan Cong. Surface-Continuous Scene Representation for Light Field Depth Estimation via Planarity Prior |
| 5067 | -- | 5077 | Xiangzeng Liu, Jianfeng Guo, Hao Chen, Qiguang Miao, Yue Xi, Ruyi Liu. Adaptive Occlusion-Aware Network for Occluded Person Re-Identification |
| 5078 | -- | 5091 | Shangshu Yu, Meiqing Wu, Siew Kei Lam. VFM-Depth: Leveraging Vision Foundation Model for Self-Supervised Monocular Depth Estimation |
| 5092 | -- | 5108 | Xi Wang, Wei Liu 0004, Shimin Gong, Zhi Liu 0002, Jing Xu 0005, Yuming Fang. Spatial Quality Oriented Rate Control for Volumetric Video Streaming via Deep Reinforcement Learning |
| 5109 | -- | 5122 | Rongyu Zhang, Jiaming Liu 0003, Xiaoqi Li 0009, Xiaowei Chi, Dan Wang 0002, Li Du, Yuan Du, Shanghang Zhang. BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection |