| 1973 | -- | 1986 | Yao Wu, Xia Kong, Yuan Xie 0006, Yanyun Qu. RE-GZSL: Relation Extrapolation for Generalized Zero-Shot Learning |
| 1987 | -- | 1998 | Chaocan Xue, Bineng Zhong, Qihua Liang, Haiying Xia, Shuxiang Song 0001. Unifying Motion and Appearance Cues for Visual Tracking via Shared Queries |
| 1999 | -- | 2012 | Xiaowei Fu, Lina Ma, Lei Zhang 0038. Remove to Regenerate: Boosting Adversarial Generalization With Attack Invariance |
| 2013 | -- | 2025 | Ming Ma, Tongzhou Zhang 0001, Ziming Wang, Yue Wang, Taoli Du, Wenhui Li 0002. Global Channel Pruning With Self-Supervised Mask Learning |
| 2026 | -- | 2044 | Xinggang Hu, Yanmin Wu, Mingyuan Zhao, Linghao Yang, Xiangkui Zhang, Xiangyang Ji. PAS-SLAM: A Visual SLAM System for Planar-Ambiguous Scenes |
| 2045 | -- | 2056 | Li Wang, Yunzhou Zhang, Fawei Ge, Wenjing Bai, Jinpeng Zhang, Yifan Wang. Learning Local Features by Jointly Semantic-Guided and Task Rewards |
| 2057 | -- | 2073 | Junduan Huang, Zifeng Li, Sushil Bhattacharjee, Sébastien Marcel, Wenxiong Kang. Mirror-Based Full-View Finger Vein Authentication With Illumination Adaptation |
| 2074 | -- | 2086 | Xize Wu, Jiasong Wu, Lei Zhu 0002, Lotfi Senhadji, Huazhong Shu. Collaborative Aware Bidirectional Semantic Reasoning for Video Question Answering |
| 2087 | -- | 2100 | Jiayu Ye, Yanhong Yu, Lin Lu, Hao Wang, Yunshao Zheng, Yang Liu, Qingxiang Wang. DEP-Former: Multimodal Depression Recognition Based on Facial Expressions and Audio Features via Emotional Changes |
| 2101 | -- | 2117 | Tianming Zhuang, Zhen Qin 0002, Yi Ding 0003, Zhiguang Qin, Ji Geng 0001, Yi Liu, Kim-Kwang Raymond Choo. DSDC-GCN: Decoupled Static-Dynamic Co-Occurrence Graph Convolutional Networks for Skeleton-Based Action Recognition |
| 2118 | -- | 2132 | Tian He, Yang Chen, Xu Gao, Ling Wang, Ting Hu, Hong Cheng 0002. Enhancing Skeleton-Based Action Recognition With Language Descriptions From Pre-Trained Large Multimodal Models |
| 2133 | -- | 2146 | Naisong Luo, Yuan Wang, Rui Sun 0006, Guoxin Xiong, Tianzhu Zhang, Feng Wu 0005. Exploring the Better Correlation for Few-Shot Video Object Segmentation |
| 2147 | -- | 2160 | Jianhui Jin, Qiuping Jiang, Qingyuan Wu, Binwei Xu, Runmin Cong. Underwater Salient Object Detection via Dual-Stage Self-Paced Learning and Depth Emphasis |
| 2161 | -- | 2172 | Xinchen Ye, Yuxiang Ou, Biao Wu, Rui Xu 0002, Haojie Li. Self-Supervised Monocular Depth Estimation From Videos via Adaptive Reconstruction Constraints |
| 2173 | -- | 2187 | Tianyu Yang, Yeqiang Qian, Weihao Yan, Chunxiang Wang, Ming Yang 0002. AdaptiveOcc: Adaptive Octree-Based Network for Multi-Camera 3D Semantic Occupancy Prediction in Autonomous Driving |
| 2188 | -- | 2201 | Yonghao Dong, Le Wang 0003, Sanping Zhou, Gang Hua 0001, Changyin Sun. Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction |
| 2202 | -- | 2214 | Chang Liu 0071, Ziqi Guan, Simiao Lai, Yang Liu 0066, Huchuan Lu, Dong Wang 0004. EMTrack: Efficient Multimodal Object Tracking |
| 2215 | -- | 2229 | Xin Lu 0007, Yixuan Pan, Yichao Cao, Xin Zhou, Xiaobo Lu. Variational Feature Imitation Conditioned on Visual Descriptions for Few-Shot Fine-Grained Recognition |
| 2230 | -- | 2243 | Qing Liu, Xianlun Tang, Ying Wang, Xingchen Li, Xinyan Jiang, Weisheng Li 0001. Feature Transductive Distribution Optimization for Few-Shot Image Classification |
| 2244 | -- | 2259 | Hengbo Qi, Xuechao Chen, Zhangguo Yu, Chao Li, Yongliang Shi, Qingrui Zhao, Qiang Huang 0002. Semantic-Independent Dynamic SLAM Based on Geometric Re-Clustering and Optical Flow Residuals |
| 2260 | -- | 2275 | Yunfeng Li, Bo Wang 0015, Jiuran Sun, Xueyi Wu, Ye Li. RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker |
| 2276 | -- | 2288 | Lei Yao, Yi Wang 0068, Moyun Liu, Lap-Pui Chau. SGIFormer: Semantic-Guided and Geometric-Enhanced Interleaving Transformer for 3D Instance Segmentation |
| 2289 | -- | 2300 | Jiahao Nie 0001, Anqi Xu, Zhengyi Bao, Zhiwei He 0001, Xudong Lv, Mingyu Gao 0002. Context Matching-Guided Motion Modeling for 3D Point Cloud Object Tracking |
| 2301 | -- | 2314 | Liyun Yu, Ziyu Guan, Wei Zhao 0019, Yaming Yang 0002, Jiale Tan. Adaptive Task-Aware Refining Network for Few-Shot Fine-Grained Image Classification |
| 2315 | -- | 2327 | Yuping Liang, Guangming Shi, Jinjian Wu. Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection |
| 2328 | -- | 2340 | Guozhu Jiang, Yongshan Zhang, Xinxin Wang, Xinwei Jiang, Lefei Zhang. Structured Anchor Learning for Large-Scale Hyperspectral Image Projected Clustering |
| 2341 | -- | 2354 | Chenyang Qian, Lingfei Song, Hua Huang 0001. A Destriping Framework With Arbitrary Bounded Image Denoisers |
| 2355 | -- | 2368 | Xiaoqian Huang, Yong Gong, Wenhao Wu, Saike Zhu, Yi Zhao. CSDet: A Compressed Sensing Object Detection Architecture With Lightweight Networks |
| 2369 | -- | 2381 | Yong Zhu, Zhenyu Wen, Xiong Li, Xiufang Shi, Xiang Wu, Hui Dong, Jiming Chen 0001. ChatNav: Leveraging LLM to Zero-Shot Semantic Reasoning in Object Navigation |
| 2382 | -- | 2397 | Zhengyu Zhang, Shishun Tian, Jinjia Zhou, Luce Morin, Lu Zhang 0037. A New Benchmark Database and Objective Metric for Light Field Image Quality Evaluation |
| 2398 | -- | 2409 | Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu 0001, Yue Wu, Bin Liu 0016, Nenghai Yu, Le Lu 0001, Jieping Ye. Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues |
| 2410 | -- | 2421 | Dechen Kong, Xi Yang 0011, Nannan Wang 0001, Xinbo Gao 0001. Perspectives of Calibrated Adaptation for Few-Shot Cross-Domain Classification |
| 2422 | -- | 2435 | Jinhui Qin, Yong Ma 0001, Jun Huang 0008, Zhanchuan Cai, Fan Fan 0001, You Du. An End-to-End Network for Rotary Motion Deblurring in the Polar Coordinate System |
| 2436 | -- | 2449 | Kumie Gedamu, Yanli Ji, Yang Yang 0002, Jie Shao 0001, Heng Tao Shen. Visual-Semantic Alignment Temporal Parsing for Action Quality Assessment |
| 2450 | -- | 2460 | Feiyan Wu, Zhunga Liu, Zuowei Zhang 0001, Jiaxiang Liu, Longfei Wang. Collaborative Global-Local Structure Network With Knowledge Distillation for Imbalanced Data Classification |
| 2461 | -- | 2473 | Xin Lin, Jingtong Yue, Sixian Ding, Chao Ren 0002, Lu Qi, Ming-Hsuan Yang 0001. Dual Degradation Representation for Joint Deraining and Low-Light Enhancement in the Dark |
| 2474 | -- | 2486 | Tian-Bao Li, Yu-Ting Su 0001, Dan Song 0006, Wen-Hui Li 0001, Zhiqiang Wei 0002, An-An Liu. Multi-Scale Spatial-Temporal Transformer for Meteorological Variable Forecasting |
| 2487 | -- | 2498 | Wenzhe Zhai, Xianglei Xing, Mingliang Gao 0001, Qilei Li. Zero-Shot Object Counting With Vision-Language Prior Guidance Network |
| 2499 | -- | 2512 | Hairui Ren, Fan Tang, Huangjie Zheng, He Zhao 0001, Dandan Guo, Yi Chang 0001. Modality-Consistent Prompt Tuning With Optimal Transport |
| 2513 | -- | 2525 | Jie Wang, Guoqiang Li, HongJie Yu, Jinwen Xi, Jie Shi, Xueying Wu. Intra-Modality Self-Enhancement Mirror Network for RGB-T Salient Object Detection |
| 2526 | -- | 2540 | Qiang Liu 0004, Yanlong Qiu, Tongqing Zhou, Ming Xu 0002, Jiaohua Qin, Wentao Ma, Fan Zhang, Zhiping Cai. Mitigating Cross-Modal Retrieval Violations With Privacy-Preserving Backdoor Learning |
| 2541 | -- | 2555 | Ching-Nung Yang, Lizhi Xiong, Shu-Yu Liu, Chih-Yueh Tseng, Xiaodan Tai, Wenbo Wan. RP-ASAF: Anonymous Submission of Application Framework Using RDHSI and Polynomial Interpolation |
| 2556 | -- | 2569 | Yamin Han, Mingyu Cai, Jie Wu, Zhixuan Bai, Tao Zhuo, Hongming Zhang 0002, Yanning Zhang 0001. Visual Object Tracking With Multi-Frame Distractor Suppression |
| 2570 | -- | 2585 | Jinyu Zhan, Shiyu Zou, Wei Jiang 0016, Youyuan Zhang, Suidi Peng, Ying Wang 0001. Accelerate Point Cloud Structuring for Deep Neural Networks via Fast Spatial-Searching Tree |
| 2586 | -- | 2602 | Mengda Xie, Yiling He, Zhan Qin, Meie Fang. RetouchUAA: Unconstrained Adversarial Attack via Realistic Image Retouching |
| 2603 | -- | 2615 | Zuoyong Li, Qinghua Lin, Haoyi Fan, Tiesong Zhao, David Zhang 0001. SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification |
| 2616 | -- | 2631 | Zihua Zhao, Zhe Cao, Haonan Xin, Rong Wang 0001, Danyang Wu, Zheng Wang 0037, Feiping Nie 0001. Enhancing Clustering Performance With Tensorized High-Order Bipartite Graphs: A Structured Graph Learning Approach |
| 2632 | -- | 2645 | Gaochang Wu, Yapeng Zhang, Lan Deng, Jingxin Zhang 0001, Tianyou Chai. Cross-Modal Learning for Anomaly Detection in Complex Industrial Process: Methodology and Benchmark |
| 2646 | -- | 2661 | Tianjun Zhang, Lin Zhang 0014, Fengyi Zhang, Shengjie Zhao, Yicong Zhou. I-DACS: Always Maintaining Consistency Between Poses and the Field for Radiance Field Construction Without Pose Prior |
| 2662 | -- | 2674 | Zhe Cao, Yihang Lu, Jinghui Yuan, Haonan Xin, Rong Wang 0001, Feiping Nie 0001. Tensorized Graph Learning for Spectral Ensemble Clustering |
| 2675 | -- | 2688 | Honglin Liu, Qirong Mao, Ming Dong 0001, Yongzhao Zhan 0001. Infrared-Visible Image Fusion Using Dual-Branch Auto-Encoder With Invertible High-Frequency Encoding |
| 2689 | -- | 2700 | Hu Gao, Depeng Dang. Exploring Richer and More Accurate Information via Frequency Selection for Image Restoration |
| 2701 | -- | 2711 | Xiang Li, Xiaolong Li 0001, Shaohai Hu, Yao Zhao 0001. Steganography-Enhanced Prediction-Error Expansion: A Novel Reversible Data Hiding Framework |
| 2712 | -- | 2724 | Xianlei Long, Xiaxin Zhu, Fangming Guo, Chao Chen 0004, Xiangwei Zhu, Fuqiang Gu, Songyu Yuan, Chunlong Zhang. Spike-BRGNet: Efficient and Accurate Event-Based Semantic Segmentation With Boundary Region-Guided Spiking Neural Networks |
| 2725 | -- | 2739 | Weiying Xie, Wenjie Shao, Daixun Li, Yunsong Li, Leyuan Fang. MIFNet: Multi-Scale Interaction Fusion Network for Remote Sensing Image Change Detection |
| 2740 | -- | 2752 | Yuanting Fan, Chengxu Liu, Ruhao Tian, Xueming Qian. InstanceSR: Efficient Reconstructing Small Object With Differential Instance-Level Super-Resolution |
| 2753 | -- | 2767 | Kaihao Lin, Guoqing Wang 0001, Tianyu Li, Yuhui Wu, Chongyi Li, Yang Yang 0002, Heng Tao Shen. Toward Generalized and Realistic Unpaired Image Dehazing via Region-Aware Physical Constraints |
| 2768 | -- | 2781 | Yeming Chen, Siyu Zhang, Yaoru Sun, Jun Yang 0056, Weijian Liang, Haoran Wang. Artificial-Spiking Hierarchical Networks for Vision-Language Representation Learning |
| 2782 | -- | 2793 | Jiebin Yan, Jiale Rao, Junjie Chen 0008, Ziwen Tan, Weide Liu, Yuming Fang. Multitask Auxiliary Network for Perceptual Quality Assessment of Non-Uniformly Distorted Omnidirectional Images |
| 2794 | -- | 2805 | Jiawei Chen, Qi Song, Wenzhong Guo, Rui Huang 0001. DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving |
| 2806 | -- | 2819 | Longtao Feng, Qian Yin, Siwei Ma. Content-Adaptive Rate Control Method for User-Generated Content Videos |
| 2820 | -- | 2831 | Xin Fang, Xiaolin Wu 0001, Fan Li 0003, Yiping Duan, Xiaoming Tao 0001. Group Image Compression for Dual Use of Machine and Human Vision |
| 2832 | -- | 2843 | Chang Liu, Leilei Huang, Chenyang Zhang, Wei Li, Zhijian Hao, Yibo Fan. Hardware Implementation of a High-Accuracy and High-Throughput Rate Estimation Unit for VVC Residual Coding |
| 2844 | -- | 2853 | Zehan Wang, Yuxuan Wei, Hui Yuan 0001, Wei Zhang 0072, Peng Li. Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC |
| 2854 | -- | 2869 | Pekka Astola, Alireza Aminlou, Ramin Ghaznavi Youvalari, Jani Lainema. Convolutional Cross-Component Models for Chroma Prediction in Video Coding |
| 2870 | -- | 2880 | Binzhe Li, Shurun Wang, Shiqi Wang 0001, Yan Ye. High Efficiency Image Compression for Large Visual-Language Models |
| 2881 | -- | 2893 | Shenshen Li, Xing Xu 0001, Chen He, Fumin Shen, Yang Yang 0002, Heng Tao Shen. Cross-Modal Uncertainty Modeling With Diffusion-Based Refinement for Text-Based Person Retrieval |
| 2894 | -- | 2904 | Huilin Ge, Xiaolei Liu, Zihang Guo, Zhiwen Qiu. Learning to Diversify for Robust Video Moment Retrieval |
| 2905 | -- | 2917 | Mingxin Jin, Cong Wang 0033, Yuan Yuan 0001. Dual Heterogeneous Network for Hyperspectral Image Classification |
| 2918 | -- | 2931 | Di-Hua Zhai, Hao Li, Qingyuan Liu, Ke Tian, Yi Yang, Zhenyao Chang, Shuo Wang, Yuanqing Xia. Focus-TransUnet3D: High-Precision Model for 3D Segmentation of Medical Point Targets |
| 2932 | -- | 2944 | Yang Yu, Qingxuan Lv, Yuezun Li, Zhiqiang Wei 0002, Junyu Dong. PhyTracker: An Online Tracker for Phytoplankton |