| 1983 | -- | 1999 | Jianwu Fang, Jiahuan Qiao, Jianru Xue, Zhengguo Li. Vision-Based Traffic Accident Detection and Anticipation: A Survey |
| 2000 | -- | 2009 | Aqi Gao, Yanwei Pang, Jing Nie, Zhuang Shao, Jiale Cao, Yishun Guo, Xuelong Li 0001. ESGN: Efficient Stereo Geometry Network for Fast 3D Object Detection |
| 2010 | -- | 2025 | Zan Gao, Peng Chen, Tao Zhuo, Meng Liu 0006, Lei Zhu 0002, Meng Wang 0001, Shengyong Chen. A Semantic Perception and CNN-Transformer Hybrid Network for Occluded Person Re-Identification |
| 2026 | -- | 2040 | Shenghao Li, Zeyang Xia, Qunfei Zhao. Representing Boundary-Ambiguous Scene Online With Scale-Encoded Cascaded Grids and Radiance Field Deblurring |
| 2041 | -- | 2055 | Yuxin Chen, Ziqi Zhang, Zhongang Qi, Chunfeng Yuan, Jie Wang, Ying Shan, Bing Li 0001, Weiming Hu, Xiaohu Qie, Jianping Wu. DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation |
| 2056 | -- | 2069 | Guanzhou Ke, Guoqing Chao, Xiaoli Wang, Chenyang Xu, Yongqi Zhu, Yang Yu. A Clustering-Guided Contrastive Fusion for Multi-View Representation Learning |
| 2070 | -- | 2082 | Yueming Lyu, Yue Jiang, Bo Peng 0002, Jing Dong 0003. InfoStyler: Disentanglement Information Bottleneck for Artistic Style Transfer |
| 2083 | -- | 2096 | Hao Li, Di-Hua Zhai, Yuanqing Xia. ERDUnet: An Efficient Residual Double-Coding Unet for Medical Image Segmentation |
| 2097 | -- | 2111 | Yanfeng Wang, Lv Tang, Yijie Zhong, Bo Li 0115. From Composited to Real-World: Transformer-Based Natural Image Matting |
| 2112 | -- | 2124 | Daikun Liu, Teng Wang, Changyin Sun. Voxel-Based Multi-Scale Transformer Network for Event Stream Processing |
| 2125 | -- | 2135 | Yaozong Zheng, Bineng Zhong, Qihua Liang, Guorong Li, Rongrong Ji, Xianxian Li. Toward Unified Token Learning for Vision-Language Tracking |
| 2136 | -- | 2150 | Lili Liu, Zhaoqiang Xia, Xiaobiao Zhang, Jinye Peng, Xiaoyi Feng, Guoying Zhao 0001. Information-Enhanced Network for Noncontact Heart Rate Estimation From Facial Videos |
| 2151 | -- | 2165 | Yingbo Tang, Zhiqiang Cao, Yuequan Yang, Jierui Liu, Junzhi Yu. Semi-Supervised Few-Shot Object Detection via Adaptive Pseudo Labeling |
| 2166 | -- | 2178 | Xiaojia Zhao, Qiangqiang Shen, Yongyong Chen, Yongsheng Liang, Junxin Chen, Yicong Zhou. Self-Completed Bipartite Graph Learning for Fast Incomplete Multi-View Clustering |
| 2179 | -- | 2194 | Lian Wu, Chao Huang 0008, Lunke Fei, Shuping Zhao, Jianchuan Zhao, Zhongwei Cui, Yong Xu 0001. Video-Based Fall Detection Using Human Pose and Constrained Generative Adversarial Network |
| 2195 | -- | 2208 | Yuandong Li, Qinglei Hu, Zhenchao Ouyang, Shuhan Shen. Neural Reflectance Decomposition Under Dynamic Point Light |
| 2209 | -- | 2222 | Wei Cong, Yang Cong, Gan Sun, Yuyang Liu, Jiahua Dong 0001. Self-Paced Weight Consolidation for Continual Learning |
| 2223 | -- | 2234 | Lixiong Qin, Mei Wang, Chao Deng, Ke Wang, Xi Chen, Jiani Hu, Weihong Deng. SwinFace: A Multi-Task Transformer for Face Recognition, Expression Recognition, Age Estimation and Attribute Estimation |
| 2235 | -- | 2248 | Bingtao Ma, Yang Cong, Yu Ren. IOSL: Incremental Open Set Learning |
| 2249 | -- | 2260 | Yinyuan Wang, Haowen Du, Zhuo Cheng, Changxin Gao, Longsheng Wei, Bin Fang, Fei Xiao, Dapeng Luo. KRRNet: Keypoint Relational Regression Network for Bottom-Up Anchor-Free Object Detection |
| 2261 | -- | 2273 | Shansi Zhang, Nan Meng, Edmund Y. Lam. Unsupervised Light Field Depth Estimation via Multi-View Feature Matching With Occlusion Prediction |
| 2274 | -- | 2288 | Haomin Liu, Linsheng Zhao, Zhen Peng, Weijian Xie, Mingxuan Jiang, Hongbin Zha, Hujun Bao, Guofeng Zhang 0001. A Low-Cost and Scalable Framework to Build Large-Scale Localization Benchmark for Augmented Reality |
| 2289 | -- | 2303 | Yu Ran, Weijia Wang, Mingjie Li 0004, Lin-Cheng Li, Yuan-Gen Wang, Jin Li 0002. Cross-Shaped Adversarial Patch Attack |
| 2304 | -- | 2317 | Hengmin Zhang, Bihan Wen, Zhiyuan Zha, Bob Zhang 0001, Yang Tang, Guo Yu 0001, Wenli Du. Accelerated PALM for Nonconvex Low-Rank Matrix Recovery With Theoretical Analysis |
| 2318 | -- | 2331 | Yusheng Peng, Gaofeng Zhang, Jun Shi 0006, Xiangyu Li, Liping Zheng. MRGTraj: A Novel Non-Autoregressive Approach for Human Trajectory Prediction |
| 2332 | -- | 2345 | Yu Qiu, Yun Liu 0011, Le Zhang 0001, Haotian Lu, Jing Xu. Boosting Salient Object Detection With Transformer-Based Asymmetric Bilateral U-Net |
| 2346 | -- | 2360 | Yang Yang, Qiang Zhang 0020. Finding Camouflaged Objects Along the Camouflage Mechanisms |
| 2361 | -- | 2373 | Guodong Du 0005, Liyan Zhang 0001. Enhanced Invariant Feature Joint Learning via Modality-Invariant Neighbor Relations for Cross-Modality Person Re-Identification |
| 2374 | -- | 2384 | Kangkai Zhang, Shiming Ge, Ruixin Shi, Dan Zeng 0001. Low-Resolution Object Recognition With Cross-Resolution Relational Contrastive Distillation |
| 2385 | -- | 2398 | Lu Zou, Zhangjin Huang, Naijie Gu, Guoping Wang. GPT-COPE: A Graph-Guided Point Transformer for Category-Level Object Pose Estimation |
| 2399 | -- | 2413 | Junxing Hu, Hongwen Zhang 0001, Yunlong Wang 0003, Min Ren, Zhenan Sun. Personalized Graph Generation for Monocular 3D Human Pose and Shape Estimation |
| 2414 | -- | 2425 | Shaoyu Zhang 0001, Chen Chen 0036, Qiong Xie, Haigang Sun, Fei Dong, Silong Peng. Distribution Unified and Probability Space Aligned Teacher-Student Learning for Imbalanced Visual Recognition |
| 2426 | -- | 2438 | Kai Zeng, Hui Zhang 0023, Wei Wang, Yaonan Wang 0001, Jianxu Mao. Deep Stereo Network With MRF-Based Cost Aggregation |
| 2439 | -- | 2452 | Linqing Zhao, Wenzhao Zheng, Yueqi Duan, Jie Zhou 0001, Jiwen Lu. SPTR: Structure-Preserving Transformer for Unsupervised Indoor Depth Completion |
| 2453 | -- | 2468 | Fei Wang, Jun Cheng 0002. HQDec: Self-Supervised Monocular Depth Estimation Based on a High-Quality Decoder |
| 2469 | -- | 2483 | Weidong Zhang 0007, Ling Zhou, Peixian Zhuang, Guohou Li, Xipeng Pan, Wenyi Zhao, Chongyi Li. Underwater Image Enhancement via Weighted Wavelet Visual Perception Fusion |
| 2484 | -- | 2497 | Zehong Zhou, Fei Zhou 0001, Guoping Qiu. Blind Image Quality Assessment Based on Separate Representations and Adaptive Interaction of Content and Distortion |
| 2498 | -- | 2511 | Zhaoshuai Qi, Jingqi Pang, Yifeng Hao, Rui Hu, Yanning Zhang. A Minimal Solution for Sphere-Based Camera-Projector Pair Calibration |
| 2512 | -- | 2524 | Yakun Ju, Muwei Jian, Cong Wang 0018, Cong Zhang, Junyu Dong, Kin-Man Lam 0001. Estimating High-Resolution Surface Normals via Low-Resolution Photometric Stereo Images |
| 2525 | -- | 2535 | Xingning Dong, Qingpei Guo, Tian Gan, Qing Wang, Jianlong Wu, Xiangyuan Ren, Yuan Cheng, Wei Chu. 3: Shared Network Pre-Training and Significant Semantic Strengthening for Various Video-Text Tasks |
| 2536 | -- | 2549 | Kaiwei Zhang, Dandan Zhu, Xiongkuo Min, Zhongpai Gao, Guangtao Zhai. Synergetic Assessment of Quality and Aesthetic: Approach and Comprehensive Benchmark Dataset |
| 2550 | -- | 2563 | Lanqing Guo, Renjie Wan, Wenhan Yang, Alex C. Kot, Bihan Wen. Cross-Image Disentanglement for Low-Light Enhancement in Real World |
| 2564 | -- | 2576 | Xingyu Miao, Yang Bai, Haoran Duan, Yawen Huang, Fan Wan, Xinxing Xu, Yang Long 0001, Yefeng Zheng 0001. DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume |
| 2577 | -- | 2590 | Yuan Rao, Wenjie Liu, Kunqian Li, Hao Fan, Sen Wang 0002, Junyu Dong. Deep Color Compensation for Generalized Underwater Image Enhancement |
| 2591 | -- | 2603 | Jingyang Ye, Erzhen Pan, Wenfu Xu. Digital Video Stabilization Method Based on Periodic Jitters of Airborne Vision of Large Flapping Wing Robots |
| 2604 | -- | 2618 | Yuanjian Qiao, Mingwen Shao, Leiquan Wang, Wangmeng Zuo. Learning Depth-Density Priors for Fourier-Based Unpaired Image Restoration |
| 2619 | -- | 2632 | Ziying Song, Caiyan Jia, Lei Yang, Haiyue Wei, Lin Liu. GraphAlign++: An Accurate Feature Alignment by Graph Matching for Multi-Modal 3D Object Detection |
| 2633 | -- | 2647 | Qinglin Liu, Shengping Zhang, Quanling Meng, Bineng Zhong, Peiqiang Liu, Hongxun Yao. End-to-End Human Instance Matting |
| 2648 | -- | 2662 | Jing Li, Liu Yang, Qinghua Hu. Enhancing Multi-Source Open-Set Domain Adaptation Through Nearest Neighbor Classification With Self-Supervised Vision Transformer |
| 2663 | -- | 2676 | Haojun Dai, Rangding Wang, Dawen Xu 0001, Songhan He, Lin Yang. HEVC Video Steganalysis Based on PU Maps and Multi-Scale Convolutional Residual Network |
| 2677 | -- | 2693 | Hao Su, Xuefeng Liu 0001, Jianwei Niu 0002, Jiahe Cui, Ji Wan, Xinghao Wu, Nana Wang. MARVEL: Raster Gray-Level Manga Vectorization via Primitive-Wise Deep Reinforcement Learning |
| 2694 | -- | 2705 | Weihao Zhao, Han Wu 0002, Weidong He, Haoyang Bi, Hao Wang 0076, Chen Zhu, Tong Xu 0001, Enhong Chen. Hierarchical Multi-Modal Attention Network for Time-Sync Comment Video Recommendation |
| 2706 | -- | 2718 | Zhiqi Pang, Chunyu Wang, Lingling Zhao, Yang Liu 0006, Gaurav Sharma 0001. Cross-Modality Hierarchical Clustering and Refinement for Unsupervised Visible-Infrared Person Re-Identification |
| 2719 | -- | 2733 | Yanshan Xiao, Jianwei Zhang, Bo Liu 0002, Liang Zhao, Xiangjun Kong, Zhifeng Hao. Multi-View Maximum Margin Clustering With Privileged Information Learning |
| 2734 | -- | 2748 | Zhicheng Sheng, Liqiang Nie, Min Zhang 0005, Xiaojun Chang, Yan Yan 0002. Stochastic Latent Talking Face Generation Toward Emotional Expressions and Head Poses |
| 2749 | -- | 2762 | Ying Fu 0001, Yang Hong, Yunhao Zou, Qiankun Liu, Yiming Zhang, Ning Liu, Chenggang Yan 0001. Raw Image Based Over-Exposure Correction Using Channel-Guidance Strategy |
| 2763 | -- | 2774 | Bin Ma 0003, Zhongquan Tao, Ruihe Ma, Chunpeng Wang, Jian Li 0034, Xiaolong Li 0001. A High-Performance Robust Reversible Data Hiding Algorithm Based on Polar Harmonic Fourier Moments |
| 2775 | -- | 2788 | Yunzuo Zhang, Yameng Liu, Weili Kang, Ran Tao 0003. VSS-Net: Visual Semantic Self-Mining Network for Video Summarization |
| 2789 | -- | 2802 | Yi Xiao, Qiangqiang Yuan, Kui Jiang, Xianyu Jin, Jiang He, Liangpei Zhang 0001, Chia-Wen Lin. Local-Global Temporal Difference Learning for Satellite Video Super-Resolution |
| 2803 | -- | 2813 | Xiaolong Liu, Yang Yu, Xiaolong Li 0001, Yao Zhao 0001. MCL: Multimodal Contrastive Learning for Deepfake Detection |
| 2814 | -- | 2831 | Suhang Gu, Fu-Lai Chung, Shitong Wang 0001. A Novel Style Takagi-Sugeno-Kang Fuzzy Classifier With Its Fast Training on Style Data |
| 2832 | -- | 2844 | Xiaogang Song, Fuqiang Guo, Lei Zhang 0081, Xiaofeng Lu, Xinhong Hei 0001. Salient Object Detection With Dual-Branch Stepwise Feature Fusion and Edge Refinement |
| 2845 | -- | 2860 | Shengju Yu, Siwei Wang 0001, Yi Wen 0001, Ziming Wang, Zhigang Luo, En Zhu, Xinwang Liu 0002. How to Construct Corresponding Anchors for Incomplete Multiview Clustering |
| 2861 | -- | 2875 | Zhiquan He, Wu Ran, Shulin Liu, Kehua Li, Jiawen Lu, Chang-Yong Xie, Yong Liu, Hong Lu 0001. Low-Light Image Enhancement With Multi-Scale Attention and Frequency-Domain Optimization |
| 2876 | -- | 2890 | Mingxiu Li, Wei Yu 0002, Qinglin Liu, Zonglin Li, Ru Li 0002, Bineng Zhong, Shengping Zhang. Hybrid Transformers With Attention-Guided Spatial Embeddings for Makeup Transfer and Removal |
| 2891 | -- | 2907 | Shaohui Li, Wenrui Dai, Yimian Fang, Ziyang Zheng, Wen Fei, Hongkai Xiong, Wei Zhang 0001. Revisiting Learned Image Compression With Statistical Measurement of Latent Representations |
| 2908 | -- | 2921 | Mu-Jung Chen, Yi-Hsin Chen, Wen-Hsiao Peng. B-CANF: Adaptive B-Frame Coding With Conditional Augmented Normalizing Flows |
| 2922 | -- | 2933 | Dongmei Xue, Haichuan Ma, Li Li 0040, Dong Liu 0002, Zhiwei Xiong, Houqiang Li. DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework |
| 2934 | -- | 2948 | Dongyi Zhang, Feng Li 0037, Man Liu, Runmin Cong, Huihui Bai 0001, Meng Wang 0001, Yao Zhao 0001. Exploring Resolution Fields for Scalable Image Compression With Uncertainty Guidance |
| 2949 | -- | 2958 | Jing Zhang, Yonghong Hou, Zhaoqing Pan, Bo Peng 0007, Nam Ling, Jianjun Lei. SWGNet: Step-Wise Reference Frame Generation Network for Multiview Video Coding |
| 2959 | -- | 2972 | Shenshen Li, Xing Xu 0001, Xun Jiang, Fumin Shen, Xin Liu 0011, Heng Tao Shen. Multi-Grained Attention Network With Mutual Exclusion for Composed Query-Based Image Retrieval |
| 2973 | -- | 2988 | Kun Zhang 0040, Bo Hu, Huatian Zhang, Zhe Li, Zhendong Mao. Enhanced Semantic Similarity Learning Framework for Image-Text Matching |
| 2989 | -- | 3002 | Zhikai Hu, Yiu-ming Cheung, Mengke Li, Weichao Lan, Donglin Zhang, Qiang Liu 0018. Joint Semantic Preserving Sparse Hashing for Cross-Modal Retrieval |
| 3003 | -- | 3016 | Meng Liu 0006, Di Zhou, Jie Guo, Xin Luo 0006, Zan Gao, Liqiang Nie. Semantic-Aware Contrastive Learning With Proposal Suppression for Video Semantic Role Grounding |
| 3017 | -- | 3029 | Qing Li 0018, Changqing Zhang, Qinghua Hu, Pengfei Zhu 0001, Huazhu Fu, Lei Chen 0011. Stabilizing Multispectral Pedestrian Detection With Evidential Hybrid Fusion |
| 3030 | -- | 3042 | Hang Shao, Lei Luo 0001, Jianjun Qian, Shuo Chen, Chuanfei Hu, Jian Yang 0003. TranPhys: Spatiotemporal Masked Transformer Steered Remote Photoplethysmography Estimation |
| 3043 | -- | 3048 | Yizhuo Song, Pengyang Zhao, Siqi Wang, Qingmin Liao, Wenming Yang. Study of 3D Finger Vein Biometrics on Imaging Device Design and Multi-View Verification |
| 3049 | -- | 3055 | Xiang Gao 0009, Hainan Cui, Wantao Huang, Menghan Li, Shuhan Shen. IRAv3+: Hierarchical Incremental Rotation Averaging via Multiple Connected Dominating Sets |
| 3056 | -- | 3060 | Tianshu Chu, Zuopeng Yang, Xiaolin Huang. Improving the Post-Training Neural Network Quantization by Prepositive Feature Quantization |