| 3063 | -- | 3066 | Dong Liu 0002, Shan Liu 0001, João Ascenso, Dong Tian, Lu Yu 0003. Guest Editorial Special Section on Recent Standardization Efforts for Learning-Based Visual Data Coding |
| 3067 | -- | 3081 | Zhaobin Zhang, Semih Esenlik, Yaojun Wu, Meng Wang 0017, Kai Zhang 0007, Li Zhang 0006. End-to-End Learning-Based Image Compression With a Decoupled Framework |
| 3082 | -- | 3095 | Junqi Shi, Ming Lu, Zhan Ma. Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression |
| 3096 | -- | 3110 | Chuanmin Jia, Feng Ye, Fanke Dong, Kai Lin, Leonardo Chiariglione, Siwei Ma, Huifang Sun, Wen Gao 0001. MPAI-EEV: Standardization Efforts of Artificial Intelligence Based End-to-End Video Coding |
| 3111 | -- | 3124 | Jianghao Jia, Yuantong Zhang, Han Zhu, Zhenzhong Chen, Zizheng Liu, Xiaozhong Xu, Shan Liu 0001. Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement |
| 3125 | -- | 3137 | Dongmei Xue, Li Li 0040, Dong Liu 0002, Houqiang Li. Lightweight Context Model Equipped aiWave in Response to the AVS Call for Evidence on Volumetric Medical Image Coding |
| 3138 | -- | 3155 | SooWoong Kim, Jihoon Do, Jungwon Kang, Hui-Yong Kim. Rate-Rendering Distortion Optimized Preprocessing for Texture Map Compression of 3D Reconstructed Scenes |
| 3156 | -- | 3167 | Yeongwoong Kim, Hyewon Jeong, Janghyun Yu, Younhee Kim, Jooyoung Lee 0004, Seyoon Jeong, Hui-Yong Kim. End-to-End Learnable Multi-Scale Feature Compression for VCM |
| 3168 | -- | 3179 | Yunjian Feng, Kunyang Zhou, Jun Li 0011, MengChu Zhou. Incremental Learning-Based Lane Detection for Automated Rubber-Tired Gantries in a Container Terminal |
| 3180 | -- | 3191 | Yongli Hu, Lincong Feng, Huajie Jiang, Mengting Liu, Baocai Yin. Domain-Aware Prototype Network for Generalized Zero-Shot Learning |
| 3192 | -- | 3203 | Xiaoqin Zhang 0002, Min Li, Sheng Lin, Hang Xu, Guobao Xiao. Transformer-Based Multimodal Emotional Perception for Dynamic Facial Expression Recognition in the Wild |
| 3204 | -- | 3219 | Jiaqi Wang, Huafeng Liu 0001, Liping Jing. Transparent Embedding Space for Interpretable Image Recognition |
| 3220 | -- | 3231 | Zhen Mei, Peng Ye, Baopu Li, Tao Chen 0003, Jiayuan Fan, Wanli Ouyang. DeNKD: Decoupled Non-Target Knowledge Distillation for Complementing Transformer-Based Unsupervised Domain Adaptation |
| 3232 | -- | 3244 | Ziheng Yan, Yuankai Qi, Guorong Li, Xinyan Liu, Weigang Zhang, Ming-Hsuan Yang 0001, Qingming Huang. Progressive Multi-Resolution Loss for Crowd Counting |
| 3245 | -- | 3259 | Dongliang Zhou, Haijun Zhang 0002, Jianghong Ma, Jianyang Shi. BC-GAN: A Generative Adversarial Network for Synthesizing a Batch of Collocated Clothing |
| 3260 | -- | 3270 | Lu Zhou, Yingying Chen 0003, Jinqiao Wang. Dual-Path Transformer for 3D Human Pose Estimation |
| 3271 | -- | 3285 | Danyang Tu, Wei Shen 0002, Wei Sun 0029, Xiongkuo Min, Guangtao Zhai, Changwen Chen. Un-Gaze: A Unified Transformer for Joint Gaze-Location and Gaze-Object Detection |
| 3286 | -- | 3298 | Guanghui Yue 0001, Houlu Xiao, Hai Xie, Tianwei Zhou, Wei Zhou 0021, Weiqing Yan, Baoquan Zhao, Tianfu Wang 0001, Qiuping Jiang. Dual-Constraint Coarse-to-Fine Network for Camouflaged Object Detection |
| 3299 | -- | 3312 | Yuting Mou, Xinghao Jiang, Ke Xu 0003, Tanfeng Sun, Zepeng Wang 0002. Compressed Video Action Recognition With Dual-Stream and Dual-Modal Transformer |
| 3313 | -- | 3326 | Yutong Liu, Zhen Cheng, Zeyu Xiao, Zhiwei Xiong. Light Field Super-Resolution Using Decoupled Selective Matching |
| 3327 | -- | 3339 | Congqi Cao, Yizhe Wang, Yueran Zhang, Yue Lu, Xin Zhang, Yanning Zhang. Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization |
| 3340 | -- | 3352 | Yadang Chen, Dingwei Zhang, Yuhui Zheng, Zhi-Xin Yang 0001, Enhua Wu, Haixing Zhao. Boosting Video Object Segmentation via Robust and Efficient Memory Network |
| 3353 | -- | 3367 | Zheng'ao Wang, Zikun Zhou, Fanglin Chen 0001, Jun Xu, Wenjie Pei, Guangming Lu. Robust Tracking via Fully Exploring Background Prior Knowledge |
| 3368 | -- | 3382 | Dongpan Chen, Dehui Kong, Jinghua Li, Lichun Wang 0002, Junna Gao, Baocai Yin. OASNet: Object Affordance State Recognition Network With Joint Visual Features and Relational Semantic Embeddings |
| 3383 | -- | 3394 | Xianlun Tang, Qiao Yang, Xi Zhang, Wuquan Deng, Huiming Wang 0002, Xinbo Gao 0001. A Refinement Method for Single-Stage Object Detection Based on Progressive Decoupled Task Alignment |
| 3395 | -- | 3408 | Zeyu Ma, Ziqiang Zheng, Jiwei Wei, Yang Yang 0002, Heng Tao Shen. Instance-Dictionary Learning for Open-World Object Detection in Autonomous Driving Scenarios |
| 3409 | -- | 3423 | Peirong Ma, Zhiquan He, Wu Ran, Hong Lu 0001. A Transferable Generative Framework for Multi-Label Zero-Shot Learning |
| 3424 | -- | 3438 | Xiaoqiang Shi, Zhenyu Yin, Guangjie Han, Wenzhuo Liu, Li Qin, Yuanguo Bi, Shurui Li. BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder |
| 3439 | -- | 3450 | Daoheng Li, Xiushan Nie, Rui Gong, Ximing Lin, Hui Yu 0001. Multi-Branch GAN-Based Abnormal Events Detection via Context Learning in Surveillance Videos |
| 3451 | -- | 3464 | Guoqiang Liang, Zhaojie Chen, Zhaoqiang Chen, Shiyu Ji, Yanning Zhang. New Insights on Relieving Task-Recency Bias for Online Class Incremental Learning |
| 3465 | -- | 3480 | Liuchi Xu, Jin Ren, Zhenhua Huang 0001, Wei-Shi Zheng 0001, Yunwen Chen. Improving Knowledge Distillation via Head and Tail Categories |
| 3481 | -- | 3495 | Xin Ding, Zheng Wang, Jing Fang, Zhenyu Shu, Ruimin Hu, Chia-Wen Lin. Watch You Under Low-Resolution and Low-Illumination: Face Enhancement via Bi-Factor Degradation Decoupling |
| 3496 | -- | 3509 | Xiaowei Zhao, Yuqing Ma, Duorui Wang, Yifan Shen, Yixuan Qiao, Xianglong Liu 0001. Revisiting Open World Object Detection |
| 3510 | -- | 3522 | Hao Sheng 0001, Shuai Wang 0027, Haobo Chen, Da Yang, Yang Huang, Jiahao Shen, Wei Ke 0001. Discriminative Feature Learning With Co-Occurrence Attention Network for Vehicle ReID |
| 3523 | -- | 3537 | Bo Liu 0002, Peng Sun, Yanshan Xiao, Shilei Zhao, Xiaokai Li, Tiantian Peng, Zhiyu Zheng, Yongsheng Huang. Dictionary-Based Multi-View Learning With Privileged Information |
| 3538 | -- | 3550 | Lihong Qiao, Shixin Wu, Bin Xiao 0002, Yucheng Shu, Xiao Luan, Sicheng Lu, Weisheng Li 0001, Xinbo Gao 0001. Boosting Robust Multi-Focus Image Fusion With Frequency Mask and Hyperdimensional Computing |
| 3551 | -- | 3562 | Zhao Pei, Jiaqing Zhang, Wenwen Zhang, Miao Wang, Jianing Wang, Yee-Hong Yang. Autofocusing for Synthetic Aperture Imaging Based on Pedestrian Trajectory Prediction |
| 3563 | -- | 3575 | Lanxiao Wang, Heqian Qiu, Benliu Qiu, Fanman Meng, Qingbo Wu 0001, Hongliang Li. TridentCap: Image-Fact-Style Trident Semantic Framework for Stylized Image Captioning |
| 3576 | -- | 3588 | Siqi Lu, Fengxu Guan, Hanyu Zhang, Haitao Lai. Speed-Up DDPM for Real-Time Underwater Image Enhancement |
| 3589 | -- | 3605 | Haiming Yao, Wenyong Yu, Wei Luo, Zhenfeng Qiang, Donghao Luo, Xiaotian Zhang. Learning Global-Local Correspondence With Semantic Bottleneck for Logical Anomaly Detection |
| 3606 | -- | 3618 | Zhaokang Liao, Wengang Zhou, Houqiang Li. DaFIR: Distortion-Aware Representation Learning for Fisheye Image Rectification |
| 3619 | -- | 3632 | Dengdi Sun, Leilei Cheng, Song Chen, Chenglong Li 0002, Yun Xiao, Bin Luo 0001. UAV-Ground Visual Tracking: A Unified Dataset and Collaborative Learning Approach |
| 3633 | -- | 3646 | Sheng Liu, Jinsong Leng, Xi-Le Zhao, Haijin Zeng, Yao Wang 0003, Jing-Hua Yang. Learnable Spatial-Spectral Transform-Based Tensor Nuclear Norm for Multi-Dimensional Visual Data Recovery |
| 3647 | -- | 3662 | Yuanfei Huang, Jie Li 0001, Yanting Hu, Hua Huang 0001, Xinbo Gao 0001. Deep Convolution Modulation for Image Super-Resolution |
| 3663 | -- | 3673 | Jin Liu 0018, Guoxiang Wang, Jialong Xie, Fengyu Zhou, Huijuan Xu 0001. Video Question Answering With Semantic Disentanglement and Reasoning |
| 3674 | -- | 3686 | Jie Geng, Shuai Song, Wen Jiang 0002. Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation |
| 3687 | -- | 3699 | Zheng Xie, Rui Guo, Chencheng Zhang, Xiaohua Qian. A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands |
| 3700 | -- | 3713 | Wei Wu 0019, Hao Chang, Zhu Li 0001. See SIFT in a Rain |
| 3714 | -- | 3727 | Huan Chen, Wangcai Zhao, Tingfa Xu, Guokai Shi, Shiyun Zhou, Peifu Liu, Jianan Li. Spectral-Wise Implicit Neural Representation for Hyperspectral Image Reconstruction |
| 3728 | -- | 3741 | Jie Wen 0001, Gehui Xu, Zhanyan Tang, Wei Wang 0169, Lunke Fei, Yong Xu 0001. Graph Regularized and Feature Aware Matrix Factorization for Robust Incomplete Multi-View Clustering |
| 3742 | -- | 3754 | Zhaoxin Liu, Jinjian Wu, Guangming Shi, Wen Yang, Weisheng Dong, Qinghang Zhao. Motion-Oriented Hybrid Spiking Neural Networks for Event-Based Motion Deblurring |
| 3755 | -- | 3767 | Kaihao Zhang, Tao Wang 0052, Wenhan Luo, Wenqi Ren, Björn Stenger, Wei Liu 0005, Hongdong Li, Ming-Hsuan Yang 0001. MC-Blur: A Comprehensive Benchmark for Image Deblurring |
| 3768 | -- | 3781 | Zhenghao Wang, Jing Lian, Linhui Li, Jian Zhao 0029. A Novel Framework for Scene Graph Generation via Prior Knowledge |
| 3782 | -- | 3794 | Yonghui Wang, Wengang Zhou, Yunyao Mao, Houqiang Li. Detect Any Shadow: Segment Anything for Video Shadow Detection |
| 3795 | -- | 3805 | JunBin Yuan, Aiqing Zhu, Qingzhen Xu, Kanoksak Wattanachote, Yongyi Gong. CTIF-Net: A CNN-Transformer Iterative Fusion Network for Salient Object Detection |
| 3806 | -- | 3818 | QiHao Zhao, Fan Zhang 0007, Wei Hu, Songhe Feng, Jun Liu 0036. OHD: An Online Category-Aware Framework for Learning With Noisy Labels Under Long-Tailed Distribution |
| 3819 | -- | 3833 | Mengru Ma, Wenping Ma 0001, Licheng Jiao, Xu Liu 0006, Fang Liu 0001, Lingling Li 0002, Shuyuan Yang. MBSI-Net: Multimodal Balanced Self-Learning Interaction Network for Image Classification |
| 3834 | -- | 3845 | Guanghui Yue 0001, Jie Gao, Runmin Cong, Tianwei Zhou, Leida Li, Tianfu Wang 0001. Deep Pyramid Network for Low-Light Endoscopic Image Enhancement |
| 3846 | -- | 3859 | Zhong Wang 0009, Lin Zhang 0014, Shengjie Zhao, Yicong Zhou. Global Localization in Large-Scale Point Clouds via Roll-Pitch-Yaw Invariant Place Recognition and Low-Overlap Global Registration |
| 3860 | -- | 3875 | Bolei Chen, Jiaxu Kang, Ping Zhong 0002, Yongzheng Cui, Siyi Lu, Yixiong Liang, Jianxin Wang 0001. Think Holistically, Act Down-to-Earth: A Semantic Navigation Strategy With Continuous Environmental Representation and Multi-Step Forward Planning |
| 3876 | -- | 3890 | Jianqi Chen, Yilan Zhang, Zhengxia Zou, Keyan Chen, Zhenwei Shi. Dense Pixel-to-Pixel Harmonization via Continuous Image Representation |
| 3891 | -- | 3904 | Jiayin Sun, Hong Wang, Qiulei Dong. Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition |
| 3905 | -- | 3918 | Chunjie Ma, Lina Du, Li Zhuo 0001, Jiafeng Li. MPLA-Net: Multiple Pseudo Label Aggregation Network for Weakly Supervised Video Salient Object Detection |
| 3919 | -- | 3929 | Jinsong Zhang, Lingfeng Gu, Yu-Kun Lai, Xueyang Wang, Kun Li 0001. Toward Grouping in Large Scenes With Occlusion-Aware Spatio-Temporal Transformers |
| 3930 | -- | 3942 | Wujie Zhou, Jiankang Hong, Weiqing Yan, Qiuping Jiang. Modal Evaluation Network via Knowledge Distillation for No-Service Rail Surface Defect Detection |
| 3943 | -- | 3956 | Hongping Gan, Xiaoyang Wang, Lijun He, Jie Liu. Learned Two-Step Iterative Shrinkage Thresholding Algorithm for Deep Compressive Sensing |
| 3957 | -- | 3970 | Qi Zhu 0010, Naishan Zheng, Jie Huang 0017, Man Zhou, Jinghao Zhang, Feng Zhao 0004. Learning Spatio-Temporal Sharpness Map for Video Deblurring |
| 3971 | -- | 3982 | Qinglong Cao, Yuntian Chen, Chao Ma 0004, Xiaokang Yang. Break the Bias: Delving Semantic Transform Invariance for Few-Shot Segmentation |
| 3983 | -- | 3997 | Biao Wang, Wenling Li, Bin Zhang 0023, Yang Liu 0096, Junping Du. Correlation Filters for UAV Online Tracking Based on Complementary Appearance Model and Reversibility Reasoning |
| 3998 | -- | 4010 | Ying Yang, Tao Xiang 0001, Xiao Lv, Shangwei Guo, Tieyong Zeng. The Illusion of Visual Security: Reconstructing Perceptually Encrypted Images |
| 4011 | -- | 4026 | Qingsen Yan, Tao Hu, Yuan Sun, Hao Tang 0005, Yu Zhu 0004, Wei Dong, Luc Van Gool, Yanning Zhang. Toward High-Quality HDR Deghosting With Conditional Diffusion Models |
| 4027 | -- | 4039 | Jiancong Chen, Meng Wang 0017, Pingping Zhang, Shurun Wang, Shiqi Wang 0001. Sparse-to-Dense: High Efficiency Rate Control for End-to-End Scale-Adaptive Video Coding |
| 4040 | -- | 4053 | Cunhui Dong, Haichuan Ma, Zhuoyuan Li, Li Li 0040, Dong Liu 0002. Temporal Wavelet Transform-Based Low-Complexity Perceptual Quality Enhancement of Compressed Video |
| 4054 | -- | 4069 | Zheng Fang, MingKui Zheng, Pingping Chen, Zhifeng Chen, Dapeng Oliver Wu. Camera Pose-Based Background Modeling for Video Coding in Moving Cameras |
| 4070 | -- | 4083 | Birendra Kathariya, Zhu Li 0001, Geert Van Der Auwera. Joint Pixel and Frequency Feature Learning and Fusion via Channel-Wise Transformer for High-Efficiency Learned In-Loop Filter in VVC |
| 4084 | -- | 4094 | Binzhe Li, Bolin Chen, Zhao Wang 0004, Baoliang Chen, Shiqi Wang 0001, Yan Ye. Quality Harmonization for Virtual Composition in Online Video Communications |
| 4095 | -- | 4108 | Hu Cao, Lei Huang 0010, Jie Nie, Zhiqiang Wei 0002. Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image Retrieval |
| 4109 | -- | 4119 | Zailong Chen, Lei Wang 0001, Peng Wang 0023, Peng Gao. Question-Aware Global-Local Video Understanding Network for Audio-Visual Question Answering |
| 4120 | -- | 4134 | Yang Yang 0045, Peiling Wen, Wenbo Ye, Beichen Li, Yue Lang. Blind Universal Denoising for Radar Micro-Doppler Spectrograms Using Identical Dual Learning and Reciprocal Adversarial Training |
| 4135 | -- | 4140 | Zhen Yang, Yuanfang Guo, Junfu Wang, Di Huang 0001, Xiuguo Bao, Yunhong Wang. Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network |