Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 34, Issue 5

3063 -- 3066Dong Liu 0002, Shan Liu 0001, João Ascenso, Dong Tian, Lu Yu 0003. Guest Editorial Special Section on Recent Standardization Efforts for Learning-Based Visual Data Coding
3067 -- 3081Zhaobin Zhang, Semih Esenlik, Yaojun Wu, Meng Wang 0017, Kai Zhang 0007, Li Zhang 0006. End-to-End Learning-Based Image Compression With a Decoupled Framework
3082 -- 3095Junqi Shi, Ming Lu, Zhan Ma. Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression
3096 -- 3110Chuanmin Jia, Feng Ye, Fanke Dong, Kai Lin, Leonardo Chiariglione, Siwei Ma, Huifang Sun, Wen Gao 0001. MPAI-EEV: Standardization Efforts of Artificial Intelligence Based End-to-End Video Coding
3111 -- 3124Jianghao Jia, Yuantong Zhang, Han Zhu, Zhenzhong Chen, Zizheng Liu, Xiaozhong Xu, Shan Liu 0001. Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement
3125 -- 3137Dongmei Xue, Li Li 0040, Dong Liu 0002, Houqiang Li. Lightweight Context Model Equipped aiWave in Response to the AVS Call for Evidence on Volumetric Medical Image Coding
3138 -- 3155SooWoong Kim, Jihoon Do, Jungwon Kang, Hui-Yong Kim. Rate-Rendering Distortion Optimized Preprocessing for Texture Map Compression of 3D Reconstructed Scenes
3156 -- 3167Yeongwoong Kim, Hyewon Jeong, Janghyun Yu, Younhee Kim, Jooyoung Lee 0004, Seyoon Jeong, Hui-Yong Kim. End-to-End Learnable Multi-Scale Feature Compression for VCM
3168 -- 3179Yunjian Feng, Kunyang Zhou, Jun Li 0011, MengChu Zhou. Incremental Learning-Based Lane Detection for Automated Rubber-Tired Gantries in a Container Terminal
3180 -- 3191Yongli Hu, Lincong Feng, Huajie Jiang, Mengting Liu, Baocai Yin. Domain-Aware Prototype Network for Generalized Zero-Shot Learning
3192 -- 3203Xiaoqin Zhang 0002, Min Li, Sheng Lin, Hang Xu, Guobao Xiao. Transformer-Based Multimodal Emotional Perception for Dynamic Facial Expression Recognition in the Wild
3204 -- 3219Jiaqi Wang, Huafeng Liu 0001, Liping Jing. Transparent Embedding Space for Interpretable Image Recognition
3220 -- 3231Zhen Mei, Peng Ye, Baopu Li, Tao Chen 0003, Jiayuan Fan, Wanli Ouyang. DeNKD: Decoupled Non-Target Knowledge Distillation for Complementing Transformer-Based Unsupervised Domain Adaptation
3232 -- 3244Ziheng Yan, Yuankai Qi, Guorong Li, Xinyan Liu, Weigang Zhang, Ming-Hsuan Yang 0001, Qingming Huang. Progressive Multi-Resolution Loss for Crowd Counting
3245 -- 3259Dongliang Zhou, Haijun Zhang 0002, Jianghong Ma, Jianyang Shi. BC-GAN: A Generative Adversarial Network for Synthesizing a Batch of Collocated Clothing
3260 -- 3270Lu Zhou, Yingying Chen 0003, Jinqiao Wang. Dual-Path Transformer for 3D Human Pose Estimation
3271 -- 3285Danyang Tu, Wei Shen 0002, Wei Sun 0029, Xiongkuo Min, Guangtao Zhai, Changwen Chen. Un-Gaze: A Unified Transformer for Joint Gaze-Location and Gaze-Object Detection
3286 -- 3298Guanghui Yue 0001, Houlu Xiao, Hai Xie, Tianwei Zhou, Wei Zhou 0021, Weiqing Yan, Baoquan Zhao, Tianfu Wang 0001, Qiuping Jiang. Dual-Constraint Coarse-to-Fine Network for Camouflaged Object Detection
3299 -- 3312Yuting Mou, Xinghao Jiang, Ke Xu 0003, Tanfeng Sun, Zepeng Wang 0002. Compressed Video Action Recognition With Dual-Stream and Dual-Modal Transformer
3313 -- 3326Yutong Liu, Zhen Cheng, Zeyu Xiao, Zhiwei Xiong. Light Field Super-Resolution Using Decoupled Selective Matching
3327 -- 3339Congqi Cao, Yizhe Wang, Yueran Zhang, Yue Lu, Xin Zhang, Yanning Zhang. Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization
3340 -- 3352Yadang Chen, Dingwei Zhang, Yuhui Zheng, Zhi-Xin Yang 0001, Enhua Wu, Haixing Zhao. Boosting Video Object Segmentation via Robust and Efficient Memory Network
3353 -- 3367Zheng'ao Wang, Zikun Zhou, Fanglin Chen 0001, Jun Xu, Wenjie Pei, Guangming Lu. Robust Tracking via Fully Exploring Background Prior Knowledge
3368 -- 3382Dongpan Chen, Dehui Kong, Jinghua Li, Lichun Wang 0002, Junna Gao, Baocai Yin. OASNet: Object Affordance State Recognition Network With Joint Visual Features and Relational Semantic Embeddings
3383 -- 3394Xianlun Tang, Qiao Yang, Xi Zhang, Wuquan Deng, Huiming Wang 0002, Xinbo Gao 0001. A Refinement Method for Single-Stage Object Detection Based on Progressive Decoupled Task Alignment
3395 -- 3408Zeyu Ma, Ziqiang Zheng, Jiwei Wei, Yang Yang 0002, Heng Tao Shen. Instance-Dictionary Learning for Open-World Object Detection in Autonomous Driving Scenarios
3409 -- 3423Peirong Ma, Zhiquan He, Wu Ran, Hong Lu 0001. A Transferable Generative Framework for Multi-Label Zero-Shot Learning
3424 -- 3438Xiaoqiang Shi, Zhenyu Yin, Guangjie Han, Wenzhuo Liu, Li Qin, Yuanguo Bi, Shurui Li. BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired From AutoEncoder
3439 -- 3450Daoheng Li, Xiushan Nie, Rui Gong, Ximing Lin, Hui Yu 0001. Multi-Branch GAN-Based Abnormal Events Detection via Context Learning in Surveillance Videos
3451 -- 3464Guoqiang Liang, Zhaojie Chen, Zhaoqiang Chen, Shiyu Ji, Yanning Zhang. New Insights on Relieving Task-Recency Bias for Online Class Incremental Learning
3465 -- 3480Liuchi Xu, Jin Ren, Zhenhua Huang 0001, Wei-Shi Zheng 0001, Yunwen Chen. Improving Knowledge Distillation via Head and Tail Categories
3481 -- 3495Xin Ding, Zheng Wang, Jing Fang, Zhenyu Shu, Ruimin Hu, Chia-Wen Lin. Watch You Under Low-Resolution and Low-Illumination: Face Enhancement via Bi-Factor Degradation Decoupling
3496 -- 3509Xiaowei Zhao, Yuqing Ma, Duorui Wang, Yifan Shen, Yixuan Qiao, Xianglong Liu 0001. Revisiting Open World Object Detection
3510 -- 3522Hao Sheng 0001, Shuai Wang 0027, Haobo Chen, Da Yang, Yang Huang, Jiahao Shen, Wei Ke 0001. Discriminative Feature Learning With Co-Occurrence Attention Network for Vehicle ReID
3523 -- 3537Bo Liu 0002, Peng Sun, Yanshan Xiao, Shilei Zhao, Xiaokai Li, Tiantian Peng, Zhiyu Zheng, Yongsheng Huang. Dictionary-Based Multi-View Learning With Privileged Information
3538 -- 3550Lihong Qiao, Shixin Wu, Bin Xiao 0002, Yucheng Shu, Xiao Luan, Sicheng Lu, Weisheng Li 0001, Xinbo Gao 0001. Boosting Robust Multi-Focus Image Fusion With Frequency Mask and Hyperdimensional Computing
3551 -- 3562Zhao Pei, Jiaqing Zhang, Wenwen Zhang, Miao Wang, Jianing Wang, Yee-Hong Yang. Autofocusing for Synthetic Aperture Imaging Based on Pedestrian Trajectory Prediction
3563 -- 3575Lanxiao Wang, Heqian Qiu, Benliu Qiu, Fanman Meng, Qingbo Wu 0001, Hongliang Li. TridentCap: Image-Fact-Style Trident Semantic Framework for Stylized Image Captioning
3576 -- 3588Siqi Lu, Fengxu Guan, Hanyu Zhang, Haitao Lai. Speed-Up DDPM for Real-Time Underwater Image Enhancement
3589 -- 3605Haiming Yao, Wenyong Yu, Wei Luo, Zhenfeng Qiang, Donghao Luo, Xiaotian Zhang. Learning Global-Local Correspondence With Semantic Bottleneck for Logical Anomaly Detection
3606 -- 3618Zhaokang Liao, Wengang Zhou, Houqiang Li. DaFIR: Distortion-Aware Representation Learning for Fisheye Image Rectification
3619 -- 3632Dengdi Sun, Leilei Cheng, Song Chen, Chenglong Li 0002, Yun Xiao, Bin Luo 0001. UAV-Ground Visual Tracking: A Unified Dataset and Collaborative Learning Approach
3633 -- 3646Sheng Liu, Jinsong Leng, Xi-Le Zhao, Haijin Zeng, Yao Wang 0003, Jing-Hua Yang. Learnable Spatial-Spectral Transform-Based Tensor Nuclear Norm for Multi-Dimensional Visual Data Recovery
3647 -- 3662Yuanfei Huang, Jie Li 0001, Yanting Hu, Hua Huang 0001, Xinbo Gao 0001. Deep Convolution Modulation for Image Super-Resolution
3663 -- 3673Jin Liu 0018, Guoxiang Wang, Jialong Xie, Fengyu Zhou, Huijuan Xu 0001. Video Question Answering With Semantic Disentanglement and Reasoning
3674 -- 3686Jie Geng, Shuai Song, Wen Jiang 0002. Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation
3687 -- 3699Zheng Xie, Rui Guo, Chencheng Zhang, Xiaohua Qian. A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands
3700 -- 3713Wei Wu 0019, Hao Chang, Zhu Li 0001. See SIFT in a Rain
3714 -- 3727Huan Chen, Wangcai Zhao, Tingfa Xu, Guokai Shi, Shiyun Zhou, Peifu Liu, Jianan Li. Spectral-Wise Implicit Neural Representation for Hyperspectral Image Reconstruction
3728 -- 3741Jie Wen 0001, Gehui Xu, Zhanyan Tang, Wei Wang 0169, Lunke Fei, Yong Xu 0001. Graph Regularized and Feature Aware Matrix Factorization for Robust Incomplete Multi-View Clustering
3742 -- 3754Zhaoxin Liu, Jinjian Wu, Guangming Shi, Wen Yang, Weisheng Dong, Qinghang Zhao. Motion-Oriented Hybrid Spiking Neural Networks for Event-Based Motion Deblurring
3755 -- 3767Kaihao Zhang, Tao Wang 0052, Wenhan Luo, Wenqi Ren, Björn Stenger, Wei Liu 0005, Hongdong Li, Ming-Hsuan Yang 0001. MC-Blur: A Comprehensive Benchmark for Image Deblurring
3768 -- 3781Zhenghao Wang, Jing Lian, Linhui Li, Jian Zhao 0029. A Novel Framework for Scene Graph Generation via Prior Knowledge
3782 -- 3794Yonghui Wang, Wengang Zhou, Yunyao Mao, Houqiang Li. Detect Any Shadow: Segment Anything for Video Shadow Detection
3795 -- 3805JunBin Yuan, Aiqing Zhu, Qingzhen Xu, Kanoksak Wattanachote, Yongyi Gong. CTIF-Net: A CNN-Transformer Iterative Fusion Network for Salient Object Detection
3806 -- 3818QiHao Zhao, Fan Zhang 0007, Wei Hu, Songhe Feng, Jun Liu 0036. OHD: An Online Category-Aware Framework for Learning With Noisy Labels Under Long-Tailed Distribution
3819 -- 3833Mengru Ma, Wenping Ma 0001, Licheng Jiao, Xu Liu 0006, Fang Liu 0001, Lingling Li 0002, Shuyuan Yang. MBSI-Net: Multimodal Balanced Self-Learning Interaction Network for Image Classification
3834 -- 3845Guanghui Yue 0001, Jie Gao, Runmin Cong, Tianwei Zhou, Leida Li, Tianfu Wang 0001. Deep Pyramid Network for Low-Light Endoscopic Image Enhancement
3846 -- 3859Zhong Wang 0009, Lin Zhang 0014, Shengjie Zhao, Yicong Zhou. Global Localization in Large-Scale Point Clouds via Roll-Pitch-Yaw Invariant Place Recognition and Low-Overlap Global Registration
3860 -- 3875Bolei Chen, Jiaxu Kang, Ping Zhong 0002, Yongzheng Cui, Siyi Lu, Yixiong Liang, Jianxin Wang 0001. Think Holistically, Act Down-to-Earth: A Semantic Navigation Strategy With Continuous Environmental Representation and Multi-Step Forward Planning
3876 -- 3890Jianqi Chen, Yilan Zhang, Zhengxia Zou, Keyan Chen, Zhenwei Shi. Dense Pixel-to-Pixel Harmonization via Continuous Image Representation
3891 -- 3904Jiayin Sun, Hong Wang, Qiulei Dong. Hierarchical Attention Network for Open-Set Fine-Grained Image Recognition
3905 -- 3918Chunjie Ma, Lina Du, Li Zhuo 0001, Jiafeng Li. MPLA-Net: Multiple Pseudo Label Aggregation Network for Weakly Supervised Video Salient Object Detection
3919 -- 3929Jinsong Zhang, Lingfeng Gu, Yu-Kun Lai, Xueyang Wang, Kun Li 0001. Toward Grouping in Large Scenes With Occlusion-Aware Spatio-Temporal Transformers
3930 -- 3942Wujie Zhou, Jiankang Hong, Weiqing Yan, Qiuping Jiang. Modal Evaluation Network via Knowledge Distillation for No-Service Rail Surface Defect Detection
3943 -- 3956Hongping Gan, Xiaoyang Wang, Lijun He, Jie Liu. Learned Two-Step Iterative Shrinkage Thresholding Algorithm for Deep Compressive Sensing
3957 -- 3970Qi Zhu 0010, Naishan Zheng, Jie Huang 0017, Man Zhou, Jinghao Zhang, Feng Zhao 0004. Learning Spatio-Temporal Sharpness Map for Video Deblurring
3971 -- 3982Qinglong Cao, Yuntian Chen, Chao Ma 0004, Xiaokang Yang. Break the Bias: Delving Semantic Transform Invariance for Few-Shot Segmentation
3983 -- 3997Biao Wang, Wenling Li, Bin Zhang 0023, Yang Liu 0096, Junping Du. Correlation Filters for UAV Online Tracking Based on Complementary Appearance Model and Reversibility Reasoning
3998 -- 4010Ying Yang, Tao Xiang 0001, Xiao Lv, Shangwei Guo, Tieyong Zeng. The Illusion of Visual Security: Reconstructing Perceptually Encrypted Images
4011 -- 4026Qingsen Yan, Tao Hu, Yuan Sun, Hao Tang 0005, Yu Zhu 0004, Wei Dong, Luc Van Gool, Yanning Zhang. Toward High-Quality HDR Deghosting With Conditional Diffusion Models
4027 -- 4039Jiancong Chen, Meng Wang 0017, Pingping Zhang, Shurun Wang, Shiqi Wang 0001. Sparse-to-Dense: High Efficiency Rate Control for End-to-End Scale-Adaptive Video Coding
4040 -- 4053Cunhui Dong, Haichuan Ma, Zhuoyuan Li, Li Li 0040, Dong Liu 0002. Temporal Wavelet Transform-Based Low-Complexity Perceptual Quality Enhancement of Compressed Video
4054 -- 4069Zheng Fang, MingKui Zheng, Pingping Chen, Zhifeng Chen, Dapeng Oliver Wu. Camera Pose-Based Background Modeling for Video Coding in Moving Cameras
4070 -- 4083Birendra Kathariya, Zhu Li 0001, Geert Van Der Auwera. Joint Pixel and Frequency Feature Learning and Fusion via Channel-Wise Transformer for High-Efficiency Learned In-Loop Filter in VVC
4084 -- 4094Binzhe Li, Bolin Chen, Zhao Wang 0004, Baoliang Chen, Shiqi Wang 0001, Yan Ye. Quality Harmonization for Virtual Composition in Online Video Communications
4095 -- 4108Hu Cao, Lei Huang 0010, Jie Nie, Zhiqiang Wei 0002. Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image Retrieval
4109 -- 4119Zailong Chen, Lei Wang 0001, Peng Wang 0023, Peng Gao. Question-Aware Global-Local Video Understanding Network for Audio-Visual Question Answering
4120 -- 4134Yang Yang 0045, Peiling Wen, Wenbo Ye, Beichen Li, Yue Lang. Blind Universal Denoising for Radar Micro-Doppler Spectrograms Using Identical Dual Learning and Reciprocal Adversarial Training
4135 -- 4140Zhen Yang, Yuanfang Guo, Junfu Wang, Di Huang 0001, Xiuguo Bao, Yunhong Wang. Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network