Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 34, Issue 11

10524 -- 10537Ancong Wu, Chengzhi Lin, Wei-Shi Zheng 0001. Asymmetric Mutual Learning for Unsupervised Transferable Visible-Infrared Re-Identification
10538 -- 10550Kan Guo, Daxin Tian, Yongli Hu, Chunmian Lin, Yanfeng Sun, Jianshan Zhou, Xuting Duan, Junbin Gao, Baocai Yin. CFMMC-Align: Coarse-Fine Multi-Modal Contrastive Alignment Network for Traffic Event Video Question Answering
10551 -- 10563Jiawen Zhu, Xin Chen 0032, Pengyu Zhang, Xinying Wang 0005, Dong Wang 0004, Wenda Zhao, Huchuan Lu. SRRT: Exploring Search Region Regulation for Visual Object Tracking
10564 -- 10577Wei Yao, Hongwen Zhang 0001, Yunlian Sun, Jinhui Tang 0001. STAF: 3D Human Mesh Recovery From Video With Spatio-Temporal Alignment Fusion
10578 -- 10589Jinhua Hu, Yonghong Hou, Zihui Guo, Jiajun Gao. Global and Local Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition
10590 -- 10603Qing Wang, Xulun Ye, Nongxiao Wang. Learning Low-Rank Representation Approximation for Few-Shot Deep Subspace Clustering
10604 -- 10617Chengrui Wei, Meng Yang 0002, Lei He, Nanning Zheng 0001. FS-Depth: Focal-and-Scale Depth Estimation From a Single Image in Unseen Indoor Scene
10618 -- 10631Qingqing Yan, Shu Li, Zongtao He, Mengxian Hu, Chengju Liu, Qijun Chen. DR-Block: Convolutional Dense Reparameterization for CNN Generalization Free Improvement
10632 -- 10645Xuze Hao, Xuhao Jiang, Wenqian Ni, Weimin Tan, Bo Yan 0001. Prompt-Guided Semantic-Aware Distillation for Weakly Supervised Incremental Semantic Segmentation
10646 -- 10661Yinsai Guo, Hang Yu 0006, Liyan Ma, Xiangfeng Luo, Shaorong Xie. DIE-CDK: A Discriminative Information Enhancement Method With Cross-Modal Domain Knowledge for Fine-Grained Ship Detection
10662 -- 10677Yi Luo, Feng Shao 0001, Baoyang Mu, Hangwei Chen, Zhuo Li, Qiuping Jiang. Dynamic Weighted Fusion and Progressive Refinement Network for Visible-Depth-Thermal Salient Object Detection
10678 -- 10691Wenkang Shan, Yuhuai Zhang, Xinfeng Zhang 0001, Shanshe Wang, Xilong Zhou, Siwei Ma, Wen Gao 0001. Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation
10692 -- 10703Yicheng Lin, Yunlong Jiang, Xujia Jiao, Bin Han 0010. Learned Good Features to Track
10704 -- 10717Hanlin Guo, Guobao Xiao, Lumei Su, Tianyou Li, Da-Han Wang, Hanzi Wang. Second-Order Proximity Guided Sampling Consensus for Robust Model Fitting
10718 -- 10731Xingyu Zhu, Xiangbo Shu, Jinhui Tang 0001. Motion-Aware Mask Feature Reconstruction for Skeleton-Based Action Recognition
10732 -- 10742Jie Zhu, Bo Peng 0007, Bingzheng Liu, Qingming Huang, Jianjun Lei. Self-Constructing Stereo Correspondences for Unsupervised Multi-View Stereo
10743 -- 10752Zhuoran Xie, Miao Yang, Mengjiao Shen, Yuquan Qiu, Xinyu Wang. FIOD-VUE: Focusing on Invariant Information in Object Detection of Varying Underwater Environment
10753 -- 10763Chao Zheng, Li Liu, Yu Meng, Xiaorui Peng, Meijun Wang. Few-Shot Point Cloud Semantic Segmentation via Support-Query Feature Interaction
10764 -- 10778Yalong Jiang, Changkang Li, Wenrui Ding, Jinzhi Xiang, Zheru Chi. Reasonable Anomaly Detection Based on Long-Term Sequence Modeling
10779 -- 10792Ruiqiu Wang, Tao Su, Dan Xu 0007, Jianlai Chen, Yuan Liang. MIGA-Net: Multi-View Image Information Learning Based on Graph Attention Network for SAR Target Recognition
10793 -- 10804Weichao Zhao, Hezhen Hu, Wengang Zhou, Yunyao Mao, Min Wang 0019, Houqiang Li. MASA: Motion-Aware Masked Autoencoder With Semantic Alignment for Sign Language Recognition
10805 -- 10816Jin Liu 0018, Jialong Xie, Fengyu Zhou, Shengfeng He. Question Type-Aware Debiasing for Test-Time Visual Question Answering Model Adaptation
10817 -- 10830Xuanyu Zhang, Bin Chen, Wenzhen Zou, Shuai Liu, Yongbing Zhang, Ruiqin Xiong, Jian Zhang 0018. Progressive Content-Aware Coded Hyperspectral Snapshot Compressive Imaging
10831 -- 10844Xiaoqiang Zhou, Chaoyou Fu, Huaibo Huang, Ran He 0001. Dynamic Graph Memory Bank for Video Inpainting
10845 -- 10859Yuanliang Xue, Guodong Jin, Tao Shen, Lining Tan, Nian Wang, Jing Gao, Lianfeng Wang. Consistent Representation Mining for Multi-Drone Single Object Tracking
10860 -- 10873Tao Yan 0001, Xiangjie Zhu, Xianglong Chen, Weijiang He, Chenglong Wang, Yang Yang 0046, YingHui Wang, Xiaojun Chang. GLGFN: Global-Local Grafting Fusion Network for High-Resolution Image Deraining
10874 -- 10887Mingjin He, Bingwen Feng, Yizhi Guo, Jian Weng 0001, Wei Lu 0001. Camera-Shooting Resilient Watermarking on Image Instance Level
10888 -- 10902Lin He, Bingwen Feng, Zecheng Peng, Bing Chen 0004, Zhihua Xia, Wei Lu 0001. Removing Hidden Information by Geometrical Perturbation in Frequency Domain
10903 -- 10916Dongjia Zhao, Lei Qi 0001, Xiao Shi, Yinghuan Shi, Xin Geng 0001. A Novel Cross-Perturbation for Single Domain Generalization
10917 -- 10929Preeti Meena, Himanshu Kumar, Sandeep Kumar Yadav. A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification
10930 -- 10943Weidong Zhang 0007, Qingmin Liu, Yikun Feng, Lei Cai, Peixian Zhuang. Underwater Image Enhancement via Principal Component Fusion of Foreground and Background
10944 -- 10958Di Wang 0018, Jinyuan Liu 0001, Long Ma 0002, Risheng Liu, Xin Fan 0001. Improving Misaligned Multi-Modality Image Fusion With One-Stage Progressive Dense Registration
10959 -- 10971Yong Wu, Guang Chen 0001, Linwei Ye, Yuanning Jia, Zhi Liu 0003, Yang Wang 0003. TTAGaze: Self-Supervised Test-Time Adaptation for Personalized Gaze Estimation
10972 -- 10986Zhaobo Qi, Yibo Yuan, Xiaowen Ruan, Shuhui Wang, Weigang Zhang, Qingming Huang. Collaborative Debias Strategy for Temporal Sentence Grounding in Video
10987 -- 10999Lizhi Xiong, Rui Ding, Ching-Nung Yang, Zhangjie Fu. Invertible Secret Image Sharing With Authentication for Embedding Color Palette Image Into True Color Image
11000 -- 11012Asif Raza, Bang Yang, Yuexian Zou. Zero-Shot Temporal Action Detection by Learning Multimodal Prompts and Text-Enhanced Actionness
11013 -- 11025Chunyan Wang, Dong Zhang, Rui Yan. Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator
11026 -- 11039Duo Qiu, Bei Yang, Xiongjun Zhang. Robust Tensor Completion via Dictionary Learning and Generalized Nonconvex Regularization for Visual Data Recovery
11040 -- 11055Qingxin Sheng, Chong Fu, Ming Tie, Xingwei Wang 0001, Junxin Chen 0001, Chiu-Wing Sham. A Chaos-Based Tunable Selective Encryption Algorithm for H.265/HEVC With Semantic Understanding
11056 -- 11069Meng Liu 0006, Da Li, Yongqiang Li, Xuemeng Song, Liqiang Nie. Audio-Semantic Enhanced Pose-Driven Talking Head Generation
11070 -- 11085Yuxin Feng, Zhuo Su 0001, Long Ma 0002, Xin Li, Risheng Liu, Fan Zhou 0001. Bridging the Gap Between Haze Scenarios: A Unified Image Dehazing Model
11086 -- 11100Linfei Wang, Yibing Zhan, Long Lan, Xu Lin, Dapeng Tao, Xinbo Gao 0001. DeIoU: Toward Distinguishable Box Prediction in Densely Packed Object Detection
11101 -- 11114Rui Guo, Linbin Wang, Chencheng Zhang, Lian Gu, Dianyou Li, Xiaohua Qian. A Causality-Informed Graph Convolutional Network for Video Assessment of Parkinsonian Leg Agility
11115 -- 11127Bokang Wang, Qian Ning, Fangfang Wu, Xin Li 0005, Weisheng Dong, Guangming Shi. Uncertainty Modeling of the Transmission Map for Single Image Dehazing
11128 -- 11141Junhui Li, Xingsong Hou. The Design of an Adaptive Enhanced AMP-Based Image Block Compressed Sensing and Its Application to Image Encryption
11142 -- 11155Chen Yang, Guorong Li, Shuhui Wang, Li Su 0003, Laiyun Qing, Qingming Huang. SpikeODE: Image Reconstruction for Spike Camera With Neural Ordinary Differential Equation
11156 -- 11168Yi-Chen Chen, Wei-Ta Chu. Positive and Negative Set Designs in Contrastive Feature Learning for Temporal Action Segmentation
11169 -- 11183Bolin Ni, Xing Nie, Chenghao Zhang, Shixiong Xu, Xin Zhang 0093, Gaofeng Meng, Shiming Xiang. MoBoo: Memory-Boosted Vision Transformer for Class-Incremental Learning
11184 -- 11197Xiaogang Song 0001, Pengfei Zhang, Xiaofeng Lu, Xinhong Hei 0001, Rongrong Liu. A Universal Multi-View Guided Network for Salient Object and Camouflaged Object Detection
11198 -- 11213Yuqiao Zeng, Tengfei Liang, Yi Jin 0001, Yidong Li. MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection
11214 -- 11228Yike Liu, Haipeng Li 0001, Shuaicheng Liu, Bing Zeng. CodingHomo: Bootstrapping Deep Homography With Video Coding
11229 -- 11240Qinghua Sheng, Hongzhao Chen, Changcai Lai, Xiaofang Huang, Yuanyuan Liu, Xiaofeng Huang, Haibing Yin. Fast Linear Equation Solving Algorithm and its Pipelined Hardware Architecture Design for VVC Affine Motion Estimation
11241 -- 11255Jian Wang, Qiang Ling. FDNet: Frequency Decomposition Network for Learned Image Compression
11256 -- 11270Guohao Xu, Leilei Huang, Zhijian Hao, Wei Li, Shiyan Yi, Xiaoyang Zeng, Yibo Fan. A High Compression Efficiency Hardware Encoder for Intra and Inter Coding With 4K@30fps Throughput
11271 -- 11285Siyu Zhou, Fuwei Zhang, Ruomei Wang 0001, Fan Zhou 0001, Zhuo Su 0001. Subtask Prior-Driven Optimized Mechanism on Joint Video Moment Retrieval and Highlight Detection
11286 -- 11298Ming Jin, Wenbo Hu 0001, Lei Zhu 0002, Xiang Wang 0010, Richang Hong. Based on Spatial and Temporal Implicit Semantic Relational Inference for Cross-Modal Retrieval
11299 -- 11312Linshan Hou, Zhongyun Hua, Yuhong Li, Yifeng Zheng, Leo Yu Zhang. M-to-N Backdoor Paradigm: A Multi-Trigger and Multi-Target Attack to Deep Learning Models
11316 -- 11339Wang Xia, Guodao Sun, Tong Li, Baofeng Chang, Jingwei Tang, Gefei Zhang, Ronghua Liang. Video Visualization and Visual Analytics: A Task-Based and Application- Driven Investigation
11340 -- 11359Zehai Niu, Ke Lu 0002, Jian Xue, Xiaoyu Qin, Jinbao Wang, Ling Shao 0001. From Methods to Applications: A Review of Deep 3D Human Motion Capture
11360 -- 11372Jianhan Qi, Yuheng Jia, Hui Liu 0032, Junhui Hou. Superpixel Graph Contrastive Clustering With Semantic-Invariant Augmentations for Hyperspectral Images
11373 -- 11385Bo Miao, Mohammed Bennamoun, Yongsheng Gao 0001, Mubarak Shah, Ajmal Mian. Temporally Consistent Referring Video Object Segmentation With Hybrid Memory
11386 -- 11399Yan Liu 0043, Qingyong Hu, Yulan Guo. BSTS: A Weakly-Supervised Method for Semantic Learning of 3D Point Clouds
11400 -- 11412Yuxi Liu, Guibo Luo, Zhenyu Weng, Yuesheng Zhu. Adaptive Face Recognition for Multi-Type Occlusions
11413 -- 11422Junran Ding, Yunxiang He, Binzhe Yuan, Zhechen Yuan, Pingqiang Zhou, Jingyi Yu, Xin Lou. Ray Reordering for Hardware-Accelerated Neural Volume Rendering
11423 -- 11437Yichen Guo, Mai Xu, Lai Jiang, Xin Deng 0002, Jing Zhou, Gaoxing Chen, Leonid Sigal. Proposal With Alignment: A Bi-Directional Transformer for 360° Video Viewport Proposal
11438 -- 11450Weijia Liu, Shaoming Zhang, Yan Tang, Zhong Wang, Jianmei Wang. Style Reconstruction-Driven Networks for Occlusion-Aware License Plate Recognition
11451 -- 11463De Cheng, Yuxin Zhao, Nannan Wang 0001, Guozhang Li, Dingwen Zhang, Xinbo Gao 0001. Efficient Statistical Sampling Adaptation for Exemplar-Free Class Incremental Learning
11464 -- 11477Xingyu Tong, Yang Xiao 0007, Bo Tan, Jianyu Yang 0002, Zhiguo Cao 0001, Joey Tianyi Zhou, Junsong Yuan 0001. You Will Never Walk Alone: One-Shot 3D Action Recognition With Point Cloud Sequence
11478 -- 11492Yi Shi, Long Qin, Shixuan Zhao 0001, Kaifu Yang, Yuyong Cui, Hongmei Yan. Weakly Supervised Fixated Object Detection in Traffic Videos Based on Driver's Selective Attention Mechanism
11493 -- 11506Peng Zhao, Xiaoming Xi, Qiangchang Wang, Yilong Yin. Characterizing Hierarchical Semantic-Aware Parts With Transformers for Generalized Zero-Shot Learning
11507 -- 11520Ruilin Yao, Yi Rong, Qiangqiang Huang, Shengwu Xiong. CTOD: Cross-Attentive Task-Alignment for One-Stage Object Detection
11521 -- 11534Yizhu Zhang, Jingang Shi, Jiayin Wang, Yuan Zong, Wenming Zheng, Guoying Zhao 0001. MaskFusionNet: A Dual-Stream Fusion Model With Masked Pre-Training Mechanism for rPPG Measurement
11535 -- 11550Sheng Yan, Mengyuan Liu, Yong Wang, Yang Liu 0264, Hong Liu 0008. MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions
11551 -- 11563Xi Yang 0011, Menghui Tian, Nannan Wang 0001, Xinbo Gao 0001. Unleashing the Feature Hierarchy Potential: An Efficient Tri-Hybrid Person Search Model
11564 -- 11578Haihong Xiao, Ying He 0001, Hao Liu, Wenxiong Kang, Yuqiong Li. Point Cloud Completion via Self-Projected View Augmentation and Implicit Field Constraint
11579 -- 11591Xin Liu, Jiamin Wu, Wenfei Yang, Xu Zhou, Tianzhu Zhang. Multi-Modal Attribute Prompting for Vision-Language Models
11592 -- 11604Yue Wang 0038, Lu Zhang 0053, Pingping Zhang, Yunzhi Zhuge, Junfeng Wu, Hong Yu, Huchuan Lu. Learning Local-Global Representation for Scribble-Based RGB-D Salient Object Detection via Transformer
11605 -- 11618Congqi Cao, Ze Sun, Qinyi Lv, Lingtong Min, Yanning Zhang 0001. VS-TransGRU: A Novel Transformer-GRU-Based Framework Enhanced by Visual-Semantic Fusion for Egocentric Action Anticipation
11619 -- 11629Xuxiang Sun 0001, Gong Cheng 0003, Hongda Li, Hongyu Peng, Junwei Han. Task-Specific Importance-Awareness Matters: On Targeted Attacks Against Object Detection
11630 -- 11643Haolin Du, Jingfei He, YuanQing Zhao. CCR: A Counterfactual Causal Reasoning-Based Method for Cross-View Geo-Localization
11644 -- 11656Zhuming Wang, Zun Li 0001, Xianglong Lang, Yihao Zheng, Meng Tian, Lifang Wu, Liang Wang 0001, Changwen Chen. Knowledge Augmented Relation Inference for Group Activity Recognition
11657 -- 11667Ning Xu 0003, Tingting Zhang, Hongshuo Tian, An-An Liu. Rule-Driven News Captioning
11668 -- 11681Jian Wang, Tianhong Dai, Xinqiao Zhao, Ángel F. García-Fernández, Eng Gee Lim, Jimin Xiao. Class Activation Map Calibration for Weakly Supervised Semantic Segmentation
11682 -- 11694Jian Zhu 0006, Hanli Wang, Miaojing Shi. Multi-Modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning
11695 -- 11708Zongyi Li, Yuxuan Shi, Hefei Ling, Jiazhong Chen, Runsheng Wang, Chengxin Zhao, Qian Wang 0001, Shijuan Huang. Knowledge Consistency Distillation for Weakly Supervised One Step Person Search
11709 -- 11723Ziheng Jia, Xiongkuo Min, Wei Sun 0029, Guangtao Zhai. Continuous and Overall Quality of Experience Evaluation for Streaming Video Based on Rich Features Exploration and Dual-Stage Attention
11724 -- 11738Liqing Gao, Fan Lyu, Peng Shi, Lei Zhu 0003, Junfu Pu, Liang Wang 0001, Wei Feng 0005. Overcoming Modality Bias in Question-Driven Sign Language Video Translation
11739 -- 11750Xu Yin, Woobin Im, Dongbo Min, Yuchi Huo, Fei Pan, Sung-Eui Yoon. Fine-Grained Background Representation for Weakly Supervised Semantic Segmentation
11751 -- 11767Shuyuan Wang, Qi Li 0005, Huiyuan Luo, Chengkan Lv, Zhengtao Zhang. Produce Once, Utilize Twice for Anomaly Detection
11768 -- 11782Ling Lin 0002, Tao Wang, Hao Liu 0019, Congcong Zhu, Jingrun Chen. Toward Quantifiable Face Age Transformation Under Attribute Unbias
11783 -- 11797Shaocong Long, Qianyu Zhou 0001, Chenhao Ying, Lizhuang Ma, Yuan Luo 0003. Rethinking Domain Generalization: Discriminability and Generalizability
11798 -- 11809Xie Yang, Yuke Wang, Fangjun Huang. CNN-Based Reversible Data Hiding for JPEG Images
11810 -- 11824Quan Chen, Tingyu Wang, Zihao Yang, Haoran Li, Rongfeng Lu, Yaoqi Sun, Bolun Zheng, Chenggang Yan 0001. SDPL: Shifting-Dense Partition Learning for UAV-View Geo-Localization
11825 -- 11837Qianzi Yu, Kai Zhu 0004, Yang Cao 0010, Feijie Xia, Yu Kang 0001. TF²: Few-Shot Text-Free Training-Free Defect Image Generation for Industrial Anomaly Inspection
11838 -- 11850Yichen Chi, Junhao Gu, Jiamiao Zhang, Wenming Yang, Yapeng Tian. EgoVSR: Toward High-Quality Egocentric Video Super-Resolution
11851 -- 11873Junge Peng, Bing Luo, Li Xu, Jun Yang, Chao Zhang 0072, Zheng Pei 0001. Blind Image Deblurring via Minimizing Similarity Between Fuzzy Sets on Image Pixels
11874 -- 11885Yongxian Wei, Zixuan Hu, Li Shen 0008, Zhenyi Wang, Lei Li, Yu Li, Chun Yuan. Meta-Learning Without Data via Unconditional Diffusion Models
11886 -- 11899Haoliang Zhou, Shucheng Huang, Feifei Zhang, Changsheng Xu. CEPrompt: Cross-Modal Emotion-Aware Prompting for Facial Expression Recognition
11900 -- 11913Liya Wang, Haipeng Chen 0002, Yu Liu 0004, Yingda Lyu. Regular Constrained Multimodal Fusion for Image Captioning
11914 -- 11928Chen Feng, Georgios Tzimiropoulos, Ioannis Patras. NoiseBox: Toward More Efficient and Effective Learning With Noisy Labels
11929 -- 11941Hongjun Wu 0003, Chenxi Wang, Luwei Tu, Constantin Patsch, Zhi Jin. CSPN: A Category-Specific Processing Network for Low-Light Image Enhancement
11942 -- 11953Mingfeng Zha, Feiyang Fu, Yunqiang Pei, Guoqing Wang 0001, Tianyu Li, Xiongxin Tang, Yang Yang 0002, Heng Tao Shen. Dual Domain Perception and Progressive Refinement for Mirror Detection
11954 -- 11964Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen. Ump: Unified Modality-Aware Prompt Tuning for Text-Video Retrieval
11965 -- 11979Feifeng Wang, Liquan Shen, Qi Teng, Zhaoyi Tian. DSCIC: Deep Screen Content Image Compression
11980 -- 11992Yi-Hsin Chen, Hong-Sheng Xie, Cheng-Wei Chen, Zong-Lin Gao, Martin Benjak, Wen-Hsiao Peng, Jörn Ostermann. MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression
11993 -- 12005Tianyi Sun, Yanze Wang, Zhijie Huang, Jun Sun 0012. STRANet: Soft-Target and Restriction-Aware Neural Network for Efficient VVC Intra Coding
12006 -- 12018Chris Henry, Li Song 0001, Zhu Li 0001. Fast Video Deduplication and Localization With Temporal Consistence Re-Ranking
12019 -- 12031Huakai Lai, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang 0001. Reliable Phrase Feature Mining for Hierarchical Video-Text Retrieval
12032 -- 12047Huadong Zhang, Shuli Cheng, Anyu Du. Multi-Stage Auxiliary Learning for Visible-Infrared Person Re-Identification
12048 -- 12058Duc Quang Vu, Trang T. T. Phung, Jia-Ching Wang, Son T. Mai. LCSL: Long-Tailed Classification via Self-Labeling
12059 -- 12072Dengdi Sun, Yajie Pan, Andong Lu, Chenglong Li 0002, Bin Luo 0001. Transformer RGBT Tracking With Spatio-Temporal Multimodal Tokens
12073 -- 12085Xingjie Dai, Ziwen He, Xiang Zhang 0023, Zhangjie Fu. SCGM: Asymmetric Steganographic Embedding Cost Learning With Adaptive Modulation
12086 -- 12091Xi Xie, Meng Wang 0017, Junru Li, Kai Zhang 0007, Li Zhang 0006, Shiqi Wang 0001. Enhanced Motion Compensated Temporal Filter for VVenC
12092 -- 12096Linwei Zhu, Yun Zhang 0002, Na Li 0015, Wenhui Wu, Shiqi Wang 0001, Sam Kwong. Neural Network Based Multi-Level In-Loop Filtering for Versatile Video Coding