Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 34, Issue 8

6615 -- 6619Wenguan Wang, Tianfei Zhou, Dongfang Liu, Zheng Thomas Tang, Alexander C. Loui. Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data
6620 -- 6633Wenyi Zhao, Lu Yang 0006, Weidong Zhang 0007, Yongqin Tian, Wenhe Jia, Wei Li, Mu Yang, Xipeng Pan, Huihua Yang. Learning What and Where to Learn: A New Perspective on Self-Supervised Learning
6634 -- 6645Qiuxia Lai, Ailing Zeng, Ye Wang 0011, Lihong Cao, Yu Li 0007, Qiang Xu 0001. Self-Supervised Video Representation Learning via Capturing Semantic Changes Indicated by Saccades
6646 -- 6660Chao Wang, Zheng Tang. The Staged Knowledge Distillation in Video Classification: Harmonizing Student Progress by a Complementary Weakly Supervised Framework
6661 -- 6673Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu 0021. Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
6674 -- 6685Ruotong Hu, Xianzhi Wang 0001, Xiaojun Chang, Yongle Zhang, Yeqi Hu, Xinyuan Liu, Shusong Yu. CStrCRL: Cross-View Contrastive Learning Through Gated GCN With Strong Augmentations for Skeleton Recognition
6686 -- 6698Xuemei Zhang, Peng Zhao, Jinsheng Ji, Xiankai Lu, Yilong Yin. Video Corpus Moment Retrieval via Deformable Multigranularity Feature Fusion and Adversarial Training
6699 -- 6709Yin Tang, Tao Chen 0012, Xiruo Jiang, Yazhou Yao, Guo-Sen Xie, Heng Tao Shen. Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
6710 -- 6721Yawen Lu, Jie Zhang 0066, Su Sun, Qianyu Guo, Zhiwen Cao, Songlin Fei, Baijian Yang 0001, Yingjie Victor Chen. Label-Efficient Video Object Segmentation With Motion Clues
6722 -- 6734Mingjie Sun, Jimin Xiao, Eng Gee Lim, Cairong Zhao, Yao Zhao 0001. Unified Multi-Modality Video Object Segmentation Using Reinforcement Learning
6735 -- 6748Ruiheng Zhang, Lu Li, Qi Zhang, Jin Zhang, Lixin Xu, Baomin Zhang, Binglu Wang. Differential Feature Awareness Network Within Antagonistic Learning for Infrared-Visible Object Detection
6749 -- 6761Chuangye Guo, Kang Liu 0014, Donghu Deng, Xuelong Li 0001. ViT Spatio-Temporal Feature Fusion for Aerial Object Tracking
6762 -- 6773Zhixiong Nan, Tao Xiang 0001. Third-Person View Attention Prediction in Natural Scenarios With Weak Information Dependency and Human-Scene Interaction Mechanism
6774 -- 6784Xiyue Wang, De Cai, Sen Yang 0006, Yiming Cui, Junyou Zhu, Kanran Wang, Junhan Zhao. SAC-Net: Enhancing Spatiotemporal Aggregation in Cervical Histological Image Classification via Label-Efficient Weakly Supervised Learning
6785 -- 6796Jiake Leng, Yiyan Zhang, Xiang Liu, Yiming Cui, Junhan Zhao, Yongxin Ge. Error-Robust and Label-Efficient Deep Learning for Understanding Tumor Microenvironment From Spatial Transcriptomics
6797 -- 6808Qingxuan Shi, Yihang Li, Huijun Di, Enyi Wu. Self-Supervised Interactive Image Segmentation
6809 -- 6813Shengxi Li, Xuelong Li, Leonardo Chiariglione, Jiebo Luo, Wenwu Wang 0001, Zhengyuan Yang, Danilo P. Mandic, Hamido Fujita. Introduction to the Special Issue on AI-Generated Content for Multimedia
6814 -- 6832Fatemeh Nazarieh, Zhenhua Feng, Muhammad Awais 0001, Wenwu Wang 0001, Josef Kittler. A Survey of Cross-Modal Visual Content Generation
6833 -- 6846Chunyi Li, Zicheng Zhang, Haoning Wu 0001, Wei Sun 0029, Xiongkuo Min, Xiaohong Liu 0001, Guangtao Zhai, Weisi Lin. AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
6847 -- 6859Yixuan Wang, Wengang Zhou, Jianmin Bao, Weilun Wang, Li Li 0040, Houqiang Li. CLIP2GAN: Toward Bridging Text With the Latent Space of GANs
6860 -- 6873Hong Chen, Yipeng Zhang 0003, Xin Wang 0019, Xuguang Duan, Yuwei Zhou, Wenwu Zhu 0001. DisenDreamer: Subject-Driven Text-to-Image Generation With Sample-Aware Disentangled Tuning
6874 -- 6887Jiyao Pu, Haoran Duan, Junzhe Zhao, Yang Long 0001. Rules for Expectation: Learning to Generate Rules via Social Environment Modeling
6888 -- 6900Jin Liu 0020, Xi Wang 0014, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han. OSM-Net: One-to-Many One-Shot Talking Head Generation With Spontaneous Head Motions
6901 -- 6912Cong Jin, Ruolin Zhu, Zixing Zhu, Lu Yang 0006, Min Yang, Jiebo Luo. MtArtGPT: A Multi-Task Art Generation System With Pre-Trained Transformer
6913 -- 6925Yang Zhao 0002, Huaen Li, Zhao Zhang 0001, Yuan Chen, Qing Liu 0022, Xiaojuan Zhang. Regional Traditional Painting Generation Based on Controllable Disentanglement Model
6926 -- 6936Yang Yu 0039, Xiaolong Liu, Rongrong Ni, Siyuan Yang, Yao Zhao 0001, Alex C. Kot. PVASS-MDD: Predictive Visual-Audio Alignment Self-Supervision for Multimodal Deepfake Detection
6937 -- 6948Miao Liu, Jing Wang 0037, Xinyuan Qian, Haizhou Li 0001. Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss
6949 -- 6962Yihao Huang 0001, Felix Juefei-Xu, Qing Guo 0005, Yang Liu 0003, Geguang Pu. Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering
6963 -- 6977Qiyuan Du, Yiping Duan, Zhipeng Xie, Xiaoming Tao, Linsu Shi, Zhijuan Jin. Optical Flow-Based Spatiotemporal Sketch for Video Representation: A Novel Framework
6978 -- 6992Junlong Gao, Chuanmin Jia, Zhimeng Huang, Shanshe Wang, Siwei Ma, Wen Gao 0001. Rate-Distortion Optimized Cross Modal Compression With Multiple Domains
6993 -- 7004Fangyuan Gao, Xin Deng 0002, Junpeng Jing, Xin Zou, Mai Xu. Extremely Low Bit-Rate Image Compression via Invertible Image Generation
7005 -- 7016Hefeng Wu, Weifeng Chen, Zhibin Liu, Tianshui Chen, Zhiguang Chen, Liang Lin. Contrastive Transformer Learning With Proximity Data Generation for Text-Based Person Search
7017 -- 7028Zhen Qin 0002, Yujie Chen, Guosong Zhu, Erqiang Zhou, Yingjie Zhou, Yicong Zhou, Ce Zhu. Enhanced Pseudo-Label Generation With Self-Supervised Training for Weakly- Supervised Semantic Segmentation
7029 -- 7040Wenxue Guan, Haobo Li, Dawei Xu, Jiaxin Liu, Shenghua Gong, Jun Liu 0006. Frequency Generation for Real-World Image Super-Resolution
7041 -- 7056Huaizhang Liao, Jingyuan Xia, ZhiXiong Yang, Fulin Pan, Zhen Liu 0004, Yongxiang Liu. Meta-Learning Based Domain Prior With Application to Optical-ISAR Image Translation
7057 -- 7068Ge Shi 0002, Sinuo Deng, Bo Wang, Chong Feng, Yan Zhuang, Xiaomei Wang. One for All: A Unified Generative Framework for Image Emotion Classification
7069 -- 7079Chunwei Tian, Menghua Zheng, Bo Li 0004, Yanning Zhang, Shichao Zhang 0001, David Zhang 0001. Perceptive Self-Supervised Learning Network for Noisy Image Watermark Removal
7080 -- 7094Linjun Li, Tao Jin, Wang Lin, Hao Jiang, Wenwen Pan, Jian Wang, Shuwen Xiao, Yan Xia 0006, Weihao Jiang, Zhou Zhao. Multi-Granularity Relational Attention Network for Audio-Visual Question Answering
7095 -- 7105Feilong Cao, Lingpeng Wang, Hailiang Ye. SharpGConv: A Novel Graph Method With Plug-and-Play Sharpening Convolution for Point Cloud Registration
7106 -- 7120Qinghua Ren, Shijian Lu, Qirong Mao, Ming Dong 0001. Exploring Prototype-Anchor Contrast for Semantic Segmentation
7121 -- 7134Jinxiang Zhu, Qi Wang, Xinyu Dong, Weijian Ruan, Haolin Chen, Liang Lei, Gefei Hao. FSNA: Few-Shot Object Detection via Neighborhood Information Adaption and All Attention
7135 -- 7148Ziye Fang, Xin Jiang, Hao Tang 0007, Zechao Li. Learning Contrastive Self-Distillation for Ultra-Fine-Grained Visual Categorization Targeting Limited Samples
7149 -- 7164Yongxi Li, Wenzhong Tang, Shuai Wang, Shengsheng Qian, Changsheng Xu. Distribution-Guided Hierarchical Calibration Contrastive Network for Unsupervised Person Re-Identification
7165 -- 7175Jiahao Xu, Xinzhu Ma, Lin Zhang, Bo Zhang 0069, Tao Chen 0003. Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification
7176 -- 7189Kaijie He, Canlong Zhang, Sheng Xie, Zhixin Li 0001, Zhi-wen Wang, Rui-Guo Qin. Target-Aware Tracking With Spatial-Temporal Context Attention
7190 -- 7201Han Lin, Yingjian Li, Zheng Zhang 0006, Lei Zhu 0002, Yong Xu 0001. Learning With Noisy Labels by Semantic and Feature Space Collaboration
7202 -- 7215YiBo Zhao, Hua Zhang 0003, Zan Gao, Weili Guan, Meng Wang 0001, Shengyong Chen. A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization
7216 -- 7230Xiaoying Yuan, Tingfa Xu, Xincong Liu, Ying Wang, Haolin Qin, Yuqiang Fang, Jianan Li. Multi-Step Temporal Modeling for UAV Tracking
7231 -- 7243Jinhong Deng, Wen Li 0001, Lixin Duan. Balanced Teacher for Source-Free Object Detection
7244 -- 7258Sungjun Jang, Heansung Lee 0001, Woo Jin Kim, Jungho Lee, Sungmin Woo, Sangyoun Lee. Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition
7259 -- 7271Baozhen Sun, Zhenhua Wang 0003, Shilei Wang, Yongkang Cheng, Jifeng Ning. Bidirectional Interaction of CNN and Transformer Feature for Visual Tracking
7272 -- 7283Yiming Wang 0007, Dongxia Chang, Zhiqiang Fu, Jie Wen 0001, Yao Zhao 0001. Partially View-Aligned Representation Learning via Cross-View Graph Contrastive Network
7284 -- 7300Yanjie Liang, Haosheng Chen 0001, Qiangqiang Wu, Changqun Xia, Jia Li 0003. Joint Spatio-Temporal Similarity and Discrimination Learning for Visual Tracking
7301 -- 7314Jiale Zhang, Chengxin Liu, Ke Xian, Zhiguo Cao 0001. Hierarchical Feature Warping and Blending for Talking Head Animation
7315 -- 7327Kangdao Liu, Xiaolin Xiao, Jinkun You, Yicong Zhou. Robust Discriminative t-Linear Subspace Learning for Image Feature Extraction
7328 -- 7343Kunchi Li, Hongyang Chen, Jun Wan 0001, Shan Yu. ESDB: Expand the Shrinking Decision Boundary via One-to-Many Information Matching for Continual Learning With Small Memory
7344 -- 7358Kunpeng Wang, Zhengzheng Tu, Chenglong Li 0002, Cheng Zhang 0010, Bin Luo 0001. Learning Adaptive Fusion Bank for Multi-Modal Salient Object Detection
7359 -- 7372Xiaoqiang Lu, Licheng Jiao, Lingling Li 0002, Fang Liu 0001, Xu Liu 0006, Shuyuan Yang. Self Pseudo Entropy Knowledge Distillation for Semi-Supervised Semantic Segmentation
7373 -- 7385Minghua Zhang, Qiuyang Zhang, Wei Song 0007, Dongmei Huang, Qi He 0003. PromptVT: Prompting for Efficient and Accurate Visual Tracking
7386 -- 7400Tianlu Zhang, Xiaoyi He, Qiang Jiao, Qiang Zhang 0020, Jungong Han. AMNet: Learning to Align Multi-Modality for RGB-T Tracking
7401 -- 7416Yimei Liu, Qing Cai, Congcong Wang, Jian Yang, Hao Fan 0004, Junyu Dong, Sheng Chen 0001. Geometry-Enhanced Attentive Multi-View Stereo for Challenging Matching Scenarios
7417 -- 7429Xu Liu, Jianing Li, Jinqiao Shi, Xiaopeng Fan, Yonghong Tian 0001, Debin Zhao. Event-Based Monocular Depth Estimation With Recurrent Transformers
7430 -- 7439Liqun Lin, Guangpeng Wei, Kanglin Liu, Wanjian Feng, Tiesong Zhao. LightViD: Efficient Video Deblurring With Spatial-Temporal Feature Fusion
7440 -- 7453Tao Zhou 0002, Yi Zhou 0007, Guangyu Li, Geng Chen 0001, Jianbing Shen. Uncertainty-Aware Hierarchical Aggregation Network for Medical Image Segmentation
7454 -- 7466Xin Liu, Biao Qian, Haipeng Liu 0004, Dan Guo, Yang Wang 0023, Meng Wang 0001. Seeking False Hard Negatives for Graph Contrastive Learning
7467 -- 7483Qihua Feng, Peiya Li, ZhiXun Lu, Chaozhuo Li 0001, Zefan Wang, Zhiquan Liu, Chunhui Duan, Feiran Huang, Jian Weng 0001, Philip S. Yu. EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing
7484 -- 7497Dandan Zhan, Jiahao Wu, Xing Luo, Zhi Jin. Learning From Text: A Multimodal Face Inpainting Network for Irregular Holes
7498 -- 7511Ahmet Burakhan Koyuncu, Panqi Jia, Atanas Boev, Elena Alshina, Eckehard G. Steinbach. Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression
7512 -- 7522Deqian Mao, Shanshan Gao, Zhenyu Li, Honghao Dai, Yunfeng Zhang 0001, Yuanfeng Zhou. Aggregating Global and Local Representations via Hybrid Transformer for Video Deraining
7523 -- 7536Pei-an, Xuzhong Hu, Junfeng Ding, Jun Zhang, Jie Ma, You Yang, Qiong Liu 0001. OL-Reg: Registration of Image and Sparse LiDAR Point Cloud With Object-Level Dense Correspondences
7537 -- 7549Lei He 0010, Yongfang Xie, Shiwen Xie, Zhaohui Jiang 0001, Zhipeng Chen. Iterative Self-Guided Image Filtering
7550 -- 7565Zizhuo Li, Chunbao Su, Fan Fan 0001, Jun Huang 0008, Jiayi Ma 0001. MC-Net: Integrating Multi-Level Geometric Context for Two-View Correspondence Learning
7566 -- 7576Chunwei Tian, Menghua Zheng, Tiancai Jiao, Wangmeng Zuo, Yanning Zhang, Chia-Wen Lin. A Self-Supervised CNN for Image Watermark Removal
7577 -- 7588Tianwei Zhou, Songbai Tan, Baoquan Zhao, Guanghui Yue 0001. Multitask Deep Neural Network With Knowledge-Guided Attention for Blind Image Quality Assessment
7589 -- 7600Yu Tian, Shiqi Wang 0001, Baoliang Chen, Sam Kwong. Causal Representation Learning for GAN-Generated Face Image Quality Assessment
7601 -- 7613Weiwei Zhang, Yufeng Guo, Junhuang Wang, Jianqing Zhu, Huanqiang Zeng. Collaborative Knowledge Distillation
7614 -- 7627Zhengeng Yang, Hongshan Yu, Wei Sun 0028, Li Cheng 0001, Ajmal Mian. Domain-Invariant Prototypes for Semantic Segmentation
7628 -- 7642Mengxin Gong, Xiuli Chai, Yang Lu 0013, Yushu Zhang. Exploiting Four-Dimensional Chaotic Systems With Dissipation and Optimized Logical Operations for Secure Image Compression and Encryption
7643 -- 7656Jiaqing Zhang, Jie Lei 0001, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li. Multimodal Informative ViT: Information Aggregation and Distribution for Hyperspectral and LiDAR Classification
7657 -- 7670Zhuyang Xie, Yan Yang 0001, Jie Wang, Xiaorong Liu, Xiaofan Li. Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
7671 -- 7682Nianzu Qiao, Changyin Sun, Lu Dong 0002, Quanbo Ge. Semi-Supervised Feature Distillation and Unsupervised Domain Adversarial Distillation for Underwater Image Enhancement
7683 -- 7698Tengfei Liang, Yi Jin 0001, Wu Liu, Tao Wang 0011, Songhe Feng, Yidong Li. Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification
7699 -- 7711Hao Zhang, Yujie Dun, Yixuan Pei, Shenqi Lai, Chengxu Liu, Kaipeng Zhang, Xueming Qian. HF-HRNet: A Simple Hardware Friendly High-Resolution Network
7712 -- 7724Xiao Lu 0002, Yulin Yuan, Xing Liu, Lucai Wang, Xuanyu Zhou, Yimin Yang. Low-Light Salient Object Detection by Learning to Highlight the Foreground Objects
7725 -- 7741Chao Li, Shanzhi Yin, Chuanmin Jia, Fanyang Meng, Yonghong Tian 0001, Yongsheng Liang. Multirate Progressive Entropy Model for Learned Image Compression
7742 -- 7755Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen 0001, Yanzan Sun. Quality of Experience Oriented Cross-Layer Optimization for Real-Time XR Video Transmission
7756 -- 7770Yihang Zhang, Sheng Cheng, Zongming Guo, Xinggong Zhang. Inferring Video Streaming Quality of Real-Time Communication Inside Network
7771 -- 7784Mingcong Lu, Ruifan Li, Fangxiang Feng, Zhanyu Ma, Xiaojie Wang 0006. LGR-NET: Language Guided Reasoning Network for Referring Expression Comprehension
7785 -- 7800Jiang Yu, Fengyong Li, Zichi Wang, Wen Si, Xinpeng Zhang 0001. Diverse Batch Steganography Using Model-Based Selection and Double-Layered Payload Assignment