| 6615 | -- | 6619 | Wenguan Wang, Tianfei Zhou, Dongfang Liu, Zheng Thomas Tang, Alexander C. Loui. Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data |
| 6620 | -- | 6633 | Wenyi Zhao, Lu Yang 0006, Weidong Zhang 0007, Yongqin Tian, Wenhe Jia, Wei Li, Mu Yang, Xipeng Pan, Huihua Yang. Learning What and Where to Learn: A New Perspective on Self-Supervised Learning |
| 6634 | -- | 6645 | Qiuxia Lai, Ailing Zeng, Ye Wang 0011, Lihong Cao, Yu Li 0007, Qiang Xu 0001. Self-Supervised Video Representation Learning via Capturing Semantic Changes Indicated by Saccades |
| 6646 | -- | 6660 | Chao Wang, Zheng Tang. The Staged Knowledge Distillation in Video Classification: Harmonizing Student Progress by a Complementary Weakly Supervised Framework |
| 6661 | -- | 6673 | Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu 0021. Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection |
| 6674 | -- | 6685 | Ruotong Hu, Xianzhi Wang 0001, Xiaojun Chang, Yongle Zhang, Yeqi Hu, Xinyuan Liu, Shusong Yu. CStrCRL: Cross-View Contrastive Learning Through Gated GCN With Strong Augmentations for Skeleton Recognition |
| 6686 | -- | 6698 | Xuemei Zhang, Peng Zhao, Jinsheng Ji, Xiankai Lu, Yilong Yin. Video Corpus Moment Retrieval via Deformable Multigranularity Feature Fusion and Adversarial Training |
| 6699 | -- | 6709 | Yin Tang, Tao Chen 0012, Xiruo Jiang, Yazhou Yao, Guo-Sen Xie, Heng Tao Shen. Holistic Prototype Attention Network for Few-Shot Video Object Segmentation |
| 6710 | -- | 6721 | Yawen Lu, Jie Zhang 0066, Su Sun, Qianyu Guo, Zhiwen Cao, Songlin Fei, Baijian Yang 0001, Yingjie Victor Chen. Label-Efficient Video Object Segmentation With Motion Clues |
| 6722 | -- | 6734 | Mingjie Sun, Jimin Xiao, Eng Gee Lim, Cairong Zhao, Yao Zhao 0001. Unified Multi-Modality Video Object Segmentation Using Reinforcement Learning |
| 6735 | -- | 6748 | Ruiheng Zhang, Lu Li, Qi Zhang, Jin Zhang, Lixin Xu, Baomin Zhang, Binglu Wang. Differential Feature Awareness Network Within Antagonistic Learning for Infrared-Visible Object Detection |
| 6749 | -- | 6761 | Chuangye Guo, Kang Liu 0014, Donghu Deng, Xuelong Li 0001. ViT Spatio-Temporal Feature Fusion for Aerial Object Tracking |
| 6762 | -- | 6773 | Zhixiong Nan, Tao Xiang 0001. Third-Person View Attention Prediction in Natural Scenarios With Weak Information Dependency and Human-Scene Interaction Mechanism |
| 6774 | -- | 6784 | Xiyue Wang, De Cai, Sen Yang 0006, Yiming Cui, Junyou Zhu, Kanran Wang, Junhan Zhao. SAC-Net: Enhancing Spatiotemporal Aggregation in Cervical Histological Image Classification via Label-Efficient Weakly Supervised Learning |
| 6785 | -- | 6796 | Jiake Leng, Yiyan Zhang, Xiang Liu, Yiming Cui, Junhan Zhao, Yongxin Ge. Error-Robust and Label-Efficient Deep Learning for Understanding Tumor Microenvironment From Spatial Transcriptomics |
| 6797 | -- | 6808 | Qingxuan Shi, Yihang Li, Huijun Di, Enyi Wu. Self-Supervised Interactive Image Segmentation |
| 6809 | -- | 6813 | Shengxi Li, Xuelong Li, Leonardo Chiariglione, Jiebo Luo, Wenwu Wang 0001, Zhengyuan Yang, Danilo P. Mandic, Hamido Fujita. Introduction to the Special Issue on AI-Generated Content for Multimedia |
| 6814 | -- | 6832 | Fatemeh Nazarieh, Zhenhua Feng, Muhammad Awais 0001, Wenwu Wang 0001, Josef Kittler. A Survey of Cross-Modal Visual Content Generation |
| 6833 | -- | 6846 | Chunyi Li, Zicheng Zhang, Haoning Wu 0001, Wei Sun 0029, Xiongkuo Min, Xiaohong Liu 0001, Guangtao Zhai, Weisi Lin. AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment |
| 6847 | -- | 6859 | Yixuan Wang, Wengang Zhou, Jianmin Bao, Weilun Wang, Li Li 0040, Houqiang Li. CLIP2GAN: Toward Bridging Text With the Latent Space of GANs |
| 6860 | -- | 6873 | Hong Chen, Yipeng Zhang 0003, Xin Wang 0019, Xuguang Duan, Yuwei Zhou, Wenwu Zhu 0001. DisenDreamer: Subject-Driven Text-to-Image Generation With Sample-Aware Disentangled Tuning |
| 6874 | -- | 6887 | Jiyao Pu, Haoran Duan, Junzhe Zhao, Yang Long 0001. Rules for Expectation: Learning to Generate Rules via Social Environment Modeling |
| 6888 | -- | 6900 | Jin Liu 0020, Xi Wang 0014, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han. OSM-Net: One-to-Many One-Shot Talking Head Generation With Spontaneous Head Motions |
| 6901 | -- | 6912 | Cong Jin, Ruolin Zhu, Zixing Zhu, Lu Yang 0006, Min Yang, Jiebo Luo. MtArtGPT: A Multi-Task Art Generation System With Pre-Trained Transformer |
| 6913 | -- | 6925 | Yang Zhao 0002, Huaen Li, Zhao Zhang 0001, Yuan Chen, Qing Liu 0022, Xiaojuan Zhang. Regional Traditional Painting Generation Based on Controllable Disentanglement Model |
| 6926 | -- | 6936 | Yang Yu 0039, Xiaolong Liu, Rongrong Ni, Siyuan Yang, Yao Zhao 0001, Alex C. Kot. PVASS-MDD: Predictive Visual-Audio Alignment Self-Supervision for Multimodal Deepfake Detection |
| 6937 | -- | 6948 | Miao Liu, Jing Wang 0037, Xinyuan Qian, Haizhou Li 0001. Audio-Visual Temporal Forgery Detection Using Embedding-Level Fusion and Multi-Dimensional Contrastive Loss |
| 6949 | -- | 6962 | Yihao Huang 0001, Felix Juefei-Xu, Qing Guo 0005, Yang Liu 0003, Geguang Pu. Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering |
| 6963 | -- | 6977 | Qiyuan Du, Yiping Duan, Zhipeng Xie, Xiaoming Tao, Linsu Shi, Zhijuan Jin. Optical Flow-Based Spatiotemporal Sketch for Video Representation: A Novel Framework |
| 6978 | -- | 6992 | Junlong Gao, Chuanmin Jia, Zhimeng Huang, Shanshe Wang, Siwei Ma, Wen Gao 0001. Rate-Distortion Optimized Cross Modal Compression With Multiple Domains |
| 6993 | -- | 7004 | Fangyuan Gao, Xin Deng 0002, Junpeng Jing, Xin Zou, Mai Xu. Extremely Low Bit-Rate Image Compression via Invertible Image Generation |
| 7005 | -- | 7016 | Hefeng Wu, Weifeng Chen, Zhibin Liu, Tianshui Chen, Zhiguang Chen, Liang Lin. Contrastive Transformer Learning With Proximity Data Generation for Text-Based Person Search |
| 7017 | -- | 7028 | Zhen Qin 0002, Yujie Chen, Guosong Zhu, Erqiang Zhou, Yingjie Zhou, Yicong Zhou, Ce Zhu. Enhanced Pseudo-Label Generation With Self-Supervised Training for Weakly- Supervised Semantic Segmentation |
| 7029 | -- | 7040 | Wenxue Guan, Haobo Li, Dawei Xu, Jiaxin Liu, Shenghua Gong, Jun Liu 0006. Frequency Generation for Real-World Image Super-Resolution |
| 7041 | -- | 7056 | Huaizhang Liao, Jingyuan Xia, ZhiXiong Yang, Fulin Pan, Zhen Liu 0004, Yongxiang Liu. Meta-Learning Based Domain Prior With Application to Optical-ISAR Image Translation |
| 7057 | -- | 7068 | Ge Shi 0002, Sinuo Deng, Bo Wang, Chong Feng, Yan Zhuang, Xiaomei Wang. One for All: A Unified Generative Framework for Image Emotion Classification |
| 7069 | -- | 7079 | Chunwei Tian, Menghua Zheng, Bo Li 0004, Yanning Zhang, Shichao Zhang 0001, David Zhang 0001. Perceptive Self-Supervised Learning Network for Noisy Image Watermark Removal |
| 7080 | -- | 7094 | Linjun Li, Tao Jin, Wang Lin, Hao Jiang, Wenwen Pan, Jian Wang, Shuwen Xiao, Yan Xia 0006, Weihao Jiang, Zhou Zhao. Multi-Granularity Relational Attention Network for Audio-Visual Question Answering |
| 7095 | -- | 7105 | Feilong Cao, Lingpeng Wang, Hailiang Ye. SharpGConv: A Novel Graph Method With Plug-and-Play Sharpening Convolution for Point Cloud Registration |
| 7106 | -- | 7120 | Qinghua Ren, Shijian Lu, Qirong Mao, Ming Dong 0001. Exploring Prototype-Anchor Contrast for Semantic Segmentation |
| 7121 | -- | 7134 | Jinxiang Zhu, Qi Wang, Xinyu Dong, Weijian Ruan, Haolin Chen, Liang Lei, Gefei Hao. FSNA: Few-Shot Object Detection via Neighborhood Information Adaption and All Attention |
| 7135 | -- | 7148 | Ziye Fang, Xin Jiang, Hao Tang 0007, Zechao Li. Learning Contrastive Self-Distillation for Ultra-Fine-Grained Visual Categorization Targeting Limited Samples |
| 7149 | -- | 7164 | Yongxi Li, Wenzhong Tang, Shuai Wang, Shengsheng Qian, Changsheng Xu. Distribution-Guided Hierarchical Calibration Contrastive Network for Unsupervised Person Re-Identification |
| 7165 | -- | 7175 | Jiahao Xu, Xinzhu Ma, Lin Zhang, Bo Zhang 0069, Tao Chen 0003. Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification |
| 7176 | -- | 7189 | Kaijie He, Canlong Zhang, Sheng Xie, Zhixin Li 0001, Zhi-wen Wang, Rui-Guo Qin. Target-Aware Tracking With Spatial-Temporal Context Attention |
| 7190 | -- | 7201 | Han Lin, Yingjian Li, Zheng Zhang 0006, Lei Zhu 0002, Yong Xu 0001. Learning With Noisy Labels by Semantic and Feature Space Collaboration |
| 7202 | -- | 7215 | YiBo Zhao, Hua Zhang 0003, Zan Gao, Weili Guan, Meng Wang 0001, Shengyong Chen. A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization |
| 7216 | -- | 7230 | Xiaoying Yuan, Tingfa Xu, Xincong Liu, Ying Wang, Haolin Qin, Yuqiang Fang, Jianan Li. Multi-Step Temporal Modeling for UAV Tracking |
| 7231 | -- | 7243 | Jinhong Deng, Wen Li 0001, Lixin Duan. Balanced Teacher for Source-Free Object Detection |
| 7244 | -- | 7258 | Sungjun Jang, Heansung Lee 0001, Woo Jin Kim, Jungho Lee, Sungmin Woo, Sangyoun Lee. Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition |
| 7259 | -- | 7271 | Baozhen Sun, Zhenhua Wang 0003, Shilei Wang, Yongkang Cheng, Jifeng Ning. Bidirectional Interaction of CNN and Transformer Feature for Visual Tracking |
| 7272 | -- | 7283 | Yiming Wang 0007, Dongxia Chang, Zhiqiang Fu, Jie Wen 0001, Yao Zhao 0001. Partially View-Aligned Representation Learning via Cross-View Graph Contrastive Network |
| 7284 | -- | 7300 | Yanjie Liang, Haosheng Chen 0001, Qiangqiang Wu, Changqun Xia, Jia Li 0003. Joint Spatio-Temporal Similarity and Discrimination Learning for Visual Tracking |
| 7301 | -- | 7314 | Jiale Zhang, Chengxin Liu, Ke Xian, Zhiguo Cao 0001. Hierarchical Feature Warping and Blending for Talking Head Animation |
| 7315 | -- | 7327 | Kangdao Liu, Xiaolin Xiao, Jinkun You, Yicong Zhou. Robust Discriminative t-Linear Subspace Learning for Image Feature Extraction |
| 7328 | -- | 7343 | Kunchi Li, Hongyang Chen, Jun Wan 0001, Shan Yu. ESDB: Expand the Shrinking Decision Boundary via One-to-Many Information Matching for Continual Learning With Small Memory |
| 7344 | -- | 7358 | Kunpeng Wang, Zhengzheng Tu, Chenglong Li 0002, Cheng Zhang 0010, Bin Luo 0001. Learning Adaptive Fusion Bank for Multi-Modal Salient Object Detection |
| 7359 | -- | 7372 | Xiaoqiang Lu, Licheng Jiao, Lingling Li 0002, Fang Liu 0001, Xu Liu 0006, Shuyuan Yang. Self Pseudo Entropy Knowledge Distillation for Semi-Supervised Semantic Segmentation |
| 7373 | -- | 7385 | Minghua Zhang, Qiuyang Zhang, Wei Song 0007, Dongmei Huang, Qi He 0003. PromptVT: Prompting for Efficient and Accurate Visual Tracking |
| 7386 | -- | 7400 | Tianlu Zhang, Xiaoyi He, Qiang Jiao, Qiang Zhang 0020, Jungong Han. AMNet: Learning to Align Multi-Modality for RGB-T Tracking |
| 7401 | -- | 7416 | Yimei Liu, Qing Cai, Congcong Wang, Jian Yang, Hao Fan 0004, Junyu Dong, Sheng Chen 0001. Geometry-Enhanced Attentive Multi-View Stereo for Challenging Matching Scenarios |
| 7417 | -- | 7429 | Xu Liu, Jianing Li, Jinqiao Shi, Xiaopeng Fan, Yonghong Tian 0001, Debin Zhao. Event-Based Monocular Depth Estimation With Recurrent Transformers |
| 7430 | -- | 7439 | Liqun Lin, Guangpeng Wei, Kanglin Liu, Wanjian Feng, Tiesong Zhao. LightViD: Efficient Video Deblurring With Spatial-Temporal Feature Fusion |
| 7440 | -- | 7453 | Tao Zhou 0002, Yi Zhou 0007, Guangyu Li, Geng Chen 0001, Jianbing Shen. Uncertainty-Aware Hierarchical Aggregation Network for Medical Image Segmentation |
| 7454 | -- | 7466 | Xin Liu, Biao Qian, Haipeng Liu 0004, Dan Guo, Yang Wang 0023, Meng Wang 0001. Seeking False Hard Negatives for Graph Contrastive Learning |
| 7467 | -- | 7483 | Qihua Feng, Peiya Li, ZhiXun Lu, Chaozhuo Li 0001, Zefan Wang, Zhiquan Liu, Chunhui Duan, Feiran Huang, Jian Weng 0001, Philip S. Yu. EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing |
| 7484 | -- | 7497 | Dandan Zhan, Jiahao Wu, Xing Luo, Zhi Jin. Learning From Text: A Multimodal Face Inpainting Network for Irregular Holes |
| 7498 | -- | 7511 | Ahmet Burakhan Koyuncu, Panqi Jia, Atanas Boev, Elena Alshina, Eckehard G. Steinbach. Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression |
| 7512 | -- | 7522 | Deqian Mao, Shanshan Gao, Zhenyu Li, Honghao Dai, Yunfeng Zhang 0001, Yuanfeng Zhou. Aggregating Global and Local Representations via Hybrid Transformer for Video Deraining |
| 7523 | -- | 7536 | Pei-an, Xuzhong Hu, Junfeng Ding, Jun Zhang, Jie Ma, You Yang, Qiong Liu 0001. OL-Reg: Registration of Image and Sparse LiDAR Point Cloud With Object-Level Dense Correspondences |
| 7537 | -- | 7549 | Lei He 0010, Yongfang Xie, Shiwen Xie, Zhaohui Jiang 0001, Zhipeng Chen. Iterative Self-Guided Image Filtering |
| 7550 | -- | 7565 | Zizhuo Li, Chunbao Su, Fan Fan 0001, Jun Huang 0008, Jiayi Ma 0001. MC-Net: Integrating Multi-Level Geometric Context for Two-View Correspondence Learning |
| 7566 | -- | 7576 | Chunwei Tian, Menghua Zheng, Tiancai Jiao, Wangmeng Zuo, Yanning Zhang, Chia-Wen Lin. A Self-Supervised CNN for Image Watermark Removal |
| 7577 | -- | 7588 | Tianwei Zhou, Songbai Tan, Baoquan Zhao, Guanghui Yue 0001. Multitask Deep Neural Network With Knowledge-Guided Attention for Blind Image Quality Assessment |
| 7589 | -- | 7600 | Yu Tian, Shiqi Wang 0001, Baoliang Chen, Sam Kwong. Causal Representation Learning for GAN-Generated Face Image Quality Assessment |
| 7601 | -- | 7613 | Weiwei Zhang, Yufeng Guo, Junhuang Wang, Jianqing Zhu, Huanqiang Zeng. Collaborative Knowledge Distillation |
| 7614 | -- | 7627 | Zhengeng Yang, Hongshan Yu, Wei Sun 0028, Li Cheng 0001, Ajmal Mian. Domain-Invariant Prototypes for Semantic Segmentation |
| 7628 | -- | 7642 | Mengxin Gong, Xiuli Chai, Yang Lu 0013, Yushu Zhang. Exploiting Four-Dimensional Chaotic Systems With Dissipation and Optimized Logical Operations for Secure Image Compression and Encryption |
| 7643 | -- | 7656 | Jiaqing Zhang, Jie Lei 0001, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li. Multimodal Informative ViT: Information Aggregation and Distribution for Hyperspectral and LiDAR Classification |
| 7657 | -- | 7670 | Zhuyang Xie, Yan Yang 0001, Jie Wang, Xiaorong Liu, Xiaofan Li. Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space |
| 7671 | -- | 7682 | Nianzu Qiao, Changyin Sun, Lu Dong 0002, Quanbo Ge. Semi-Supervised Feature Distillation and Unsupervised Domain Adversarial Distillation for Underwater Image Enhancement |
| 7683 | -- | 7698 | Tengfei Liang, Yi Jin 0001, Wu Liu, Tao Wang 0011, Songhe Feng, Yidong Li. Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification |
| 7699 | -- | 7711 | Hao Zhang, Yujie Dun, Yixuan Pei, Shenqi Lai, Chengxu Liu, Kaipeng Zhang, Xueming Qian. HF-HRNet: A Simple Hardware Friendly High-Resolution Network |
| 7712 | -- | 7724 | Xiao Lu 0002, Yulin Yuan, Xing Liu, Lucai Wang, Xuanyu Zhou, Yimin Yang. Low-Light Salient Object Detection by Learning to Highlight the Foreground Objects |
| 7725 | -- | 7741 | Chao Li, Shanzhi Yin, Chuanmin Jia, Fanyang Meng, Yonghong Tian 0001, Yongsheng Liang. Multirate Progressive Entropy Model for Learned Image Compression |
| 7742 | -- | 7755 | Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen 0001, Yanzan Sun. Quality of Experience Oriented Cross-Layer Optimization for Real-Time XR Video Transmission |
| 7756 | -- | 7770 | Yihang Zhang, Sheng Cheng, Zongming Guo, Xinggong Zhang. Inferring Video Streaming Quality of Real-Time Communication Inside Network |
| 7771 | -- | 7784 | Mingcong Lu, Ruifan Li, Fangxiang Feng, Zhanyu Ma, Xiaojie Wang 0006. LGR-NET: Language Guided Reasoning Network for Referring Expression Comprehension |
| 7785 | -- | 7800 | Jiang Yu, Fengyong Li, Zichi Wang, Wen Si, Xinpeng Zhang 0001. Diverse Batch Steganography Using Model-Based Selection and Double-Layered Payload Assignment |