Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 33, Issue 9

4448 -- 4451Liqiang Nie, Jianlong Wu, Nicu Sebe, Kiyoharu Aizawa. Guest Editorial Introduction to the Special Issue on Video Transformers
4452 -- 4461Wei Wang, Xin Yang 0008, Jinhui Tang 0001. Vision Transformer With Hybrid Shifted Windows for Gastrointestinal Endoscopy Image Classification
4462 -- 4471Yang Yu, Rongrong Ni, Yao Zhao 0001, Siyuan Yang, Fen Xia, Ning Jiang, Guoqing Zhao. MSVT: Multiple Spatiotemporal Views Transformer for DeepFake Video Detection
4472 -- 4483Weili Guan, Xuemeng Song, Kejie Wang, Haokun Wen, Hongda Ni, Yaowei Wang, Xiaojun Chang. Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction
4484 -- 4495Bofeng Wu, Buyu Liu, Peng Huang, Jun Bao, Peng Xi, Jun Yu 0002. Concept Parser With Multimodal Graph Learning for Video Captioning
4496 -- 4506Fan Zhang 0045, Gongguan Chen, Hua Wang, Jinjiang Li, Caiming Zhang 0001. Multi-Scale Video Super-Resolution Transformer With Polynomial Approximation
4507 -- 4517Feng Xue, Yu Li, Deyin Liu, Yincen Xie, Lin Wu 0001, Richang Hong. LipFormer: Learning to Lipread Unseen Speakers Based on Visual-Landmark Transformers
4518 -- 4528Mingqi Gao 0003, Jinyu Yang, Jungong Han, Ke Lu, Feng Zheng, Giovanni Montana. Decoupling Multimodal Transformers for Referring Video Object Segmentation
4529 -- 4541Rong Wang, Zongheng Tang, Qianli Zhou, Xiaoqian Liu, Tianrui Hui, Quange Tan, Si Liu 0001. Unified Transformer With Isomorphic Branches for Natural Language Tracking
4542 -- 4551Yuhui Zheng, Yan Zhang, Bin Xiao 0002. Target-Aware Transformer Tracking
4552 -- 4563Guanlin Chen, Pengfei Zhu, Bing Cao, Xing Wang, Qinghua Hu. Cross-Drone Transformer Network for Robust Single Object Tracking
4564 -- 4576Di Gai, Runyang Feng, Weidong Min, Xiaosong Yang, Pengxiang Su, Qi Wang, Qing Han. Spatiotemporal Learning Transformer for Video-Based Human Pose Estimation
4577 -- 4587Haipeng Chen 0002, Jiahui Hu, Wenyin Zhang, Pengxiang Su. Spatiotemporal Consistency Learning From Momentum Cues for Human Motion Prediction
4588 -- 4602Wenfei Wan, Dengjia Huang, Bin Shang, Shengyu Wei, Hong Ren Wu, Jinjian Wu, Guangming Shi. Depth Perception Assessment of 3D Videos Based on Stereoscopic and Spatial Orientation Structural Features
4603 -- 4615Yujie Hu, Yinhuai Wang, Jian Zhang 0018. DEAR-GAN: Degradation-Aware Face Restoration With GAN Prior
4616 -- 4629Chengcheng Ma, Yang Liu, Jiankang deng, Lingxi Xie, Weiming Dong, Changsheng Xu. Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
4630 -- 4644Jiaxin Yao, Yongqiang Zhao 0001, Yuanyang Bu, Seong G. Kong, Jonathan Cheung-Wai Chan. Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion
4645 -- 4659Ying Yang, Tao Xiang, Shangwei Guo, Xiao Lv, Hantao Liu, Xiaofeng Liao 0001. EHNQ: Subjective and Objective Quality Evaluation of Enhanced Night-Time Images
4660 -- 4674Ying Huang, Hu Guan, Jie Liu 0028, Shuwu Zhang, Baoning Niu, Guixuan Zhang. Robust Texture-Aware Local Adaptive Image Watermarking With Perceptual Guarantee
4675 -- 4688Wei Wu 0019, Yong Liu, Zhu Li 0001. Subband Differentiated Learning Network for Rain Streak Removal
4689 -- 4702Yining Su, Lin Teng, Pengbo Liu, Salahuddin Unar, Xingyuan Wang 0001, XianPing Fu. Visualized Multiple Image Selection Encryption Based on Log Chaos System and Multilayer Cellular Automata Saliency Detection
4703 -- 4714Yuyuan Zeng, Bowen Zhao, Shanzhao Qiu, Tao Dai 0001, Shu-Tao Xia. Toward Effective Image Manipulation Detection With Proposal Contrastive Learning
4715 -- 4727Lu Sun, Yichen Wang, Fangfang Wu, Xin Li 0005, Weisheng Dong, Guangming Shi. Deep Unfolding Network for Efficient Mixed Video Noise Removal
4728 -- 4740Xin Zhou, Xiao-wen Liu, Gong Zhang, Luliang Jia, Xu Wang 0015, Zhiyuan Zhao. An Iterative Threshold Algorithm of Log-Sum Regularization for Sparse Problem
4741 -- 4753Bo Jiang, Yao Lu, Bob Zhang 0001, Guangming Lu. Few-Shot Learning for Image Denoising
4754 -- 4768Pei Geng, Xuequan Lu, Chunyu Hu, Hong Liu 0013, Lei Lyu. Focusing Fine-Grained Action by Self-Attention-Enhanced Graph Neural Networks With Contrastive Learning
4769 -- 4783Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós, Juan C. SanMiguel. Attention-Based Knowledge Distillation in Scene Recognition: The Impact of a DCT-Driven Loss
4784 -- 4797Zheng Zhou, Yongyong Chen, Yicong Zhou. Deep Dynamic Memory Augmented Attentional Dictionary Learning for Image Denoising
4798 -- 4811Leida Li, Yipo Huang, Jinjian Wu, Yuzhe Yang, Yaqian Li, Yandong Guo, Guangming Shi. Theme-Aware Visual Attribute Reasoning for Image Aesthetics Assessment
4812 -- 4824Ninghui Xu, Lihui Wang 0003, Jiajia Zhao, Zhiting Yao. Denoising for Dynamic Vision Sensor Based on Augmented Spatiotemporal Correlation
4825 -- 4839Runzhe Zhu, Ling Yin, Mingze Yang, Fei Wu 0006, Yuncheng Yang, Wenbo Hu. SUES-200: A Multi-Height Multi-Scene Cross-View Image Benchmark Across Drone and Satellite
4840 -- 4854Haoning Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin. DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment
4855 -- 4867Zicheng Feng, Wenlong Zhang, Shunkun Liang, Qifeng Yu. Deep Video Super-Resolution Using Hybrid Imaging System
4868 -- 4880Zhi-Yong Wang, Xiao-peng Li, Hing-Cheung So, Abdelhak M. Zoubir. Adaptive Rank-One Matrix Completion Using Sum of Outer Products
4881 -- 4892Huapeng Wu, Jie Gui, Jun Zhang 0024, James T. Kwok, Zhihui Wei. Feedback Pyramid Attention Networks for Single Image Super-Resolution
4893 -- 4906Kai Zeng, Kejiang Chen, Weiming Zhang 0001, Yaofei Wang, Nenghai Yu. Robust Steganography for High Quality Images
4907 -- 4920Zenan Shi, Haipeng Chen 0002, Dong Zhang. Transformer-Auxiliary Neural Networks for Image Manipulation Localization by Operator Inductions
4921 -- 4933Shihao Zou, Yuanlu Xu, Chao Li 0021, Lingni Ma, Li Cheng 0001, Minh Vo. Snipper: A Spatiotemporal Transformer for Simultaneous Multi-Person 3D Pose Estimation Tracking and Forecasting on a Video Snippet
4934 -- 4947Yu Liu 0023, Haihang Li, Juan Cheng, Xun Chen 0001. MSCAF-Net: A General Framework for Camouflaged Object Detection via Learning Multi-Scale Context-Aware Features
4948 -- 4961Jun Li 0043, Yuquan Bi, Sumei Wang, Qiming Li. CFRLA-Net: A Context-Aware Feature Representation Learning Anchor-Free Network for Pedestrian Detection
4962 -- 4972Huafeng Li, Minghui Liu, Zhanxuan Hu, Feiping Nie 0001, Zhengtao Yu 0001. Intermediary-Guided Bidirectional Spatial-Temporal Aggregation Network for Video-Based Visible-Infrared Person Re-Identification
4973 -- 4984Shuang Li, Lichun Wang 0002, Shaofan Wang, Dehui Kong, Baocai Yin. Hierarchical Coupled Discriminative Dictionary Learning for Zero-Shot Learning
4985 -- 4996Zhuoxu Huang, Zhiyou Zhao, Banghuai Li, Jungong Han. LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers
4997 -- 5008Hao Zhang, Shenqi Lai, Yaxiong Wang, Zongyang Da, Yujie Dun, Xueming Qian. SCGNet: Shifting and Cascaded Group Network
5009 -- 5021Ruyi Ji, Jiaying Li, Libo Zhang 0001, Jing Liu, Yanjun Wu. Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification
5022 -- 5035Hao Ren, Ziqiang Zheng, Yang Wu 0001, Hong Lu 0001, Yang Yang 0002, Ying Shan, Sai Kit Yeung. ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval
5036 -- 5048Weiping Xiao, Yan Peng 0001, Chang Liu, Jiantao Gao, Yiqiang Wu, Xiaomao Li. Balanced Sample Assignment and Objective for Single-Model Multi-Class 3D Object Detection
5049 -- 5061Chenglong Zhao, Yunxiang Zhang, Bingbing Ni. Exploiting Channel Similarity for Network Pruning
5062 -- 5075Fei Zhou, Wei Wei 0008, Lei Zhang 0054, Yanning Zhang. Learning to Class-Adaptively Manipulate Embeddings for Few-Shot Learning
5076 -- 5088Zexing Du, Xue Wang 0006, Qing Wang 0006. Self-Supervised Global Spatio-Temporal Interaction Pre-Training for Group Activity Recognition
5089 -- 5101Xinchen Ye, Jinyi Zhang, Yazhi Yuan, Rui Xu 0002, Zhihui Wang, Haojie Li. Underwater Depth Estimation via Stereo Adaptation Networks
5102 -- 5116Chuanming Tang, Xiao Wang 0014, Yuanchao Bai, Zhe Wu, Jianlin Zhang, Yongmei Huang. Learning Spatial-Frequency Transformer for Visual Object Tracking
5117 -- 5132Yan Jin, Fang Gao, Jun Yu 0001, Jiabao Wang, Feng Shuang 0002. Multi-Object Tracking: Decoupling Features to Solve the Contradictory Dilemma of Feature Requirements
5133 -- 5147Jing Li, Liu Yang, Qilong Wang, Qinghua Hu. WDAN: A Weighted Discriminative Adversarial Network With Dual Classifiers for Fine-Grained Open-Set Domain Adaptation
5148 -- 5159Huanjie Tao, Qianyue Duan, Jianfeng An. An Adaptive Interference Removal Framework for Video Person Re-Identification
5160 -- 5173Haihong Xiao, Yuqiong Li, Wenxiong Kang, Qiuxia Wu. Distinguishing and Matching-Aware Unsupervised Point Cloud Completion
5174 -- 5185Zhilei Li, Jun Li 0072, Yuqing Ma, Rui Wang 0024, Zhi-Ping Shi 0002, Yifu Ding, Xianglong Liu 0001. Spatio-Temporal Adaptive Network With Bidirectional Temporal Difference for Action Recognition
5186 -- 5199Yi Hou, Shanghang Zhang, Rui Ma, Huizhu Jia, Xiaodong Xie. Frame-Recurrent Video Crowd Counting
5200 -- 5211Mingrui Zhu, Zicheng Wu, Nannan Wang 0001, Heng Yang, Xinbo Gao 0001. Dual Conditional Normalization Pyramid Network for Face Photo-Sketch Synthesis
5212 -- 5226Sheng Cheng, Han Hu, Xinggong Zhang. ABRF: Adaptive BitRate-FEC Joint Control for Real-Time Video Streaming
5227 -- 5241Yi Chen, Meng Wang 0017, Shiqi Wang 0001, Zhangkai Ni, Sam Kwong. A CTU-Level Screen Content Rate Control for Low-Delay Versatile Video Coding
5242 -- 5256Yunlong Li, Xinfeng Zhang 0001, Chen Cui, Shanshe Wang, Siwei Ma. Fleet: Improving Quality of Experience for Low-Latency Live Video Streaming
5257 -- 5270Huaiwen Zhang, Yang Yang, Fan Qi, Shengsheng Qian, Changsheng Xu. Debiased Video-Text Retrieval via Soft Positive Sample Calibration
5271 -- 5280Jiwei Wei, Yang Yang 0002, Xing Xu 0001, Jingkuan Song, Guoqing Wang 0001, Heng Tao Shen. Less is Better: Exponential Loss for Cross-Modal Matching
5281 -- 5295Xin Sun, Jialin Gao, Yizhe Zhu, Xuan Wang, Xi Zhou. Video Moment Retrieval via Comprehensive Relation-Aware Network
5296 -- 5308Rong-Cheng Tu, Jie Jiang, Qinghong Lin, Chengfei Cai, Shangxuan Tian, Hongfa Wang, Wei Liu 0005. Unsupervised Cross-Modal Hashing With Modality-Interaction
5309 -- 5317Sung-Jun Min, Kyeongbo Kong, Suk-Ju Kang. Out-of-Focus Image Deblurring for Mobile Display Vision Inspection
5318 -- 5329Mixiao Hou, Zheng Zhang 0006, Chang Liu, Guangming Lu. Semantic Alignment Network for Multi-Modal Emotion Recognition

Volume 33, Issue 8

3560 -- 3569Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin. Self-Training Vision Language BERTs With a Unified Conditional Model
3570 -- 3584Zhiqiang Bao, Shunzhi Yang, Zhenhua Huang, MengChu Zhou, Yunwen Chen. A Lightweight Block With Information Flow Enhancement for Convolutional Neural Networks
3585 -- 3595Yue Wu 0004, Xidao Hu, Yue Zhang, Maoguo Gong, Wenping Ma 0001, Qiguang Miao. SACF-Net: Skip-Attention Based Correspondence Filtering Network for Point Cloud Registration
3596 -- 3607Peng Xing, Zechao Li. Visual Anomaly Detection via Partition Memory Bank Module and Error Estimation
3608 -- 3621Yuzhen Niu, Zhihua Lin, Wenxi Liu, Wenzhong Guo. Progressive Moire Removal and Texture Complementation for Image Demoireing
3622 -- 3637Mengyao Li, Kun Wang, Liquan Shen, Yufei Lin, Zhengyong Wang, Qijie Zhao. UIALN: Enhancement for Underwater Image With Artificial Light
3638 -- 3648Zixuan Hu, Li Shen 0008, Shenqi Lai, Chun Yuan. Task-Adaptive Feature Disentanglement and Hallucination for Few-Shot Classification
3649 -- 3662Qiang Qi, Tianxiang Hou, Yan Yan 0001, Yang Lu 0009, Hanzi Wang. TCNet: A Novel Triple-Cooperative Network for Video Object Detection
3663 -- 3676Guilin Pang, Baopeng Zhang, Zhu Teng, Zige Qi, Jianping Fan 0001. MRE-Net: Multi-Rate Excitation Network for Deepfake Video Detection
3677 -- 3688Zhishe Wang, Wenyu Shao, Yanlin Chen, Jiawei Xu 0004, Lei Zhang 0168. A Cross-Scale Iterative Attentional Adversarial Fusion Network for Infrared and Visible Images
3689 -- 3700Jin Tang, Jin Zhang, Rui Ding, Baoxuan Gu, Jianqin Yin. Collaborative Multi-Dynamic Pattern Modeling for Human Motion Prediction
3701 -- 3712Nanfeng Jiang, Junhong Lin, Ting Zhang, Haifeng Zheng, Tiesong Zhao. Low-Light Image Enhancement via Stage-Transformer-Guided Network
3713 -- 3725Zhi Han, Shaojie Zhang, Zhiyu Liu, Yanmei Wang, Junping Yao, Yao Wang 0003. Tensor Robust Principal Component Analysis With Side Information: Models and Applications
3726 -- 3736Praveen Kandula, Maitreya Suin, A. N. Rajagopalan 0001. Illumination-Adaptive Unpaired Low-Light Enhancement
3737 -- 3746Xuepeng Chang, Huihui Pan, Weichao Sun, Huijun Gao. A Multi-Phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation With Weak Supervision
3747 -- 3758Zhuohang Dang, Minnan Luo, Chengyou Jia, Caixia Yan, Xiaojun Chang, Qinghua Zheng. Counterfactual Generation Framework for Few-Shot Learning
3759 -- 3773Binbin Song, Jiantao Zhou 0001, Xiangyu Chen, Shile Zhang. Real-Scene Reflection Removal With RAW-RGB Image Pairs
3774 -- 3785Hongzu Su, Jingjing Li 0001, Ke Lu 0001, Lei Zhu 0002, Heng Tao Shen. Dual-Aligned Feature Confusion Alleviation for Generalized Zero-Shot Learning
3786 -- 3798Xu Zhang, Nian Cai, Huan Zhang, Yun Zhang 0002, Jianglei Di, Weisi Lin. AFD-Former: A Hybrid Transformer With Asymmetric Flow Division for Synthesized View Quality Enhancement
3799 -- 3810Feng Gao, Yeyun Cai, Fang Deng, Chengpu Yu, Jie Chen 0003. Feature Alignment in Anchor-Free Object Detection
3811 -- 3821Neng Zhang, Ebroul Izquierdo. A Four-Point Camera Calibration Method for Sport Videos
3822 -- 3832Yihong Cao, Hui Zhang 0023, Xiao Lu, Yurong Chen 0003, Zheng Xiao, Yaonan Wang 0001. Adaptive Refining-Aggregation-Separation Framework for Unsupervised Domain Adaptation Semantic Segmentation
3833 -- 3847Ying He 0013, Dongheng Zhang, Yan Chen 0007. 3D Radio Imaging Under Low-Rank Constraint
3848 -- 3859Dongliang Chen, Guihua Wen, Huihui Li, Rui Chen, Cheng Li. Multi-Relations Aware Network for In-the-Wild Facial Expression Recognition
3860 -- 3871Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu 0002, Min Wu 0008, Zhengguo Li, Zhenghua Chen. Multi-Source Video Domain Adaptation With Temporal Attentive Moment Alignment Network
3872 -- 3887Hao Yang 0016, Zihan Yang, Anyong Hu, Che Liu, Tie Jun Cui, Jungang Miao. Unifying Convolution and Transformer for Efficient Concealed Object Detection in Passive Millimeter-Wave Images
3888 -- 3898Liangpeng Hu, Yating Kong, Jide Li, Xiaoqiang Li. Effective Local-Global Transformer for Natural Image Matting
3899 -- 3911Zengmao Wang, Zixi Chen, Bo Du 0001. Active Learning With Co-Auxiliary Learning and Multi-Level Diversity for Image Classification
3912 -- 3923Yatong Chen, Hongwei Ge, Yuxuan Liu, Xinye Cai, Liang Sun 0003. AGPN: Action Granularity Pyramid Network for Video Action Recognition
3924 -- 3934Yuanqi Chen, Cece Jin, Ge Li 0002, Thomas H. Li, Wei Gao 0003. Mitigating Label Noise in GANs via Enhanced Spectral Normalization
3935 -- 3946Junliang Chen, Weizeng Lu, Yuexiang Li, LinLin Shen, Jinming Duan 0001. Adversarial Learning of Object-Aware Activation Map for Weakly-Supervised Semantic Segmentation
3947 -- 3961Zican Zha, Hao Tang 0007, Yunlian Sun, Jinhui Tang 0001. Boosting Few-Shot Fine-Grained Recognition With Background Suppression and Foreground Alignment
3962 -- 3975Chongben Tao, Jiecheng Cao, Chen Wang 0041, Zufeng Zhang, Zhen Gao. Pseudo-Mono for Monocular 3D Object Detection in Autonomous Driving
3976 -- 3988Shubo Zhou, Liang Hu, Yunlong Wang, Zhenan Sun, Kunbo Zhang, Xueqin Jiang 0001. AIF-LFNet: All-in-Focus Light Field Super-Resolution Method Considering the Depth-Varying Defocus
3989 -- 4001Chunlan Zhang, Chunyu Lin, Kang Liao, Lang Nie, Yao Zhao 0001. As-Deformable-As-Possible Single-Image-Based View Synthesis Without Depth Prior
4002 -- 4010Guangli Ren, Wenjie Geng, Peiyu Guan, Zhiqiang Cao, Junzhi Yu. Pixel-Wise Grasp Detection via Twin Deconvolution and Multi-Dimensional Attention
4011 -- 4026Zuyi Wang, Wenjun Zhu, Wei Zhao, Li Xu. Balanced One-Stage Object Detection by Enhancing the Effect of Positive Samples
4027 -- 4040Hanyi Wang, Zihan Liu, Shilin Wang. Exploiting Complementary Dynamic Incoherence for DeepFake Video Detection
4041 -- 4053Bo Peng 0007, Mingliang Zhang, Jianjun Lei, Huazhu Fu, Haifeng Shen, Qingming Huang. RGB-D Human Matting: A Real-World Benchmark Dataset and a Baseline Method
4054 -- 4069Keyu Deng, Congxuan Zhang, Zhen Chen 0004, Weiming Hu, Bing Li 0001, Feng Lu. Jointing Recurrent Across-Channel and Spatial Attention for Multi-Object Tracking With Block-Erasing Data Augmentation
4070 -- 4082Jian-Xun Mi, Yun Gao, Shiyao Yuan, Weisheng Li 0001. Accurate and Robust Eye Center Localization by Deep Voting
4083 -- 4095Fukun Yin, Zilong Huang, Tao Chen 0003, Guozhong Luo, Gang Yu, Bin Fu. DCNet: Large-Scale Point Cloud Semantic Segmentation With Discriminative and Efficient Feature Aggregation
4096 -- 4107Guoqing Zhang, Hongwei Zhang, Weisi Lin, Arun Kumar Chandran, Xuan Jing. Camera Contrast Learning for Unsupervised Person Re-Identification
4108 -- 4121Zhihao Duan, Zhan Ma, Fengqing Zhu 0001. Unified Architecture Adaptation for Compressed Domain Semantic Inference
4122 -- 4136Yifei Xu, Xiangshun Li, Litong Pan, Weiguang Sang, Pingping Wei, Li Zhu. Self-Supervised Adversarial Video Summarizer With Context Latent Sequence Learning
4137 -- 4148Haowei Liu, Yongcheng Liu, Yuxin Chen, Chunfeng Yuan, Bing Li 0001, Weiming Hu. TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition
4149 -- 4163Zhengxuan Xie, Feng Shao, Gang Chen, Hangwei Chen, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho. Cross-Modality Double Bidirectional Interaction and Fusion Network for RGB-T Salient Object Detection
4164 -- 4176Zaiyu Pan, Jun Wang, Zhengwen Shen, Shuyu Han. Disentangled Representation and Enhancement Network for Vein Recognition
4177 -- 4189Yin-Ping Zhao, Xiangfeng Dai, Zhen Wang 0004, Xuelong Li 0001. Subspace Clustering via Adaptive Non-Negative Representation Learning and Its Application to Image Segmentation
4190 -- 4203Xin Wang, Yue Zhan, Yang Zhao, Tangwen Yang, Qiuqi Ruan. Semi-Supervised Crowd Counting With Spatial Temporal Consistency and Pseudo-Label Filter
4204 -- 4216Haoxuan Ding, Junyu Gao 0001, Yuan Yuan 0001, Qi Wang 0009. Boosting One-Stage License Plate Detector via Self-Constrained Contrastive Aggregation
4217 -- 4231Gang Yan, Zijin Wang, Shuze Geng, Yang Yu 0022, Yingchun Guo. Part-Based Representation Enhancement for Occluded Person Re-Identification
4232 -- 4243Zhiqi Yu, Jingjing Li 0001, Lei Zhu 0002, Ke Lu 0001, Heng Tao Shen. Classification Certainty Maximization for Unsupervised Domain Adaptation
4244 -- 4256De Cheng, Gerong Wang, Nannan Wang 0001, Dingwen Zhang, Qiang Zhang 0020, Xinbo Gao 0001. Discriminative and Robust Attribute Alignment for Zero-Shot Learning
4257 -- 4268Jing Zhang 0041, Yingshuai Xie, Weichao Ding, Zhe Wang 0002. Cross on Cross Attention: Deep Fusion Transformer for Image Captioning
4269 -- 4278Stefano Battista, Guido Meardi, Simone Ferrara, Lorenzo Ciccarelli, Florian Maurer 0005, Massimo Conti, Simone Orcioni. Verification Test of the Low Complexity Enhancement Video Coding (LCEVC) Standard
4279 -- 4293Alexandre Tissier, Wassim Hamidouche, Souhaiel Belhadj Dit Mdalsi, Jarno Vanne, Franck Galpin, Daniel Ménard. Machine Learning Based Efficient QT-MTT Partitioning Scheme for VVC Intra Encoders
4294 -- 4308Fei Song, Ge Li 0002, Xiaodong Yang, Wei Gao 0003, Shan Liu 0001. Block-Adaptive Point Cloud Attribute Coding With Region-Aware Optimized Transform
4309 -- 4321Haisheng Fu, Feng Liang 0001, Jie Liang 0001, Binglin Li, Guohe Zhang, Jingning Han. Asymmetric Learned Image Compression With Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering
4322 -- 4336Chuntao Wang, Juan Hu, Shan Bian, Jiangqun Ni, Xinpeng Zhang 0001. A Customized Deep Network Based Encryption-Then-Lossy-Compression Scheme of Color Images Achieving Arbitrary Compression Ratios
4337 -- 4348Dat Thanh Nguyen, André Kaup. Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model
4349 -- 4361Adrian Dziembowski, Dawid Mieloch, Jun Young Jeong, Gwangsoon Lee. Immersive Video Postprocessing for Efficient Video Coding
4362 -- 4374Jinkuan Zhu, Pengpeng Zeng, Lianli Gao, Gongfu Li, Dongliang Liao, Jingkuan Song. Complementarity-Aware Space Learning for Video-Text Retrieval
4375 -- 4387Jing-Ming Guo, Sankarasrinivasan Seshathiri. Visually Encrypted Watermarking for Ordered-Dithered Clustered-Dot Halftones
4388 -- 4400Suo Gao, Rui Wu, Xingyuan Wang 0001, Jiafeng Liu, Qi Li 0029, Chunpeng Wang, Xianglong Tang. Asynchronous Updating Boolean Network Encryption Algorithm
4401 -- 4414Dongheng Zhang, Jia Meng, Jian Zhang, Xinzhe Deng, Shouhong Ding, Man Zhou, Qian Wang 0002, Qi Li 0002, Yan Chen 0007. SonarGuard: Ultrasonic Face Liveness Detection on Mobile Devices
4415 -- 4428Zhenyu Cui, Jiahuan Zhou, Yuxin Peng, Shiliang Zhang, Yaowei Wang. DCR-ReID: Deep Component Reconstruction for Cloth-Changing Person Re-Identification
4429 -- 4434A. Venkata Subramanyam. Meta Generative Attack on Person Reidentification
4435 -- 4440Yue Liu, Zhangkai Ni, Shiqi Wang 0001, Hanli Wang, Sam Kwong. High Dynamic Range Image Quality Assessment Based on Frequency Disparity
4441 -- 4445Pingping Zhang, Shiqi Wang 0001, Meng Wang 0017, Jiguo Li 0002, Xu Wang 0006, Sam Kwong. Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

Volume 33, Issue 7

3043 -- 3054Taotao Lai, Yizhang Liu, Jie Chang, Lifang Wei, Zuoyong Li, Hamido Fujita. Guided Sampling for Multistructure Data via Neighborhood Consensus and Residual Sorting
3055 -- 3070Hangwei Chen, Feng Shao, Xiongli Chai, Yuese Gu, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho. Quality Evaluation of Arbitrary Style Transfer: Subjective Study and Objective Metric
3071 -- 3086Jianshuang Xu, Christian Brauers, Johannes Klein, Jörn Jochims, Rüdiger Kays. Symbol Position Recovery for Optical Camera Communication With High-Density Matrix Codes
3087 -- 3103Xu Huang, Xutao Li, Yunming Ye, Shanshan Feng, Chuyao Luo, Bowen Zhang 0005. On Understanding of Spatiotemporal Prediction Model
3104 -- 3118Kechen Song, Liming Huang, Aojun Gong, Yunhui Yan. Multiple Graph Affinity Interactive Network and a Variable Illumination Dataset for RGBT Image Salient Object Detection
3119 -- 3132Tingting Xu, Xiaoyu Kong, Qiangqiang Shen, Yongyong Chen, Yicong Zhou. Deep and Low-Rank Quaternion Priors for Color Image Processing
3133 -- 3144Yuhui Quan, Xiaoheng Tan, Yan Huang 0031, Yong Xu 0007, Hui Ji. Image Desnowing via Deep Invertible Separation
3145 -- 3158Satoshi Kosugi, Toshihiko Yamasaki. Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter
3159 -- 3172Wei Tang, Fazhi He, Yu Liu 0023, Yansong Duan, Tongzhen Si. DATFuse: Infrared and Visible Image Fusion via Dual Attention Transformer
3173 -- 3184Ze Zhou, Yinghui Sun, Quansen Sun, Chaobo Li, Zhenwen Ren. Only Once Attack: Fooling the Tracker With Adversarial Template
3185 -- 3198Guowen Xu, Guanlin Li, Shangwei Guo, Tianwei Zhang 0004, Hongwei Li 0001. Secure Decentralized Image Classification With Multiparty Homomorphic Encryption
3199 -- 3213Ze Fu, Changmeng Zheng, Junhao Feng, Yi Cai 0001, Xiao-Yong Wei, Yaowei Wang, Qing Li 0001. DRAKE: Deep Pair-Wise Relation Alignment for Knowledge-Enhanced Multimodal Scene Graph Generation in Social Media Posts
3214 -- 3228Yaxian Wang, Bifan Wei, Jun Liu 0002, Qika Lin, Lingling Zhang, Yaqiang Wu. Spatial-Semantic Collaborative Graph Network for Textbook Question Answering
3229 -- 3242Chao Shang, Hongliang Li 0001, Heqian Qiu, Qingbo Wu 0001, Fanman Meng, Taijin Zhao, King Ngi Ngan. Cross-Modal Recurrent Semantic Comprehension for Referring Image Segmentation
3243 -- 3256Md. Moniruzzaman, Zhaozheng Yin, Zhihai He, Ming C. Leu, Ruwen Qin. Jointly-Learnt Networks for Future Action Anticipation via Self-Knowledge Distillation and Cycle Consistency
3257 -- 3269Yueyi Zhu, Yongqiang Zhang 0007, Mingli Ding, Wangmeng Zuo. Uncertainty-Aware Graph-Guided Weakly Supervised Object Detection
3270 -- 3283Guiyu Xia, Dong Luo, Zeyuan Zhang, Yubao Sun, Qingshan Liu 0001. 3D Information Guided Motion Transfer via Sequential Image Based Human Model Refinement and Face-Attention GAN
3284 -- 3295Yuanjie Shao, Wenxiao Wu, Xinge You, Changxin Gao, Nong Sang. Improving the Generalization of MAML in Few-Shot Classification via Bi-Level Constraint
3296 -- 3307Zhihao Peng 0002, Hui Liu 0032, Yuheng Jia, Junhui Hou. Deep Attention-Guided Graph Clustering With Dual Self-Supervision
3308 -- 3318Dingyuan Zheng, Jimin Xiao, Mingjie Sun, Huihui Bai, Junhui Hou. Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification
3319 -- 3332Mingkang Tang, Zhanyu Wang, Zhaoyang Zeng, Xiu Li, Luping Zhou. Stay in Grid: Improving Video Captioning via Fully Grid-Level Representation
3333 -- 3342Zhengcen Li, Yueran Li, Linlin Tang, Tong Zhang, Jingyong Su. Two-Person Graph Convolutional Network for Skeleton-Based Human Interaction Recognition
3343 -- 3357Lin Wang, Xiangmin Xu, Kailing Guo, Bolun Cai, Fang Liu 0030. Reflective Learning With Label Noise
3358 -- 3368Daoyong Fu, Songchen Han, Binbin Liang, Wei Li 0075. The 6D Pose Estimation of the Aircraft Using Geometric Property
3369 -- 3382Wei Yu, Yanping Li, Rui Wang 0034, Wenming Cao 0001, Wei Xiang 0001. PCFN: Progressive Cross-Modal Fusion Network for Human Pose Transfer
3383 -- 3397Xiaolin Zhu, Yan Zhou 0003, Dongli Wang, Wanli Ouyang, Rui Su. MLST-Former: Multi-Level Spatial-Temporal Transformer for Group Activity Recognition
3398 -- 3408Zhiqi Yu, Jingjing Li 0001, Lei Zhu 0002, Ke Lu 0001, Heng Tao Shen. Uneven Bi-Classifier Learning for Domain Adaptation
3409 -- 3424Yuhao Tang, Liyan Zhang 0001, Ye Yuan, Zhixian Chen. Describe Fashion Products via Local Sparse Self-Attention Mechanism and Attribute-Based Re-Sampling Strategy
3425 -- 3440Han Wang, Jing Liu 0002, Yuting Su 0001, Xiaokang Yang. Trajectory Guided Robust Visual Object Tracking With Selective Remedy
3441 -- 3454Liuyi Wang, Zongtao He, Ronghao Dang, Huiyi Chen, Chengju Liu, Qijun Chen. RES-StS: Referring Expression Speaker via Self-Training With Scorer for Goal-Oriented Vision-Language Navigation
3455 -- 3461Yuheng Jia, Guanxing Lu, Hui Liu 0032, Junhui Hou. Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation
3462 -- 3476Yunqiu Lv, Jing Zhang 0052, Yuchao Dai, Aixuan Li, Nick Barnes, Deng-Ping Fan. Toward Deeper Understanding of Camouflaged Object Detection
3477 -- 3487Jielian Lin, Aiping Huang, Tiesong Zhao, Xu Wang 0006, Sam Kwong. λ-Domain VVC Rate Control Based on Nash Equilibrium
3488 -- 3501Yunhui Shi, Kangfu Zhang, Jin Wang, Nam Ling, Baocai Yin. Variable-Rate Image Compression Based on Side Information Compensation and R-λ Model Rate Control
3502 -- 3515Kai Lin, Chuanmin Jia, Xinfeng Zhang 0001, Shanshe Wang, Siwei Ma, Wen Gao 0001. DMVC: Decomposed Motion Modeling for Learned Video Compression
3516 -- 3528Wenrui Li, Zhengyu Ma, Liang-Jian Deng, Xiaopeng Fan, Yonghong Tian 0001. Neuron-Based Spiking Transmission and Reasoning Network for Robust Image-Text Retrieval
3529 -- 3541Zhang Xi, Xiumei Wang, Peitao Cheng. Unsupervised Hashing Retrieval via Efficient Correlation Distillation
3542 -- 3558Laijin Meng, Xinghao Jiang, Zhenzhen Zhang, Zhaohong Li, Tanfeng Sun. A Robust Coverless Image Steganography Based on an End-to-End Hash Generation Model

Volume 33, Issue 6

2533 -- 2545Yuanzhi Wang, Tao Lu 0001, Yanduo Zhang, Zhongyuan Wang 0001, Junjun Jiang, Zixiang Xiong. FaceFormer: Aggregating Global and Local Representation for Face Hallucination
2546 -- 2560Chongzhen Tian, Feng Shao, Xiongli Chai, Qiuping Jiang, Long Xu, Yo-Sung Ho. Viewport-Sphere-Branch Network for Blind Quality Assessment of Stitched 360° Omnidirectional Images
2561 -- 2576Kunqian Li, Li Wu, Qi Qi, Wenjie Liu, Xiang Gao 0009, Liqin Zhou 0001, Dalei Song. Beyond Single Reference for Training: Underwater Image Enhancement via Comparative Learning
2577 -- 2589Pengfei Chen, Leida Li, Haoliang Li, Jinjian Wu, Weisheng Dong, Guangming Shi. Dynamic Expert-Knowledge Ensemble for Generalizable Video Quality Assessment
2590 -- 2599Xingzheng Wang, Jiehao Liu, Songwei Chen, Guoyao Wei. Effective Light Field De-Occlusion Network Based on Swin Transformer
2600 -- 2615Yanliang Ge, Qiao Zhang, Tian-Zhu Xiang, Cong Zhang, Hongbo Bi. TCNet: Co-Salient Object Detection via Parallel Interaction of Transformers and CNNs
2616 -- 2626Liqun Lin, Zheng Wang, Jiachen He, Weiling Chen, Yiwen Xu, Tiesong Zhao. Deep Quality Assessment of Compressed Videos: A Subjective and Objective Study
2627 -- 2641Zeyu Wang, Xiongfei Li, Shuang Yu, Haoran Duan, Xiaoli Zhang, Jizheng Zhang, Shiping Chen 0001. VSP-Fuse: Multifocus Image Fusion Model Using the Knowledge Transferred From Visual Salience Priors
2642 -- 2655Zhiwei Hao, Shan Gai, Pengcheng Li. Multi-Scale Self-Calibrated Dual-Attention Lightweight Residual Dense Deraining Network Based on Monogenic Wavelets
2656 -- 2671Yixuan Gao, Xiongkuo Min, Wenhan Zhu, Xiao-Ping Zhang 0002, Guangtao Zhai. Image Quality Score Distribution Prediction via Alpha Stable Model
2672 -- 2682Han Huang, Li Shen 0008, Chaoyang He 0001, Weisheng Dong, Wei Liu 0005. Differentiable Neural Architecture Search for Extremely Lightweight Image Super-Resolution
2683 -- 2695Yuanwei Li, En Zhu, Hang Chen, Jiyong Tan, Li Shen. Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection
2696 -- 2712Wei Xiong, Zhenyu Xiong, Yaqi Cui, Linzhou Huang, Ruining Yang. An Interpretable Fusion Siamese Network for Multi-Modality Remote Sensing Ship Image Retrieval
2713 -- 2723Dongchen Han, Weifeng Liu 0001, Mingchen Zou, Baodi Liu. Non-Contrastive Nearest Neighbor Identity-Guided Method for Unsupervised Object Re-Identification
2724 -- 2737Zhipu Liu, Lei Zhang 0038, David Zhang 0001. Neural Image Parts Group Search for Person Re-Identification
2738 -- 2752Zhenyu Weng, Huiping Zhuang, Haizhou Li 0001, Balakrishnan Ramalingam, Rajesh Elara Mohan, Zhiping Lin. Online Multi-Face Tracking With Multi-Modality Cascaded Matching
2753 -- 2766Zhiliang Wu, Changchang Sun, Hanyu Xuan, Kang Zhang, Yan Yan 0002. Divide-and-Conquer Completion Network for Video Inpainting
2767 -- 2782Huafeng Qin, Rongshan Hu, Mounim A. El-Yacoubi, Yantao Li, Xinbo Gao 0001. Local Attention Transformer-Based Full-View Finger-Vein Identification
2783 -- 2797Changchen Zhao, Hongsheng Wang, Huiling Chen 0001, Weiwei Shi 0003, Yuanjing Feng. JAMSNet: A Remote Pulse Extraction Network Based on Joint Attention and Multi-Scale Fusion
2798 -- 2812Min Wang, Peng Zhao, Xin Lu, Fan Min, Xizhao Wang. Fine-Grained Visual Categorization: A Spatial-Frequency Feature Fusion Perspective
2813 -- 2825Xin Huang, Yutao Hu, Xiaoyan Luo, Jungong Han, Baochang Zhang 0001, Xianbin Cao 0001. Boosting Variational Inference With Margin Learning for Few-Shot Scene-Adaptive Anomaly Detection
2826 -- 2838Baoquan Zhang, Hao Jiang, Xutao Li, Shanshan Feng, Yunming Ye, Chen Luo, Rui Ye. MetaDT: Meta Decision Tree With Class Hierarchy for Interpretable Few-Shot Learning
2839 -- 2851Lifang Wu, Xianglong Lang, Ye Xiang, Changwen Chen, Zun Li, Zhuming Wang. Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition
2852 -- 2863Igor Morawski, Wen-Nung Lie, Lee Aing, Jui-Chiu Chiang, Kuan-Ting Chen. Deep-Learning Technique for Risk-Based Action Prediction Using Extremely Low-Resolution Thermopile Sensor Array
2864 -- 2876Xingfeng Li 0004, Yinghui Sun, Quansen Sun, Zhenwen Ren. Consensus Cluster Center Guided Latent Multi-Kernel Clustering
2877 -- 2891Weihuang Chen, Zhigang Yang, Lingyang Xue, Jinghai Duan, Hongbin Sun 0001, Nanning Zheng 0001. Multimodal Pedestrian Trajectory Prediction Using Probabilistic Proposal Network
2892 -- 2905Shenlu Zhao, Qiang Zhang 0020. A Feature Divide-and-Conquer Network for RGB-T Semantic Segmentation
2906 -- 2919Jinshi Liu, Zhaohui Jiang 0001, Ting Cao, Zhiwen Chen 0001, Chaobo Zhang, Weihua Gui 0001. Generated Pseudo-Labels Guided by Background Skeletons for Overcoming Under-Segmentation in Overlapping Particle Objects
2920 -- 2934Liqiang He, Xiaohai He, Shuhua Xiong, Zeming Zhao, Hang Xiao, Honggang Chen. Efficient Rate Control in Versatile Video Coding With Adaptive Spatial-Temporal Bit Allocation and Parameter Updating
2935 -- 2949Lichen Zhao, Daigang Cai, Jing Zhang 0017, Lu Sheng, Dong Xu, Rui Zheng, Yinjie Zhao, Lipeng Wang, Xibo Fan. Toward Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline
2950 -- 2961Zhengkai Fang, Liquan Shen, Mengyao Li, Zhengyong Wang, Yanliang Jin. Prior-Guided Contrastive Image Compression for Underwater Machine Vision
2962 -- 2978Shaohui Li, Han Li, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong. Learned Progressive Image Compression With Dead-Zone Quantizers
2979 -- 2989Shurun Wang, Zhao Wang 0004, Shiqi Wang 0001, Yan Ye. Deep Image Compression Toward Machine Vision: A Unified Optimization Framework
2990 -- 3002Mengge He, Wenjing Du, Zhiquan Wen, Qing Du, Yutong Xie, Qi Wu 0001. Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning
3003 -- 3016Tongbao Chen, Wenmin Wang, Kangrui Han, Huijuan Xu 0001. SaGCN: Semantic-Aware Graph Calibration Network for Temporal Sentence Grounding
3017 -- 3029Zhiying Zhu, Ping Wei 0004, Zhenxing Qian, Sheng Li 0006, Xinpeng Zhang 0001. Image Sanitization in Online Social Networks: A General Framework for Breaking Robust Information Hiding
3030 -- 3042Pauline Puteaux, Felix Yriarte, William Puech. M) Galois Fields

Volume 33, Issue 5

2009 -- 2018Renlong Hang, Xuwei Qian, Qingshan Liu 0001. MSNet: Multi-Resolution Synergistic Networks for Adaptive Inference
2019 -- 2032Yulan Zhang, Guopu Zhu, Xing Wang, Xiangyang Luo, Yicong Zhou, Hongli Zhang 0001, Ligang Wu. CNN-Transformer Based Generative Adversarial Network for Copy-Move Source/ Target Distinguishment
2033 -- 2047Yu Gu 0003, Huan Yan, Xiang Zhang 0011, Yantong Wang, Yusheng Ji, Fuji Ren. Toward Facial Expression Recognition in the Wild via Noise-Tolerant Network
2048 -- 2060Zhe Zhang, Bo Peng 0007, Jianjun Lei, Haifeng Shen, Qingming Huang. Recurrent Interaction Network for Stereoscopic Image Super-Resolution
2061 -- 2074Lei Ma, Yuhui Zheng, Zhao Zhang 0001, Yazhou Yao, Xijian Fan, Qiaolin Ye. Motion Stimulation for Compositional Action Recognition
2075 -- 2087Tongtong Su, Qiyu Liang, Jinsong Zhang, Zhaoyang Yu, Ziyue Xu, Gang Wang 0001, Xiaoguang Liu 0001. Deep Cross-Layer Collaborative Learning Network for Online Knowledge Distillation
2088 -- 2101Xianrui Luo, Juewen Peng, Weiyue Zhao, Ke Xian, Hao Lu 0003, Zhiguo Cao 0001. Point-and-Shoot All-in-Focus Photo Synthesis From Smartphone Camera Pair
2102 -- 2115Zewen Zheng, Guoheng Huang, Xiaochen Yuan, Chi-Man Pun, Hongrui Liu, Wing-kuen Ling. Quaternion-Valued Correlation Learning for Few-Shot Semantic Segmentation
2116 -- 2128Yuantong Zhang, Huairui Wang, Han Zhu, Zhenzhong Chen. Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution
2129 -- 2146Jun Chen 0013, Hui Duan, Yuanxin Song, Zemin Cai, Guangguang Yang, Tianshu Liu. Motion Estimation for Complex Fluid Flows Using Helmholtz Decomposition
2147 -- 2161Peng Zhou 0010, Lingxi Xie, Bingbing Ni, Lin Liu 0016, Qi Tian 0001. HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis
2162 -- 2175Mengyang Zhang, Guohui Tian, Ying Zhang, Hong Liu 0013. Sequential Learning for Ingredient Recognition From Images
2176 -- 2189Qihang Zhou, Shibo He, Haoyu Liu, Tao Chen 0003, Jiming Chen 0001. Pull & Push: Leveraging Differential Knowledge Distillation for Efficient Unsupervised Anomaly Detection and Localization
2190 -- 2201Linfeng Zhang, Kaisheng Ma. A Good Data Augmentation Policy is not All You Need: A Multi-Task Learning Perspective
2202 -- 2216Yikang Wei, Liu Yang, Yahong Han, Qinghua Hu. Multi-Source Collaborative Contrastive Learning for Decentralized Domain Adaptation
2217 -- 2232Huiwei Lin, Shanshan Feng, Xutao Li, Wentao Li, Yunming Ye. Anchor Assisted Experience Replay for Online Class-Incremental Learning
2233 -- 2244Qianwen Cao, Heyan Huang, Mucheng Ren, Changsen Yuan. Concept-Enhanced Relation Network for Video Visual Relation Inference
2245 -- 2258Wei Lin, Xiaoyu Liu, Yihong Zhuang, Xinghao Ding, Xiaotong Tu, Yue Huang 0001, Huanqiang Zeng. Unsupervised Video-Based Action Recognition With Imagining Motion and Perceiving Appearance
2259 -- 2274Xinfeng Zhang 0003, Jinpeng Fang, Baoqing Yang, Shuhan Chen, Bin Li 0006. Hybrid Attention and Motion Constraint for Anomaly Detection in Crowded Scenes
2275 -- 2289Wenyi Zhao, Chongyi Li, Weidong Zhang 0007, Lu Yang 0006, Peixian Zhuang, Lingqiao Li, Kefeng Fan, Huihua Yang. Embedding Global Contrastive and Local Location in Self-Supervised Learning
2290 -- 2301Sheng-Ye Wang, Zhong Qu, Cui-jin Li. A Dense-Aware Cross-splitNet for Object Detection and Recognition
2302 -- 2316Limin Sun, Dongyang Ma, Xiao Pan, Yuanfeng Zhou. Weak-Boundary Sensitive Superpixel Segmentation Based on Local Adaptive Distance
2317 -- 2329Boying Wang, Ruyi Ji, Libo Zhang 0001, Yanjun Wu. Bridging Multi-Scale Context-Aware Representation for Object Detection
2330 -- 2341Jie Gao, Bineng Zhong, Yan Chen 0017. Robust Tracking via Learning Model Update With Unsupervised Anomaly Detection Philosophy
2342 -- 2356Linhui Dai, Hong Liu 0008, Hao Tang 0005, Zhiwei Wu, Pinhao Song. AO2-DETR: Arbitrary-Oriented Object Detection Transformer
2357 -- 2369Shuai Shao 0006, Lei Xing 0005, Yanjiang Wang 0001, Baodi Liu, Weifeng Liu 0001, Yicong Zhou. Attention-Based Multi-View Feature Collaboration for Decoupled Few-Shot Learning
2370 -- 2380Keyu Wu 0002, Min Wu 0008, Zhenghua Chen, Ruibing Jin, Wei Cui 0002, Zhiguang Cao, Xiaoli Li 0001. Reinforced Adaptation Network for Partial Domain Adaptation
2381 -- 2395Han Yan, Haijun Zhang 0002, Jianyang Shi, Jianghong Ma. Texture Brush for Fashion Inspiration Transfer: A Generative Adversarial Network With Heatmap-Guided Semantic Disentanglement
2396 -- 2409Meng Lei, Jiaqi Zhang, Shiqi Wang 0001, Shanshe Wang, Siwei Ma. Deep Intra Prediction by Jointly Exploiting Local and Non-Local Similarities
2410 -- 2423Ren Yang, Radu Timofte, Luc Van Gool. Advancing Learned Video Compression With In-Loop Frame Prediction
2424 -- 2438Pan Gao, Shengzhou Luo, Manoranjan Paul. Rate-Distortion Modeling for Bit Rate Constrained Point Cloud Compression
2439 -- 2450Chaofan He, Shuyuan Zhu, Bing Zeng. NOMA-Based Uncoded Video Transmission With Optimization of Joint Resource Allocation
2451 -- 2464Wenyuan Zhong, Huaxiong Li, Qinghua Hu, Yang Gao 0001, Chunlin Chen. Multi-Level Cascade Sparse Representation Learning for Small Data Classification
2465 -- 2476Zejun Liu, Fanglin Chen 0001, Jun Xu, Wenjie Pei, Guangming Lu. Image-Text Retrieval With Cross-Modal Semantic Importance Consistency
2477 -- 2490Dayoung Lee, Joonho Lee, Minseok Song 0002. Video File Allocation for Wear-Leveling in Distributed Storage Systems With Heterogeneous Solid-State-Disks (SSDs)
2491 -- 2505Daizong Liu, Pan Zhou, Zichuan Xu, Haozhao Wang, Ruixuan Li. Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning
2506 -- 2519Pengbo Liu, Xing-Yuan Wang 0001, Yining Su. Image Encryption via Complementary Embedding Algorithm and New Spatiotemporal Chaotic System
2520 -- 2532Yizhuo Song, Pengyang Zhao, Wenming Yang, Qingmin Liao, Jie Zhou 0001. EIFNet: An Explicit and Implicit Feature Fusion Network for Finger Vein Verification

Volume 33, Issue 4

1493 -- 1506Lei He, Yongfang Xie, Shiwen Xie, Zhipeng Chen. Structure-Preserving Texture Smoothing via Scale-Aware Bilateral Total Variation
1507 -- 1520Meiqin Liu, Shuo Jin, Chao Yao, Chunyu Lin, Yao Zhao 0001. Temporal Consistency Learning of Inter-Frames for Video Super-Resolution
1521 -- 1534Zhi-Yong Wang, Xiao-peng Li, Hing-Cheung So. Robust Matrix Completion Based on Factorization and Truncated-Quadratic Loss Function
1535 -- 1548Bobo Xi, Jiaojiao Li, Yan Diao, Yunsong Li, Zan Li 0001, Yan Huang 0018, Jocelyn Chanussot. DGSSC: A Deep Generative Spectral-Spatial Classifier for Imbalanced Hyperspectral Imagery
1549 -- 1563Weixiang Xu, Fanrong Li, Yingying Jiang, Yong A, Xiangyu He, Peisong Wang, Jian Cheng 0001. Improving Extreme Low-Bit Quantization With Soft Threshold
1564 -- 1576Baoqiang Shi, Zhenhong Jia, Jie Yang 0002, Nikola K. Kasabov. Unsupervised Change Detection in Wide-Field Video Images Under Low Illumination
1577 -- 1592Chang Xu, Qingwu Li, Xiongbiao Jiang, Dabing Yu, Yaqin Zhou. Dual-Space Graph-Based Interaction Network for RGB-Thermal Semantic Segmentation in Electric Power Scene
1593 -- 1609Yichao Tang, Shuai Wang, Chuntao Wang, Shijun Xiang, Yiu-ming Cheung. A Highly Robust Reversible Watermarking Scheme Using Embedding Optimization and Rounded Error Compensation
1610 -- 1623Guanglin Li, Bin Li 0011, Shunquan Tan, Guoping Qiu. Learning Deep Co-Occurrence Features
1624 -- 1642Chenxin Wang, Zhenwei Zhang, Zhichang Guo, Tieyong Zeng, Yuping Duan. Efficient SAV Algorithms for Curvature Minimization Problems
1643 -- 1657Yun Liu 0002, Zhongsheng Yan, Jinge Tan, Yuche Li. Multi-Purpose Oriented Single Nighttime Image Haze Removal Based on Unified Variational Retinex Model
1658 -- 1670Xin Li, Rongrong Ni, Pengpeng Yang, Zhiqiang Fu, Yao Zhao 0001. Artifacts-Disentangled Adversarial Learning for Deepfake Detection
1671 -- 1683Yaozong Zheng, Bineng Zhong, Qihua Liang, Zhenjun Tang, Rongrong Ji, Xianxian Li. Leveraging Local and Global Cues for Visual Tracking via Parallel Interaction Network
1684 -- 1696Lixiang Lin, Jianke Zhu, Yisu Zhang. Multiview Textured Mesh Recovery by Differentiable Rendering
1697 -- 1709Youjie Wang, Yanmin Luo 0001, Guihu Bai, Jing-Ming Guo. UformPose: A U-Shaped Hierarchical Multi-Scale Keypoint-Aware Framework for Human Pose Estimation
1710 -- 1724Yanghong Zhou, P. Y. Mok. A Pose-Aware Global Representation Network for Human Parsing
1725 -- 1739Yunfan Liu, Qi Li 0005, Qiyao Deng, Zhenan Sun. Towards Spatially Disentangled Manipulation of Face Images With Pre-Trained StyleGANs
1740 -- 1751Jie Ma, Xiangyuan Lan, Bineng Zhong, Guorong Li, Zhenjun Tang, Xianxian Li, Rongrong Ji. Robust Tracking via Uncertainty-Aware Semantic Consistency
1752 -- 1761Peixia Li, Boyu Chen, Lei Bai 0001, Lei Qiao, Bo Li 0114, Wanli Ouyang. SiamSampler: Video-Guided Sampling for Siamese Visual Tracking
1762 -- 1775Congju Du, Zengqiang Yan, Han Yu, Li Yu 0003, Zixiang Xiong. Hierarchical Associative Encoding and Decoding for Bottom-Up Human Pose Estimation
1776 -- 1786Guangming Wang 0001, Jiquan Zhong, Shijie Zhao, Wenhua Wu, Zhe Liu 0022, Hesheng Wang 0001. 3D Hierarchical Refinement and Augmentation for Unsupervised Learning of Depth and Pose From Monocular Video
1787 -- 1801Gang Chen, Feng Shao, Xiongli Chai, Hangwei Chen, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho. Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection
1802 -- 1815Yunyang Xu, Xifeng Gao, Caiming Zhang 0001, Jianchao Tan, Xuemei Li 0001. High Quality Superpixel Generation Through Regional Decomposition
1816 -- 1826Qinghai Lang, Lei Zhang 0038, Wenxu Shi, Weijie Chen, Shiliang Pu. Exploring Implicit Domain-Invariant Features for Domain Adaptive Object Detection
1827 -- 1838Yamin Cheng, Zhi Wang, Wenhan Zhan, Hancong Duan. Multi-Scale Human-Object Interaction Detector
1839 -- 1853Shihua Li, Haobin Chen, Shijie Yu, Zhiqun He, Feng Zhu, Rui Zhao 0001, Jie Chen, Yu Qiao 0001. COCAS+: Large-Scale Clothes-Changing Person Re-Identification With Clothes Templates
1854 -- 1867Lin Zhao, Wenbing Tao. JSNet++: Dynamic Filters and Pointwise Correlation for 3D Point Cloud Instance and Semantic Segmentation
1868 -- 1883Zengxi Huang, Yusong Qin, Xiaobing Lin, Tianlin Liu, Zhen-Hua Feng, Yiguang Liu. Motion-Driven Spatial and Temporal Adaptive High-Resolution Graph Convolutional Networks for Skeleton-Based Action Recognition
1884 -- 1898Li Ma, Peixi Peng, Guangyao Chen, Yifan Zhao, Siwei Dong, Yonghong Tian 0001. Picking Up Quantization Steps for Compressed Image Classification
1899 -- 1910Yuting Yang 0008, Licheng Jiao, Xu Liu 0006, Fang Liu, Shuyuan Yang, Lingling Li 0002, Puhua Chen, Xiufang Li, Zhongjian Huang. Dual Wavelet Attention Networks for Image Classification
1911 -- 1921Bo Peng 0007, Renjie Chang, Zhaoqing Pan, Ge Li, Nam Ling, Jianjun Lei. Deep In-Loop Filtering via Multi-Domain Correlation Learning and Partition Constraint for Multiview Video Coding
1922 -- 1936Youneng Bao, Fanyang Meng, Chao Li, Siwei Ma, Yonghong Tian 0001, Yongsheng Liang. Nonlinear Transforms in Learned Image Compression From a Communication Perspective
1937 -- 1951Mengyao Li, Liquan Shen, Yufei Lin, Kun Wang, Jinbo Chen. Extreme Underwater Image Compression Using Physical Priors
1952 -- 1965Pandeng Li, Hongtao Xie, Yan Jiang, Jiannan Ge, Yongdong Zhang 0001. Neighborhood-Adaptive Multi-Cluster Ranking for Deep Metric Learning
1966 -- 1978Jiajia Tang, Dongjun Liu, Xuanyu Jin, Yong Peng 0001, Qibin Zhao, Yu Ding 0001, Wanzeng Kong. BAFN: Bi-Direction Attention Based Fusion Network for Multimodal Sentiment Analysis
1979 -- 1993Mingsong Li, Yikun Liu, Guangkuo Xue, Yuwen Huang, Gongping Yang. Exploring the Relationship Between Center and Neighborhoods: Central Vector Oriented Self-Similarity Network for Hyperspectral Image Classification
1994 -- 2000Yizhong Pan, Chao Ren 0002, Xiaohong Wu, Jie Huang, Xiaohai He. Real Image Denoising via Guided Residual Estimation and Noise Correction
2001 -- 2006Xiang Gao 0009, Hainan Cui, Menghan Li, Zexiao Xie, Shuhan Shen. IRAv3: Hierarchical Incremental Rotation Averaging on the Fly

Volume 33, Issue 3

977 -- 987Xiaoxiao Sheng, Kunchang Li, Zhiqiang Shen, Gang Xiao. A Progressive Difference Method for Capturing Visual Tempos on Action Recognition
988 -- 1002Yaozu Kang, Qiuping Jiang, Chongyi Li, Wenqi Ren, Hantao Liu, Pengjun Wang. A Perception-Aware Decomposition and Fusion Framework for Underwater Image Enhancement
1003 -- 1018Zhongyun Hua, Ziyi Wang, Yifeng Zheng, Yongyong Chen, Yuanman Li. Enabling Large-Capacity Reversible Data Hiding Over Encrypted JPEG Bitstreams
1019 -- 1030Ziyang Wang, Yunhao Gou, Jingjing Li 0001, Lei Zhu 0002, Heng Tao Shen. Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning
1031 -- 1042Hanwei Zhu, Baoliang Chen, Lingyu Zhu 0006, Shiqi Wang 0001. Learning Spatiotemporal Interactions for User-Generated Video Quality Assessment
1043 -- 1054Yongxu Liu, Jinjian Wu, Leida Li, Weisheng Dong, Guangming Shi. Quality Assessment of UGC Videos Based on Decomposition and Recomposition
1055 -- 1068Yuehai Chen, Jing Yang 0014, Badong Chen, Shaoyi Du. Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation
1069 -- 1081Wen Sun, Jian Jin, Weisi Lin. Minimum Noticeable Difference-Based Adversarial Privacy Preserving Image Generation
1082 -- 1092Zhong Ji, Jiacheng Hou, Yimu Su, Yanwei Pang, Xuelong Li 0001. G2LP-Net: Global to Local Progressive Video Inpainting Network
1093 -- 1108Heqian Qiu, Hongliang Li 0001, Qingbo Wu 0001, Jianhua Cui, Zichen Song, Lanxiao Wang, Minjian Zhang 0003. CrossDet++: Growing Crossline Representation for Object Detection
1109 -- 1122Yuwei Wang, Yuanying Qiu, Peitao Cheng, Junyu Zhang. Hybrid CNN-Transformer Features for Visual Place Recognition
1123 -- 1139Zheyin Wang, Liquan Shen, Zhengyong Wang, Yufei Lin, Yanliang Jin. Generation-Based Joint Luminance-Chrominance Learning for Underwater Image Quality Assessment
1140 -- 1156Bo Tang, Cheng Yang, Yana Zhang. A Format Compliant Framework for HEVC Selective Encryption After Encoding
1157 -- 1167Jingjing Ren, Xiaowei Hu 0001, Lei Zhu 0003, Xuemiao Xu, Yangyang Xu, Weiming Wang, Zijun Deng, Pheng-Ann Heng. Deep Texture-Aware Features for Camouflaged Object Detection
1168 -- 1180Zelin Chen, Kun-Yu Lin, Wei-Shi Zheng 0001. Consistent Intra-Video Contrastive Learning With Asynchronous Long-Term Memory Bank
1181 -- 1197Guo-Sen Xie, Zheng Zhang 0006, Huan Xiong, Ling Shao 0001, Xuelong Li 0001. Towards Zero-Shot Learning: A Brief Review and an Attention-Based Embedding Network
1198 -- 1208Siyu Ren, Yiming Zeng, Junhui Hou, Xiaodong Chen. CorrI2P: Deep Image-to-Point Cloud Registration via Dense Correspondence
1209 -- 1222Zi Wang, Zhiheng Fu, Yulan Guo, Zhang Li, Qifeng Yu. Local-to-Global Cost Aggregation for Semantic Correspondence
1223 -- 1235Gongyang Li, Yike Wang, Zhi Liu 0003, Xinpeng Zhang 0001, Dan Zeng 0001. RGB-T Semantic Segmentation With Location, Activation, and Sharpening
1236 -- 1246Hairui Yang, Baoli Sun, Baopu Li, Caifei Yang, Zhihui Wang, Jenhui Chen, Lei Wang 0005, Haojie Li. Iterative Class Prototype Calibration for Transductive Zero-Shot Learning
1247 -- 1261Linsen Song, Wayne Wu, Chaoyou Fu, Chen Change Loy, Ran He. Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis
1262 -- 1275Christos Papaioannidis, Ioannis Mademlis, Ioannis Pitas. Fast CNN-Based Single-Person 2D Human Pose Estimation for Autonomous Systems
1276 -- 1290Xingtao Wang, Xiaopeng Fan, Debin Zhao. PointFilterNet: A Filtering Network for Point Cloud Denoising
1291 -- 1304Xiang Xu, Jian Zhao 0017, Jianmin Wu, Furao Shen. Switch and Refine: A Long-Term Tracking and Segmentation Framework
1305 -- 1319Anyang Tong, Chao Tang, Wenjian Wang. Semi-Supervised Action Recognition From Temporal Augmentation Using Curriculum Learning
1320 -- 1334Jiaxu Leng, Mengjingcheng Mo, Yinghua Zhou, Chenqiang Gao, Weisheng Li 0001, Xinbo Gao 0001. Pareto Refocusing for Drone-View Object Detection
1335 -- 1348Yujie Fu, Pengju Zhang, Bingxi Liu, Zheng Rong, Yihong Wu 0002. Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching
1349 -- 1362Ziqi Jin, Jinheng Xie, Bizhu Wu, LinLin Shen. Weakly Supervised Pedestrian Segmentation for Person Re-Identification
1363 -- 1373Shuiping Gou, Xinlin Wang, Shasha Mao, Licheng Jiao, Zhen Liu, Yinghai Zhao. Weakly-Supervised Semantic Feature Refinement Network for MMW Concealed Object Detection
1374 -- 1385Kun Wu, Lei Zhu 0010, Weihang Shi, Wenwu Wang 0008, Jin Wu. Self-Attention Memory-Augmented Wavelet-CNN for Anomaly Detection
1386 -- 1397Qiuyu Zhu, Xuewen Zu. A Softmax-Free Loss Function Based on Predefined Optimal-Distribution of Latent Features for Deep Learning Classifier
1398 -- 1412Shixiang Su, Songlin Du, Xuan Wei, Xiaobo Lu. RFS-Net: Railway Track Fastener Segmentation Network With Shape Guidance
1413 -- 1426Yue Wu 0004, Yue Zhang, Xiaolong Fan, Maoguo Gong, Qiguang Miao, Wenping Ma 0001. INENet: Inliers Estimation Network With Similarity Learning for Partial Overlapping Registration
1427 -- 1437Youguang Yu, Wei Zhang 0072, Ge Li 0002, Fuzheng Yang. A Regularized Projection-Based Geometry Compression Scheme for LiDAR Point Cloud
1438 -- 1453Zerun Feng, Zhimin Zeng, Caili Guo, Zheng Li 0014. Temporal Multimodal Graph Transformer With Global-Local Alignment for Video-Text Retrieval
1454 -- 1467Fuwei Zhang, Ruomei Wang 0001, Fan Zhou 0001, Yuanmao Luo. ERM: Energy-Based Refined-Attention Mechanism for Video Question Answering
1468 -- 1480Han Chen, Yuzhen Lin, Bin Li 0011, Shunquan Tan. Learning Features of Intra-Consistency and Inter-Diversity: Keys Toward Generalizable Deepfake Detection
1481 -- 1486Renwei Yang, Hewei Liu, Shuyuan Zhu, Xiaozhen Zheng, Bing Zeng. DFCE: Decoder-Friendly Chrominance Enhancement for HEVC Intra Coding
1487 -- 1492Yuzhen Niu, Shanshan Chen, Bingrui Song, Zhixian Chen, Wenxi Liu. Comment-Guided Semantics-Aware Image Aesthetics Assessment

Volume 33, Issue 2

457 -- 477Chenglizhao Chen, Mengke Song, Wenfeng Song, Li Guo, Muwei Jian. A Comprehensive Survey on Video Saliency Detection With Auditory Information: The Audio-Visual Consistency Perceptual is the Key!
478 -- 490Yang Zhao 0002, Wei Jia 0001, Yuan Chen, Ronggang Wang. Fast Blind Decontouring Network
491 -- 504Xiuli Bi, Yixuan Shang, Bo Liu, Bin Xiao 0002, Weisheng Li 0001, Xinbo Gao 0001. A Versatile Detection Method for Various Contrast Enhancement Manipulations
505 -- 520Yifan Zuo, Jiacheng Xie, Hao Wang, Yuming Fang, Deyang Liu, Wenying Wen. Gradient-Guided Single Image Super-Resolution Based on Joint Trilateral Feature Filtering
521 -- 533Fengli Yang, Long Zhao 0004. Closed-Form Solution of Principal Line for Camera Calibration Based on Orthogonal Vanishing Points
534 -- 548Runmin Cong, Qi Qin, Chen Zhang, Qiuping Jiang, Shiqi Wang 0001, Yao Zhao 0001, Sam Kwong. A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
549 -- 561Yuan Gao, Shiwei Ma, Jingjing Liu 0004. DCDR-GAN: A Densely Connected Disentangled Representation Generative Adversarial Network for Infrared and Visible Image Fusion
562 -- 574Jiawei Zhang, Jinwei Wang, Hao Wang 0060, Xiangyang Luo. Self-Recoverable Adversarial Examples: A New Effective Protection Mechanism in Social Networks
575 -- 588Xibin Song, Dingfu Zhou, Wei Li 0143, Haodong Ding, Yuchao Dai, Liangjun Zhang. WSAMF-Net: Wavelet Spatial Attention-Based MultiStream Feedback Network for Single Image Dehazing
589 -- 601Elena Luna, Juan C. SanMiguel, José María Martínez Sanchez, Pablo Carballeira. Graph Neural Networks for Cross-Camera Data Association
602 -- 617Tong Wu 0008. Online Tensor Low-Rank Representation for Streaming Data Clustering
618 -- 633Lei Cai, Yuli Fu 0001, Wanliang Huo, Youjun Xiang, Tao Zhu, Ying Zhang, Huanqiang Zeng, Delu Zeng. Multiscale Attentive Image De-Raining Networks via Neural Architecture Search
634 -- 647Mingqin Chen, Yuhui Quan, Yong Xu 0007, Hui Ji. Self-Supervised Blind Image Deconvolution via Deep Generative Ensemble Learning
648 -- 660Mingji Yu, Heng Yao, Chuan Qin 0001, Xinpeng Zhang 0001. Reversible Data Hiding in Palette Images
661 -- 676Haitao Zhang, Beijing Chen, Jinwei Wang, Guoying Zhao 0001. A Local Perturbation Generation Method for GAN-Generated Face Anti-Forensics
677 -- 688Lingtong Kong, Jie Yang 0002. MDFlow: Unsupervised Optical Flow Learning by Reliable Mutual Knowledge Distillation
689 -- 700Shan Xu, Huaidong Zhang, Xuemiao Xu, Xiaowei Hu 0001, Yangyang Xu, Liangui Dai, Kup-Sze Choi, Pheng-Ann Heng. Representative Feature Alignment for Adaptive Object Detection
701 -- 712Zhiwen Chen, Jinjian Wu, Junhui Hou, Leida Li, Weisheng Dong, Guangming Shi. ECSNet: Spatio-Temporal Feature Learning for Event Camera
713 -- 727Jiawei Li, Jinyuan Liu, Shihua Zhou, Qiang Zhang, Nikola K. Kasabov. Learning a Coordinated Network for Detail-Refinement Multiexposure Image Fusion
728 -- 742Bin Tang, Zhengyi Liu, Yacheng Tan, Qian He. HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection
743 -- 755Huajun Zhou, Peijia Chen, Lingxiao Yang, Xiaohua Xie, Jianhuang Lai. Activation to Saliency: Forming High-Quality Labels for Unsupervised Salient Object Detection
756 -- 770Jianchuan Ding, Lingping Gao, Wenxi Liu, Haiyin Piao, Jia Pan, Zhenjun Du, Xin Yang 0011, Baocai Yin. Monocular Camera-Based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning
771 -- 781Lei Tan, Xue Lin, Dongmei Niu, Daole Wang, Miao Yin, Xiuyang Zhao. Projected Generative Adversarial Network for Point Cloud Completion
782 -- 792Liguang Zang, Yuancheng Li, Hui Chen. Multilabel Recognition Algorithm With Multigraph Structure
793 -- 803Cong Cao, Tianwei Lin, Dongliang He, Fu Li, Huanjing Yue, Jing-Yu Yang 0002, Errui Ding. Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation
804 -- 817Qianyu Zhou 0001, Zhengyang Feng, Qiqi Gu, Jiangmiao Pang, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma. Context-Aware Mixup for Domain Adaptive Semantic Segmentation
818 -- 829Yuyang Liu, Yang Cong, Gan Sun, Zhengming Ding. Lifelong Visual-Tactile Spectral Clustering for Robotic Object Perception
830 -- 846Runze Li, Pan Ji, Yi Xu, Bir Bhanu. MonoIndoor++: Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments
847 -- 860Hua Bao, Ping Shu, Hongchao Zhang, Xiaobai Liu. Siamese-Based Twin Attention Network for Visual Tracking
861 -- 871Lei Zhao, Junlin Li, Lianli Gao, Yunbo Rao, Jingkuan Song, Heng Tao Shen. Heterogeneous Knowledge Network for Visual Dialog
872 -- 885Peining Zhen, Xiaotao Yan, Wei Wang, Tianshu Hou, Hao Wei, Hai-Bao Chen. Toward Compact Transformers for End-to-End Object Detection With Decomposed Chain Tensor Structure
886 -- 896Zhecheng Wang, Shuai Wan, Lei Wei. Local Geometry-Based Intra Prediction for Octree-Structured Geometry Coding of Point Clouds
897 -- 908Yang Zhao, Xinlong Wang, Xiaohan Yu 0001, Chunlei Liu 0001, Yongsheng Gao 0001. Gait-Assisted Video Person Retrieval
909 -- 919Ming Jin, Huaxiang Zhang 0001, Lei Zhu, Jiande Sun, Li Liu 0031. Video Sampled Frame Category Aggregation and Consistent Representation for Cross-Modal Retrieval
920 -- 934Lei Liao, Meng Yang 0001, Bob Zhang. Deep Supervised Dual Cycle Adversarial Network for Cross-Modal Retrieval
935 -- 951Jiaxin Chen, Xin Liao, Wei Wang 0025, Zhenxing Qian, Zheng Qin 0001, Yaonan Wang. SNIS: A Signal Noise Separation-Based Network for Post-Processed Image Forgery Detection
952 -- 962Wei Lu 0001, Wenbo Xu, Ziqi Sheng. An Interpretable Image Tampering Detection Approach Based on Cooperative Game
963 -- 976Yue Lu, Congqi Cao, Yifan Zhang 0001, Yanning Zhang. Learnable Locality-Sensitive Hashing for Video Anomaly Detection

Volume 33, Issue 12

7082 -- 7083Feng Wu. Editor-in-Chief Message
7084 -- 7095Ming Liu 0022, Changchun Zhou, Siyuan Qiu, Yifan He, Hailong Jiao. CNN Accelerator at the Edge With Adaptive Zero Skipping and Sparsity-Driven Data Flow
7096 -- 7108Wujie Zhou, Han Zhang, Weiqing Yan, Weisi Lin. MMSMCNet: Modal Memory Sharing and Morphological Complementary Networks for RGB-T Urban Scene Semantic Segmentation
7109 -- 7120Aojun Gong, Junfei Nie, Chen Niu, Yuan Yu, Jun Li 0009, Lianbo Guo. Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images
7121 -- 7130Ying Zhang 0043, Maoliang Yin, Heyong Wang, Changchun Hua. Cross-Level Multi-Modal Features Learning With Transformer for RGB-D Object Recognition
7131 -- 7143Lijun He, Ziqing Wang, Liejun Wang, Fan Li 0003. Multimodal Mutual Attention-Based Sentiment Analysis Framework Adapted to Complicated Contexts
7144 -- 7155Bilian Chen, Jiewen Guan, Zhening Li, Zhehao Zhou. ∞-Norm Based Nonnegative Tucker Decomposition
7156 -- 7169Wenyu Hao, Shanmin Pang, Xiuxiu Bai, Jianru Xue. Tensor-Based Incomplete Multi-View Clustering With Low-Rank Data Reconstruction and Consistency Guidance
7170 -- 7182Huiming Sun, Jin Ma, Qing Guo 0005, Qin Zou 0001, Shaoyue Song, Yuewei Lin, Hongkai Yu. Coarse-to-Fine Task-Driven Inpainting for Geoscience Images
7183 -- 7196Honghu Pan, Qiao Liu 0001, Yongyong Chen, Yunqi He, Yuan Zheng 0002, Feng Zheng, Zhenyu He 0001. Pose-Aided Video-Based Person Re-Identification via Recurrent Graph Convolutional Network
7197 -- 7211Liu Yang, Shiqiao Gu, Chenyang Shen, Xile Zhao, Qinghua Hu. Skeleton Neural Networks via Low-Rank Guided Filter Pruning
7212 -- 7223Ningxiong Mao, HongJie He, Fan Chen 0003, Yuan Yuan, Lingfeng Qu. Reversible Data Hiding of JPEG Image Based on Adaptive Frequency Band Length
7224 -- 7235Jin Chen, Aiping Huang, Wei Gao 0003, Yuzhen Niu, Tiesong Zhao. Joint Shared-and-Specific Information for Deep Multi-View Clustering
7236 -- 7251Xin Liao, Yumei Wang, Tianyi Wang 0006, Juan Hu, Xiaoshuai Wu. FAMM: Facial Muscle Motions for Detecting Compressed Deepfake Videos Over Social Networks
7252 -- 7266Hao Sheng 0001, Sizhe Wang, Da Yang, Ruixuan Cong, Zhenglong Cui, Rongshan Chen. Cross-View Recurrence-Based Self-Supervised Super-Resolution of Light Field
7267 -- 7281Longteng Kong, Wanting Zhou, Duoxuan Pei, Zhaofeng He, Di Huang 0001. Group Activity Representation Learning With Long-Short States Predictive Transformer
7282 -- 7295Mengxian Hu, Chengju Liu, Shu Li, Qingqing Yan, Qin Fang, Qijun Chen. A Geometric Knowledge Oriented Single-Frame 2D-to-3D Human Absolute Pose Estimation Method
7296 -- 7309Shoukai Xu, Shuhai Zhang, Jing Liu 0048, Bohan Zhuang, Yaowei Wang 0001, Mingkui Tan. Generative Data Free Model Quantization With Knowledge Matching for Classification
7310 -- 7326Dahao Fu, Xiaoyi Zhou, Liaoran Xu, Kaiyue Hou, Xianyi Chen. Robust Reversible Watermarking by Fractional Order Zernike Moments and Pseudo-Zernike Moments
7327 -- 7341Shengyu Hou, Mengyin Fu, Wenjie Song 0001. Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network
7342 -- 7353Shiyi Chen, Asad Malik, Xinpeng Zhang 0001, Guorui Feng, Hanzhou Wu. A Fast Method for Robust Video Watermarking Based on Zernike Moments
7354 -- 7369Xing Luo, Guizhong Fu, Jiangxin Yang, Yanlong Cao, Yanpeng Cao. Multi-Modal Image Fusion via Deep Laplacian Pyramid Hybrid Network
7370 -- 7384Xiaofei Yang, Weijia Cao, Yao Lu, Yicong Zhou. QTN: Quaternion Transformer Network for Hyperspectral Image Classification
7385 -- 7397Zhenyu Wang, Xuemei Xie, Qinghang Zhao, Guangming Shi. Filter Clustering for Compressing CNN Model With Better Feature Diversity
7398 -- 7412Qingzhe Pan, Zhifu Zhao, Xuemei Xie, Jianan Li, Yuhan Cao, Guangming Shi. View-Normalized and Subject-Independent Skeleton Generation for Action Recognition
7413 -- 7424Leida Li, Tong Zhu, Pengfei Chen, Yuzhe Yang, Yaqian Li, Weisi Lin. Image Aesthetics Assessment With Attribute-Assisted Multimodal Memory Network
7425 -- 7437Mengqi Rong, Shuhan Shen. 3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection
7438 -- 7451Jiadong Liang, Wenjie Pei, Feng Lu 0005. Layout-Bridging Text-to-Image Synthesis
7452 -- 7465Jiaxing Sun, Xiaobo Shen 0001, Quansen Sun. 2,1-Norm Regularization for Few-Shot Classification
7466 -- 7479Jinping Wang, Xiaojun Tan. Mutually Beneficial Transformer for Multimodal Data Fusion
7480 -- 7490Peirui Cheng, YuZhong Zhao, Weiqiang Wang. Detect Arbitrary-Shaped Text via Adaptive Thresholding and Localization Quality Estimation
7491 -- 7502Hanyu Shi 0002, Ruibo Li, Fayao Liu, Guosheng Lin. Temporal Feature Matching and Propagation for Semantic Segmentation on 3D Point Cloud Sequences
7503 -- 7514Yu Zhang 0004, JunJie Zhao, Zhengjie Chen, Siya Mi, Hongyuan Zhu, Xin Geng 0001. A Closer Look at Video Sampling for Sequential Action Recognition
7515 -- 7529Hengmin Zhang, Shuyi Li, Jing Qiu 0002, Yang Tang, Jie Wen 0001, Zhiyuan Zha, Bihan Wen. Efficient and Effective Nonconvex Low-Rank Subspace Clustering via SVT-Free Operators
7530 -- 7540Xiaoxu Li, Qi Song, Jijie Wu, Rui Zhu 0006, Zhanyu Ma, Jing-Hao Xue. Locally-Enriched Cross-Reconstruction for Few-Shot Fine-Grained Image Classification
7541 -- 7553Dong Chen, Hao Shen 0005, Yuchen Shen. JDT-NAS: Designing Efficient Multi-Object Tracking Architectures for Non-GPU Computers
7554 -- 7564Zihang Feng, Liping Yan, Yuanqing Xia, Bo Xiao 0006. Multi-Task Probabilistic Regression With Overlap Maximization for Visual Tracking
7565 -- 7577Zhong Liu, Ran Li, Shuwei Shao, Xingming Wu, Weihai Chen. Self-Supervised Monocular Depth Estimation With Self-Reference Distillation and Disparity Offset Refinement
7578 -- 7590Tangfei Liao, Xiaoqin Zhang 0002, Yuewang Xu, Ziwei Shi, Guobao Xiao. SGA-Net: A Sparse Graph Attention Network for Two-View Correspondence Learning
7591 -- 7603Hualian Sheng, Sijia Cai, Na Zhao 0004, Bing Deng, Min-Jian Zhao 0001, Gim Hee Lee. PDR: Progressive Depth Regularization for Monocular 3D Object Detection
7604 -- 7615ZiCheng Wang, Wen Li 0001, Dong Xu 0001. Domain Adaptive Sampling for Cross-Domain Point Cloud Recognition
7616 -- 7629Chuxin Wang, Jiacheng Deng, Jianfeng He, Tianzhu Zhang, Zhe Zhang, Yongdong Zhang 0001. Long-Short Range Adaptive Transformer With Dynamic Sampling for 3D Object Detection
7630 -- 7644Chenchen Li, Liyang Zhou, Hanqing Jiang, Zhuang Zhang, Xiaojun Xiang, Han Sun, Qing Luan, Hujun Bao, Guofeng Zhang 0001. Hybrid-MVS: Robust Multi-View Reconstruction With Hybrid Optimization of Visual and Depth Cues
7645 -- 7657Shaomeng Wang, Rui Yan, Peng Huang, Guangzhao Dai, Yan Song 0005, Xiangbo Shu. Com-STAL: Compositional Spatio-Temporal Action Localization
7658 -- 7670Xinhui Liu, Wei Xi, Wen Li 0001, Dong Xu 0001, Gairui Bai, Jizhong Zhao. Co-MDA: Federated Multisource Domain Adaptation on Black-Box Models
7671 -- 7683Shuangqing Zhang, Chenglong Li 0002, Zhen Jia, Lei Liu, Zhang Zhang 0001, Liang Wang 0001. Diag-IoU Loss for Object Detection
7684 -- 7695Yuanyuan Wang, Meng Liu 0006, Jianlong Wu, Liqiang Nie. Multi-Granularity Interaction and Integration Network for Video Question Answering
7696 -- 7707Xiaofei Zhou, Songhe Wu, Ran Shi, Bolun Zheng, Shuai Wang 0003, Haibing Yin, Jiyong Zhang, Chenggang Yan 0001. Transformer-Based Multi-Scale Feature Integration Network for Video Saliency Prediction
7708 -- 7722Tingting Su, Dazheng Feng, Meng Wang, MoHan Chen. Dual Discriminative Low-Rank Projection Learning for Robust Image Classification
7723 -- 7736Yongyi Su, Xun Xu 0002, Kui Jia. Weakly Supervised 3D Point Cloud Segmentation via Multi-Prototype Learning
7737 -- 7748Yike Wang 0003, Gongyang Li, Zhi Liu 0003. SGFNet: Semantic-Guided Fusion Network for RGB-Thermal Semantic Segmentation
7749 -- 7763Kanglei Zhou, Yue Ma, Hubert P. H. Shum, Xiaohui Liang. Hierarchical Graph Convolutional Networks for Action Quality Assessment
7764 -- 7773Zhengyi Liu, Qian He, Linbo Wang, Xianyong Fang, Bin Tang. LFTransNet: Light Field Salient Object Detection via a Learnable Weight Descriptor
7774 -- 7788Xianyuan Liu, Shuo Zhou, Tao Lei, Ping Jiang, Zhixiang Chen 0003, Haiping Lu. First-Person Video Domain Adaptation With Multi-Scene Cross-Site Datasets and Attention-Based Methods
7789 -- 7802Xixi Wang, Xiao Wang 0014, Bo Jiang 0002, Bin Luo 0001. Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification
7803 -- 7818Duoxuan Pei, Di Huang 0001, Longteng Kong, Yunhong Wang. Key Role Guided Transformer for Group Activity Recognition
7819 -- 7831Leiping Jie, Hui Zhang 0062. RMLANet: Random Multi-Level Attention Network for Shadow Detection and Removal
7832 -- 7841Fatih Kamisli. Learned Lossless Image Compression Through Interpolation With Low Complexity
7842 -- 7856Tong Chen 0004, Zhan Ma. Toward Robust Neural Image Compression: Adversarial Attack and Model Finetuning
7857 -- 7869Tingyu Fan, Linyao Gao, Yiling Xu, Dong Wang, Zhu Li 0001. Multiscale Latent-Guided Entropy Model for LiDAR Point Cloud Compression
7870 -- 7883Jie Li 0015, Huiyu Wang, Zhi Liu 0002, Pengyuan Zhou, Xianfu Chen, Qiyue Li, Richang Hong. Toward Optimal Real-Time Volumetric Video Streaming: A Rolling Optimization and Deep Reinforcement Learning Based Approach
7884 -- 7899Liying Gao, Kai Niu 0002, Bingliang Jiao, Peng Wang 0015, Yanning Zhang. Addressing Information Inequality for Text-Based Person Search via Pedestrian-Centric Visual Denoising and Bias-Aware Alignments
7900 -- 7913Yan Zhang, Zhong Ji, Yanwei Pang, Xuelong Li 0001. Consensus Knowledge Exploitation for Partial Query Based Image Retrieval
7914 -- 7927Qibing Qin, Lei Huang 0010, Kezhen Xie, Zhiqiang Wei 0002, Chengduan Wang, Wenfeng Zhang. Deep Adaptive Quadruplet Hashing With Probability Sampling for Large-Scale Image Retrieval
7928 -- 7942Xiaoyue Ji, Zhekang Dong, Yifeng Han, Chun Sing Lai, Donglian Qi. A Brain-Inspired Hierarchical Interactive In-Memory Computing System and Its Application in Video Sentiment Analysis
7943 -- 7956Yukai Wang, Chunlei Peng, Decheng Liu, Nannan Wang 0001, Xinbo Gao 0001. Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in VIS and NIR Scenario

Volume 33, Issue 11

6200 -- 6212Baojie Fan, Kexin Chen, Guoping Jiang, Jiandong Tian. Two-Way Complementary Tracking Guidance
6213 -- 6224Wen Wu, Wenya Yang, Weiyin Ma, Xiao-Diao Chen. How Many Annotations Do We Need for Generalizing New-Coming Shadow Images?
6225 -- 6235Junhong Lin, Nanfeng Jiang, Zhentao Zhang, Weiling Chen, Tiesong Zhao. LMQFormer: A Laplace-Prior-Guided Mask Query Transformer for Lightweight Snow Removal
6236 -- 6248Zhanchen Zhu, Daokun Zhang, Zhikang Wang, Siyuan Feng, Peibo Duan. Spectral Dual-Channel Encoding for Image Dehazing
6249 -- 6259Yiming Huang, Haisong Xu, Zhengnan Ye, Yuemin Li, Minhang Yang, Weige Lv. Luminance and Detail Enhancement for HDR Images Based on Surround-Aware Perceptual Quantization Under Ambient Illumination
6260 -- 6272Baowei Wang, Yufeng Wu, Guiling Wang. Adaptor: Improving the Robustness and Imperceptibility of Watermarking by the Adaptive Strength Factor
6273 -- 6287Detian Huang, Xiancheng Zhu, Xiaorui Li, Huanqiang Zeng. CLSR: Cross-Layer Interaction Pyramid Super-Resolution Network
6288 -- 6301Xingtao Wang, Wenxue Cui, Ruiqin Xiong, Xiaopeng Fan, Debin Zhao. FCNet: Learning Noise-Free Features for Point Cloud Denoising
6302 -- 6316Shuqin Wang, Yongyong Chen, Zhiping Lin 0001, Yigang Cen, Qi Cao 0002. Robustness Meets Low-Rankness: Unified Entropy and Tensor Learning for Multi-View Subspace Clustering
6317 -- 6330Rui Shu, Cairong Zhao, Shuyang Feng, Liang Zhu, Duoqian Miao. Text-Enhanced Scene Image Super-Resolution via Stroke Mask and Orthogonal Attention
6331 -- 6346Heng Wang, Cong Wang 0033, Yuan Yuan 0026. Asymmetric Dual-Direction Quasi-Recursive Network for Single Hyperspectral Image Super-Resolution
6347 -- 6359Qiuyu Ren, Zhiying Lu, Haopeng Wu, Jianfeng Zhang, Zijian Dong. HR-Net: A Landmark Based High Realistic Face Reenactment Network
6360 -- 6373Jingwei Li, Yuan Li, Huanjie Wang, Chengbao Liu, Jie Tan. Exploring Explicitly Disentangled Features for Domain Generalization
6374 -- 6389Yukun Hao, Feihong Yu. Super-Resolution Degradation Model: Converting High-Resolution Datasets to Optical Zoom Datasets
6390 -- 6403Dengyun Xu, Xuanjing Shen, Yingda Lyu. UP-Net: Uncertainty-Supervised Parallel Network for Image Manipulation Localization
6404 -- 6417Zhenyu Wang, Yunzhou Zhang, Yan Liu, Delong Zhu, Sonya A. Coleman, Dermot Kerr. ELWNet: An Extremely Lightweight Approach for Real-Time Salient Object Detection
6418 -- 6430Jianghao Wu, Baopeng Zhang, Zhaoyang Li, Guilin Pang, Zhu Teng, Jianping Fan 0007. Interactive Two-Stream Network Across Modalities for Deepfake Detection
6431 -- 6442Xinxin Zhang 0004, Kaixin Xing, Qifang Liu, Da Chen 0002, Yilong Yin. Single Image Reflection Removal Based on Dark Channel Sparsity Prior
6443 -- 6458Chunqiang Yu, Xianquan Zhang, Chuan Qin 0001, Zhenjun Tang. Reversible Data Hiding in Encrypted Images With Secret Sharing and Hybrid Coding
6459 -- 6473Zhijian Wu, Wenhui Liu, Jun Li 0033, Chang Xu 0002, Dingjiang Huang. SFHN: Spatial-Frequency Domain Hybrid Network for Image Super-Resolution
6474 -- 6486Shuwei Dong, Xiaoyu Kong, Xingjia Pan, Fan Tang, Wei Li, Yi Chang, Weiming Dong. Semantic-Context Graph Network for Point-Based 3D Object Detection
6487 -- 6502Shunzhi Yang, Liuchi Xu, MengChu Zhou, Xiong Yang, Jinfeng Yang, Zhenhua Huang. Skill-Transferring Knowledge Distillation Method
6503 -- 6518Xiaotian Wu, Zishuo Xu, Wei Qi Yan 0001. Sharing Visual Secrets Among Multiple Groups With Enhanced Performance
6519 -- 6530Jiajun Gao, Yonghong Hou, Zihui Guo, Haochun Zheng. Learning Spatio-Temporal Semantics and Cluster Relation for Zero-Shot Action Recognition
6531 -- 6543Fei Peng, Tongxin Liao, Min Long, Jin Li 0002, Wensheng Zhang 0002, Yicong Zhou. Semi-Fragile Reversible Watermarking for 3D Models Using Spherical Crown Volume Division
6544 -- 6557Zhaoyi Yan, Pengyu Li, Biao Wang, Dongwei Ren, Wangmeng Zuo. Towards Learning Multi-Domain Crowd Counting
6558 -- 6570Shijie Wang, Zhihui Wang, Haojie Li, Jianlong Chang, Wanli Ouyang, Qi Tian 0001. Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition
6571 -- 6594Mengjie Hu, Xiaotong Zhu, Haotian Wang, Shixiang Cao, Chun Liu 0004, Qing Song 0006. STDFormer: Spatial-Temporal Motion Transformer for Multiple Object Tracking
6595 -- 6608Jinguang Cheng, Zongwei Wu, Shuo Wang 0010, Cédric Demonceaux, Qiuping Jiang. Bidirectional Collaborative Mentoring Network for Marine Organism Detection and Beyond
6609 -- 6621Lingling Zhang, Xinyu Zhang, QianYing Wang, Wenjun Wu, Xiaojun Chang, Jun Liu 0002. RPMG-FSS: Robust Prior Mask Guided Few-Shot Semantic Segmentation
6622 -- 6634Lihua Zhou, Siying Xiao, Mao Ye 0001, Xiatian Zhu, Shuaifeng Li. Adaptive Mutual Learning for Unsupervised Domain Adaptation
6635 -- 6648Lei Ma 0004, Fan Zhao, Hanyu Hong, Lei Wang 0068, Ying Zhu 0002. Complementary Parts Contrastive Learning for Fine-Grained Weakly Supervised Object Co-Localization
6649 -- 6660Haoran Li, Yulan Guo, Zhenwen Ren, F. Richard Yu, Jiali You 0002, Xiaojian You. Explicit Local Coupling Global Structure Clustering
6661 -- 6678Qi Zhao, Shuchang Lyu, Lijiang Chen, Binghao Liu, Ting-Bing Xu, Guangliang Cheng, Wenquan Feng. Learn by Oneself: Exploiting Weight-Sharing Potential in Knowledge Distillation Guided Ensemble Network
6679 -- 6692Maregu Assefa, Wei Jiang 0016, Kumie Alemu Gedamu, Getinet Yilma, Deepak Adhikari, Melese Ayalew, Abegaz Mohammed Seid, Aiman Erbad. Actor-Aware Self-Supervised Learning for Semi-Supervised Video Representation Learning
6693 -- 6707Yuan Rao, Yakun Ju, Cong Li, Eric Rigall, Jian Yang, Hao Fan, Junyu Dong. Learning General Descriptors for Image Matching With Regression Feedback
6708 -- 6720Yuxin Sun, Li Su, Shouzheng Yuan, Hao Meng. DANet: Dual-Branch Activation Network for Small Object Instance Segmentation of Ship Images
6721 -- 6732Li He 0002, Hong Zhang 0013. Doubly Stochastic Distance Clustering
6733 -- 6746Qingxuan Lv, Yuezun Li, Junyu Dong, Ziqian Guo. LaFea: Learning Latent Representation Beyond Feature for Universal Domain Adaptation
6747 -- 6763Rong Zhao, Xie Han, Xindong Guo, Liqun Kuang, Xiaowen Yang, Fusheng Sun. Exploring the Point Feature Relation on Point Cloud for Multi-View Stereo
6764 -- 6776Zehua Chai, Yongguo Ling, Zhiming Luo, Dazhen Lin, Min Jiang 0005, Shaozi Li. Dual-Stream Transformer With Distribution Alignment for Visible-Infrared Person Re-Identification
6777 -- 6787Guosong Jiang, Pengfei Zhu, Yu Wang 0106, Qinghua Hu. OpenMix+: Revisiting Data Augmentation for Open Set Recognition
6788 -- 6803Yanan Wu, Songhe Feng, Yang Wang 0003. Semantic-Aware Graph Matching Mechanism for Multi-Label Image Recognition
6804 -- 6818Wenqian Dong, Teng Yang, Jiahui Qu, Tian Zhang, Song Xiao 0001, Yunsong Li. Joint Contextual Representation Model-Informed Interpretable Network With Dictionary Aligning for Hyperspectral and LiDAR Classification
6819 -- 6831Lulu Tian, Hongxun Yao, Ming Li. FakePoI: A Large-Scale Fake Person of Interest Video Detection Benchmark and a Strong Baseline
6832 -- 6844Lei Yang, Xinyu Zhang 0001, Jun Li 0082, Li Wang, Minghan Zhu, Chuang Zhang, Huaping Liu 0001. Mix-Teaching: A Simple, Unified and Effective Semi-Supervised Learning Framework for Monocular 3D Object Detection
6845 -- 6859Yuejian Wu, Linqing Zhao, Jiwen Lu, Haibin Yan. Dense Hybrid Proposal Modulation for Lane Detection
6860 -- 6871Yizhe Ma, Fangjian Lin, Sitong Wu, Shengwei Tian, Long Yu 0001. PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation
6872 -- 6886Yilei Zhang, Yi Tian, Sihui Zhang, Yaping Huang. Dual-Uncertainty Guided Cycle-Consistent Network for Zero-Shot Learning
6887 -- 6896Chenping Fu, Xin Fan 0001, Jiewen Xiao, Wanqi Yuan, Risheng Liu, Zhongxuan Luo. Learning Heavily-Degraded Prior for Underwater Object Detection
6897 -- 6911Zechu Zhou, Xinyu Zhou, Zhaoyu Chen, Pinxue Guo, Qian-Yu Liu, Wenqiang Zhang. Memory Network With Pixel-Level Spatio-Temporal Learning for Visual Object Tracking
6912 -- 6923Junying Huang, Junhao Cao, Liang Lin, Dongyu Zhang. IRA-FSOD: Instant-Response and Accurate Few-Shot Object Detector
6924 -- 6938Haoyang Cheng, Hongliang Li 0001, Qingbo Wu 0001, Heqian Qiu, Xiaoliang Zhang 0002, Fanman Meng, Taijin Zhao. Disturbed Augmentation Invariance for Unsupervised Visual Representation Learning
6939 -- 6951Md. Moniruzzaman, Zhaozheng Yin. Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization
6952 -- 6964Jingwei Xin, Zikai Wei, Nannan Wang 0001, Jie Li 0001, Xiaoyu Wang 0002, Xinbo Gao 0001. Learning a High Fidelity Identity Representation for Face Frontalization
6965 -- 6980Minjie Ren, Xiangdong Huang, Jing Liu 0002, Ming Liu, Xuanya Li, An-An Liu. MALN: Multimodal Adversarial Learning Network for Conversational Emotion Recognition
6981 -- 6995Yue Qian, Junhui Hou, Qijian Zhang, Yiming Zeng, Sam Kwong, Ying He 0001. Task-Oriented Compact Representation of 3D Point Clouds via A Matrix Optimization-Driven Network
6996 -- 7008Mingxuan Li, Wen Ji. Lightweight Multiattention Recursive Residual CNN-Based In-Loop Filter Driven by Neuron Diversity
7009 -- 7023Bolin Chen, Zhao Wang 0004, Binzhe Li, Shiqi Wang 0001, Yan Ye. Compact Temporal Trajectory Representation for Talking Face Video Compression
7024 -- 7035Xu Wang 0028, Dezhong Peng, Peng Hu 0002, Yunhong Gong, Yong Chen. Cross-Domain Alignment for Zero-Shot Sketch-Based Image Retrieval
7036 -- 7049Tian-Bao Li, An-An Liu, Dan Song 0006, Wenhui Li 0001, Xuanya Li, Yuting Su 0001. Focus on Hard Samples: Hierarchical Unbiased Constraints for Cross-Domain 3D Model Retrieval
7050 -- 7065Jing-Ming Guo, Alim Wicaksono Hari Prayuda, Heri Prasetyo, Sankarasrinivasan Seshathiri. Deep Learning-Based Image Retrieval With Unsupervised Double Bit Hashing
7066 -- 7079Xiaolin Yin, Shaowu Wu, Ke Wang, Wei Lu 0001, Yicong Zhou, Jiwu Huang. Anti-Rounding Image Steganography With Separable Fine-Tuned Network

Volume 33, Issue 10

5332 -- 5344Ziyang Liu, Zhengguo Li, Weihai Chen, Xingming Wu, Zhong Liu. Unsupervised Optical Flow Estimation for Differently Exposed Images in LDR Domain
5345 -- 5359Xiangyang Wang, Yupan Lin, Yixuan Shen, Panpan Niu. UDTCWT-PHFMs Domain Statistical Image Watermarking Using Vector BW-Type R Distribution
5360 -- 5374Mingde Yao, Dongliang He, Xin Li, Fu Li, Zhiwei Xiong. Toward Interactive Self-Supervised Denoising
5375 -- 5390Ho Sub Lee, Sung In Cho. Locally Adaptive Channel Attention-Based Spatial-Spectral Neural Network for Image Deblurring
5391 -- 5405Nianzu Qiao, Jia Sun, Quanbo Ge, Changyin Sun. UIE-FSMC: Underwater Image Enhancement Based on Few-Shot Learning and Multi-Color Space
5406 -- 5419Guancheng Chen, Junli Lin, Huabiao Qin. UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion
5420 -- 5432Jun Li 0043, Yuxuan Han, Yin Gao, Qiming Li, Sumei Wang. An Enhance Relative Total Variation With BF Model for Edge-Preserving Image Smoothing
5433 -- 5443Arbish Akram, Nazar Khan. SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation
5444 -- 5457Haozhe Xing, Shuyong Gao, Yan Wang 0068, Xujun Wei, Hao Tang 0005, Wenqiang Zhang. Go Closer to See Better: Camouflaged Object Detection via Object Area Amplification and Figure-Ground Conversion
5458 -- 5469Weiwei Cai, Huaidong Zhang, Xuemiao Xu, Shengfeng He, Kun Zhang 0001, Jing Qin 0001. Contextual-Assisted Scratched Photo Restoration
5470 -- 5485Yuxin Feng, Xiaozhe Meng, Fan Zhou 0001, Weisi Lin, Zhuo Su 0001. Real-World Non-Homogeneous Haze Removal by Sliding Self-Attention Wavelet Network
5486 -- 5497Wentao Ma, Qingchao Chen, Tongqing Zhou, Shan Zhao 0002, Zhiping Cai. Using Multimodal Contrastive Knowledge Distillation for Video-Text Retrieval
5498 -- 5509Wuyang Luo, Su Yang, Weishan Zhang. Reference-Guided Large-Scale Face Inpainting With Identity and Texture Control
5510 -- 5524Siyuan Peng, Jingxing Yin, Zhijing Yang, Badong Chen, Zhiping Lin 0001. Multiview Clustering via Hypergraph Induced Semi-Supervised Symmetric Nonnegative Matrix Factorization
5525 -- 5537Yanni Zhang, Qiang Li, Miao Qi, Di Liu 0004, Jun Kong, Jianzhong Wang. Multi-Scale Frequency Separation Network for Image Deblurring
5538 -- 5548Zheng Wang 0044, Zhenwei Gao, Guoqing Wang 0001, Yang Yang 0002, Heng Tao Shen. Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning
5549 -- 5561Guanghui Yue 0001, Di Cheng, Tianwei Zhou, Jingwen Hou, Weide Liu, Long Xu, Tianfu Wang 0001, Jun Cheng 0003. Perceptual Quality Assessment of Enhanced Colonoscopy Images: A Benchmark Dataset and an Objective Method
5562 -- 5575Xin Wen, Weizhi Nie, Jing Liu 0002, Yuting Su 0001. MRFT: Multiscale Recurrent Fusion Transformer Based Prior Knowledge for Bit-Depth Enhancement
5576 -- 5586Jin Luo, ZhaoHui Tang, Hu Zhang, Ying Fan, Yongfang Xie, Weihua Gui 0001. A Binocular Camera Calibration Method in Froth Flotation Based on Key Frame Sequences and Weighted Normalized Tilt Difference
5587 -- 5604Lihao Zhuang, Liquan Shen, Zhengyong Wang, Yinyi Li. UCSNet: Priors Guided Adaptive Compressive Sensing Framework for Underwater Images
5605 -- 5616Xiangqing Liu, Gang Li, Zhenyang Zhao, Qi Cao, Zijun Zhang, Shaoan Yan, Jianbin Xie, Minghua Tang. EAF-WGAN: Enhanced Alignment Fusion-Wasserstein Generative Adversarial Network for Turbulent Image Restoration
5617 -- 5630Xiao-Qian Liu, Xue-Ying Ding, Xin Luo 0006, Xin-Shun Xu. Unsupervised Domain Adaptation via Class Aggregation for Text Recognition
5631 -- 5644Renjie Xu, Xinghao Yang, Xingxing Yao, Dapeng Tao, Weijia Cao, Xiaoping Lu, Weifeng Liu 0001. Self-Paced Hard Task-Example Mining for Few-Shot Classification
5645 -- 5654Jianqin Sun, Xianchao Xiu, Ziyan Luo, Wanquan Liu. Learning High-Order Multi-View Representation by New Tensor Canonical Correlation Analysis
5655 -- 5663Fangjian Lin, Zhanhao Liang, Sitong Wu, Junjun He, Kai Chen, Shengwei Tian. StructToken: Rethinking Semantic Segmentation With Structural Prior
5664 -- 5678Junyao Sun, Jingkai Zhou, Qiong Liu. PoiseNet: Dealing With Data Imbalance in DensePose
5679 -- 5691Jinxian Liu, Ye Chen, Bingbing Ni, Zhenbo Yu. Joint Global and Dynamic Pseudo Labeling for Semi-Supervised Point Cloud Sequence Segmentation
5692 -- 5706Si Chen 0002, Xueyan Zhu, Yan Yan 0001, Shunzhi Zhu, Shao-Zi Li, Da-Han Wang. Identity-Aware Contrastive Knowledge Distillation for Facial Attribute Recognition
5707 -- 5720Hua Yu, Xuanzhe Fan, Yaqing Hou, WenBin Pei, Hongwei Ge, Xin Yang 0011, Dongsheng Zhou, Qiang Zhang 0008, Mengjie Zhang 0001. Toward Realistic 3D Human Motion Prediction With a Spatio-Temporal Cross- Transformer Approach
5721 -- 5733Tianyu Sun, Guodong Zhang 0004, Wenming Yang, Jing-Hao Xue, Guijin Wang. TROSD: A New RGB-D Dataset for Transparent and Reflective Object Segmentation in Practice
5734 -- 5749Zhenzhen Quan, Qingshan Chen, Moyan Zhang, Weifeng Hu, Qiang Zhao, Jiangang Hou, Yujun Li, Zhi Liu 0004. MAWKDN: A Multimodal Fusion Wavelet Knowledge Distillation Approach Based on Cross-View Attention for Action Recognition
5750 -- 5763Yiming Yang, Weipeng Hu, Haifeng Hu 0001. Neutral Face Learning and Progressive Fusion Synthesis Network for NIR-VIS Face Recognition
5764 -- 5777Zhengzheng Sun, Lianfang Tian, Qiliang Du, Wenzhi Liao, Zhaolin Wang. Adaptive Anchor Matching Strategy for Face Detection
5778 -- 5789Tao Jing, Ming Zeng 0001, Qing-Hao Meng. SmokePose: End-to-End Smoke Keypoint Detection
5790 -- 5801Ming Yuan, Dong Xu. Spatio-Temporal Feature Pyramid Interactive Attention Network for Egocentric Gaze Prediction
5802 -- 5813Jinjia Peng, Guangqi Jiang, Huibing Wang. Adaptive Memorization With Group Labels for Unsupervised Person Re-Identification
5814 -- 5827Ning Wang, Guangming Zhu 0001, Hongsheng Li, Mingtao Feng, Xia Zhao, Lan Ni, Peiyi Shen, Lin Mei 0001, Liang Zhang 0010. Exploring Spatio-Temporal Graph Convolution for Video-Based Human-Object Interaction Recognition
5828 -- 5843Yuhongze Zhou, Liguang Zhou, Tin Lun Lam, Yangsheng Xu. Sampling Propagation Attention With Trimap Generation Network for Natural Image Matting
5844 -- 5854Xiaoyu Tian, Ming Yang, Qian Yu, Junhai Yong, Dong Xu 0001. MedoidsFormer: A Strong 3D Object Detection Backbone by Exploiting Interaction With Adjacent Medoid Tokens
5855 -- 5867Wenyu Liu 0005, Wentong Li, Jianke Zhu, Miaomiao Cui, Xuansong Xie, Lei Zhang 0006. Improving Nighttime Driving-Scene Segmentation via Dual Image-Adaptive Learnable Filters
5868 -- 5881Zhen Mei, Peng Ye, Hancheng Ye, Baopu Li, Jinyang Guo, Tao Chen 0003, Wanli Ouyang. Automatic Loss Function Search for Adversarial Unsupervised Domain Adaptation
5882 -- 5893Lorenzo Papa, Paolo Russo 0001, Irene Amerini. METER: A Mobile Vision Transformer Architecture for Monocular Depth Estimation
5894 -- 5907Yuhang Zhou, Fuxiang Huang, Weijie Chen, Shiliang Pu, Lei Zhang 0038. Stochastic Gradient Perturbation: An Implicit Regularizer for Person Re-Identification
5908 -- 5920Zeqi Chen, Zhichao Cui, Chi Zhang 0020, Jiahuan Zhou, Yuehu Liu. Dual Clustering Co-Teaching With Consistent Sample Mining for Unsupervised Person Re-Identification
5921 -- 5931Shaokun Wang, Weiwei Shi 0003, Songlin Dong, Xinyuan Gao, Xiang Song, Yihong Gong. Semantic Knowledge Guided Class-Incremental Learning
5932 -- 5946Xiao Wang, Weirong Ye, Zhongang Qi, Guangge Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Hanzi Wang. Task-Aware Dual-Representation Network for Few-Shot Action Recognition
5947 -- 5958Tiantian Gong, Kaixiang Chen, Liyan Zhang 0001, Junsheng Wang. Debiased Contrastive Curriculum Learning for Progressive Generalizable Person Re-Identification
5959 -- 5972Mingyao Hong, Xinfeng Zhang 0001, Guorong Li, Qingming Huang. Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification
5973 -- 5985Li Chen, Cong Peng 0001, Bingchao Zhao. Novel Multi-Task Learning for Motion Magnification
5986 -- 5998Hao Wang 0073, Tong Jia, Bowen Ma, Qilong Wang, Wangmeng Zuo. Fully Cascade Consistency Learning for One-Stage Object Detection
5999 -- 6012Hui Li 0085, Mingjie Sun, Jimin Xiao, Eng Gee Lim, Yao Zhao 0001. Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning
6013 -- 6025Chuanchuan Chen, Dongrui Liu, Changqing Xu, Trieu-Kien Truong. SAKS: Sampling Adaptive Kernels From Subspace for Point Cloud Graph Convolution
6026 -- 6040Xin Deng 0002, Yufan Deng, Ren Yang, Wenzhe Yang, Radu Timofte, Mai Xu. MASIC: Deep Mask Stereo Image Compression
6041 -- 6056Jeeyoon Park, Jeehwan Lee, Bumyoon Kim, Byeungwoo Jeon. Learning-Based Early Transform Skip Mode Decision for VVC Screen Content Coding
6057 -- 6071Dandan Ding, Junjie Wang, Guangkun Zhen, Debargha Mukherjee, Urvang Joshi, Zhan Ma. Neural Adaptive Loop Filtering for Video Coding: Exploring Multi-Hypothesis Sample Refinement
6072 -- 6085Yunhao Mao, Meng Wang 0017, Zhangkai Ni, Shiqi Wang 0001, Sam Kwong. Neural Network Based Rate Control for Versatile Video Coding
6086 -- 6100Xin Fang, Yiping Duan, Qiyuan Du, Xiaoming Tao, Fan Li 0003. Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach
6101 -- 6116Zheng Liu, Tianyi Li, Ying Chen, Kaijin Wei, Mai Xu, Honggang Qi. Deep Multi-Task Learning Based Fast Intra-Mode Decision for Versatile Video Coding
6117 -- 6130Hao Hao, Changqiao Xu, Wei Zhang 0049, Shujie Yang, Gabriel-Miro Muntean. Computing Offloading With Fairness Guarantee: A Deep Reinforcement Learning Method
6131 -- 6143Hongguang Zhu, Chunjie Zhang, Yunchao Wei, Shujuan Huang, Yao Zhao 0001. ESA: External Space Attention Aggregation for Image-Text Retrieval
6144 -- 6158Yan Wang, Yuting Su 0001, Wenhui Li 0001, Jun Xiao 0001, Xuanya Li, An-An Liu. Dual-Path Rare Content Enhancement Network for Image and Text Matching
6159 -- 6172Tianshi Wang, Lei Zhu 0002, Zheng Zhang 0006, Huaxiang Zhang 0001, Junwei Han. Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval
6173 -- 6184Yijun Cao, Xian-Shi Zhang, Fuya Luo, Chuan Lin 0003, Yong-Jie Li. Unsupervised Visual Odometry and Action Integration for PointGoal Navigation in Indoor Environment
6185 -- 6190Chaofan He, Roberto Gerson De Albuquerque Azevedo, Jiacheng Chen, Shuyuan Zhu, Bing Zeng, Pascal Frossard. Quality-Constrained Encoding Optimization for Omnidirectional Video Streaming
6191 -- 6197Mengke Huang, Gongyang Li, Zhi Liu 0003, Linchao Zhu. Lightweight Distortion-Aware Network for Salient Object Detection in Omnidirectional Images

Volume 33, Issue 1

1 -- 15Wenzhu Yan, Ming Yang, Yanmeng Li. Robust Low Rank and Sparse Representation for Multiple Kernel Dimensionality Reduction
16 -- 29Xijie Xiang, Lin Zhu 0012, Jianing Li, Yixuan Wang, Tie-Jun Huang 0001, Yonghong Tian 0001. Learning Super-Resolution Reconstruction for High Temporal Resolution Spike Stream
30 -- 42Jing-Hui Shi, Qing Zhang 0004, Yu-Hao Tang, Zhong-Qun Zhang. Polyp-Mixer: An Efficient Context-Aware MLP-Based Paradigm for Polyp Segmentation
43 -- 58Wenhua Zhang, Licheng Jiao, Fang Liu 0001, Shuyuan Yang, Jia Liu 0020. DFAT: Dynamic Feature-Adaptive Tracking
59 -- 73Angfan Zhu, Yang Xiao 0007, Chengxin Liu, Zhiguo Cao 0001. Robust LiDAR-Camera Alignment With Modality Adapted Local-to-Global Representation
74 -- 87Liangchen Hu, Zhenlei Dai, Lei Tian 0007, Wensheng Zhang 0002. Class-Oriented Self-Learning Graph Embedding for Image Compact Representation
88 -- 103Xiaotian Wu, Na An, Zishuo Xu. Sharing Multiple Secrets in XOR-Based Visual Cryptography by Non-Monotonic Threshold Property
104 -- 117Lei Wang 0018, Xun-Yu Liu, Xiaoliang Ma, Jiaji Wu, Jun Cheng 0002, MengChu Zhou. A Progressive Quadric Graph Convolutional Network for 3D Human Mesh Recovery
118 -- 131Lifang Zhou, Jiaqi Li, Bangjun Lei, Weisheng Li 0001, Jiaxu Leng. Correlation Filter Tracker With Sample-Reliability Awareness and Self-Guided Update
132 -- 145Feng Zhang, Xiaoyue Jiang, Zhaoqiang Xia, Moncef Gabbouj, Jinye Peng, Xiaoyi Feng. Non-Local Color Compensation Network for Intrinsic Image Decomposition
146 -- 159Qinghai Zheng, Jihua Zhu, Zhongyu Li, Haoyu Tang. Graph-Guided Unsupervised Multiview Representation Learning
160 -- 171Mingdeng Cao, Yanbo Fan, Yong Zhang 0034, Jue Wang 0001, Yujiu Yang. VDTR: Video Deblurring With Transformer
172 -- 185Yuxin Mao, Zhexiong Wan, Yuchao Dai, Xin Yu 0002. Deep Idempotent Network for Efficient Single Image Blind Deblurring
186 -- 199Chao Fan, Hongyuan Yu, Yan Huang 0008, Caifeng Shan, Liang Wang 0001, Chenglong Li 0002. SiamON: Siamese Occlusion-Aware Network for Visual Tracking
200 -- 212Xianlin Zeng, Yalong Jiang, Wenrui Ding, Hongguang Li, Yafeng Hao, Zifeng Qiu. A Hierarchical Spatio-Temporal Graph Convolutional Neural Network for Anomaly Detection in Videos
213 -- 227Longrong Yang, Hongliang Li 0001, Fanman Meng, Qingbo Wu 0001, King Ngi Ngan. Task-Specific Loss for Robust Instance Segmentation With Noisy Class Labels
228 -- 241Zhe Wu, Xinfeng Zhang 0001, Geng Tian, Yaowei Wang, Qingming Huang. Spatial-Temporal Graph Network for Video Crowd Counting
242 -- 256Longbin Yan, Yunxiao Qin, Jie Chen 0022. Scale-Balanced Real-Time Object Detection With Varying Input-Image Resolution
257 -- 268Jinjing Gu, Hanli Wang, Ruichao Fan. Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention
269 -- 282Jian Xu, Bo Liu 0002, Yanshan Xiao. A Variational Inference Method for Few-Shot Learning
283 -- 297Shengrong Yang, Weihong Liu, Yangbin Yu, Haifeng Hu 0001, Dihu Chen, Tao Su. Diverse Feature Learning Network With Attention Suppression and Part Level Background Suppression for Person Re-Identification
298 -- 311Qing Zhang 0004, Rui Zhao, Liqian Zhang. TCRNet: A Trifurcated Cascaded Refinement Network for Salient Object Detection
312 -- 325Jiayin Sun, Hong Wang, Qiulei Dong. MoEP-AE: Autoencoding Mixtures of Exponential Power Distributions for Open-Set Recognition
326 -- 341Yuxuan Liu, Hongwei Ge, Liang Sun 0003, Yaqing Hou. Complementary Attention-Driven Contrastive Learning With Hard-Sample Exploring for Unsupervised Domain Adaptive Person Re-ID
342 -- 353Xin Xiong, Weidong Min, Qi Wang 0061, Cheng Zha. Human Skeleton Feature Optimizer and Adaptive Structure Enhancement Graph Convolution Network for Action Recognition
354 -- 366Weiqi Sun, Rui Su, Qian Yu, Dong Xu 0001. Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization
367 -- 378Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen 0001, Yao Zhao 0001. Incomplete Multiview Clustering via Cross-View Relation Transfer
379 -- 392Yu Ren, Yang Cong, Jiahua Dong, Gan Sun. Uni3DA: Universal 3D Domain Adaptation for Object Recognition
393 -- 406Liqi Yan, Qifan Wang, Siqi Ma, Jingang Wang, Changbin Yu. Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework With Spatio-Temporal Collaboration
407 -- 420Ziwei Wei, Benben Niu, Haodong Xiao, Yun He. Isolated Points Prediction via Deep Neural Network on Point Cloud Lossless Geometry Compression
421 -- 433Zhisen Tang, Hanli Wang, Xiaokai Yi, Yun Zhang 0002, Sam Kwong, C. C. Jay Kuo. Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression
434 -- 444Hui Lan, Zhe Ji, Cheolkon Jung, Dan Zou, Ming Li. Multisensor Collaboration Network for Video Compression Based on Wavelet Decomposition
445 -- 456Ye Yuan, Jiawan Zhang. Unsupervised Video Summarization via Deep Reinforcement Learning With Shot-Level Semantics