Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 34, Issue 7

5215 -- 5228YinLong Liu, Guang Chen 0001, Alois Knoll. Absolute Pose Estimation With a Known Direction by Motion Decoupling
5229 -- 5241Pengcheng Li, Chenqiang Gao, Fangcen Liu, Deyu Meng, Yan Yan 0002. THISNet: Tooth Instance Segmentation on 3D Dental Models via Highlighting Tooth Regions
5242 -- 5254Xiao Kang, Xingbo Liu, Xuening Zhang, Xiushan Nie, Yilong Yin. Online Discriminative Cross-Modal Hashing
5255 -- 5265Yihuan Zhu, Yunan Liu, Chunpeng Wang, Simiao Wang, Mingyu Lu. Intermediate Domain-Based Meta Learning Framework for Adaptive Object Detection
5266 -- 5281Jingyu Li, Lei Zhang 0119, Kun Zhang 0040, Bo Hu, Hongtao Xie, Zhendong Mao. Cascade Semantic Prompt Alignment Network for Image Captioning
5282 -- 5294Hao Qi, Huiyu Zhou 0001, Junyu Dong, Xinghui Dong. Small Sample Image Segmentation by Coupling Convolutions and Transformers
5295 -- 5305Hongbo Xu, Lichun Wang 0002, Kai Xu 0012, Fangyu Fu, Baocai Yin, Qingming Huang. A New Training Data Organization Form and Training Mode for Unbiased Scene Graph Generation
5306 -- 5320Yong Luo, Hongwei Ge, Yuxuan Liu, Chunguo Wu. Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning
5321 -- 5334Qi Gao, Mingfeng Yin, Xiang Wu, Di Liu, Yuming Bo. Online Multi-Scale Classification and Global Feature Modulation for Robust Visual Tracking
5335 -- 5349Yu Liu 0040, Sitong Su, Junchen Zhu, Feng Zheng, Lianli Gao, Jingkuan Song. Allowing Supervision in Unsupervised Deformable- Instances Image-to-Image Translation
5350 -- 5360Cheng Wang, Yuxin Fang, Jiemin Fang, Peng Guo, Rui Wu 0018, He Huang, Xinggang Wang, Chang Huang, Wenyu Liu 0001. Efficient Task-Specific Feature Re-Fusion for More Accurate Object Detection and Instance Segmentation
5361 -- 5375Shikun Zhang, Jiaqi Yang 0002, Zhaoshuai Qi, Yanning Zhang 0001. Toward Meta-Shape-Based Multi-View 3D Point Cloud Registration: An Evaluation
5376 -- 5388Jinyu Yang, Mingqi Gao 0003, Runmin Cong, Chengjie Wang, Feng Zheng, Ales Leonardis. Unveiling the Power of Visible-Thermal Video Object Segmentation
5389 -- 5399Sen Xu, Shikui Wei, Tao Ruan, Yao Zhao 0001. ESNet: An Efficient Framework for Superpixel Segmentation
5400 -- 5413Hongchen Tan, Baocai Yin, Kaiqiang Xu, Huasheng Wang, Xiuping Liu, Xin Li 0003. Attention-Bridged Modal Interaction for Text-to-Image Generation
5414 -- 5423Zhizhe Liu, Shuai Zheng 0005, Xiaoyi Sun, Zhenfeng Zhu, Yawei Zhao, Xuebing Yang, Yao Zhao 0001. The Devil Is in the Boundary: Boundary-Enhanced Polyp Segmentation
5424 -- 5439Hanqing Yang 0002, Sijia Cai, Bing Deng, Jieping Ye, Guosheng Lin, Yu Zhang 0018. Context-Aware and Semantic-Consistent Spatial Interactions for One-Shot Object Detection Without Fine-Tuning
5440 -- 5451Weijia Liu, Jiuxin Cao, Ran Wei, Xuelin Zhu, Bo Liu 0004. Enhancing Micro-Video Venue Recognition via Multi-Modal and Multi-Granularity Object Relations
5452 -- 5465Xihang Hu, Xiaoli Zhang 0001, Fasheng Wang, Jing Sun, Fuming Sun. Efficient Camouflaged Object Detection Network Based on Global Localization Perception and Local Guidance Refinement
5466 -- 5479Zonglin Li, Zhaoxin Zhang, Shengfeng He, Quanling Meng, Shengping Zhang, Bineng Zhong, Rongrong Ji. Identity-Aware Variational Autoencoder for Face Swapping
5480 -- 5492Yidan Fan, Yongxin Yu, Wenhuan Lu, Yahong Han. Weakly-Supervised Video Anomaly Detection With Snippet Anomalous Attention
5493 -- 5504Jiashuo Li, Songlin Dong, Yihong Gong, Yuhang He, Xing Wei 0001. Analogical Learning-Based Few-Shot Class-Incremental Learning
5505 -- 5518An Tao, Yueqi Duan, Yingqi Wang, Jiwen Lu, Jie Zhou 0001. Dynamics-Aware Adversarial Attack of Adaptive Neural Networks
5519 -- 5532Yuhang He, Zhiheng Ma, Xing Wei 0001, Yihong Gong. Knowledge Synergy Learning for Multi-Modal Tracking
5533 -- 5545Mengzhu Wang, Shanshan Wang 0008, Xun Yang 0001, Jianlong Yuan, Wenju Zhang. Equity in Unsupervised Domain Adaptation by Nuclear Norm Maximization
5546 -- 5559Jin Liu, Huiyuan Fu, Xin Wang 0001, Huadong Ma. SwinIT: Hierarchical Image-to-Image Translation Framework Without Cycle Consistency
5560 -- 5574Zhaobin Chang, Xiong Gao, Na Li, Huiyu Zhou 0001, Yonggang Lu. DRNet: Disentanglement and Recombination Network for Few-Shot Semantic Segmentation
5575 -- 5588Minghao Zou, Qingtian Zeng, Xue Zhang. Weakly-Supervised Action Learning in Procedural Task Videos via Process Knowledge Decomposition
5589 -- 5602Zaiyang Yu, Lusi Li, Jinlong Xie, Changshuo Wang 0001, Weijun Li, Xin Ning 0001. Pedestrian 3D Shape Understanding for Person Re-Identification via Multi-View Learning
5603 -- 5615Yuxiao Wang 0003, Qi Liu 0005, Yu Lei. TED-Net: Dispersal Attention for Perceiving Interaction Region in Indirectly-Contact HOI Detection
5616 -- 5629Zhifu Zhao, Ziwei Chen, Jianan Li 0003, Xiaotian Wang 0001, Xuemei Xie, Lei Huang, Wanxin Zhang, Guangming Shi. Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-Based Action Recognition
5630 -- 5640Zhuohao Sun, Yiqiao Qiu, Zhijun Tan, Weishi Zheng 0001, Ruixuan Wang. Classifier-Head Informed Feature Masking and Prototype-Based Logit Smoothing for Out-of-Distribution Detection
5641 -- 5652Mengyu Gao, Qiulei Dong. Adaptive Conditional Denoising Diffusion Model With Hybrid Affinity Regularizer for Generalized Zero-Shot Learning
5653 -- 5664Ren Wang, Tae Sung Kim, Jin-Sung Kim, Hyuk-Jae Lee. Toward Real-World Multi-View Object Classification: Dataset, Benchmark, and Analysis
5665 -- 5676Saihui Hou, Panjian Huang, Xu Liu 0008, Chunshui Cao, Yongzhen Huang. Cloth-Imbalanced Gait Recognition via Hallucination
5677 -- 5692Doyoung Kim, Taewan Kim, Inwoong Lee, Sanghoon Lee 0001. Kinematic Diversity and Rhythmic Alignment in Choreographic Quality Transformers for Dance Quality Assessment
5693 -- 5703Yang Xu, Yan Yan 0001, Jing-Hao Xue, Yang Hua, Hanzi Wang. Unpaired Caricature-Visual Face Recognition via Feature Decomposition-Restoration-Decomposition
5704 -- 5715Yunbo Rao, Qingsong Lv, Andrei Sharf, Zhanglin Cheng. RWS: Refined Weak Slice for Semantic Segmentation Enhancement
5716 -- 5727Mochu Xiang, Jing Zhang 0052, Nick Barnes, Yuchao Dai. Measuring and Modeling Uncertainty Degree for Monocular Depth Estimation
5728 -- 5741Fan Qi, Huaiwen Zhang, Xiaoshan Yang, Changsheng Xu. A Versatile Multimodal Learning Framework for Zero-Shot Emotion Recognition
5742 -- 5752Mingzhi Yuan, Kexue Fu, Zhihao Li, Yucong Meng, Ao Shen, Manning Wang. Robust Point Cloud Registration via Random Network Co-Ensemble
5753 -- 5764Jian Wang 0113, Fan Li 0003, Yi An, Xuchong Zhang, Hongbin Sun 0001. Toward Robust LiDAR-Camera Fusion in BEV Space via Mutual Deformable Attention and Temporal Aggregation
5765 -- 5775Zhepeng Gong, Guobao Xiao, Ziwei Shi, Riqing Chen, Jun Yu 0002. MSGA-Net: Progressive Feature Matching via Multi-Layer Sparse Graph Attention
5776 -- 5789Dawei Zhang 0002, Xin Xiao, Zhonglong Zheng, Yunliang Jiang, Yi Yang. Probabilistic Assignment With Decoupled IoU Prediction for Visual Tracking
5790 -- 5804Ruixuan Cong, Hao Sheng 0001, Dazhi Yang, Da Yang 0001, Rongshan Chen, Sizhe Wang, Zhenglong Cui. End-to-End Semantic Segmentation Utilizing Multi-Scale Baseline Light Field
5805 -- 5817Jiashan Wu, Chunbo Lang, Gong Cheng 0003, Xingxing Xie, Junwei Han. Retentive Compensation and Personality Filtering for Few-Shot Remote Sensing Object Detection
5818 -- 5829Bowei Yan, Chunbo Lang, Gong Cheng 0003, Junwei Han. Understanding Negative Proposals in Generic Few-Shot Object Detection
5830 -- 5842Zhihao Chen 0004, Liang Wan, Yefan Xiao, Lei Zhu 0003, Huazhu Fu. Learning Physical-Spatio-Temporal Features for Video Shadow Removal
5843 -- 5855Jinwei Ren, Jianke Zhu. Pyramid Deep Fusion Network for Two-Hand Reconstruction From RGB-D Images
5856 -- 5867Junwei Zhao, Shiliang Zhang, Zhaofei Yu, Tiejun Huang 0001. SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera
5868 -- 5883Bicheng Guo, Lilin Xu, Tao Chen 0003, Peng Ye, Shibo He, Haoyu Liu, Jiming Chen 0001. Latency-Aware Neural Architecture Performance Predictor With Query-to-Tier Technique
5884 -- 5896Tianshu Song, Leida Li, Deqiang Cheng, Pengfei Chen 0003, Jinjian Wu. Active Learning-Based Sample Selection for Label-Efficient Blind Image Quality Assessment
5897 -- 5907Keke Zhang, Tiesong Zhao, Weiling Chen, Yuzhen Niu, Jinsong Hu, Weisi Lin. Perception-Driven Similarity-Clarity Tradeoff for Image Super-Resolution Quality Assessment
5908 -- 5920Zhaoshui He, Hao Liang, Senquan Yang, Wenqing Su, Peitao Wang, Zhijie Lin, Beihai Tan, Shengli Xie. Accelerating Robust-Object-Tracking via Level-3 BLAS-Based Sparse Learning
5921 -- 5934Pan Mu, Guanyao Wu, Jinyuan Liu 0001, Yuduo Zhang, Xin Fan 0001, Risheng Liu. Learning to Search a Lightweight Generalized Network for Medical Image Fusion
5935 -- 5950ShiJie Wen, Li Yang 0014, Mai Xu, Minglang Qiao, Tao Xu, Lin Bai 0001. Saliency Prediction on Mobile Videos: A Fixation Mapping-Based Dataset and A Transformer Approach
5951 -- 5962Haoyang Peng, Baopu Li, Bo Zhang 0069, Xin Chen 0040, Tao Chen 0003, Hongyuan Zhu. Multi-View Vision Fusion Network: Can 2D Pre-Trained Model Boost 3D Point Cloud Data-Scarce Learning?
5963 -- 5976Han Chen, Qi Wang, Kailin Xie, Liang Lei, Matthieu Gaetan Lin, Tian Lv, Yongjin Liu 0001, Jiebo Luo 0001. SD-FSOD: Self-Distillation Paradigm via Distribution Calibration for Few-Shot Object Detection
5977 -- 5994De Han, Xing Cheng, Nan Guo, Xiaochun Ye, Benjamin Rainer, Peter Priller. Momentum Cross-Modal Contrastive Learning for Video Moment Retrieval
5995 -- 6008Yuantong Zhang, Baoxin Teng, Daiqin Yang, Zhenzhong Chen, Haichuan Ma, Gang Li, Wenpeng Ding. Learning a Single Convolutional Layer Model for Low Light Image Enhancement
6009 -- 6023Huafeng Li, Dan Wang, Yuxin Huang, Yafei Zhang, Zhengtao Yu 0001. Generation and Recombination for Multifocus Image Fusion With Free Number of Inputs
6024 -- 6038Guangfei Li, Wenbing Liu, Quanxue Gao, Qianqian Wang 0001, Jungong Han, Xinbo Gao 0001. Self-Supervised Edge Perceptual Learning Framework for High-Resolution Remote Sensing Images Classification
6039 -- 6050Haixin Wang, Lu Zhou, Yingying Chen 0003, Zhiyang Chen 0002, Ming Tang 0001, Jinqiao Wang. EFCPose: End-to-End Multi-Person Pose Estimation With Fully Convolutional Heads
6051 -- 6062Guangxing Wang, Gong Cheng 0003, Peicheng Zhou, Junwei Han. Cross-Level Attentive Feature Aggregation for Change Detection
6063 -- 6076Zixiao Wang 0002, Hongtao Xie, Yuxin Wang 0002, Hai Xu, Guoqing Jin. DCFP: Distribution Calibrated Filter Pruning for Lightweight and Accurate Long-Tail Semantic Segmentation
6077 -- 6091Lin Yuan 0002, Kai Liang, Xiao Pu 0002, Yan Zhang 0108, Jiaxu Leng, Tao Wu 0003, Nannan Wang 0001, Xinbo Gao 0001. Invertible Image Obfuscation for Facial Privacy Protection via Secure Flow
6092 -- 6104Yanting Liu, Hui Yin, Ai-Xin Chong, Jin Wan. Reference-Based Image Dehazing With Internal and External Contrastive Learning
6105 -- 6115Lanqing Guo, Siyu Huang, Haosen Liu 0001, Bihan Wen. Toward Robust Image Denoising via Flow-Based Joint Image and Noise Model
6116 -- 6127Yuan Zhou 0006, Axin Guo, Shuwei Huo, Yu Liu 0004, Sun-Yuan Kung. Weakly Supervised Video Re-Localization Through Multi-Agent-Reinforced Switchable Network
6128 -- 6143Xinjue Hu, Zhangjie Fu, Xiang Zhang 0023, Yanyu Chen. Invisible and Steganalysis-Resistant Deep Image Hiding Based on One-Way Adversarial Invertible Networks
6144 -- 6155Zhenhao Sun, Meng Wang 0017, Peilin Chen, Xu Wang 0006, Shiqi Wang 0001, Sam Kwong. Revisiting All-Zero Block Detection for Versatile Video Coding
6156 -- 6166Shifei Ding, Qidong Wang, Lili Guo, Xuan Li, Ling Ding 0001, Xindong Wu 0001. Wavelet and Adaptive Coordinate Attention Guided Fine-Grained Residual Network for Image Denoising
6167 -- 6180Zhiwen Zuo, Ailin Li, Zhizhong Wang, Lei Zhao 0011, Jianfeng Dong, Xun Wang 0007, Meng Wang 0001. Statistics Enhancement Generative Adversarial Networks for Diverse Conditional Image Synthesis
6181 -- 6193Hao Wei 0005, Chenyang Ge, Zhiyuan Li, Xin Qiao, Pengchao Deng. Toward Extreme Image Rescaling With Generative Prior and Invertible Prior
6194 -- 6206Xiaohui Chen, Lin Chen, Lingjun Chen, Peng Chen, Guanqun Sheng, Xiaosheng Yu, Yaobin Zou. Modeling Thermal Infrared Image Degradation and Real-World Super-Resolution Under Background Thermal Noise and Streak Interference
6207 -- 6223Xiongli Chai, Feng Shao 0001, Baoyang Mu, Hangwei Chen, Qiuping Jiang, Yo-Sung Ho. Plain-PCQA: No-Reference Point Cloud Quality Assessment by Analysis of Plain Visual and Geometrical Components
6224 -- 6237Zhu Liu 0004, Jinyuan Liu 0001, Guanyao Wu, Zihang Chen, Xin Fan 0001, Risheng Liu. Searching a Compact Architecture for Robust Multi-Exposure Image Fusion
6238 -- 6252Dan Guo, Kun Li 0008, Bin Hu 0001, Yan Zhang 0053, Meng Wang 0001. Benchmarking Micro-Action Recognition: Dataset, Methods, and Applications
6253 -- 6264Jiahao Wang 0002, Fang Liu 0001, Licheng Jiao, Yingjia Gao, Hao Wang 0211, Lingling Li 0002, Puhua Chen, Xu Liu 0006, Shuo Li 0010. Satellite Video Object Tracking Based on Location Prompts
6265 -- 6278Jie Gui, Xiaofeng Cong, Chengwei Peng, Yuan Yan Tang, James Tin-Yau Kwok. Fooling the Image Dehazing Models by First Order Gradient
6279 -- 6290Zihan Zhou 0007, Jing Li 0026, Dexiang Zhong, Yong Xu 0007, Patrick Le Callet. Deep Blind Image Quality Assessment Using Dynamic Neural Model With Dual-Order Statistics
6291 -- 6302Jie Zhao, Shikui Wei, Yakun Chang, Tao Ruan, Yao Zhao 0001. Model-Free Rectification via Cascaded Distortion Model and Enhanced Backward Flow Network
6303 -- 6317Shenglun Chen, Hong Zhang, Xinzhu Ma, Zhihui Wang 0001, Haojie Li. Learning Pixel-Wise Continuous Depth Representation via Clustering for Depth Completion
6318 -- 6333Wenbin Yan, Xiaogang Zhang, Hua Chen 0008. Occlusion-Aware Unsupervised Light Field Depth Estimation Based on Multi-Scale GANs
6334 -- 6346Jilong Wang 0002, Wei Gao 0003, Ge Li 0002. Zoom to Perceive Better: No-Reference Point Cloud Quality Assessment via Exploring Effective Multiscale Feature
6347 -- 6362Zijian Chen 0001, Wei Sun 0029, Jun Jia, Fangfang Lu, Zicheng Zhang, Jing Liu 0002, Ru Huang 0002, Xiongkuo Min, Guangtao Zhai. BAND-2k: Banding Artifact Noticeable Database for Banding Detection and Quality Assessment
6363 -- 6375Yinuo Jiang, Beitong Zhou, Xiaoyu Liu, Qingyi Li, Cheng Cheng. GTINet: Global Topology-Aware Interactions for Unsupervised Point Cloud Registration
6376 -- 6390Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin. Hierarchical Multi-Modal Prompting Transformer for Multi-Modal Long Document Classification
6391 -- 6402Xin Li, Guopu Zhu, Shen Wang 0004, Yicong Zhou, Xinpeng Zhang 0001. Deep Reverse Attack on SIFT Features With a Coarse-to-Fine GAN Model
6403 -- 6415Hanwei Zhu, Baoliang Chen, Lingyu Zhu 0006, Peilin Chen, Linqi Song, Shiqi Wang 0001. Video Quality Assessment for Spatio-Temporal Resolution Adaptive Coding
6416 -- 6429Mingyi Yang, Fei Yang 0004, Luka Murn, Marc Górriz Blanch, Juil Sock, Shuai Wan, FuZheng Yang 0001, Luis Herranz. Task-Switchable Pre-Processor for Image Compression for Multiple Machine Vision Tasks
6430 -- 6444Xiaofeng Huang, Ran Tang, Rui Pan, Haibing Yin, Zhao Wang 0004, Shiqi Wang 0001, Siwei Ma. Parallelized RDOQ Algorithm and Fully Pipelined Hardware Architecture for AVS3 Video Coding
6445 -- 6459Fabian Brand, Jürgen Seiler, André Kaup. Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding
6460 -- 6473Xihua Sheng, Li Li 0040, Dong Liu 0002, Houqiang Li. Spatial Decomposition and Temporal Fusion Based Inter Prediction for Learned Video Compression
6474 -- 6488Mengyao Li, Liquan Shen, Xia Hua, Zhaoyi Tian. EUICN: An Efficient Underwater Image Compression Network
6489 -- 6502Hao Liu 0044, Hui Yuan 0001, Raouf Hamzaoui, Qi Liu 0029, Shuai Li 0005. PU-Mask: 3D Point Cloud Upsampling via an Implicit Virtual Mask
6503 -- 6516Qingrong Cheng, Zhenshan Tan, Keyu Wen, Cheng Chen, Xiaodong Gu 0001. Semantic Pre-Alignment and Ranking Learning With Unified Framework for Cross-Modal Retrieval
6517 -- 6529Xuening Zhang, Xingbo Liu, Xiushan Nie, Xiao Kang, Yilong Yin. Semi-Supervised Semi-Paired Cross-Modal Hashing
6530 -- 6541Jiaxing Li, Wai-Keung Wong, Lin Jiang, Xiaozhao Fang, Shengli Xie, Yong Xu 0001. CKDH: CLIP-Based Knowledge Distillation Hashing for Cross-Modal Retrieval
6542 -- 6558Zhe Li, Lei Zhang 0119, Kun Zhang 0040, Yongdong Zhang 0001, Zhendong Mao. Fast, Accurate, and Lightweight Memory-Enhanced Embedding Learning Framework for Image-Text Retrieval
6559 -- 6575Lei Chen, Zhen Deng, Libo Liu, Shibai Yin. Multilevel Semantic Interaction Alignment for Video-Text Cross-Modal Retrieval
6576 -- 6589Qinghang Su, Dayan Wu, Chenming Wu, Bo Li 0063, Weiping Wang 0005. From Data to Optimization: Data-Free Deep Incremental Hashing With Data Disambiguation and Adaptive Proxies
6590 -- 6607Zhe Li, Lei Zhang 0119, Kun Zhang 0040, Yongdong Zhang 0001, Zhendong Mao. Improving Image-Text Matching With Bidirectional Consistency of Cross-Modal Alignment
6608 -- 6612Hongliang Lei, Tianlei Wang, Xianfu Bao, Huafei Huang, Jiuwen Cao. Auxiliary Label Classification Based Multi-Label Limb Movement Recognition of Preterm Infant