Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 35, Issue 5

3924 -- 3939Yafeng Li, Yuehan Chen, Jiqing Zhang, Yudong Li, XianPing Fu. An Underwater Image Restoration Method With Polarization Imaging Optimization Model for Poor Visible Conditions
3940 -- 3954Runhao Zeng, Yishen Zhuo, Jialiang Li, Yunjin Yang, Huisi Wu, Qi Chen 0014, Xiping Hu 0001, Victor C. M. Leung. Improving Video Moment Retrieval by Auxiliary Moment-Query Pairs With Hyper-Interaction
3955 -- 3968Xun Jiang 0001, Liqing Zhu, Xing Xu 0001, Fumin Shen, Yang Yang 0002, Heng Tao Shen. Query as Supervision: Toward Low-Cost and Robust Video Moment and Highlight Retrieval
3969 -- 3982Xinyi Zhang, Haoran Xu 0004, Chenyun Yu, Guang Tan. PCTrack: Accurate Object Tracking for Live Video Analytics on Resource-Constrained Edge Devices
3983 -- 3999Lei Zhu 0005, Runbing Wu, Xinghui Zhu, Chengyuan Zhang 0001, Lin Wu 0001, Shichao Zhang 0001, Xuelong Li 0001. Bi-Direction Label-Guided Semantic Enhancement for Cross-Modal Hashing
4000 -- 4010Yiheng Jiang, Haotian Zhang, Li Li 0040, Dong Liu 0002, Zhu Li 0001. Sparse Point Clouds Assisted Learned Image Compression
4011 -- 4025Xinjian Wei, Yu Qiu, Xiaoxuan Xu, Jing Xu 0008, Jie Mei, Jun Zhang 0003. ECINFusion: A Novel Explicit Channel-Wise Interaction Network for Unified Multi-Modal Medical Image Fusion
4026 -- 4038Jiayi Lyu, Xing Lan, Guohong Hu, Hanyu Jiang, Wei Gan, Jinbao Wang, Jian Xue. Multimodal Emotional Talking Face Generation Based on Action Units
4039 -- 4054Junyu Fan, Jie Xu, Jingchun Zhou, Danling Meng, Yi Lin 0006. See Through Water: Heuristic Modeling Toward Color Correction for Underwater Image Enhancement
4055 -- 4071Qi Zang, Shuang Wang 0001, Dong Zhao, Zhun Zhong, Biao Hou, Licheng Jiao. Joint Style and Layout Synthesizing: Toward Generalizable Remote Sensing Semantic Segmentation
4072 -- 4086Lanhu Wu, Miao Zhang, Yongri Piao, Zhenyan Yao, Weibing Sun, Feng Tian, Huchuan Lu. CNN-Transformer Rectified Collaborative Learning for Medical Image Segmentation
4087 -- 4099Wen Zhang, Zhenshan Tan, Li Zhang, Zhijiang Li. Color Decoupling for Multi-Illumination Color Constancy
4100 -- 4115Xuan Tan, Xun Gong 0002, Yang Xiang. CLIP-Based Camera-Agnostic Feature Learning for Intra-Camera Supervised Person Re-Identification
4116 -- 4129Anwei Luo, Rizhao Cai, Chenqi Kong, Yakun Ju, Xiangui Kang, Jiwu Huang, Alex C. Kot. Forgery-Aware Adaptive Learning With Vision Transformer for Generalized Face Forgery Detection
4130 -- 4143Yanlong Yang, Jianan Liu, Tao Huang 0008, Qing-Long Han, Gang Ma, Bing Zhu 0004. RaLiBEV: Radar and LiDAR BEV Fusion Learning for Anchor Box Free Object Detection Systems
4144 -- 4157Zhaofeng Shi, Heqian Qiu, Lanxiao Wang, Fanman Meng, Qingbo Wu 0001, Hongliang Li 0001. Cognition Transferring and Decoupling for Text-Supervised Egocentric Semantic Segmentation
4158 -- 4171Qiongjie Cui, Zhenyu Lou, Zhenbo Song, Xiangbo Shu. Toward Physically Stable Motion Generation: A New Paradigm of Human Pose Representation
4172 -- 4183Hangwei Chen, Feng Shao 0001, Xiongli Chai, Baoyang Mu, Qiuping Jiang. Art Comes From Life: Artistic Image Aesthetics Assessment via Attribute Knowledge Amalgamation
4184 -- 4197Yuxiang Shao, Feifei Zhang, Changsheng Xu. Text-Video Knowledge Guided Prompting for Weakly Supervised Temporal Action Localization
4198 -- 4211Dewen Qiao, Xiang Ao, Yu Liu 0021, Xuetao Chen, Fuyuan Song, Zheng Qin 0001, Wenqiang Jin. Tri-AFLLM: Resource-Efficient Adaptive Asynchronous Accelerated Federated LLMs
4212 -- 4225Haihong Xiao, Wenxiong Kang, Hao Liu 0061, Yuqiong Li, Ying He 0001. Semantic Scene Completion via Semantic-Aware Guidance and Interactive Refinement Transformer
4226 -- 4241Yuqi Jiang, Jing Li 0010, Yanran Dai, Haidong Qin, Xiaoshi Zhou, Yong Zhang, Hongwei Liu, Kefan Yan, Tao Yang 0006. RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array
4242 -- 4255Shibai Yin, Yiwei Shi, Yibin Wang 0001, Yee-Hong Yang. When Aware Haze Density Meets Diffusion Model for Synthetic-to-Real Dehazing
4256 -- 4270Fan Xu 0005, Chuibin Chen, Zhigao Shang, Kai-Kuang Ma, QiHui Wu, Zebin Lin, Jie Zhan, Yizhou Shi. Deep Multi-Modal Ship Detection and Classification Network
4271 -- 4286Yanjie Liang, Qiangqiang Wu, Lin Cheng, Changqun Xia, Jia Li 0003. Progressive Semantic-Visual Alignment and Refinement for Vision-Language Tracking
4287 -- 4299Dan Song 0006, Xuanpu Zhang, Jianhao Zeng, Pengxin Zhan, Qingguo Chen, Weihua Luo, An-An Liu. Better Fit: Accommodate Variations in Clothing Types for Virtual Try-On
4300 -- 4313Yiyao Fan, Jun Lin 0003, Changming Sun, Tianhao Wang 0009, Yuehan Qi, Guanyu Zhang, Yang Liu 0333. An Image Terrain Map Model for Texture Filtering
4314 -- 4328Yuanwei Liu, Nian Liu, Yi Wu, Hisham Cholakkal, Rao Muhammad Anwer, Xiwen Yao, Junwei Han. NTRENet++: Unleashing the Power of Non-Target Knowledge for Few-Shot Semantic Segmentation
4329 -- 4340Rao Fu, Qian Li, Cheng Wen 0001, Ning An 0002, Fulin Tang. A Novel Framework for Learning Bézier Decomposition From 3D Point Clouds
4341 -- 4356Sai Yang, Bin Hu 0023, Fan Liu 0003, Xiaoxin Wu 0004, Weiping Ding 0001, Jun Zhou 0001. IPT-ILR: Image Pyramid Transformer Coupled With Information Loss Regularization for All-in-One Image Restoration
4357 -- 4369Han Zhu 0003, Zhenzhong Chen, Shan Liu 0001. Information Bottleneck Based Self-Distillation: Boosting Lightweight Network for Real-World Super-Resolution
4370 -- 4383Chenyang Shi, Boyi Wei, Xiucheng Wang, Hanxiao Liu, Yibo Zhang, Wenzhuo Li, Ningfang Song, Jing Jin. Polarity-Focused Denoising for Event Cameras
4384 -- 4396Hong Zhu, Pingping Zhang, Lei Xue, Guanglin Yuan. Multi-Modal Understanding and Generation for Object Tracking
4397 -- 4408Guang-yong Chen, Chao-Wei Zheng, Guodong Fan, Jian-Nan Su, Min Gan, C. L. Philip Chen. Real-World Image Reflection Removal: An Ultra-High-Definition Dataset and an Efficient Baseline
4409 -- 4422Xinmiao Ding, Jinming Lou, Wenyang Luo, Yufan Liu, Bing Li 0001, Weiming Hu. iESTA: Instance-Enhanced Spatial-Temporal Alignment for Video Copy Localization
4423 -- 4436Qianyu Zhang, Bolun Zheng, Xingying Chen, Quan Chen, Zunjie Zhu, Canjin Wang, Zongpeng Li, Xu Jia 0012, Chengang Yan. Hierarchical Frequency-Based Upsampling and Refining for HEVC Compressed Video Enhancement
4437 -- 4449Haochen Yu, Weixi Gong, Jiansheng Chen, Huimin Ma 0001. GET3DGS: Generate 3D Gaussians Based on Points Deformation Fields
4450 -- 4463Shuyang Wang, Kang Liu 0014, Ju Huang, Xuelong Li 0001. FLDet: Faster and Lighter Aerial Object Detector
4464 -- 4478Yang Yang 0080, Chao Wang 0003, Lei Gong, Min Wu 0008, Zhenghua Chen, Yingxue Gao, Teng Wang, Xuehai Zhou. Uncertainty-Aware Self-Knowledge Distillation
4479 -- 4492Jie Wang, Xiangji Kong, Nana Yu, Zihao Zhang, Yahong Han. Explicitly Disentangling and Exclusively Fusing for Semi-Supervised Bi-Modal Salient Object Detection
4493 -- 4505Yicong He, George K. Atia. Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers
4506 -- 4520Hao Zhang, Sicheng Li, Yupeng Gui, Zhiyong Li 0016, Shusong Xu, YanHeng Lu, Dimin Niu, Hongzhong Zheng, Yen-Kuang Chen, Yuan Xie 0001, Yibo Fan. A Tightly Coupled AI-ISP Vision Processor
4521 -- 4534Jie Wang, Nana Yu, Zihao Zhang, Yahong Han. Single-Group Generalized RGB and RGB-D Co-Salient Object Detection
4535 -- 4548Jinzheng Guang, Shichao Wu, Zhengxi Hu, Qianyi Zhang, Peng Wu, Jingtai Liu. DCCLA: Dense Cross Connections With Linear Attention for LiDAR-Based 3D Pedestrian Detection
4549 -- 4559Tong Zhao, Qiang Fang, Xin Xu 0001. Denser Teacher: Rethinking Dense Pseudo-Label for Semi-Supervised Oriented Object Detection
4560 -- 4575Chao You, Licheng Jiao, Lingling Li 0002, Xu Liu 0006, Fang Liu 0001, Wenping Ma 0001, Shuyuan Yang 0001. Contour Knowledge-Aware Perception Learning for Semantic Segmentation
4576 -- 4591Ting Luo 0001, Yuhang Zhou, Zhouyan He, Gangyi Jiang, Haiyong Xu, Shuren Qi, Yushu Zhang 0001. StegMamba: Distortion-Free Immune-Cover for Multi-Image Steganography With State Space Model
4592 -- 4607Zeng You, Zhiquan Wen, Yaofo Chen, Xin Li 0034, Runhao Zeng, Yaowei Wang 0001, Mingkui Tan. Toward Long Video Understanding via Fine-Detailed Video Story Generation
4608 -- 4618Yanfeng Zheng, Zhong Luo, Ying Cao 0001, Xiaosong Yang, Weiwei Xu, Zheng Lin 0005, Nan Yin, Pengjie Wang 0001. Unsupervised Salient Object Detection on Light Field With High-Quality Synthetic Labels
4619 -- 4634Xiaoyan Yu, Shen Zhou, Huafeng Li, Liehuang Zhu. Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration
4635 -- 4647Huafeng Chen, Pengxu Wei, Guangqian Guo, Shan Gao 0003. SAM-COD+: SAM-Guided Unified Framework for Weakly-Supervised Camouflaged Object Detection
4648 -- 4660Yue Wu 0004, Jiayi Lei, Yongzhe Yuan, Xiaolong Fan, Maoguo Gong, Wenping Ma 0001, Qiguang Miao, Mingyang Zhang 0002. Equivariance-Based Markov Decision Process for Unsupervised Point Cloud Registration
4661 -- 4674Qiang Qiao, Meixia Qu, Wenyu Wang, Bin Jiang 0011, Qiang Guo 0003. Effective Global Context Integration for Lightweight 3D Medical Image Segmentation
4675 -- 4685Hu Ding, Yan Yan 0001, Yang Lu 0009, Jing-Hao Xue, Hanzi Wang. Uncertainty-Aware Label Refinement on Hypergraphs for Personalized Federated Facial Expression Recognition
4686 -- 4697Zhimao Peng, Enguang Wang, Xialei Liu, Ming-Ming Cheng. Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection
4698 -- 4712Lingyun Yu 0002, Tian Xie, Chuanbin Liu 0001, Guoqing Jin, Zhiguo Ding 0006, Hongtao Xie. Distilling Multi-Level Semantic Cues Across Multi-Modalities for Face Forgery Detection
4713 -- 4726Shuhan Dong, Weiying Xie, Danian Yang, Yunsong Li, Jiaqing Zhang, Jiayuan Tian, Jie Lei 0001. SeaDATE: Remedy Dual-Attention Transformer With Semantic Alignment via Contrast Learning for Multimodal Object Detection
4727 -- 4739Wenjie Li, Xiaolong Li 0001, Rongrong Ni, Yao Zhao 0001. Extracting High-Discriminative Features for Detecting Double JPEG Compression With the Same Quantization Matrix
4740 -- 4752Xin Guo, Xi Wang, Xueyang Fu, Zheng-Jun Zha. Deep Unfolding Network for Image Desnowing With Snow Shape Prior
4753 -- 4767Ye Zhang, Yifeng Wang 0001, Zijie Fang, Hao Bian, Linghan Cai, Ziyue Wang 0005, Yongbing Zhang 0002. DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions
4768 -- 4783Shaohui Li, Shuoyu Ma, Wenrui Dai, Nuowen Kan, Fan Cheng 0002, Chenglin Li, Junni Zou, Hongkai Xiong. Task-Adapted Learnable Embedded Quantization for Scalable Human-Machine Image Compression
4784 -- 4800Xiaojiao Guo, Xuhang Chen 0002, Shuqiang Wang, Chi-Man Pun. Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis
4801 -- 4815XiangHai Wang 0001, Liyang Song, Yining Feng, Junheng Zhu. S3F2Net: Spatial-Spectral-Structural Feature Fusion Network for Hyperspectral Image and LiDAR Data Classification
4816 -- 4830Tongbo Wang, Lin Zhu 0012, Hua Huang 0001. Enhancing Real-Time Object Detection With Optical Flow-Guided Streaming Perception
4831 -- 4845Zhenghua Huang, Cheng Lin, Biyun Xu, Menghan Xia, Qian Li 0019, Yansheng Li 0001, Nong Sang. 2EA: Target-Aware Taylor Expansion Approximation Network for Infrared and Visible Image Fusion
4846 -- 4856Li Yu 0004, Hongchao Zhong, Longkun Zou, Ke Chen 0004, Pan Gao 0001. Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation
4857 -- 4869Ruiheng Zhang, Zhe Cao, Yan Huang 0023, Shuo Yang 0006, Lixin Xu, Min Xu 0001. Visible-Infrared Person Re-Identification With Real-World Label Noise
4870 -- 4882Zelin Liu, Xinggang Wang, Cheng Wang, Wenyu Liu 0001, Xiang Bai. SparseTrack: Multi-Object Tracking by Performing Scene Decomposition Based on Pseudo-Depth
4883 -- 4895Yuan Zhao, Jiayu Sun, Lihe Zhang, Huchuan Lu. FocusCLIP: Focusing on Anomaly Regions by Visual-Text Discrepancies
4896 -- 4909Chunxiao Liu, Zelong Wang, Philip Birch, Xun Wang 0007. Efficient Retinex-Based Framework for Low-Light Image Enhancement Without Additional Networks
4910 -- 4922Sun'ao Liu, Hongtao Xie, Jiannan Ge, Yongdong Zhang 0001. ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation
4923 -- 4936Chaojun Dong, Chengxuan Wang, Yikui Zhai, Ye Li, Jianhong Zhou, Pasquale Coscia, Angelo Genovese, Vincenzo Piuri, Fabio Scotti. GMTNet: Dense Object Detection via Global Dynamically Matching Transformer Network
4937 -- 4948An-An Liu, Quanhan Wu, Ning Xu 0003, Hongshuo Tian, Lanjun Wang. Enriched Image Captioning Based on Knowledge Divergence and Focus
4949 -- 4962Jian Wang 0113, Fan Li 0003, Lijun He 0001. A Unified Framework for Adversarial Patch Attacks Against Visual 3D Object Detection in Autonomous Driving
4963 -- 4975Yaoye Song, Peng Zhang 0005, Wei Huang 0013, Yufei Zha, Yanning Zhang 0001. Flexible Temperature Parallel Distillation for Dense Object Detection: Make Response-Based Knowledge Distillation Great Again
4976 -- 4990Enki Cho, Jung-Uk Kim, Seong Tae Kim 0001. Spatial Mask-Based Adaptive Robust Training for Video Object Segmentation With Noisy Labels
4991 -- 5005Shuo Li 0010, Fang Liu 0001, Licheng Jiao, Lingling Li 0002, Puhua Chen, Xu Liu 0006, Wenping Ma 0001. Prompt-Based Concept Learning for Few-Shot Class-Incremental Learning
5006 -- 5021Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro. MSCoTDet: Language-Driven Multi-Modal Fusion for Improved Multispectral Pedestrian Detection
5022 -- 5036Zhihao Li, Huaxiang Zhang 0001, Lei Zhu 0002, Jiande Sun 0001, Li Liu 0031. Heterogeneous Generative Tokens and Distance-Aware Recovery Network for Occluded Person Re-Identification
5037 -- 5050Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yunhai Tong, Chen Change Loy. DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training
5051 -- 5066Rongshan Chen, Hao Sheng 0001, Da Yang 0001, Zhenglong Cui, Ruixuan Cong. Surface-Continuous Scene Representation for Light Field Depth Estimation via Planarity Prior
5067 -- 5077Xiangzeng Liu, Jianfeng Guo, Hao Chen, Qiguang Miao, Yue Xi, Ruyi Liu. Adaptive Occlusion-Aware Network for Occluded Person Re-Identification
5078 -- 5091Shangshu Yu, Meiqing Wu, Siew Kei Lam. VFM-Depth: Leveraging Vision Foundation Model for Self-Supervised Monocular Depth Estimation
5092 -- 5108Xi Wang, Wei Liu 0004, Shimin Gong, Zhi Liu 0002, Jing Xu 0005, Yuming Fang. Spatial Quality Oriented Rate Control for Volumetric Video Streaming via Deep Reinforcement Learning
5109 -- 5122Rongyu Zhang, Jiaming Liu 0003, Xiaoqi Li 0009, Xiaowei Chi, Dan Wang 0002, Li Du, Yuan Du, Shanghang Zhang. BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection