Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 35, Issue 12

11740 -- 11753Wei Feng, Chang Tang, Cheng Zeng, Xinwang Liu 0002, Junjun Jiang, Xianju Li, Xinzhong Zhu. Diversity Learning Guided Dual Graph Autoencoder for Unsupervised Hyperspectral Band Selection
11754 -- 11769Yutang Jin, Shiming Chen 0002, Tianle Tong, Weiping Ding 0001, Yisong Wang. Multi-Modal Prompts With Primitives Enhancement for Compositional Zero-Shot Learning
11770 -- 11782Wenjie Liu 0001, Zhijie Ren. DM-MKGC: Multimodal Knowledge Graph Completion Based on Dynamic Prompt Learning and Multi-Granularity Aggregation
11783 -- 11796Jian Yang, Yuan Rao 0001, Hao Fan 0004, Junyu Dong, Hui Yu 0001. Learning Semantic-Aware Point-Line Features for Localization and Reconstruction
11797 -- 11809Yujia Sun, Weisheng Dong, Shuaibo Wang, Peng Wu 0015, Mingtao Feng, Xin Li 0005, Guangming Shi. Distilling Hierarchical Knowledge From Multimodal Fusion for Unimodal Image Segmentation
11810 -- 11821Haoxing Chen, Yaohui Li, Zizheng Huang, Yan Hong 0001, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang 0002. Conditional Prototype Rectification Prompt Learning
11822 -- 11833Zhenglai Li, Yuqi Shi, Xiao He 0010, Chang Tang. Mask-Informed Deep Contrastive Incomplete Multi-View Clustering
11834 -- 11848Anqi Zhao, Ruitao Feng, Xinghua Li 0002. ThiefCloud: A Thickness Fused Thin Cloud Removal Network for Optical Remote Sensing Image With Self-Supervised Learnable Cloud Prior
11849 -- 11862Yangpeng Liu, Junjian Huang, Shiping Wen 0001, Xing He 0001, Wei Zhang 0102, Zhao Feng. CTIGEN-CDM: Controlled Text-to-Image Generation Using Cropped Diffusion Models
11863 -- 11876Shu Jiang, Dong Zhang, Rui Yan 0010, Xiangbo Shu, Pingcheng Dong, Long Chen 0016, Xiaoyu Du 0002. Eliminating Semantic Ambiguity in Human Pose Estimation via Stable Feature Upsampling
11877 -- 11892Guohua Lv, Xiang Gao, Aimei Dong, Zhonghe Wei, Jinyong Cheng. SLFusion: A Structure-Aware Infrared and Visible Image Fusion Network for Low-Light Scenes
11893 -- 11907Shuai Han, Jingwei Xin, Jie Li 0001, Nannan Wang 0001, Xinbo Gao 0001. Unsupervised Face Super-Resolution via Integrating Faithful 3D Facial Priors
11908 -- 11921Qingguo Meng, Andong Lu, Zhe Jin 0001. BR-MoE: Blind Multi-Modal Tracking With Route-Dynamic Mixture of Experts
11922 -- 11934Mingzhu Xu, Zhengyu Sun, Yijun Hu, Haoyu Tang 0002, Yupeng Hu 0003, Xuemeng Song, Liqiang Nie. Superpixel Segmentation With Edge Guided Local-Global Attention Network
11935 -- 11949Dingli Hua, Qingmao Chen, Zhiliang Wu, Yifan Zuo, Wenying Wen, Yuming Fang. Perceptual Transform Fusion of Infrared and Visible Images
11950 -- 11964Kui Liu, Bart Goossens, Tom De Schepper, Wilfried Philips. Improving Post-Training Quantization via Probabilistic Programming
11965 -- 11977Lvwei Zhu, Eric Rigall, Ying Gao 0005, Zongshuai Zhang, Yafei Bai, Junyu Dong. Region-Aware Driven Distribution Optimization for Stereo Matching
11978 -- 11992Zuojie Xie, Hao Ren 0002, Junjian Huang, Zhiquan He, Hong Lu 0001, Yong Liu, Jiawen Lu, Lvfan Yuan, Shulin Liu, Changyong Xie. Low-Light Image Enhancement via Multi-Exposure Progressive Contrastive Regularization
11993 -- 12006Rong Zhou, Simin Yu. Breaking a New Image Cryptosystem From Three Perspectives
12007 -- 12022Shuang Li, Ganggang Dong, Hongwei Liu 0001. ImagingNet: A New Learnable SAR Imaging Method via Hierarchical U-Shaped Network
12023 -- 12037Ying Zhu, Hong Liu 0008, Guoliang Hua, Hao Tang 0005, Yidi Li, Weibo Huang. Dual Attention Guidance Network for Self-Supervised Monocular Depth Estimation
12038 -- 12051Yuxiang Zhang 0005, Wei Li 0032, Wen Jia, Mengmeng Zhang 0005, Ran Tao 0003, Shunlin Liang. Cross-Domain Hyperspectral Image Classification Based on Bi-Directional Domain Adaptation
12052 -- 12065Peng Yang, Ming Liu 0029, Liquan Dong, Lingqin Kong, Yuejin Zhao. Polynomial Fitting-Based Estimation of Spatially Varying Point Spread Function From a Single Image
12066 -- 12081Haoyuan Li, Qi Hu, Binjia Zhou, You Yao, Jiacheng Lin, Kailun Yang 0001, Peng Chen 0008. CFMW: Cross-Modality Fusion Mamba for Robust Object Detection Under Adverse Weather
12082 -- 12095Zhongling Huang, Long Liu, Shuxin Yang, Zhirui Wang 0003, Gong Cheng 0003, Junwei Han 0001. Physics-Guided Detector for SAR Airplanes
12096 -- 12108Xu Han, Qi Wang. Compensating for the Incomplete With the Complete: An Efficient Scene Text Detector
12109 -- 12124Kunpeng Wang 0005, Zhengzheng Tu, Chenglong Li 0002, Zhengyi Liu, Bin Luo 0001. Unified-Modal Salient Object Detection via Adaptive Prompt Learning
12125 -- 12137Yu Liu 0021, Chun Luo, Wanglong Wan, Wenqiang Jin, Zheng Qin 0001. A Secure Medical Image Encryption Scheme Based on Cross-Ring Josephus Scrambling and Two-Dimensional Cellular Automata
12138 -- 12151Na Zheng, Xuemeng Song, Wai Teng Tang, See-Kiong Ng, Liqiang Nie, Roger Zimmermann. Unsupervised Few-Shot Food Recognition With Intra-Class Variation and Inter-Class Similarity Modeling
12152 -- 12166Mingyue Chen, Xin Liao 0001, Han Fang, Jinlin Guo, Yanxiang Chen, Xiaoshuai Wu. Flexible Partial Screen-Shooting Watermarking With Provable Robustness
12167 -- 12181Meng Li, Bo Ma 0012, Yulin Zhang. Lightweight Image Super-Resolution With Pyramid Clustering Transformer
12182 -- 12195Jia Wang 0054, Zhiguo Qu, Lingshuang Kong, Wentao Yuan, Encai Liu, Rui Zhang, Ruigang Fu. Learning a Perspective-Invariant Descriptor for Remote Sensing Image Matching
12196 -- 12211Piotr Kopa Ostrowski, Daniel Wesierski, Anna Jezierska, Tomasz P. Stefanski. Lifting Deep Image Denoisers to Video With Frame Interpolation Pre-Training
12212 -- 12226Hengyue Bi, Long Chen 0019, Jingchao Cao, Jingyang Wang, Jinghao Sun, Yuan Rao 0001, Junyu Dong. SeaDiff: Underwater Image Enhancement With Degradation-Aware Diffusion Model
12227 -- 12237Bo Hu 0008, Wei Wang, Leida Li, Lihuo He, Wen Lu 0004, Xinbo Gao 0001. Blind Quality Assessment of Wide-Angle Videos Based on Deformation Representation Learning and Multi-Dimensional Feature Fusion
12238 -- 12250Yunnan Wang, Ziqiang Li, Wenyao Zhang, Lexiang Lv, Zequn Zhang, Xiaoyu Shen, Xin Jin 0014, Wenjun Zeng 0001. Canvas: Compositional Generation for Art Painting With Seamless Subject-Driven Infusion
12251 -- 12264Dongshuai Duan, Honglei Su, Qi Liu 0029, Hui Yuan 0001, Zhou Wang 0001. DQP-PCQA: Deep Quantization Parameters Bring New Insight to Point Cloud Quality Assessment
12265 -- 12277Ting Zhou, Siyuan Chen, Siyao Wan, Hanyun Lv, Zheng Luo, Jianhui Wu 0002. GEDR: Gaussian-Enhanced Detail Reconstruction for Real-Time High-Fidelity 3D Scene Reconstruction
12278 -- 12291Xiao Xu 0005, Libo Qin 0001, Wanxiang Che, Min-Yen Kan. Manager: Aggregating Insights From Unimodal Experts in Two-Tower VLMs and MLLMs
12292 -- 12305Runhao Zeng, Qi Deng, Ronghao Zhang, Shuaicheng Niu, Jian Chen 0011, Xiping Hu, Victor C. M. Leung. Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation
12306 -- 12316Guangsheng Xu, Guoyi Zhang, Lejia Ye, Shuwei Gan, Xiaohu Zhang, Xia Yang. Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation
12317 -- 12328Shanshan Han, Shuang Li, Shuodi Wang, Lin Yuan 0002, Yan Zhang 0108, Xinbo Gao 0001. Deepfake Detection Leveraging Self-Blended Artifacts Guided by Facial Embedding Discrepancy
12329 -- 12340Yongqi Huang, Peng Ye 0006, Chongjun Tu, Tao Chen 0003, Tong He 0001, Wanli Ouyang. Sparse-to-Dense Training: A Novel Training Scheme to Enhance Vision Transformers
12341 -- 12354Cheng Lin, Hong Hu, Jie Zou 0001, Lujun Li, Jun Liu, Yipeng Gao, Yang Yang 0002, Heng Tao Shen. Distilling Grounding DINO for an Edge-Cloud Collaborative Advanced Driver Assistance System
12355 -- 12368Zhifeng Wang 0004, Qixuan Zhang, Peter Zhang, Wenjia Niu, Kaihao Zhang, Ramesh S. Sankaranarayana, Sabrina B. Caldwell, Tom Gedeon. Visual and Textual Prompts in VLLMs for Enhancing Emotion Recognition
12369 -- 12381Jianing Wang 0003, Shengjia Hao, Zheng Hua, Yuqiong Yao, Qiong Xu, Bo Liu 0009, Maoguo Gong. TBGA-Net: Trigonometric Bilinear Attention and Global-Aware Aggregation Network for Large-Scale 3D Point Cloud Segmentation
12382 -- 12395Bo Pang, Deming Zhai, Jianan Zhen, Long Wang, Xu Han, Guofeng Zhang 0001, Xianming Liu 0005. Zero6DOT: Zero-Shot 6D Object Pose Tracking With Monocular RGB Video
12396 -- 12409Kehua Chen, Zhenlong Yuan, Haihong Xiao, Tianlu Mao, Zhaoqi Wang. Learning Multi-View Stereo With Geometry-Aware Prior
12410 -- 12425Jianping Zhong, Zhaobo Qi, Kaiwen Duan, Yuanrong Xu, Weigang Zhang, Qingming Huang. VPA: Multi-Modal Virtual Point Augmentation for 3D Object Detection
12426 -- 12440Yanbo Gao, Huibin Bai, Huasong Zhou, Xingyu Gao 0001, Shuai Li 0005, Xun Cai, Hui Yuan 0001, Wei Hua 0002, Tian Xie 0011. Adaptive Depth-Converted-Scale Convolution for Self-Supervised Monocular Depth Estimation
12441 -- 12454Zijian Zhang, Muqing Wu, Honghao Qi, Min Zhao 0002. EFMK: Extrinsic Parameters-Free Multi-View 3D Human Skeleton Estimation
12455 -- 12466Yiqiang Wu, Yu Qin, Jiacheng Sun, Chang Liu 0082, Yunfei Bai, Chenghai Mao, Xiaomao Li. SampleDet3D: Sample Enhanced 3D Object Detection
12467 -- 12476Junyi Hou, Zihao Pan, Changjun Xu, Lei Yu 0007. Low Texture 3D Reconstruction System Based on Manhattan Axis and 2D/3D Line Features
12477 -- 12491Baoyang Mu, Feng Shao 0001, Hangwei Chen, Xuejin Wang, Qiuping Jiang. A Mutual Head Knowledge Distillation Framework for Lightweight RGB-T Crowd Counting
12492 -- 12507Aihua Mao, Shuyi Wen, Feng Chen, Ran Yi 0002, Yong-Jin Liu 0001. Robust 3D Visual Question Answering via Bias Learning
12508 -- 12523Shengjun Zhu, Jiaxin Cai, Runqing Xiong, Liping Zheng, Duo Ma. Singular Pooling: A Spectral Pooling Paradigm for Second-Trimester Prenatal Level II Ultrasound Standard Fetal Plane Identification
12524 -- 12537Tianshi Luo, Hao Li 0009, Maoguo Gong, Yu Zhou 0051, A. Kai Qin. STEAM: Style Transfer Enabled Adversarial Attack With Attention Mechanism on Remote Sensing Image Scene Classification
12538 -- 12549Yang Liu 0069, Jiale Du, Xinbo Gao 0001, Jungong Han, Ling Shao 0001. Relation-Aware Meta-Learning for Zero-Shot Sketch-Based Image Retrieval
12550 -- 12563Xuecheng Li, Yuanjie Zheng. Inpaint-Outpaint Synergy: Mask Refinement for Trimap-Free Matting
12564 -- 12578Lei Song 0010, Huaibo Song, Bo Jiang 0017. Adaptive Clustering and Frequency Division Network for Efficient Monocular Depth Estimation
12579 -- 12591Gee-Sern Jison Hsu, Wei-Jun Lin, Wei-Chun Hsieh, Wei-Zhe Jian, Sheng-Luen Chung, Marina L. Gavrilova. Style-Preserving Generator for Synthetic License Plate Recognition
12592 -- 12606Wenbin Yan, Hua Chen 0008, Qingwei Wu, Xiaogang Zhang, Qiu Fang, Shengjie Hu, Yaonan Wang 0001. LFSSMam: Efficient Aggregation of Multi-Spatial-Angular-Modal Information Using Selective SSM for Light Field Semantic Segmentation
12607 -- 12621Yang Li, Songlin Yang, Wei Wang 0025, Jing Dong 0003. Beyond Inserting: Learning Subject Embedding for Semantic-Fidelity Personalized Diffusion Generation
12622 -- 12635Wenjia Meng, Huimin Han, Xiankai Lu, Yilong Yin, Gang Pan 0001, Qian Zheng. LAC-PS: A Light Direction Selection Policy Under the Accuracy Constraint for Photometric Stereo
12636 -- 12651Gang He 0002, Long Gao, Langkun Chen, Yan Jiang, Weiying Xie, Yunsong Li 0001. Hyperspectral Object Tracking With Spectral Information Prompt
12652 -- 12665Yao Chen, Guancheng Jia, Yufei Zha, Peng Zhang 0005, Yanning Zhang 0001. LINR: A Plug-and-Play Local Implicit Neural Representation Module for Visual Object Tracking
12666 -- 12679Ye Wang 0020, Mingyang Ma 0004, Ge Zhang 0006, Yuheng Liu, Tao Gao 0001, Shaohui Mei. Hyperspectral Tracker With Constrained Object Adaptive Learning and Trajectory Construction
12680 -- 12691Shou Feng, Jinghe Zhang, Yuanze Fan, Xinyao Liu, Chunhui Zhao 0003, Wei Li 0032, Ran Tao 0003. Cross-Domain Few-Shot Learning Method Based on Fractional Domain Information for Hyperspectral Image Multi-Class Change Detection
12692 -- 12706Xuting Lan, Weizhi Xian, Mingliang Zhou 0001, Jielu Yan, Xuekai Wei, Jun Luo 0006, Weijia Jia 0001, Sam Kwong. No-Reference Image Quality Assessment: Exploring Intrinsic Distortion Characteristics via Generative Noise Estimation With Mamba
12707 -- 12718Jinglin Xu, Yaqi Zhang, Wenhao Zhou, Hongmin Liu 0001. BFSTAL: Bidirectional Feature Splitting With Cross-Layer Fusion for Temporal Action Localization
12719 -- 12733Guanqi Ding, Xinzhe Han, Shuhui Wang, Xin Jin 0004, Qingming Huang. Stable Attribute Group Editing for Reliable Few-Shot Image Generation
12734 -- 12746Jingqian Wu, Shuo Zhu, Chutian Wang, Boxin Shi, Edmund Y. Lam. SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering From a Single Sweep
12747 -- 12759Honglin Guo, Ruidong Chen, Weizhi Nie, Lanjun Wang, Anan Liu. CompCraft: Foreground-Driven Image Synthesis With Customized Layouts
12760 -- 12771Yiqian Wu, Hao Xu 0049, Xiangjun Tang, Yue Shangguan, Hongbo Fu 0001, Xiaogang Jin 0001. 3DPortraitGAN: Learning One-Quarter Headshot 3D GANs From a Single-View Portrait Dataset With Diverse Body Poses
12772 -- 12787Alessandro Gnutti, Fabrizio Guerrini, Riccardo Leonardi, Antonio Ortega. Variable-Size Symmetry-Based Graph Fourier Transforms for Image Compression
12788 -- 12801Shiwei Wang 0005, Liquan Shen, Peiying Wu, Zhaoyi Tian, Feifeng Wang. DRLN: Disparity-Aware Rescaling Learning Network for Multi-View Video Coding Optimization
12802 -- 12815Jian Xiong 0005, Junhao Wu, Wang Luo, Jiucheng Xie, Hui Yuan 0001, Hao Gao 0005. Multi-Task Learning Model for V-PCC Geometry Compression Artifact Removal
12816 -- 12829Jie Li 0015, Zhixin Li, Zhi Liu 0002, Peng Yuan Zhou, Richang Hong, Qiyue Li 0001, Han Hu 0003. Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and User Trajectory Information
12830 -- 12845Guquan Jing, Peng Gao, Yujian Lee, Yiyang Hu, Hui Zhang 0062. 3D-Aided Pedestrian Representation Learning for Video-Based Person Re-Identification
12846 -- 12860Lingchen Gu, Xiaojuan Shen, Jiande Sun 0001, Yan Liu, Jing Li 0046, Zhihui Li, Sen-Ching S. Cheung, Wenbo Wan. Dual Prototypes-Based Personalized Federated Adversarial Cross-Modal Hashing
12861 -- 12873Fengling Li 0001, Zequn Wang, Tianshi Wang 0001, Lei Zhu 0002, Xiaojun Chang. Generative Augmentation Hashing for Few-Shot Cross-Modal Retrieval
12874 -- 12889Yating Liu, Yaowei Li 0001, Xiangyuan Lan, Wenming Yang, Zimo Liu, Qingmin Liao. UP-Person: Unified Parameter-Efficient Transfer Learning for Text-Based Person Retrieval
12890 -- 12903Pujun Zhou, Guanchao Qiao, Qi Yu 0002, M. Chen, Y. C. Wang, Y.-C. Chen, J. J. Wang, Ning Ning 0002, Y. Liu, Shaogang Hu. A 0.96 pJ/SOP Heterogeneous Neuromorphic Chip Toward Energy-Efficient Edge Visual Applications
12904 -- 12917Lixin Zhang, Qian Wang 0046. Synergistic Fusion Network of Microscopic Hyperspectral and RGB Images for Multi-Perspective Segmentation
12918 -- 12924Ziwen He, Xingjie Dai, Xiang Zhang 0023, Zhangjie Fu. MMDStegNet: An Adversarial Steganography Framework With Maximum Mean Discrepancy Regularization