| 2947 | -- | 2950 | Wenguan Wang, Hengshuang Zhao, Xinggang Wang, Fisher Yu 0001, David Crandall. Guest Editorial Introduction to the Special Issue on Segment Anything for Videos and Beyond |
| 2951 | -- | 2962 | Shanghong Li, Yongquan Chen, Long Xu, Jun Luo, Rui Huang 0001, Feng Wu 0001, Yingliang Miao. ClickAdapter: Integrating Details Into Interactive Segmentation Model With Adapter |
| 2963 | -- | 2974 | Hao Fang 0010, Tong Zhang, Xiaofei Zhou, Xinxin Zhang 0004. Learning Better Video Query With SAM for Video Instance Segmentation |
| 2975 | -- | 2986 | Yuhang Ding, Hongmin Liu 0001. Barely-Supervised Brain Tumor Segmentation via Employing Segment Anything Model |
| 2987 | -- | 2998 | Binwei Xu, Qiuping Jiang, Xing Zhao 0001, Chenyang Lu 0002, Haoran Liang 0001, Ronghua Liang. Multidimensional Exploration of Segment Anything Model for Weakly Supervised Video Salient Object Detection |
| 2999 | -- | 3012 | Xingyu Gao 0001, Zuolei Li, Hailong Shi, Zhenyu Chen 0003, Peilin Zhao. Scribble-Supervised Video Object Segmentation via Scribble Enhancement |
| 3013 | -- | 3023 | Ziqi Zhang, Siduo Pan, Kun Wei, Jiapeng Ji, Xu Yang 0019, Cheng Deng. Few-Shot Generative Model Adaption via Optimal Kernel Modulation |
| 3024 | -- | 3038 | Zhengqing Fang, Zhouhang Yuan, Ziyu Li, Jingyuan Chen, Kun Kuang, Yu-Feng Yao, Fei Wu 0001. Cross-Modality Image Interpretation via Concept Decomposition Vector of Visual-Language Models |
| 3039 | -- | 3053 | Peng Huang, Xiangbo Shu, Rui Yan 0010, Zhewei Tu, Jinhui Tang 0001. Appearance-Agnostic Representation Learning for Compositional Action Recognition |
| 3054 | -- | 3080 | Lingyan Ran, Yali Li, Guoqiang Liang 0001, Yanning Zhang 0001. Pseudo Labeling Methods for Semi-Supervised Semantic Segmentation: A Review and Future Perspectives |
| 3081 | -- | 3093 | Zhifan Gao, Saidi Guo, Chenchu Xu, Jinglin Zhang, Mingming Gong, Javier Del Ser, Shuo Li 0001. Multi-Domain Adversarial Variational Bayesian Inference for Domain Generalization |
| 3094 | -- | 3103 | Xiaoxu Li, Peiyu Lu, Rui Zhu 0006, Zhanyu Ma, Jie Cao 0014, Jing-Hao Xue. Rise by Lifting Others: Interacting Features to Uplift Few-Shot Fine-Grained Classification |
| 3104 | -- | 3118 | Haoran Gao, Fasheng Wang, Mengyin Wang, Fuming Sun, Haojie Li. Highly Efficient RGB-D Salient Object Detection With Adaptive Fusion and Attention Regulation |
| 3119 | -- | 3133 | Zhiyang Guo, Wengang Zhou 0001, Li Li 0040, Min Wang 0019, Houqiang Li. Motion-Aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction |
| 3134 | -- | 3145 | Luoying Hao, Yan Hu, Yang Yue, Li Wu, Huazhu Fu, Jinming Duan 0001, Jiang Liu 0001. Hierarchical Context Transformer for Multi-Level Semantic Scene Understanding |
| 3146 | -- | 3159 | Songlin Dong, Xinyuan Gao, Yuhang He, Zhengdong Zhou, Alex C. Kot, Yihong Gong. CEAT: Continual Expansion and Absorption Transformer for Non-Exemplar Class-Incremental Learning |
| 3160 | -- | 3171 | Guoqing Zhang 0002, Jin Li, Yuhui Zheng, Ruili Wang. InfinitePerson: Innovating Synthetic Data Creation for Generalization Person Re-Identification |
| 3172 | -- | 3184 | Jiang Xin, Sheng Yue, Jinrui Zhang, Ju Ren 0001, Feng Qian, Yaoxue Zhang. MAML-RAL: Learning Domain-Invariant HOI Rules for Real-Time Video Matting |
| 3185 | -- | 3195 | Zicheng Zhang, Wei Ke 0003, Yi Zhu 0004, Xiaodan Liang, Jianzhuang Liu, Qixiang Ye, Tong Zhang 0001. Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation |
| 3196 | -- | 3208 | Huan Liu, Jian Sun 0009. UniSTAD: An Unified Triple-Tower Student-Teacher Model for Multi-Class Anomaly Detection and Localization |
| 3209 | -- | 3221 | Jingkai Ma, Shuang Bai. SGFNet: Structure-Guided Few-Shot Object Detection |
| 3222 | -- | 3233 | Wujie Zhou, Hongping Wu, Qiuping Jiang. MDNet: Mamba-Effective Diffusion-Distillation Network for RGB-Thermal Urban Dense Prediction |
| 3234 | -- | 3249 | Mingzhu Xu, Tianxiang Xiao, Yutong Liu, Haoyu Tang, Yupeng Hu, Liqiang Nie. CMIRNet: Cross-Modal Interactive Reasoning Network for Referring Image Segmentation |
| 3250 | -- | 3261 | Weiqing Yan, Kanglong Liu, Wujie Zhou, Chang Tang. Deep Incomplete Multi-View Clustering via Dynamic Imputation and Triple Alignment With Dual Optimization |
| 3262 | -- | 3275 | Jiaming Li, Lingyun Yu 0002, Runxin Liu, Hongtao Xie. A Detail-Aware Transformer to Generalizable Face Forgery Detection |
| 3276 | -- | 3289 | Gang Wang, Chaoran Zhu, Qian Xu, Tongzhou Zhang 0001, Hai Zhang 0003, Xiaopeng Fan, Jue Hu. CCTNet: A Circular Convolutional Transformer Network for LiDAR-Based Place Recognition Handling Movable Objects Occlusion |
| 3290 | -- | 3302 | Yanqing Yao, Gong Cheng 0003, Chunbo Lang, Xingxing Xie, Junwei Han. Centric Probability-Based Sample Selection for Oriented Object Detection |
| 3303 | -- | 3314 | Liang Zhao 0005, Xiao Wang, Zhenjiao Liu, Ziyue Wang, Zhikui Chen. Learnable Graph Guided Deep Multi-View Representation Learning via Information Bottleneck |
| 3315 | -- | 3327 | Yibo Zhao 0001, Zan Gao, Chunjie Ma, Weili Guan, Riwei Wang, Shengyong Chen. Fine-Grained Modality Relation-Aware Network for Video Moment Retrieval |
| 3328 | -- | 3341 | Huangxing Lin, Yunlong Lin, Jingyuan Xia, Linyu Fan, Feifei Li, Yingying Wang 0005, Xinghao Ding. Fusion2Void: Unsupervised Multi-Focus Image Fusion Based on Image Inpainting |
| 3342 | -- | 3354 | Zhongyang Li, Faming Fang, Tingting Wang, Guixu Zhang. Homography Estimation With Adaptive Query Transformer and Gated Interaction Module |
| 3355 | -- | 3367 | Wenhui Jiang, Linxin Liu, Yuming Fang, Yibo Cheng, Yuxin Peng, Yang Liu 0293. Learning Comprehensive Visual Grounding for Video Captioning |
| 3368 | -- | 3382 | Yan Gan, Chengqian Wu, Deqiang Ouyang, Song Tang 0001, Mao Ye 0001, Tao Xiang 0001. LESEP: Boosting Adversarial Transferability via Latent Encoding and Semantic Embedding Perturbations |
| 3383 | -- | 3395 | Tingting Han 0003, Yaochen Xu, Jun Yu 0002, Zhou Yu 0001, Sicheng Zhao. Action-Driven Semantic Representation and Aggregation for Video Captioning |
| 3396 | -- | 3409 | Yan Huang 0031, Xiaoshan Liao, Jinxiu Liang, Boxin Shi, Yong Xu 0007, Patrick Le Callet. Detail-Preserving Diffusion Models for Low-Light Image Enhancement |
| 3410 | -- | 3425 | Yulin Wang, Yueming Ma, Yuanyuan Li, Jiqing Zhang, Zetian Mi, XianPing Fu. Underwater Vignetting Image Correction Based on Binary Polynomial Regularization and Latent Low-Rank Representation |
| 3426 | -- | 3437 | Siqi Wang, Yehu Shen, Wenming Yang. Touchless Finger Vein and Fingerprint Verification via Exploiting Attention-Based Cross-Domain Fusion |
| 3438 | -- | 3449 | Qi Zhang, Long Chen 0001, Wanfeng Shang. Cross Dense Feature Learning With Task Guidance for Few-Shot Classification |
| 3450 | -- | 3461 | Xinbo Wu, Jianxun Lou, Yingying Wu, Wan'an Liu, Paul L. Rosin, Gualtiero B. Colombo, Stuart M. Allen, Roger M. Whitaker, Hantao Liu. Image Manipulation Quality Assessment |
| 3462 | -- | 3474 | Yu Zhou 0027, Wei Xie, Huisi Wu, Lei Huang 0001, Sam Kwong, Jianmin Jiang. Denoiser-Regulated Deep Unfolding Compressed Sensing With Learnable Fixed-Point Projections |
| 3475 | -- | 3485 | Jinyang Liu 0004, Shutao Li, Renwei Dian, Ze Song, Lishan Tan. Asymptotic Spectral Mapping for Hyperspectral Image Fusion |
| 3486 | -- | 3497 | Zhefei Cai, Yingle Fan, Minwei Zhu, Tao Fang. Ultra-Lightweight Network for Medical Image Segmentation Inspired by Bio-Visual Interaction |
| 3498 | -- | 3511 | Haowen Bai, Zixiang Zhao, Jiangshe Zhang 0001, Baisong Jiang, Lilun Deng, Yukun Cui, Shuang Xu, Chunxia Zhang 0002. Deep Unfolding Multi-Modal Image Fusion Network via Attribution Analysis |
| 3512 | -- | 3526 | Ye Yao, Detong Wang, Yanzhao Shen, Dawen Xu 0001, Ching-Chun Chang, Chinchen Chang 0001. PVO-Based Reversible Data Hiding Using Two-Stage Embedding and FPM Mode Selection |
| 3527 | -- | 3540 | Yuchao Zheng, Huimin Lu 0001, Jingyi Wang, Weidong Zhang 0007, Mohsen Guizani. High-Turbidity Underwater Image Enhancement via Turbidity Suppression Fusion |
| 3541 | -- | 3556 | Qingshan Hou, Yaqi Wang, Linqi Lan, Peng Cao 0001, Jinzhu Yang, Xiaoli Liu 0001, Meng Wang, Yih Chung Tham, Osmar R. Zaïane. A Reference-Free Quality Enhancement Framework for Low-Quality Fundus Images |
| 3557 | -- | 3572 | Mingyang Zhang 0002, Xiangyu Wang, Shuang Wu, Zhaoyang Wang, Maoguo Gong, Yu Zhou, Fenlong Jiang, Yue Wu 0004. Spatial-Spectral Aggregation Transformer With Diffusion Prior for Hyperspectral Image Super-Resolution |
| 3573 | -- | 3588 | Zijian Chen 0001, Wei Sun 0029, Haoning Wu 0001, Zicheng Zhang, Jun Jia, Ru Huang 0002, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang 0001. Study of Subjective and Objective Naturalness Assessment of AI-Generated Images |
| 3589 | -- | 3602 | Mengjiao Shen, Liuyi Wang, Xianyou Zhong, Chengju Liu, Qijun Chen. FoggyDepth: Leveraging Channel Frequency and Non-Local Features for Depth Estimation in Fog |
| 3603 | -- | 3618 | Linbo Fu, Xin Liao, Jinlin Guo, Li Dong 0006, Zheng Qin 0001. WaveRecovery: Screen-Shooting Watermarking Based on Wavelet and Recovery |
| 3619 | -- | 3632 | Huibin Lin, Chun-Yang Zhang, C. L. Philip Chen. Contextual Distribution Alignment via Correlation Contrasting for Domain Generalization |
| 3633 | -- | 3648 | Zhi Yu, Zhiyong Huang 0004, Mingyang Hou, Jiaming Pei, Yan Yan 0022, Yushi Liu 0001, Daming Sun. Representation Selective Coupling via Token Sparsification for Multi-Spectral Object Re-Identification |
| 3649 | -- | 3663 | Zhaodi Ge, Hanning Chen, Xiaodan Liang, Lianbo Ma. Gated Mechanism Attention Transformer Based on Wavelet Enhanced Optical Flow Field Estimation for Foreground Detection |
| 3664 | -- | 3678 | Shuwei Shao, Zhongcai Pei, Weihai Chen, Dingchi Sun, Peter C. Y. Chen, Zhengguo Li. MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model |
| 3679 | -- | 3692 | Zilu Guo, Liuyang Bian, Hu Wei, Jingyu Li, Huasheng Ni, Xuan Huang. DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation |
| 3693 | -- | 3705 | Yujie Zhang, Qi Yang 0003, Ziyu Shan, Yiling Xu. Asynchronous Feedback Network for Perceptual Point Cloud Quality Assessment |
| 3706 | -- | 3718 | Guang-yong Chen, Wei Dong, Guodong Fan, Jian-Nan Su, Min Gan, C. L. Philip Chen. LPFSformer: Location Prior Guided Frequency and Spatial Interactive Learning for Nighttime Flare Removal |
| 3719 | -- | 3731 | Kristian Fischer 0001, Fabian Brand, André Kaup. Boosting Neural Image Compression for Machines Using Latent Space Masking |
| 3732 | -- | 3744 | Yangang Cai, Peiyin Xing, Xuesong Gao. High Efficient 3D Convolution Feature Compression |
| 3745 | -- | 3756 | Pingping Zhang, Meng Wang 0017, Baoliang Chen, Rongqun Lin, Xu Wang 0006, Shiqi Wang 0001, Sam Kwong. Learning-Based Compression for Noisy Images in the Wild |
| 3757 | -- | 3769 | Zhimeng Huang, Chuanmin Jia, Shanshe Wang, Siwei Ma. HMFVC: A Human-Machine Friendly Video Compression Scheme |
| 3770 | -- | 3785 | Maida Cao, Wenrui Dai, Shaohui Li, Chenglin Li, Junni Zou, Ying Chen, Hongkai Xiong. End-to-End Optimized Image Compression With Deep Gaussian Process Regression |
| 3786 | -- | 3797 | Hadi Amirpour, M. Ghanbari 0001, Christian Timmerer. DeepStream: Video Streaming Enhancements Using Compressed Deep Neural Networks |
| 3798 | -- | 3811 | Heming Sun, Lu Yu 0003, Jiro Katto. Q-LIC: Quantizing Learned Image Compression With Channel Splitting |
| 3812 | -- | 3824 | Wenhan Yang, Haofeng Huang, Jiaying Liu 0001, Alex C. Kot. Facial Image Compression via Neural Image Manifold Compression |
| 3825 | -- | 3836 | Yuefeng Zhang, Chuanmin Jia, Jianhui Chang, Siwei Ma. Machine Perception-Driven Facial Image Compression: A Layered Generative Approach |
| 3837 | -- | 3852 | ShuShi Chen, Leilei Huang, Zhao Zan, Zhijian Hao, Hao Zhang, Xiaoxiang Chen, Minge Jing, Xiaoyang Zeng, Yibo Fan. Affine Motion Estimation Hardware Implementation With 51.7%/67.5% Internal Bandwidth Reduction for Versatile Video Coding |
| 3853 | -- | 3866 | Congkai An, Huanhuan Zhang, Jingyang Kang, Zhuo Liu, Anfu Zhou, Liang Liu 0001, Huadong Ma. Enhancing QoE of Adaptive Video Streaming by Generating Fine-Grained Throughput |
| 3867 | -- | 3881 | Zitong Li, Changqiao Xu, Han Xiao, Chuxing Fang, Lujie Zhong, Shujie Yang, Gabriel-Miro Muntean. Harmony: An Eco-Friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet |
| 3882 | -- | 3892 | Xingyu Gao 0001, Zhenyu Chen 0003, Boshen Zhang, Jianze Wei. Deep Learning to Hash With Application to Cross-View Nearest Neighbor Search |
| 3893 | -- | 3906 | Mingyang Lei, Jingfan Fan, Long Shao, Hong Song, Deqiang Xiao, Danni Ai, Tianyu Fu 0003, Yucong Lin, Ying Gu, Jian Yang 0009. Double-Shot 3D Shape Measurement With a Dual-Branch Network for Structured Light Projection Profilometry |
| 3907 | -- | 3920 | Bin Ma 0003, Haocheng Wang, Jian Xu, Xiao-Yu Wang 0011, Xiaolong Li 0001, Jian Li 0034. Color Image High-Capacity Differential Steganography Algorithm Based on Multiple Adversarial Networks |