| 11740 | -- | 11753 | Wei Feng, Chang Tang, Cheng Zeng, Xinwang Liu 0002, Junjun Jiang, Xianju Li, Xinzhong Zhu. Diversity Learning Guided Dual Graph Autoencoder for Unsupervised Hyperspectral Band Selection |
| 11754 | -- | 11769 | Yutang Jin, Shiming Chen 0002, Tianle Tong, Weiping Ding 0001, Yisong Wang. Multi-Modal Prompts With Primitives Enhancement for Compositional Zero-Shot Learning |
| 11770 | -- | 11782 | Wenjie Liu 0001, Zhijie Ren. DM-MKGC: Multimodal Knowledge Graph Completion Based on Dynamic Prompt Learning and Multi-Granularity Aggregation |
| 11783 | -- | 11796 | Jian Yang, Yuan Rao 0001, Hao Fan 0004, Junyu Dong, Hui Yu 0001. Learning Semantic-Aware Point-Line Features for Localization and Reconstruction |
| 11797 | -- | 11809 | Yujia Sun, Weisheng Dong, Shuaibo Wang, Peng Wu 0015, Mingtao Feng, Xin Li 0005, Guangming Shi. Distilling Hierarchical Knowledge From Multimodal Fusion for Unimodal Image Segmentation |
| 11810 | -- | 11821 | Haoxing Chen, Yaohui Li, Zizheng Huang, Yan Hong 0001, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang 0002. Conditional Prototype Rectification Prompt Learning |
| 11822 | -- | 11833 | Zhenglai Li, Yuqi Shi, Xiao He 0010, Chang Tang. Mask-Informed Deep Contrastive Incomplete Multi-View Clustering |
| 11834 | -- | 11848 | Anqi Zhao, Ruitao Feng, Xinghua Li 0002. ThiefCloud: A Thickness Fused Thin Cloud Removal Network for Optical Remote Sensing Image With Self-Supervised Learnable Cloud Prior |
| 11849 | -- | 11862 | Yangpeng Liu, Junjian Huang, Shiping Wen 0001, Xing He 0001, Wei Zhang 0102, Zhao Feng. CTIGEN-CDM: Controlled Text-to-Image Generation Using Cropped Diffusion Models |
| 11863 | -- | 11876 | Shu Jiang, Dong Zhang, Rui Yan 0010, Xiangbo Shu, Pingcheng Dong, Long Chen 0016, Xiaoyu Du 0002. Eliminating Semantic Ambiguity in Human Pose Estimation via Stable Feature Upsampling |
| 11877 | -- | 11892 | Guohua Lv, Xiang Gao, Aimei Dong, Zhonghe Wei, Jinyong Cheng. SLFusion: A Structure-Aware Infrared and Visible Image Fusion Network for Low-Light Scenes |
| 11893 | -- | 11907 | Shuai Han, Jingwei Xin, Jie Li 0001, Nannan Wang 0001, Xinbo Gao 0001. Unsupervised Face Super-Resolution via Integrating Faithful 3D Facial Priors |
| 11908 | -- | 11921 | Qingguo Meng, Andong Lu, Zhe Jin 0001. BR-MoE: Blind Multi-Modal Tracking With Route-Dynamic Mixture of Experts |
| 11922 | -- | 11934 | Mingzhu Xu, Zhengyu Sun, Yijun Hu, Haoyu Tang 0002, Yupeng Hu 0003, Xuemeng Song, Liqiang Nie. Superpixel Segmentation With Edge Guided Local-Global Attention Network |
| 11935 | -- | 11949 | Dingli Hua, Qingmao Chen, Zhiliang Wu, Yifan Zuo, Wenying Wen, Yuming Fang. Perceptual Transform Fusion of Infrared and Visible Images |
| 11950 | -- | 11964 | Kui Liu, Bart Goossens, Tom De Schepper, Wilfried Philips. Improving Post-Training Quantization via Probabilistic Programming |
| 11965 | -- | 11977 | Lvwei Zhu, Eric Rigall, Ying Gao 0005, Zongshuai Zhang, Yafei Bai, Junyu Dong. Region-Aware Driven Distribution Optimization for Stereo Matching |
| 11978 | -- | 11992 | Zuojie Xie, Hao Ren 0002, Junjian Huang, Zhiquan He, Hong Lu 0001, Yong Liu, Jiawen Lu, Lvfan Yuan, Shulin Liu, Changyong Xie. Low-Light Image Enhancement via Multi-Exposure Progressive Contrastive Regularization |
| 11993 | -- | 12006 | Rong Zhou, Simin Yu. Breaking a New Image Cryptosystem From Three Perspectives |
| 12007 | -- | 12022 | Shuang Li, Ganggang Dong, Hongwei Liu 0001. ImagingNet: A New Learnable SAR Imaging Method via Hierarchical U-Shaped Network |
| 12023 | -- | 12037 | Ying Zhu, Hong Liu 0008, Guoliang Hua, Hao Tang 0005, Yidi Li, Weibo Huang. Dual Attention Guidance Network for Self-Supervised Monocular Depth Estimation |
| 12038 | -- | 12051 | Yuxiang Zhang 0005, Wei Li 0032, Wen Jia, Mengmeng Zhang 0005, Ran Tao 0003, Shunlin Liang. Cross-Domain Hyperspectral Image Classification Based on Bi-Directional Domain Adaptation |
| 12052 | -- | 12065 | Peng Yang, Ming Liu 0029, Liquan Dong, Lingqin Kong, Yuejin Zhao. Polynomial Fitting-Based Estimation of Spatially Varying Point Spread Function From a Single Image |
| 12066 | -- | 12081 | Haoyuan Li, Qi Hu, Binjia Zhou, You Yao, Jiacheng Lin, Kailun Yang 0001, Peng Chen 0008. CFMW: Cross-Modality Fusion Mamba for Robust Object Detection Under Adverse Weather |
| 12082 | -- | 12095 | Zhongling Huang, Long Liu, Shuxin Yang, Zhirui Wang 0003, Gong Cheng 0003, Junwei Han 0001. Physics-Guided Detector for SAR Airplanes |
| 12096 | -- | 12108 | Xu Han, Qi Wang. Compensating for the Incomplete With the Complete: An Efficient Scene Text Detector |
| 12109 | -- | 12124 | Kunpeng Wang 0005, Zhengzheng Tu, Chenglong Li 0002, Zhengyi Liu, Bin Luo 0001. Unified-Modal Salient Object Detection via Adaptive Prompt Learning |
| 12125 | -- | 12137 | Yu Liu 0021, Chun Luo, Wanglong Wan, Wenqiang Jin, Zheng Qin 0001. A Secure Medical Image Encryption Scheme Based on Cross-Ring Josephus Scrambling and Two-Dimensional Cellular Automata |
| 12138 | -- | 12151 | Na Zheng, Xuemeng Song, Wai Teng Tang, See-Kiong Ng, Liqiang Nie, Roger Zimmermann. Unsupervised Few-Shot Food Recognition With Intra-Class Variation and Inter-Class Similarity Modeling |
| 12152 | -- | 12166 | Mingyue Chen, Xin Liao 0001, Han Fang, Jinlin Guo, Yanxiang Chen, Xiaoshuai Wu. Flexible Partial Screen-Shooting Watermarking With Provable Robustness |
| 12167 | -- | 12181 | Meng Li, Bo Ma 0012, Yulin Zhang. Lightweight Image Super-Resolution With Pyramid Clustering Transformer |
| 12182 | -- | 12195 | Jia Wang 0054, Zhiguo Qu, Lingshuang Kong, Wentao Yuan, Encai Liu, Rui Zhang, Ruigang Fu. Learning a Perspective-Invariant Descriptor for Remote Sensing Image Matching |
| 12196 | -- | 12211 | Piotr Kopa Ostrowski, Daniel Wesierski, Anna Jezierska, Tomasz P. Stefanski. Lifting Deep Image Denoisers to Video With Frame Interpolation Pre-Training |
| 12212 | -- | 12226 | Hengyue Bi, Long Chen 0019, Jingchao Cao, Jingyang Wang, Jinghao Sun, Yuan Rao 0001, Junyu Dong. SeaDiff: Underwater Image Enhancement With Degradation-Aware Diffusion Model |
| 12227 | -- | 12237 | Bo Hu 0008, Wei Wang, Leida Li, Lihuo He, Wen Lu 0004, Xinbo Gao 0001. Blind Quality Assessment of Wide-Angle Videos Based on Deformation Representation Learning and Multi-Dimensional Feature Fusion |
| 12238 | -- | 12250 | Yunnan Wang, Ziqiang Li, Wenyao Zhang, Lexiang Lv, Zequn Zhang, Xiaoyu Shen, Xin Jin 0014, Wenjun Zeng 0001. Canvas: Compositional Generation for Art Painting With Seamless Subject-Driven Infusion |
| 12251 | -- | 12264 | Dongshuai Duan, Honglei Su, Qi Liu 0029, Hui Yuan 0001, Zhou Wang 0001. DQP-PCQA: Deep Quantization Parameters Bring New Insight to Point Cloud Quality Assessment |
| 12265 | -- | 12277 | Ting Zhou, Siyuan Chen, Siyao Wan, Hanyun Lv, Zheng Luo, Jianhui Wu 0002. GEDR: Gaussian-Enhanced Detail Reconstruction for Real-Time High-Fidelity 3D Scene Reconstruction |
| 12278 | -- | 12291 | Xiao Xu 0005, Libo Qin 0001, Wanxiang Che, Min-Yen Kan. Manager: Aggregating Insights From Unimodal Experts in Two-Tower VLMs and MLLMs |
| 12292 | -- | 12305 | Runhao Zeng, Qi Deng, Ronghao Zhang, Shuaicheng Niu, Jian Chen 0011, Xiping Hu, Victor C. M. Leung. Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation |
| 12306 | -- | 12316 | Guangsheng Xu, Guoyi Zhang, Lejia Ye, Shuwei Gan, Xiaohu Zhang, Xia Yang. Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation |
| 12317 | -- | 12328 | Shanshan Han, Shuang Li, Shuodi Wang, Lin Yuan 0002, Yan Zhang 0108, Xinbo Gao 0001. Deepfake Detection Leveraging Self-Blended Artifacts Guided by Facial Embedding Discrepancy |
| 12329 | -- | 12340 | Yongqi Huang, Peng Ye 0006, Chongjun Tu, Tao Chen 0003, Tong He 0001, Wanli Ouyang. Sparse-to-Dense Training: A Novel Training Scheme to Enhance Vision Transformers |
| 12341 | -- | 12354 | Cheng Lin, Hong Hu, Jie Zou 0001, Lujun Li, Jun Liu, Yipeng Gao, Yang Yang 0002, Heng Tao Shen. Distilling Grounding DINO for an Edge-Cloud Collaborative Advanced Driver Assistance System |
| 12355 | -- | 12368 | Zhifeng Wang 0004, Qixuan Zhang, Peter Zhang, Wenjia Niu, Kaihao Zhang, Ramesh S. Sankaranarayana, Sabrina B. Caldwell, Tom Gedeon. Visual and Textual Prompts in VLLMs for Enhancing Emotion Recognition |
| 12369 | -- | 12381 | Jianing Wang 0003, Shengjia Hao, Zheng Hua, Yuqiong Yao, Qiong Xu, Bo Liu 0009, Maoguo Gong. TBGA-Net: Trigonometric Bilinear Attention and Global-Aware Aggregation Network for Large-Scale 3D Point Cloud Segmentation |
| 12382 | -- | 12395 | Bo Pang, Deming Zhai, Jianan Zhen, Long Wang, Xu Han, Guofeng Zhang 0001, Xianming Liu 0005. Zero6DOT: Zero-Shot 6D Object Pose Tracking With Monocular RGB Video |
| 12396 | -- | 12409 | Kehua Chen, Zhenlong Yuan, Haihong Xiao, Tianlu Mao, Zhaoqi Wang. Learning Multi-View Stereo With Geometry-Aware Prior |
| 12410 | -- | 12425 | Jianping Zhong, Zhaobo Qi, Kaiwen Duan, Yuanrong Xu, Weigang Zhang, Qingming Huang. VPA: Multi-Modal Virtual Point Augmentation for 3D Object Detection |
| 12426 | -- | 12440 | Yanbo Gao, Huibin Bai, Huasong Zhou, Xingyu Gao 0001, Shuai Li 0005, Xun Cai, Hui Yuan 0001, Wei Hua 0002, Tian Xie 0011. Adaptive Depth-Converted-Scale Convolution for Self-Supervised Monocular Depth Estimation |
| 12441 | -- | 12454 | Zijian Zhang, Muqing Wu, Honghao Qi, Min Zhao 0002. EFMK: Extrinsic Parameters-Free Multi-View 3D Human Skeleton Estimation |
| 12455 | -- | 12466 | Yiqiang Wu, Yu Qin, Jiacheng Sun, Chang Liu 0082, Yunfei Bai, Chenghai Mao, Xiaomao Li. SampleDet3D: Sample Enhanced 3D Object Detection |
| 12467 | -- | 12476 | Junyi Hou, Zihao Pan, Changjun Xu, Lei Yu 0007. Low Texture 3D Reconstruction System Based on Manhattan Axis and 2D/3D Line Features |
| 12477 | -- | 12491 | Baoyang Mu, Feng Shao 0001, Hangwei Chen, Xuejin Wang, Qiuping Jiang. A Mutual Head Knowledge Distillation Framework for Lightweight RGB-T Crowd Counting |
| 12492 | -- | 12507 | Aihua Mao, Shuyi Wen, Feng Chen, Ran Yi 0002, Yong-Jin Liu 0001. Robust 3D Visual Question Answering via Bias Learning |
| 12508 | -- | 12523 | Shengjun Zhu, Jiaxin Cai, Runqing Xiong, Liping Zheng, Duo Ma. Singular Pooling: A Spectral Pooling Paradigm for Second-Trimester Prenatal Level II Ultrasound Standard Fetal Plane Identification |
| 12524 | -- | 12537 | Tianshi Luo, Hao Li 0009, Maoguo Gong, Yu Zhou 0051, A. Kai Qin. STEAM: Style Transfer Enabled Adversarial Attack With Attention Mechanism on Remote Sensing Image Scene Classification |
| 12538 | -- | 12549 | Yang Liu 0069, Jiale Du, Xinbo Gao 0001, Jungong Han, Ling Shao 0001. Relation-Aware Meta-Learning for Zero-Shot Sketch-Based Image Retrieval |
| 12550 | -- | 12563 | Xuecheng Li, Yuanjie Zheng. Inpaint-Outpaint Synergy: Mask Refinement for Trimap-Free Matting |
| 12564 | -- | 12578 | Lei Song 0010, Huaibo Song, Bo Jiang 0017. Adaptive Clustering and Frequency Division Network for Efficient Monocular Depth Estimation |
| 12579 | -- | 12591 | Gee-Sern Jison Hsu, Wei-Jun Lin, Wei-Chun Hsieh, Wei-Zhe Jian, Sheng-Luen Chung, Marina L. Gavrilova. Style-Preserving Generator for Synthetic License Plate Recognition |
| 12592 | -- | 12606 | Wenbin Yan, Hua Chen 0008, Qingwei Wu, Xiaogang Zhang, Qiu Fang, Shengjie Hu, Yaonan Wang 0001. LFSSMam: Efficient Aggregation of Multi-Spatial-Angular-Modal Information Using Selective SSM for Light Field Semantic Segmentation |
| 12607 | -- | 12621 | Yang Li, Songlin Yang, Wei Wang 0025, Jing Dong 0003. Beyond Inserting: Learning Subject Embedding for Semantic-Fidelity Personalized Diffusion Generation |
| 12622 | -- | 12635 | Wenjia Meng, Huimin Han, Xiankai Lu, Yilong Yin, Gang Pan 0001, Qian Zheng. LAC-PS: A Light Direction Selection Policy Under the Accuracy Constraint for Photometric Stereo |
| 12636 | -- | 12651 | Gang He 0002, Long Gao, Langkun Chen, Yan Jiang, Weiying Xie, Yunsong Li 0001. Hyperspectral Object Tracking With Spectral Information Prompt |
| 12652 | -- | 12665 | Yao Chen, Guancheng Jia, Yufei Zha, Peng Zhang 0005, Yanning Zhang 0001. LINR: A Plug-and-Play Local Implicit Neural Representation Module for Visual Object Tracking |
| 12666 | -- | 12679 | Ye Wang 0020, Mingyang Ma 0004, Ge Zhang 0006, Yuheng Liu, Tao Gao 0001, Shaohui Mei. Hyperspectral Tracker With Constrained Object Adaptive Learning and Trajectory Construction |
| 12680 | -- | 12691 | Shou Feng, Jinghe Zhang, Yuanze Fan, Xinyao Liu, Chunhui Zhao 0003, Wei Li 0032, Ran Tao 0003. Cross-Domain Few-Shot Learning Method Based on Fractional Domain Information for Hyperspectral Image Multi-Class Change Detection |
| 12692 | -- | 12706 | Xuting Lan, Weizhi Xian, Mingliang Zhou 0001, Jielu Yan, Xuekai Wei, Jun Luo 0006, Weijia Jia 0001, Sam Kwong. No-Reference Image Quality Assessment: Exploring Intrinsic Distortion Characteristics via Generative Noise Estimation With Mamba |
| 12707 | -- | 12718 | Jinglin Xu, Yaqi Zhang, Wenhao Zhou, Hongmin Liu 0001. BFSTAL: Bidirectional Feature Splitting With Cross-Layer Fusion for Temporal Action Localization |
| 12719 | -- | 12733 | Guanqi Ding, Xinzhe Han, Shuhui Wang, Xin Jin 0004, Qingming Huang. Stable Attribute Group Editing for Reliable Few-Shot Image Generation |
| 12734 | -- | 12746 | Jingqian Wu, Shuo Zhu, Chutian Wang, Boxin Shi, Edmund Y. Lam. SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering From a Single Sweep |
| 12747 | -- | 12759 | Honglin Guo, Ruidong Chen, Weizhi Nie, Lanjun Wang, Anan Liu. CompCraft: Foreground-Driven Image Synthesis With Customized Layouts |
| 12760 | -- | 12771 | Yiqian Wu, Hao Xu 0049, Xiangjun Tang, Yue Shangguan, Hongbo Fu 0001, Xiaogang Jin 0001. 3DPortraitGAN: Learning One-Quarter Headshot 3D GANs From a Single-View Portrait Dataset With Diverse Body Poses |
| 12772 | -- | 12787 | Alessandro Gnutti, Fabrizio Guerrini, Riccardo Leonardi, Antonio Ortega. Variable-Size Symmetry-Based Graph Fourier Transforms for Image Compression |
| 12788 | -- | 12801 | Shiwei Wang 0005, Liquan Shen, Peiying Wu, Zhaoyi Tian, Feifeng Wang. DRLN: Disparity-Aware Rescaling Learning Network for Multi-View Video Coding Optimization |
| 12802 | -- | 12815 | Jian Xiong 0005, Junhao Wu, Wang Luo, Jiucheng Xie, Hui Yuan 0001, Hao Gao 0005. Multi-Task Learning Model for V-PCC Geometry Compression Artifact Removal |
| 12816 | -- | 12829 | Jie Li 0015, Zhixin Li, Zhi Liu 0002, Peng Yuan Zhou, Richang Hong, Qiyue Li 0001, Han Hu 0003. Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and User Trajectory Information |
| 12830 | -- | 12845 | Guquan Jing, Peng Gao, Yujian Lee, Yiyang Hu, Hui Zhang 0062. 3D-Aided Pedestrian Representation Learning for Video-Based Person Re-Identification |
| 12846 | -- | 12860 | Lingchen Gu, Xiaojuan Shen, Jiande Sun 0001, Yan Liu, Jing Li 0046, Zhihui Li, Sen-Ching S. Cheung, Wenbo Wan. Dual Prototypes-Based Personalized Federated Adversarial Cross-Modal Hashing |
| 12861 | -- | 12873 | Fengling Li 0001, Zequn Wang, Tianshi Wang 0001, Lei Zhu 0002, Xiaojun Chang. Generative Augmentation Hashing for Few-Shot Cross-Modal Retrieval |
| 12874 | -- | 12889 | Yating Liu, Yaowei Li 0001, Xiangyuan Lan, Wenming Yang, Zimo Liu, Qingmin Liao. UP-Person: Unified Parameter-Efficient Transfer Learning for Text-Based Person Retrieval |
| 12890 | -- | 12903 | Pujun Zhou, Guanchao Qiao, Qi Yu 0002, M. Chen, Y. C. Wang, Y.-C. Chen, J. J. Wang, Ning Ning 0002, Y. Liu, Shaogang Hu. A 0.96 pJ/SOP Heterogeneous Neuromorphic Chip Toward Energy-Efficient Edge Visual Applications |
| 12904 | -- | 12917 | Lixin Zhang, Qian Wang 0046. Synergistic Fusion Network of Microscopic Hyperspectral and RGB Images for Multi-Perspective Segmentation |
| 12918 | -- | 12924 | Ziwen He, Xingjie Dai, Xiang Zhang 0023, Zhangjie Fu. MMDStegNet: An Adversarial Steganography Framework With Maximum Mean Discrepancy Regularization |