| 3 | -- | 18 | Xijie Cheng, Xiaohui He 0001, Mengjia Qiao, Panle Li, Peng Chang, Tianhao Zhang, Xiaoyu Guo, Jinyong Wang, Zhihui Tian, Guangsheng Zhou. Multi-View Graph Convolutional Network With Spectral Component Decompose for Remote Sensing Images Classification |
| 19 | -- | 32 | Junbin Zhuang 0001, Yan Zheng, Baolong Guo 0001, Yunyi Yan. Globally Deformable Information Selection Transformer for Underwater Image Enhancement |
| 33 | -- | 44 | Jianan Li 0001, Xiaoying Yuan, Haolin Qin, Ying Wang 0064, Xincong Liu, Tingfa Xu. CVT-Track: Concentrating on Valid Tokens for One-Stream Tracking |
| 45 | -- | 61 | Minglei Li 0002, Wushuang Gong, Pengfei Yan, Xiang Li 0084, Yuchen Jiang, Hao Luo 0003, Hang Zhou, Shen Yin. Joint Lesion Detection and Classification of Breast Ultrasound Video via a Clinical Knowledge-Aware Framework |
| 62 | -- | 74 | Yihui Liang, Qian Fu, Kun Zou, Guisong Liu, Han Huang 0002. Enhancing Transparent Object Matting Using Predicted Definite Foreground and Background |
| 75 | -- | 90 | Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao 0001. A Masked Reference Token Supervision-Based Iterative Visual-Language Framework for Robust Visual Grounding |
| 91 | -- | 102 | Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei 0001. Exploring Vision-Language Foundation Model for Novel Object Captioning |
| 103 | -- | 117 | Jiabing Xiong, Qiang Ling 0001. Mask-Guided Siamese Tracking With a Frequency-Spatial Hybrid Network |
| 118 | -- | 133 | Qinglei Li, Qi Wang, Yongbin Qin, Xinyu Dong, Xingcai Wu, Shiming Chen 0002, Wu Liu, Yong-Jin Liu 0001, Jiebo Luo 0001. DRC: Discrete Representation Classifier With Salient Features via Fixed-Prototype |
| 134 | -- | 147 | Pengxiang Li 0002, Chengtang Yao, Yunde Jia, Yuwei Wu 0001. Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching |
| 148 | -- | 161 | Xiao Wang 0014, Jiandong Jin, Chenglong Li 0002, Jin Tang 0001, Cheng Zhang 0010, Wei Wang 0115. Pedestrian Attribute Recognition via CLIP-Based Prompt Vision-Language Fusion |
| 162 | -- | 177 | Zongyang Zhao, Jiehu Kang, Luyuan Feng, Jian Liang, Yuqi Ren, Bin Wu. LFA-Net: Enhanced PointNet and Assignable Weights Transformer Network for Partial-to-Partial Point Cloud Registration |
| 178 | -- | 194 | Jiehua Zhang, Liang Li 0003, Chenggang Yan 0001, Wei Ke 0003, Yihong Gong. Monocular Depth Estimation on Adverse Weathers With Curriculum Domain Distribution Alignment |
| 195 | -- | 206 | Junrui Xiao, Zhikai Li, Jianquan Li, Lianwei Yang, Qingyi Gu. BinaryViT: Toward Efficient and Accurate Binary Vision Transformers |
| 207 | -- | 218 | Minjun Shen, Guobao Xiao, Changcai Yang, Junwen Guo, Lei Zhu 0002. CLG-Net: Rethinking Local and Global Perception in Lightweight Two-View Correspondence Learning |
| 219 | -- | 231 | Zhuomin Liang, Liang Bai, Jinyu Fan, Xian Yang 0001, Jiye Liang. Contrastive Learning With Enhancing Detailed Information for Pre-Training Vision Transformer |
| 232 | -- | 244 | Bo Liu, Chengrong Yang, Jing Guo, Yun Yang 0003. A Novel Semi-Supervised Object Detection Approach via Scale Rebalancing and Global Proposal Contrast Consistency |
| 245 | -- | 258 | Yijin Yang, Xiaodong Gu 0001. Attention-Based Gating Network for Robust Segmentation Tracking |
| 259 | -- | 272 | Zhuang Luo, Yang Xiao 0007, Feng Yang 0012, Joey Tianyi Zhou, Zhiwen Fang. Rhythmer: Ranking-Based Skill Assessment With Rhythm-Aware Transformer |
| 273 | -- | 286 | Anjun Chen, Xiangyu Wang, Kun Shi 0003, Yuchi Huo, Jiming Chen 0001, Qi Ye. Toward Weather-Robust 3D Human Body Reconstruction: Millimeter-Wave Radar-Based Dataset, Benchmark, and Multi-Modal Fusion |
| 287 | -- | 299 | Fei Wu 0001, Jun Yin, Xiaochuan Li, Jianfeng Wu, Da Jin, Jiamin Yang. CoNet: A Consistency-Oriented Network for Camouflaged Object Segmentation |
| 300 | -- | 314 | Tongtong Yuan, Xuange Zhang, Bo Liu 0011, Kun Liu, Jian Jin, Zhenzhen Jiao. Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models |
| 315 | -- | 328 | Hao Liu, Yong Zhou 0003, Bing Liu 0016, Ming Yan 0007, Joey Tianyi Zhou. L2A: Learning Affinity From Attention for Weakly Supervised Continual Semantic Segmentation |
| 329 | -- | 342 | Min Xie, Jieyu Zhao, Kedi Shen. A Novel SO(3) Rotational Equivariant Masked Autoencoder for 3D Mesh Object Analysis |
| 343 | -- | 356 | Yuwen Pan, Rui Sun 0006, Yuan Wang, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang 0001. Purify Then Guide: A Bi-Directional Bridge Network for Open-Vocabulary Semantic Segmentation |
| 357 | -- | 366 | Shanaka Ramesh Gunasekara, Wanqing Li 0001, Jack Yang 0003, Philip O. Ogunbona. Asynchronous Joint-Based Temporal Pooling for Skeleton-Based Action Recognition |
| 367 | -- | 379 | Peiyu Guan, Zhiqiang Cao, Shengxuan Fan, Yuequan Yang, Junzhi Yu, Shuo Wang 0001. Hardness-Aware Metric Learning With Cluster-Guided Attention for Visual Place Recognition |
| 380 | -- | 393 | Jinfan Liu, Yichao Yan, Junjie Li, Weiming Zhao, Pengzhi Chu, Xingdong Sheng, Yunhui Liu 0006, Xiaokang Yang. IPAD: Industrial Process Anomaly Detection Dataset |
| 394 | -- | 404 | Tianyu Sun, Dingchang Hu, Yixiang Dai, Guijin Wang. Diffusion-Based Depth Inpainting for Transparent and Reflective Objects |
| 405 | -- | 417 | Xiaomin Li, Qinghe Wang, Dezhuang Li, Mengmeng Ge, Xu Jia 0012, You He, Huchuan Lu. MoBox: Enhancing Video Object Segmentation With Motion-Augmented Box Supervision |
| 418 | -- | 430 | Kexing Ding, Ting Lu 0002, Wei Fu 0003, Leyuan Fang. Cross-Scene Hyperspectral Image Classification With Consistency-Aware Customized Learning |
| 431 | -- | 444 | Pan Liu, Yuanyang Bu, Yong-Qiang Zhao 0001, Seong G. Kong. Enhancing Visual Data Completion With Pseudo Side Information Regularization |
| 445 | -- | 460 | Fan Yang 0032, Sosuke Yamao, Ikuo Kusajima, Atsunori Moteki, Shoichi Masui, Shan Jiang 0006. YOWO: You Only Walk Once to Jointly Map an Indoor Scene and Register Ceiling-Mounted Cameras |
| 461 | -- | 476 | HuaQing Hao, Weibin Liu, Weiwei Xing. Prior-Structure Driven Weakly-Supervised Learning for Fine-Grained Human Parsing |
| 477 | -- | 491 | Ke Gu 0001, Hongyan Liu 0004, Yuchen Liu, Junfei Qiao 0001, Guangtao Zhai, Wenjun Zhang 0001. Perceptual Information Fidelity for Quality Estimation of Industrial Images |
| 492 | -- | 507 | Linfeng Ma, Han Fang, Zehua Ma, Zhaoyang Jia, Weiming Zhang 0001, Nenghai Yu. C³shartMark: A Chart Watermarking Scheme With Consecutive-Encoding and Concurrent-Decoding |
| 508 | -- | 519 | Jiawei Mao, Guangyi Zhao, Xuesong Yin, Yuanqi Chang. SwinStyleformer is a Favorable Choice for Image Inversion |
| 520 | -- | 533 | Wenhao Xu, Changwei Wang 0001, Xuxiang Feng, Rongtao Xu, Longzhao Huang, Zherui Zhang, Li Guo 0004, Shibiao Xu. Generalization Boosted Adapter for Open-Vocabulary Segmentation |
| 534 | -- | 546 | Weijia Wu 0001, Zhuang Li, Yuanqiang Cai, Hong Zhou, Mike Zheng Shou. A Bilingual, Open World Video Text Dataset and Real-Time Video Text Spotting With Contrastive Learning |
| 547 | -- | 560 | Fan Yang, Binbin Liang, Wei Li 0075, Jianwei Zhang 0013. Multidimensional Fusion Network for Multispectral Object Detection |
| 561 | -- | 576 | Yuxuan Gu, Yi Jin 0002, Ben Wang 0005, Zhixiang Wei, Xiaoxiao Ma 0006, Haoxuan Wang, Pengyang Ling, Huaian Chen, Enhong Chen. Seed Optimization With Frozen Generator for Superior Zero-Shot Low-Light Image Enhancement |
| 577 | -- | 588 | Jiacheng Hou, Zhong Ji, Jinyu Yang, Feng Zheng. Bidirectional Error-Aware Fusion Network for Video Inpainting |
| 589 | -- | 600 | Yongkang Zhang 0001, Han Zhang, Jun Li 0072, Zhiping Shi 0002, Jian Yang 0030, Kaixin Yang, Shuo Yin, Qiuyan Liang, Xianglong Liu 0001. Bullet-Screen-Emoji Attack With Temporal Difference Noise for Video Action Recognition |
| 601 | -- | 616 | Yuanyuan Li, Zetian Mi, Yulin Wang, Shuaiyong Jiang, XianPing Fu. TAFormer: A Transmission-Aware Transformer for Underwater Image Enhancement |
| 617 | -- | 631 | Junfei Shi, Shanshan Ji, Haiyan Jin, Junhuai Li, Maoguo Gong, Weisi Lin. Content-Adaptive Multi-Region Deep Network for Polarimetric SAR Image Classification |
| 632 | -- | 642 | Kaihui Cheng, Chule Yang, Xiao Liu, Naiyang Guan, Zhiyuan Wang. LPN: Language-Guided Prototypical Network for Few-Shot Classification |
| 643 | -- | 656 | Yu Xie, Lianhang Luo, Tianpei Cao, Bin Yu 0011, A. Kai Qin. Contrastive Learning Network for Unsupervised Graph Matching |
| 657 | -- | 669 | Chen Yang 0020, Junxiao Wang, Huixiao Meng, Shuyuan Yang, Zhixi Feng. Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images |
| 670 | -- | 683 | Chang Wan, Ming-Hsuan Yang 0001, Minglu Li 0001, Yunliang Jiang, Zhonglong Zheng. Nested Annealed Training Scheme for Generative Adversarial Networks |
| 684 | -- | 697 | Xiao Jiang, Yiyuan Xie, Yushu Zhang 0001, Yichen Ye, Fang Xu, Lili Li, Ye Su, Zhuang Chen. Reversible Data Hiding in Encrypted Images Using Reservoir Computing-Based Data Fusion Strategy |
| 698 | -- | 712 | Xueli Geng, Lingling Li 0002, Licheng Jiao, Xu Liu 0006, Fang Liu 0001, Shuyuan Yang. Knowledge-Aware Geometric Contourlet Semantic Learning for Hyperspectral Image Classification |
| 713 | -- | 727 | Yaowu Fan, Jia Wan 0001, Andy J. Ma. Learning Crowd Scale and Distribution for Weakly Supervised Crowd Counting and Localization |
| 728 | -- | 740 | Zhishe Wang, Zhuoqun Zhang, Wuqiang Qi, Fengbao Yang, Jiawei Xu 0004. FreqGAN: Infrared and Visible Image Fusion via Unified Frequency Adversarial Learning |
| 741 | -- | 753 | Shi Chen, Lefei Zhang, Liangpei Zhang 0001. Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion |
| 754 | -- | 768 | Heqian Qiu, Lanxiao Wang, Taijin Zhao, Fanman Meng, Qingbo Wu 0001, Hongliang Li 0001. MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension |
| 769 | -- | 782 | Bin Fan 0002, Ying Guo, Yuchao Dai, Chao Xu 0006, Boxin Shi. Self-Supervised Learning for Rolling Shutter Temporal Super-Resolution |
| 783 | -- | 799 | Guanbo Wang, Haiyan Li, Victor S. Sheng, Yujun Ma, Hongwei Ding, Hongzhi Zhao. DPMNet: A Remote Sensing Forest Fire Real-Time Detection Network Driven by Dual Pathways and Multidimensional Interactions of Features |
| 800 | -- | 810 | Lei Qi 0001, Dongjia Zhao, Yinghuan Shi, Xin Geng 0001. Patch-Aware Batch Normalization for Improving Cross-Domain Robustness |
| 811 | -- | 822 | Yusong Hu, Zichen Liang, Xialei Liu, Qibin Hou, Ming-Ming Cheng. Reformulating Classification as Image-Class Matching for Class Incremental Learning |
| 823 | -- | 837 | Wanyu Wu, Wei Wang 0170, Zheng Wang 0007, Kui Jiang, Zhengguo Li. For Overall Nighttime Visibility: Integrate Irregular Glow Removal With Glow-Aware Enhancement |
| 838 | -- | 856 | Mingye Ju, Chunming He, Can Ding, Wenqi Ren, Lin Zhang 0014, Kai-Kuang Ma. All-Inclusive Image Enhancement for Degraded Images Exhibiting Low-Frequency Corruption |
| 857 | -- | 873 | Yuxin Kong, Peng Yang 0004, Yan Cheng. Adaptive On-Device Model Update for Responsive Video Analytics in Adverse Environments |
| 874 | -- | 887 | Bobiao Guo, Ping Ping, Junyuan Huo. CRDH: Compatible Reversible Data Hiding With High Capacity and Generalization |
| 888 | -- | 899 | Zhiyuan Li, Yanhui Zhou, Hao Wei 0005, Chenyang Ge, Jingwen Jiang. Toward Extreme Image Compression With Latent Feature Guidance and Diffusion Prior |
| 900 | -- | 910 | Yili Jin 0001, Xize Duan, Kaiyuan Hu, Fangxin Wang 0001, Xue Liu 0001. 3D Video Conferencing via On-Hand Devices |
| 911 | -- | 921 | Wenhui Li 0001, Chao Pang, Weizhi Nie, Hongshuo Tian, An-An Liu. Bidirectional Mask Selection for Zero-Shot Referring Image Segmentation |
| 922 | -- | 937 | Laijin Meng, Fan Li, Xinghao Jiang, Qiang Xu 0007. A Universal Framework for Improving the Robustness of Coverless Image Steganography Based on Image Restoration |
| 938 | -- | 952 | Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Zhongang Qi, Ying Shan, Jian-Fang Hu, Wei-Shi Zheng 0001. Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning |
| 953 | -- | 966 | Kuiyuan Zhang, Zeming Hou, Zhongyun Hua, Yifeng Zheng, Leo Yu Zhang. Boosting Deepfake Detection Generalizability via Expansive Learning and Confidence Judgement |
| 967 | -- | 980 | Jing Lian, Zhenghao Wang, Dongfang Yang, Wen Zheng, Linhui Li, Yibin Zhang. Pedestrian Facial Attention Detection Using Deep Fusion and Multi-Modal Fusion Classifier |
| 981 | -- | 985 | Yaning Zhang, Yingqian Wang 0002, Tianhao Wu, Jungang Yang 0001, Wei An. Fixed Relative Pose Prior for Camera Array Self-Calibration |