| 8436 | -- | 8451 | Chenyu Wang, Shuo Yan, Yixuan Chen 0003, Xianwei Wang, Yujiang Wang 0001, Mingzhi Dong, Xiaochen Yang, Dongsheng Li 0002, Rui Zhu 0006, David A. Clifton, Robert P. Dick, Qin Lv, Fan Yang 0001, Tun Lu, Ning Gu, Li Shang. Denoising Reuse: Exploiting Inter-Frame Motion Consistency for Efficient Video Generation |
| 8452 | -- | 8465 | Qingsen Yan, Tao Hu, Peng Wu 0015, Duwei Dai, Shuhang Gu, Wei Dong 0010, Yanning Zhang 0001. Efficient Image Enhancement With a Diffusion-Based Frequency Prior |
| 8466 | -- | 8477 | Jiaming Liu, Linghe Kong, Yue Wu 0004, Maoguo Gong, Hao Li 0009, Qiguang Miao, Wenping Ma 0001, Can Qin. Triple Point Masking |
| 8478 | -- | 8494 | Jiajie Yu, Xing Lu, Lijun Guo, Chong Wang 0001, Guoqi Li, Jiangbo Qian. Event-Based Video Reconstruction Via Spatial-Temporal Heterogeneous Spiking Neural Network |
| 8495 | -- | 8507 | Bangzhen Liu, Yangyang Xu, Cheng Xu, Xuemiao Xu, Shengfeng He. Open-Set Mixed Domain Adaptation via Visual-Linguistic Focal Evolving |
| 8508 | -- | 8519 | Nuo Chen, Chushu Zhang, Wei An 0003, Longguang Wang, Miao Li, Qiang Ling 0002. Event-Based Motion Deblurring With Blur-Aware Reconstruction Filter |
| 8520 | -- | 8532 | Fang-Yi Liang, Yu-Wei Zhan, Jiale Liu, Chong-Yu Zhang, Zhen-Duo Chen 0001, Xin Luo 0006, Xin-Shun Xu. Class-Aware Prompting for Federated Few-Shot Class-Incremental Learning |
| 8533 | -- | 8544 | Zhigang Chen, Benjia Zhou, Yiqing Huang, Jun Wan 0001, Yibo Hu 0001, Hailin Shi, Yanyan Liang 0001, Zhen Lei 0001, Du Zhang. 2RL: Content and Context Representation Learning for Gloss-Free Sign Language Translation and Retrieval |
| 8545 | -- | 8558 | Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang 0001. Efficient Non-Blind Image Deblurring With Discriminative Shrinkage Deep Networks |
| 8559 | -- | 8573 | Jun Chen 0013, He Wang, Zhifeng Hao, Zemin Cai, Ling Mei, Tianshu Liu. Flow Visualization for Complex Fluid Flows via a Structure-Enhanced Motion Estimator |
| 8574 | -- | 8585 | Keyi Zhou, Li Li 0040, Wengang Zhou 0001, Yonghui Wang, Hao Feng 0009, Houqiang Li. LaneTCA: Enhancing Video Lane Detection With Temporal Context Aggregation |
| 8586 | -- | 8597 | Zezeng Li, Zhihui Qi, Weimin Wang 0007, Ziliang Wang, Junyi Duan, Na Lei. Point2Quad: Generating Quad Meshes From Point Clouds via Face Prediction |
| 8598 | -- | 8613 | Shuze Geng, Yifan Liu, Zijin Wang, Gang Yan 0001, Yang Yu 0022, Yingchun Guo. Pose-Skeleton Guided Cross-Attention Representation Fusion for Occluded Pedestrian Re-Identification |
| 8614 | -- | 8626 | Pengfei Fang, Qiang Xu, Zixuan Lin, Hui Xue 0002. On Modulating Motion-Aware Visual-Language Representation for Few-Shot Action Recognition |
| 8627 | -- | 8638 | Quan Wan, Maofa Wang, Weifeng Shan, Bin Wang, Lu Zhang, Zhixiong Leng, Bingchen Yan, Yanlin Xu, Huiling Chen. Meta-Learning With Task-Adaptive Selection |
| 8639 | -- | 8652 | Tong Ning, Ke Lu 0002, Xirui Jiang, Hongjuan Pei, Jian Xue 0002. DinoQuery: Promoting Small 3D Object Detection With Textual Prompt |
| 8653 | -- | 8665 | Ping Li 0006, Xingchao Ye, Lingfeng He. Coarse-to-Fine Hypergraph Network for Spatiotemporal Action Detection |
| 8666 | -- | 8678 | Chao Qu, Zewei Chen, Jingyuan Zhang, Xiaoyu Chen 0003, Jing Han 0009. Self-BSR: Self-Supervised Image Denoising and Destriping Based on Blind-Spot Regularization |
| 8679 | -- | 8691 | Mianzhao Wang, Fan Shi 0001, Xu Cheng 0003, Shengyong Chen. Prior Knowledge-Driven Hybrid Prompter Learning for RGB-Event Tracking |
| 8692 | -- | 8706 | Wuzhen Shi, Zibang Xue, Yang Wen. Keypoints and Action Units Jointly Drive Talking Head Generation for Video Conferencing |
| 8707 | -- | 8722 | Kong Li, Zhe Dai, Hua Cui, Xuan Wang 0021, Huansheng Song. VRAR: Video-Radar Automatic Registration Method Based on Trajectory Spatiotemporal Features and Bidirectional Mapping |
| 8723 | -- | 8737 | Ying Zhang 0063, Puhong Duan, Lianhui Liang, Xudong Kang, Jun Li 0009, Antonio Plaza. PFS3F: Probabilistic Fusion of Superpixel-Wise and Semantic-Aware Structural Features for Hyperspectral Image Classification |
| 8738 | -- | 8753 | Lei Fan, Qi Yang 0002, Hongqiang Wang 0001, Yuliang Qin, Bin Deng 0002. Sequential Ground Moving Target Imaging Based on Hybrid ViSAR-ISAR Image Formation in Terahertz Band |
| 8754 | -- | 8766 | Kangdao Liu, Tianhao Sun, Hao Zeng 0005, Yongshan Zhang, Chi-Man Pun, Chi-Man Vong. Spatial-Aware Conformal Prediction for Trustworthy Hyperspectral Image Classification |
| 8767 | -- | 8779 | Dan Song 0006, Xinwei Fu, Ning Liu, Wei-Zhi Nie, Wenhui Li 0001, Lanjun Wang, You Yang, An-An Liu. MV-CLIP: Multi-View CLIP for Zero-Shot 3D Shape Recognition |
| 8780 | -- | 8793 | Xiao Cui, Weicai Ye, Yifan Wang, Guofeng Zhang 0001, Wengang Zhou 0001, Tong He 0001, Houqiang Li. StreetSurfGS: Scalable Urban Street Surface Reconstruction With Planar-Based Gaussian Splatting |
| 8794 | -- | 8806 | Yun Liu 0009, Sifan Li, Huiyu Duan, Yu Zhou 0009, Daoxin Fan, Guangtao Zhai. Multi-Task Guided No-Reference Omnidirectional Image Quality Assessment With Feature Interaction |
| 8807 | -- | 8818 | Qibo Qiu, Honghui Yang, Jian Jiang, Shun Zhang, Haochao Ying, Haiming Gao, Wenxiao Wang 0001, Xiaofei He 0001. 3CS: Multi-Target Masked Point Modeling With Learnable Codebook and Siamese Decoders |
| 8819 | -- | 8834 | Yishi Li, Fanhong Zeng, Rui Lai, Tong Wu, Juntao Guan, Anfu Zhu, Zhangming Zhu. TinyFusionDet: Hardware-Efficient LiDAR-Camera Fusion Framework for 3D Object Detection at Edge |
| 8835 | -- | 8848 | Tong Jin, Feng Lu, ShuYu Hu, Chun Yuan, Yunpeng Liu 0001. EDTformer: An Efficient Decoder Transformer for Visual Place Recognition |
| 8849 | -- | 8863 | Zhangdong Wang, Zhihuang Liu, Yuanjing Luo, Tongqing Zhou, Jiaohua Qin, Zhiping Cai. PPIDM: Privacy-Preserving Inference for Diffusion Model in the Cloud |
| 8864 | -- | 8879 | Yakun Ma, Xiuli Chai, Guoqiang Long, Zhihua Gan, Yushu Zhang 0001. TPE for JPEG Images With Dynamic M-Ary Decomposition and Adaptive Threshold Constraints |
| 8880 | -- | 8892 | Lizhi Xiong, Rui Ding, Ching-Nung Yang, Zhangjie Fu. Robust Secret Image Sharing Scheme Based on Polynomial k-Consistency |
| 8893 | -- | 8906 | Hongfei Xiao, Ying Yang 0019, Tao Xiang 0001. Visual Content Revealing From Perceptually Encrypted Images |
| 8907 | -- | 8916 | Jingyuan Jiang, Zichi Wang, Zihan Yuan, Xinpeng Zhang 0001. Generative Image Steganography Based on Text-to-Image Multimodal Generative Model |
| 8917 | -- | 8928 | Jingchao Cao, Shuai Zhang, Yutao Liu 0002, Feng Gao 0005, Ke Gu 0001, Guangtao Zhai, Junyu Dong, Sam Kwong. Multi-Scale Local and Global Feature Fusion for Blind Quality Assessment of Enhanced Images |
| 8929 | -- | 8944 | Huanjing Yue, Cong Cao 0005, Lei Liao, Jing-Yu Yang 0002. RViDeformer: Efficient Raw Video Denoising Transformer With a Larger Benchmark Dataset |
| 8945 | -- | 8957 | Chengxing Xie, Xiaoming Zhang 0008, Linze Li 0001, Yuqian Fu, Biao Gong, Tianrui Li 0001, Kai Zhang. MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution |
| 8958 | -- | 8972 | Jingchao Cao, Wangzhen Peng, Yutao Liu 0002, Junyu Dong, Patrick Le Callet, Sam Kwong. ERD: Encoder-Residual-Decoder Neural Network for Underwater Image Enhancement |
| 8973 | -- | 8988 | Jinbao Wei, Gang Yang, Wei Wei 0068, Aiping Liu, Xun Chen 0001. Multi-Contrast MRI Arbitrary-Scale Super-Resolution via Dynamic Implicit Network |
| 8989 | -- | 9002 | Patrik Patera, Yie-Tarng Chen, Wen-Hsien Fang. A Multi-Modal Architecture With Spatio-Temporal-Text Adaptation for Video-Based Traffic Accident Anticipation |
| 9003 | -- | 9016 | Yixin Qin, Lei Zhao 0017, Lianli Gao, Haonan Zhang, Pengpeng Zeng, Heng Tao Shen. Temporal-Guided Mixture-of-Experts for Zero-Shot Video Question Answering |
| 9017 | -- | 9029 | Chunzheng Zhu, Jialin Shao, Jianxin Lin, Yijun Wang 0002, Jing Wang 0113, Jinhui Tang 0001, Kenli Li 0001. fMRI2GES: Co-Speech Gesture Reconstruction From fMRI Signal With Dual Brain Decoding Alignment |
| 9030 | -- | 9045 | Yao Wu, Mingwei Xing, Yachao Zhang 0001, Yuan Xie 0006, Yanyun Qu. Fusion-Then-Distillation: Toward Cross-Modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation |
| 9046 | -- | 9059 | Zhirui Gao, Renjiao Yi, Chenyang Zhu 0002, Ke Zhuang, Wei Chen 0009, Kai Xu 0004. Generic Objects as Pose Probes for Few-Shot View Synthesis |
| 9060 | -- | 9074 | Chengchao Huang, Feng Shao 0001, Hangwei Chen, Baoyang Mu, Long Xu 0001. GADFNet: Geometric Priors Assisted Dual-Projection Fusion Network for Monocular Panoramic Depth Estimation |
| 9075 | -- | 9089 | Ce Zhou, Qiang Ling 0001. Dual Geometry Learning and Adaptive Sparse Attention for Point Cloud Analysis |
| 9090 | -- | 9104 | Xuzhi Wang, Wei Feng 0005, Lingdong Kong, Liang Wan. NUC-Net: Non-Uniform Cylindrical Partition Network for Efficient LiDAR Semantic Segmentation |
| 9105 | -- | 9118 | Mingyue Cui, Yuyang Zhong, Mingjian Feng, Junhua Long, Yehua Ling, Jiahao Xu, Kai Huang 0001. GAEM: Graph-Driven Attention-Based Entropy Model for LiDAR Point Cloud Compression |
| 9119 | -- | 9131 | Wan Li, Xiao Pan, Jiaxin Lin, Ping Lu, Daquan Feng, Wenzhe Shi. FRPGS: Fast, Robust, and Photorealistic Monocular Dynamic Scene Reconstruction With Deformable 3D Gaussians |
| 9132 | -- | 9147 | Zhenjiang Du, Zhitao Liu, Guan Wang, Jiwei Wei, Sophyani Banaamwini Yussif, Zheng Wang 0044, Ning Xie 0003, Yang Yang 0002. CMNet: Cross-Modal Coarse-to-Fine Network for Point Cloud Completion Based on Patches |
| 9148 | -- | 9160 | Le Han, Kai-Xuan Chen 0001, Lei Zhao 0026, Yangbo Jiang, Pengfei Wang, Nenggan Zheng. Cross-Domain Animal Pose Estimation With Skeleton Anomaly-Aware Learning |
| 9161 | -- | 9174 | Xiao-dong Xie, Yu-Wei Zhan, Zhen-Xiang Ma, Hong-Mei Liu, Zhen-Duo Chen 0001, Xin Luo 0006, Xin-Shun Xu. Distributed Learning for Privacy-Preserving Semi-Supervised Video Anomaly Detection |
| 9175 | -- | 9189 | Chenghu Du, Junyin Wang, Kai Liu, Shengwu Xiong 0001. GLV: Geometric Correlation Distillation for Latent Diffusion-Enhanced Parser-Free Virtual Try-On |
| 9190 | -- | 9204 | Yuxuan Luo, Jinpeng Chen 0003, Runmin Cong, Horace Ho-Shing Ip, Sam Kwong. Concept-Level Semantic Transfer and Context-Level Distribution Modeling for Few-Shot Segmentation |
| 9205 | -- | 9217 | Ling-An Zeng, Gaojie Wu, Ancong Wu, Jian-Fang Hu, Wei-Shi Zheng 0001. Progressive Human Motion Generation Based on Text and Few Motion Frames |
| 9218 | -- | 9233 | Xintao Pang, Fengjuan Yao, YanMing Zhang, Yue Sun, Edmundo Patricio Lopes Lao, Chuan Lin 0003, Patrick Cheong-Iao Pang, Wei Wang 0181, Wei Li, Zhifan Gao, Tao Tan 0002. BLENet: A Bio-Inspired Lightweight and Efficient Network for Left Ventricle Segmentation in Echocardiography |
| 9234 | -- | 9245 | Chuang Yang 0003, Xu Han, Tao Han 0002, Han Han, Bingxuan Zhao, Qi Wang 0009. Edge Approximation Text Detector |
| 9246 | -- | 9260 | Jiaxin Chen, Jiawen Peng, Yanzuo Lu, Jian-Huang Lai, Andy J. Ma. Vision-Language Adaptive Clustering and Meta-Adaptation for Unsupervised Few-Shot Action Recognition |
| 9261 | -- | 9275 | Jiahao Huang, Xiaochen Yuan, Chan-Tong Lam, Sio Kei Im, Fangyuan Lei, Xiuli Bi. TransHFC: Joints Hypergraph Filtering Convolution and Transformer Framework for TemporalForgery Localization |
| 9276 | -- | 9286 | Donghai Liao, Xiu Shu, Zhihui Li 0001, Qiao Liu 0001, Di Yuan 0002, Xiaojun Chang, Zhenyu He 0001. Fine-Grained Feature and Template Reconstruction for TIR Object Tracking |
| 9287 | -- | 9299 | ChengAo Zong, Jie Zhao 0014, Xin Chen 0032, Huchuan Lu, Dong Wang 0004. Learning Language Prompt for Vision-Language Tracking |
| 9300 | -- | 9311 | Liangtao Shi, Bineng Zhong, Qihua Liang, Xiantao Hu, Zhiyi Mo, Shuxiang Song 0001. Mamba Adapter: Efficient Multi-Modal Fusion for Vision-Language Tracking |
| 9312 | -- | 9323 | Simiao Lai, Chang Liu 0071, Jiawen Zhu, Ben Kang, Yang Liu 0066, Dong Wang 0004, Huchuan Lu. MambaVT: Spatio-Temporal Contextual Modeling for Robust RGB-T Tracking |
| 9324 | -- | 9336 | Tianxu Wu, Zhimeng Xin, Shiming Chen 0002, Yixiong Zou, Xinge You. Adversarial Feature Training for Few-Shot Object Detection |
| 9337 | -- | 9350 | Jinpeng Dong, Dingyi Yao, Yufeng Hu, Sanping Zhou, Nanning Zheng 0001. A Novel Dense Object Detector With Scale Balanced Sample Assignment and Refinement |
| 9351 | -- | 9364 | Juexiao Feng, Yuhong Yang 0008, Mengyao Lyu, Tianxiang Hao, Yi-Jie Huang, Yanchun Xie, Yaqian Li, Jungong Han, Liuyu Xiang, Guiguang Ding. Toward Realistic Hierarchical Object Detection: Problem, Benchmark, and Solution |
| 9365 | -- | 9383 | Jingchun Gao, Lei Zhang 0119, Jingyu Li, Zhendong Mao 0001. Fully Semantic Gap Recovery for End-to-End Image Captioning |
| 9384 | -- | 9397 | Yuqing Zhu, Yuan Gao 0015, Tianle Ding, Xiang Liu 0020, Wenfei Yang, Tianzhu Zhang 0001. Spatio-Temporal Pyramid Keypoint Detection With Event Cameras |
| 9398 | -- | 9413 | Nanhua Chen, Dongshuo Zhang, Kai Jiang, Meng Yu, Yeqing Zhu, Tai-Shan Lou, Liangyu Zhao. SHAA: Spatial Hybrid Attention Network With Adaptive Cross-Entropy Loss Function for UAV-View Geo-Localization |
| 9414 | -- | 9427 | Yingwen Zhang, Meng Wang 0017, Junru Li, Kai Zhang 0007, Li Zhang 0006, Shiqi Wang 0001. A Theoretical and Experimental Study for Dependent Learned Rate-Distortion Optimization |
| 9428 | -- | 9442 | Youneng Bao, Wen Tan, Chuanmin Jia, Mu Li 0005, Yongsheng Liang 0001, Yonghong Tian 0001. ShiftLIC: Lightweight Learned Image Compression With Spatial-Channel Shift Operations |
| 9443 | -- | 9459 | Maida Cao, Wenrui Dai, Shaohui Li, Chenglin Li, Junni Zou, Weisheng Hu, Hongkai Xiong. Generative Probabilistic Entropy Modeling With Conditional Diffusion for Learned Image Compression |
| 9460 | -- | 9474 | Panqi Jia, Fabian Brand, Dequan Yu, Alexander Karabutov, Elena Alshina, André Kaup. Overview of Variable Rate Coding in JPEG AI |
| 9475 | -- | 9486 | Yuxuan Wei, Zehan Wang, Tian Guo, Hao Liu 0044, Liquan Shen, Hui Yuan 0001. High Efficiency Wiener Filter-Based Point Cloud Quality Enhancement for MPEG G-PCC |
| 9487 | -- | 9501 | Yifei Xu, Zaiqiang Wu, Li Li, Siqi Li, Wenlong Li, Mingqi Li, Yuan Rao, ShuiGuang Deng. Hybrid Siamese Masked Autoencoders as Unsupervised Video Summarizer |
| 9502 | -- | 9516 | Pengzhe Wang, Lei Zhang 0119, Zhendong Mao 0001, Nenan Lyu, Yongdong Zhang 0001. Matryoshka Learning With Metric Transfer for Image-Text Matching |
| 9517 | -- | 9529 | Mingyang Lei, Jingfan Fan, Long Shao, Hong Song, Deqiang Xiao, Danni Ai, Tianyu Fu 0003, Yucong Lin, Ying Gu, Jian Yang 0009. Structured Light Image Planar-Topography Feature Decomposition for Generalizable 3D Shape Measurement |
| 9530 | -- | 9544 | Nishang Xie, Tao Zhang 0027, Lanyu Zhang, Jie Chen, Feiming Wei, Wenxian Yu. VLF-SAR: A Novel Vision-Language Framework for Few-Shot SAR Target Recognition |
| 9545 | -- | 9558 | Xiuxian Wang, Lanjun Wang, Yuting Su 0001, Hongshuo Tian, Guoqing Jin, An-An Liu. Few-Shot In-Context Learning for Implicit Semantic Multimodal Content Detection and Interpretation |