| 7803 | -- | 7819 | Kai Niu 0002, Yanyi Liu, Yuzhou Long, Yan Huang 0008, Liang Wang 0001, Yanning Zhang. An Overview of Text-Based Person Search: Recent Advances and Future Directions |
| 7820 | -- | 7829 | Guoyu Yang, Jie Lei 0002, Hao Tian, Zunlei Feng, Ronghua Liang. Asymptotic Feature Pyramid Network for Labeling Pixels and Regions |
| 7830 | -- | 7843 | Ke Song, Guoqiang Liang, Zhaojie Chen, Yanning Zhang. Non-Exemplar Class-Incremental Learning by Random Auxiliary Classes Augmentation and Mixed Features |
| 7844 | -- | 7855 | Wujie Zhou, Bitao Jian, Meixin Fang, Xiena Dong, Yuanyuan Liu 0004, Qiuping Jiang. DGPINet-KD: Deep Guided and Progressive Integration Network With Knowledge Distillation for RGB-D Indoor Scene Analysis |
| 7856 | -- | 7869 | Qiming Li, Jinghang Cheng, Yin Gao, Jun Li 0043. Learning Geometric Information via Transformer Network for Key-Points Based Motion Segmentation |
| 7870 | -- | 7881 | Yandong Bi, Huajie Jiang, Yongli Hu, Yanfeng Sun, Baocai Yin. Fair Attention Network for Robust Visual Question Answering |
| 7882 | -- | 7895 | Jian Sun, Hao Sun, Lin Lei, Kefeng Ji, Gangyao Kuang. TirSA: A Three Stage Approach for UAV-Satellite Cross-View Geo-Localization Based on Self-Supervised Feature Enhancement |
| 7896 | -- | 7911 | Yu Xue, Lai-Man Po, Wing Yin Yu, Haoxuan Wu, Xuyuan Xu, Kun Li, Yuyang Liu. Self-Calibration Flow Guided Denoising Diffusion Model for Human Pose Transfer |
| 7912 | -- | 7921 | Yi Zhang, Xiaotian Zhu. Attention-Based Layer Fusion and Token Masking for Weakly Supervised Semantic Segmentation |
| 7922 | -- | 7934 | Dongyue Li, Songlin Du. ContextMatcher: Detector-Free Feature Matching With Cross-Modality Context |
| 7935 | -- | 7946 | Daosong Hu, Kai Huang 0001. Semi-Supervised Multitask Learning Using Gaze Focus for Gaze Estimation |
| 7947 | -- | 7961 | Hao Feng, Keyi Zhou, Wengang Zhou, Yufei Yin, Jiajun Deng, Qi Sun, Houqiang Li. Recurrent Generic Contour-Based Instance Segmentation With Progressive Learning |
| 7962 | -- | 7974 | Yisheng Zhao, Huaiyu Zhu 0004, Ruohong Huan, Yaoqi Bao, Yun Pan. Heterogeneous Graph Network for Action Detection |
| 7975 | -- | 7985 | Sungjune Park, Hyunjun Kim, Yong Man Ro. Integrating Language-Derived Appearance Elements With Visual Cues in Pedestrian Detection |
| 7986 | -- | 7997 | Enhao Zhang, Chuanxing Geng, Chaohua Li, Songcan Chen. Dynamic Learnable Logit Adjustment for Long-Tailed Visual Recognition |
| 7998 | -- | 8012 | Fupeng Chu, Yang Cong, Ronghan Chen. OPEN: Occlusion-Invariant Perception Network for Single Image-Based 3D Shape Retrieval |
| 8013 | -- | 8025 | Xiao He, Mingrui Zhu, Nannan Wang 0001, Xinbo Gao 0001. Few-Shot Font Generation by Learning Style Difference and Similarity |
| 8026 | -- | 8040 | Tianhuan Huang, Xianye Ben, Chen Gong 0002, Wenzheng Xu, Qiang Wu 0001, Hongchao Zhou. GaitDAN: Cross-View Gait Recognition via Adversarial Domain Adaptation |
| 8041 | -- | 8052 | Yinan Wu 0001, Licheng Jiao, Xu Liu 0006, Fang Liu 0001, Shuyuan Yang, Lingling Li 0002. Domain Adaptation-Aware Transformer for Hyperspectral Object Tracking |
| 8053 | -- | 8066 | Shiyao Li, Zhenhua Zhu, Hanbo Sun, Xuefei Ning, Guohao Dai, Yiming Hu, Huazhong Yang, Yu Wang 0002. Toward High-Accuracy and Real-Time Two-Stage Small Object Detection on FPGA |
| 8067 | -- | 8079 | Fan Wan, Xingyu Miao, Haoran Duan, Jingjing Deng 0001, Rui Gao, Yang Long 0001. Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm Without Real Data Exposure |
| 8080 | -- | 8092 | Chenhao Wu, Qingbo Wu 0001, Rui Ma, King Ngi Ngan, Hongliang Li 0001, Fanman Meng, Heqian Qiu. Continual Cross-Domain Image Compression via Entropy Prior Guided Knowledge Distillation and Scalable Decoding |
| 8093 | -- | 8106 | Shuai Guo 0002, Qiuwen Wang, Yijie Gao, Rong Xie, Lin Li 0062, Fang Zhu, Li Song 0001. Depth-Guided Robust Point Cloud Fusion NeRF for Sparse Input Views |
| 8107 | -- | 8121 | Yufan Wang, Le Huang, Qunfei Zhao, Zeyang Xia, Ning Zhao. Hybrid Shape Deformation for Face Reconstruction in Aesthetic Orthodontics |
| 8122 | -- | 8134 | Shaojie Zhang, Jianqin Yin, Yonghao Dang, Jiajun Fu. SiT-MLP: A Simple MLP With Point-Wise Topology Feature Learning for Skeleton-Based Action Recognition |
| 8135 | -- | 8147 | Jiahe Zhu, Jinji Zheng, Xinyi Xia, Yifan Li, Zhiru Li, Xicai Li. IGM-MELv2: Infrared Guiding Modal Multiuser Eye Localization System on ARM CPU for Autostereoscopic Displays |
| 8148 | -- | 8160 | Anlei Zhu, YingHui Wang, Jinlong Yang 0002, Tao Yan, Haomiao Ma, Wei Li 0121. YOWOv3: A Lightweight Spatio-Temporal Joint Network for Video Action Detection |
| 8161 | -- | 8171 | Chang Liu 0071, Jie Zhao 0014, Chunjuan Bo, Shengming Li, Dong Wang 0004, Huchuan Lu. LGTrack: Exploiting Local and Global Properties for Robust Visual Tracking |
| 8172 | -- | 8187 | Zhaoqilin Yang, GaoYun An, ZhenXing Zheng, Shan Cao, Qiuqi Ruan. GBC: Guided Alignment and Adaptive Boosting CLIP Bridging Vision and Language for Robust Action Recognition |
| 8188 | -- | 8200 | Jie Wu, Leyuan Fang, Jun Yue. TAKD: Target-Aware Knowledge Distillation for Remote Sensing Scene Classification |
| 8201 | -- | 8214 | Wenlve Zhou, Zhiheng Zhou. Unsupervised Domain Adaption Harnessing Vision-Language Pre-Training |
| 8215 | -- | 8229 | Wenmin Huang, Weiqi Luo 0001, Xiaochun Cao, Jiwu Huang. Interactive Generative Adversarial Networks With High-Frequency Compensation for Facial Attribute Editing |
| 8230 | -- | 8241 | Yifei Qian, Xiaopeng Hong, Zhongliang Guo 0001, Ognjen Arandjelovic, Carl R. Donovan. Semi-Supervised Crowd Counting With Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes |
| 8242 | -- | 8252 | Zhimin Wei, Zhipeng Zhang, Peng Wu, Ji Wang, Peng Wang 0015, Yanning Zhang. Fine-Granularity Alignment for Text-Based Person Retrieval Via Semantics-Centric Visual Division |
| 8253 | -- | 8265 | Haoyuan Jin, Xuesong Nie, Yunfeng Yan, Xi Chen, Zhihang Zhu, Donglian Qi. AHOR: Online Multi-Object Tracking With Authenticity Hierarchizing and Occlusion Recovery |
| 8266 | -- | 8280 | Zhilin Zhang, Chengxiu Liu, Xiaoxu Wang, Ziyu Han, Guantai Yang, Cheng Wang, Panfeng Huang, Qianbo Lu. DLP-Fusion: Depth of Field, Light Source, and Polarization Fusion Toward Intelligent Optical Imaging for Complex Scenes |
| 8281 | -- | 8291 | Ye Huang, Di Kang, Shenghua Gao, Wen Li 0001, Lixin Duan. High-Level Feature Guided Decoding for Semantic Segmentation |
| 8292 | -- | 8309 | Tung Minh Tran, Doanh C. Bui, Tam V. Nguyen 0002, Khang Nguyen 0001. Transformer-Based Spatio-Temporal Unsupervised Traffic Anomaly Detection in Aerial Videos |
| 8310 | -- | 8326 | Xiaoqiang Zhu, Jiayu Zhou, Lihua You, Xiaosong Yang, Jian Chang, Jian-Jun Zhang 0001, Dan Zeng 0001. DFIE3D: 3D-Aware Disentangled Face Inversion and Editing via Facial-Contrastive Learning |
| 8327 | -- | 8342 | Zidong Liu, Jiasong Wu, Zeyu Shen, Xin Chen, Qianyu Wu, Zhiguo Gui, Lotfi Senhadji, Huazhong Shu. Improving End-to-End Sign Language Translation With Adaptive Video Representation Enhanced Transformer |
| 8343 | -- | 8354 | Yongzhe Yuan, Yue Wu 0004, Mingyu Yue, Maoguo Gong, Xiaolong Fan, Wenping Ma 0001, Qiguang Miao. Learning Discriminative Features via Multi-Hierarchical Mutual Information for Unsupervised Point Cloud Registration |
| 8355 | -- | 8367 | ATing Yin, Yaonan Wang 0001, Jianxu Mao, Hui Zhang 0023, Xiuyi Chen. Category-Contextual Relation Encoding Network for Few-Shot Object Detection |
| 8368 | -- | 8381 | Wen Wen, Mu Li 0005, Yiru Yao, Xiangjie Sui, Yabin Zhang 0002, Long Lan, Yuming Fang, Kede Ma. Perceptual Quality Assessment of Virtual Reality Videos in the Wild |
| 8382 | -- | 8397 | Guangning Xu, Michael K. Ng 0001, Yunming Ye, Xutao Li, Ge Song, Bowen Zhang 0005, Zhichao Huang. TLS-MWP: A Tensor-Based Long- and Short-Range Convolution for Multiple Weather Prediction |
| 8398 | -- | 8411 | Hao Liu, Lijun He, Miao Zhang, Fan Li 0003. VADiffusion: Compressed Domain Information Guided Conditional Diffusion for Video Anomaly Detection |
| 8412 | -- | 8426 | Guanyi Li, Junjie Zhang 0002, Enquan Yang, Haoran Jiang, Dan Zeng 0001. Multi-Level Information Fusion Network With Edge Information Injection for Single-Band Cloud Detection |
| 8427 | -- | 8441 | Yiwen Shan, Dong Hu, Zhi Wang 0015. A Novel Truncated Norm Regularization Method for Multi-Channel Color Image Denoising |
| 8442 | -- | 8455 | Tong Qiao, Hang Shao, Shichuang Xie, Ran Shi. Unsupervised Generative Fake Image Detector |
| 8456 | -- | 8468 | Sicheng Pan, Yingming Li. EBDNet: Integrating Optical Flow With Kernel Prediction for Burst Denoising |
| 8469 | -- | 8480 | Yong Wang, Pengbo Zhou, Guohua Geng, Li An, Kang Li, Ruoxue Li. Neighborhood Multi-Compound Transformer for Point Cloud Registration |
| 8481 | -- | 8493 | Yuting Yang 0008, Licheng Jiao, Xu Liu 0006, Lingling Li 0002, Fang Liu 0001, Shuyuan Yang, Xiangrong Zhang. Efficient LWPooling: Rethinking the Wavelet Pooling for Scene Parsing |
| 8494 | -- | 8508 | Xiaoxu Chen, Jingfan Tan, Tao Wang 0052, Kaihao Zhang, Wenhan Luo, Xiaochun Cao. Toward Real-World Blind Face Restoration With Generative Diffusion Prior |
| 8509 | -- | 8521 | Yu Wang 0073, Liquan Chen, Kunliang Yu, Tong Fu. A Secure Spatio-Temporal Chaotic Pseudorandom Generator for Image Encryption |
| 8522 | -- | 8535 | Xiao Wang, Yang Lu 0009, Wanchuan Yu, Yanwei Pang, Hanzi Wang. Few-Shot Action Recognition via Multi-View Representation Learning |
| 8536 | -- | 8546 | Zhidan Ran, Xuan Wei, Wei Liu, Xiaobo Lu. Multiscale Aligned Spatial-Temporal Interaction for Video-Based Person Re-Identification |
| 8547 | -- | 8561 | Lan Li 0005, Meiping Song, Qiang Zhang 0011, Yushuai Dong, Yulei Wang 0002, Qiangqiang Yuan. Local Extremum Constrained Total Variation Model for Natural and Hyperspectral Image Non-Blind Deblurring |
| 8562 | -- | 8575 | Yuzhen Niu, Rui Xu, Zhihua Lin, Wenxi Liu. STD-Net: Spatio-Temporal Decomposition Network for Video Demoiréing With Sparse Transformers |
| 8576 | -- | 8588 | Eunpil Park, Jaejun Yoo, Jae-Young Sim. Universal Dehazing via Haze Style Transfer |
| 8589 | -- | 8601 | Liangliang Song, Zhixi Feng, Shuyuan Yang, Xinyu Zhang, Licheng Jiao. Interactive Spectral-Spatial Transformer for Hyperspectral Image Classification |
| 8602 | -- | 8613 | Meiqi Wu, Kaiqi Huang, Yuanqiang Cai, Shiyu Hu, YuZhong Zhao, Weiqiang Wang. Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World |
| 8614 | -- | 8628 | Haoran Wei, Qingbo Wu 0001, Chenhao Wu, King Ngi Ngan, Hongliang Li 0001, Fanman Meng, Heqian Qiu. Robust Unpaired Image Dehazing via Adversarial Deformation Constraint |
| 8629 | -- | 8643 | Yurong Chen 0003, Yaonan Wang 0001, Hui Zhang 0023. Prior Images Guided Generative Autoencoder Model for Dual-Camera Compressive Spectral Imaging |
| 8644 | -- | 8656 | Kaijie He, Jun Xie 0003, Xinguang Dai, Kenglun Chang, Feng Chen 0044, Zhepeng Wang 0002. STADet: Streaming Timing-Aware Video Lane Detection |
| 8657 | -- | 8671 | Sanaz Nami, Farhad Pakdaman, Mahmoud Reza Hashemi, Shervin Shirmohammadi, Moncef Gabbouj. Lightweight Multitask Learning for Robust JND Prediction Using Latent Space and Reconstructed Frames |
| 8672 | -- | 8683 | Wenyan Pan, Wentao Ma, Shan Zhao 0002, Lichuan Gu, Guolong Shi, Zhihua Xia, Meng Wang 0001. Image Manipulation Detection With Cascade Hierarchical Graph Representation |
| 8684 | -- | 8697 | Rongqin Liang, Yuanman Li, Jiantao Zhou 0001, Xia Li 0006. Text-Driven Traffic Anomaly Detection With Temporal High-Frequency Modeling in Driving Videos |
| 8698 | -- | 8709 | Aihua Zheng, Fan Yuan, Haichuan Zhang, Jiaxiang Wang 0001, Chao Tang, Chenglong Li 0002. Public-Private Attributes-Based Variational Adversarial Network for Audio-Visual Cross-Modal Matching |
| 8710 | -- | 8721 | Jiayuan Xie, Jiali Chen, Zhenghao Liu, Yi Cai 0001, Qingbao Huang, Qing Li 0001. Video Question Generation for Dynamic Changes |
| 8722 | -- | 8735 | Yabin Zhu, Chenglong Li 0002, Xiao Wang 0014, Jin Tang 0001, Zhixiang Huang. RGBT Tracking via Progressive Fusion Transformer With Dynamically Guided Learning |
| 8736 | -- | 8752 | Yihui Fan, Xin Jin 0002, Siyao Zhou 0003, Shun Zou. Light Fields Stitching for Windowed-6DoF VR Content |
| 8753 | -- | 8765 | Yingxue Xu, Guihua Wen, Yang Hu, Pei Yang 0001. Modeling Hierarchical Structural Distance for Unsupervised Domain Adaptation |
| 8766 | -- | 8778 | Huimin Ma, Siwei Wang 0001, Junpu Zhang, Shengju Yu, Suyuan Liu, Xinwang Liu 0002, Kunlun He. Symmetric Multi-View Subspace Clustering With Automatic Neighbor Discovery |
| 8779 | -- | 8793 | Wei Huang, Zhiliang Peng, Li Dong 0004, Furu Wei, Qixiang Ye, Jianbin Jiao. Generic-to-Specific Distillation of Masked Autoencoders |
| 8794 | -- | 8807 | Xinju Wu, Pingping Zhang, Meng Wang 0017, Peilin Chen, Shiqi Wang 0001, Sam Kwong. Geometric Prior Based Deep Human Point Cloud Geometry Compression |
| 8808 | -- | 8820 | Xiaohan Fang, Peilin Chen, Meng Wang 0017, Xi Xie, Shiqi Wang 0001, Shanshe Wang, Siwei Ma. Exploiting Bidirectional Quality Impulse for Reference Picture Resampled Gaming Video Coding |
| 8821 | -- | 8835 | Chen Zhu, Guo Lu, Huanbang Chen, Donghui Feng 0003, Shen Wang, Yan Zhao, Rong Xie, Li Song 0001. A Character Position-Aware Compression Framework for Screen Text Image |
| 8836 | -- | 8847 | Hadi Amirpour, Klaus Schoeffmann, Mohammad Ghanbari 0001, Christian Timmerer. DeepVCA: Deep Video Complexity Analyzer |
| 8848 | -- | 8861 | Chunhui Yang, Jiayu Yang, Yongqi Zhai, Ronggang Wang. FICNet: An End to End Network for Free-View Image Coding |
| 8862 | -- | 8880 | Liying Gao, Bingliang Jiao, Yuzhou Long, Kai Niu 0002, He Huang, Peng Wang 0015, Yanning Zhang. Contrastive Pedestrian Attentive and Correlation Learning Network for Occluded Person Re-Identification |
| 8881 | -- | 8895 | Kaixiang Chen, Pengfei Fang, Zi Ye, Liyan Zhang 0001. Multi-Scale Explicit Matching and Mutual Subject Teacher Learning for Generalizable Person Re-Identification |
| 8896 | -- | 8911 | Ruomei Wang 0001, Jiawei Feng, Fuwei Zhang, Xiaonan Luo, Yuanmao Luo. Modality-Aware Heterogeneous Graph for Joint Video Moment Retrieval and Highlight Detection |
| 8912 | -- | 8923 | Yuzhe Fu, Changchun Zhou, Tianling Huang, Eryi Han, Yifan He, Hailong Jiao. SoftAct: A High-Precision Softmax Architecture for Transformers Supporting Nonlinear Functions |
| 8924 | -- | 8938 | Mingyue Niu, Ya Li, Jianhua Tao 0001, Xiuzhuang Zhou, Björn W. Schuller. DepressionMLP: A Multi-Layer Perceptron Architecture for Automatic Depression Level Prediction via Facial Keypoints and Action Units |
| 8939 | -- | 8952 | Xianglong Wang, Eric Rigall, Xifeng An, Zhihao Li, Qing Cai, Shu Zhang 0002, Junyu Dong. A New Benchmark and Low Computational Cost Localization Method for Cephalometric Analysis |
| 8953 | -- | 8965 | Zhaojie Chu, Kailing Guo, Xiaofen Xing, Yilin Lan, Bolun Cai, Xiangmin Xu. CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation |
| 8966 | -- | 8971 | Qinghai Zheng. Flexible and Parameter-Free Graph Learning for Multi-View Spectral Clustering |
| 8972 | -- | 8977 | Dengyong Zhang, Jiahao Chen, Xin Liao, Feng Li, Jiaxin Chen, Gaobo Yang. Face Forgery Detection via Multi-Feature Fusion and Local Enhancement |
| 8978 | -- | 8982 | Binzhe Li, Bolin Chen, Zhao Wang 0004, Shiqi Wang 0001, Yan Ye. Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based Approach |