| 10524 | -- | 10537 | Ancong Wu, Chengzhi Lin, Wei-Shi Zheng 0001. Asymmetric Mutual Learning for Unsupervised Transferable Visible-Infrared Re-Identification |
| 10538 | -- | 10550 | Kan Guo, Daxin Tian, Yongli Hu, Chunmian Lin, Yanfeng Sun, Jianshan Zhou, Xuting Duan, Junbin Gao, Baocai Yin. CFMMC-Align: Coarse-Fine Multi-Modal Contrastive Alignment Network for Traffic Event Video Question Answering |
| 10551 | -- | 10563 | Jiawen Zhu, Xin Chen 0032, Pengyu Zhang, Xinying Wang 0005, Dong Wang 0004, Wenda Zhao, Huchuan Lu. SRRT: Exploring Search Region Regulation for Visual Object Tracking |
| 10564 | -- | 10577 | Wei Yao, Hongwen Zhang 0001, Yunlian Sun, Jinhui Tang 0001. STAF: 3D Human Mesh Recovery From Video With Spatio-Temporal Alignment Fusion |
| 10578 | -- | 10589 | Jinhua Hu, Yonghong Hou, Zihui Guo, Jiajun Gao. Global and Local Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition |
| 10590 | -- | 10603 | Qing Wang, Xulun Ye, Nongxiao Wang. Learning Low-Rank Representation Approximation for Few-Shot Deep Subspace Clustering |
| 10604 | -- | 10617 | Chengrui Wei, Meng Yang 0002, Lei He, Nanning Zheng 0001. FS-Depth: Focal-and-Scale Depth Estimation From a Single Image in Unseen Indoor Scene |
| 10618 | -- | 10631 | Qingqing Yan, Shu Li, Zongtao He, Mengxian Hu, Chengju Liu, Qijun Chen. DR-Block: Convolutional Dense Reparameterization for CNN Generalization Free Improvement |
| 10632 | -- | 10645 | Xuze Hao, Xuhao Jiang, Wenqian Ni, Weimin Tan, Bo Yan 0001. Prompt-Guided Semantic-Aware Distillation for Weakly Supervised Incremental Semantic Segmentation |
| 10646 | -- | 10661 | Yinsai Guo, Hang Yu 0006, Liyan Ma, Xiangfeng Luo, Shaorong Xie. DIE-CDK: A Discriminative Information Enhancement Method With Cross-Modal Domain Knowledge for Fine-Grained Ship Detection |
| 10662 | -- | 10677 | Yi Luo, Feng Shao 0001, Baoyang Mu, Hangwei Chen, Zhuo Li, Qiuping Jiang. Dynamic Weighted Fusion and Progressive Refinement Network for Visible-Depth-Thermal Salient Object Detection |
| 10678 | -- | 10691 | Wenkang Shan, Yuhuai Zhang, Xinfeng Zhang 0001, Shanshe Wang, Xilong Zhou, Siwei Ma, Wen Gao 0001. Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation |
| 10692 | -- | 10703 | Yicheng Lin, Yunlong Jiang, Xujia Jiao, Bin Han 0010. Learned Good Features to Track |
| 10704 | -- | 10717 | Hanlin Guo, Guobao Xiao, Lumei Su, Tianyou Li, Da-Han Wang, Hanzi Wang. Second-Order Proximity Guided Sampling Consensus for Robust Model Fitting |
| 10718 | -- | 10731 | Xingyu Zhu, Xiangbo Shu, Jinhui Tang 0001. Motion-Aware Mask Feature Reconstruction for Skeleton-Based Action Recognition |
| 10732 | -- | 10742 | Jie Zhu, Bo Peng 0007, Bingzheng Liu, Qingming Huang, Jianjun Lei. Self-Constructing Stereo Correspondences for Unsupervised Multi-View Stereo |
| 10743 | -- | 10752 | Zhuoran Xie, Miao Yang, Mengjiao Shen, Yuquan Qiu, Xinyu Wang. FIOD-VUE: Focusing on Invariant Information in Object Detection of Varying Underwater Environment |
| 10753 | -- | 10763 | Chao Zheng, Li Liu, Yu Meng, Xiaorui Peng, Meijun Wang. Few-Shot Point Cloud Semantic Segmentation via Support-Query Feature Interaction |
| 10764 | -- | 10778 | Yalong Jiang, Changkang Li, Wenrui Ding, Jinzhi Xiang, Zheru Chi. Reasonable Anomaly Detection Based on Long-Term Sequence Modeling |
| 10779 | -- | 10792 | Ruiqiu Wang, Tao Su, Dan Xu 0007, Jianlai Chen, Yuan Liang. MIGA-Net: Multi-View Image Information Learning Based on Graph Attention Network for SAR Target Recognition |
| 10793 | -- | 10804 | Weichao Zhao, Hezhen Hu, Wengang Zhou, Yunyao Mao, Min Wang 0019, Houqiang Li. MASA: Motion-Aware Masked Autoencoder With Semantic Alignment for Sign Language Recognition |
| 10805 | -- | 10816 | Jin Liu 0018, Jialong Xie, Fengyu Zhou, Shengfeng He. Question Type-Aware Debiasing for Test-Time Visual Question Answering Model Adaptation |
| 10817 | -- | 10830 | Xuanyu Zhang, Bin Chen, Wenzhen Zou, Shuai Liu, Yongbing Zhang, Ruiqin Xiong, Jian Zhang 0018. Progressive Content-Aware Coded Hyperspectral Snapshot Compressive Imaging |
| 10831 | -- | 10844 | Xiaoqiang Zhou, Chaoyou Fu, Huaibo Huang, Ran He 0001. Dynamic Graph Memory Bank for Video Inpainting |
| 10845 | -- | 10859 | Yuanliang Xue, Guodong Jin, Tao Shen, Lining Tan, Nian Wang, Jing Gao, Lianfeng Wang. Consistent Representation Mining for Multi-Drone Single Object Tracking |
| 10860 | -- | 10873 | Tao Yan 0001, Xiangjie Zhu, Xianglong Chen, Weijiang He, Chenglong Wang, Yang Yang 0046, YingHui Wang, Xiaojun Chang. GLGFN: Global-Local Grafting Fusion Network for High-Resolution Image Deraining |
| 10874 | -- | 10887 | Mingjin He, Bingwen Feng, Yizhi Guo, Jian Weng 0001, Wei Lu 0001. Camera-Shooting Resilient Watermarking on Image Instance Level |
| 10888 | -- | 10902 | Lin He, Bingwen Feng, Zecheng Peng, Bing Chen 0004, Zhihua Xia, Wei Lu 0001. Removing Hidden Information by Geometrical Perturbation in Frequency Domain |
| 10903 | -- | 10916 | Dongjia Zhao, Lei Qi 0001, Xiao Shi, Yinghuan Shi, Xin Geng 0001. A Novel Cross-Perturbation for Single Domain Generalization |
| 10917 | -- | 10929 | Preeti Meena, Himanshu Kumar, Sandeep Kumar Yadav. A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification |
| 10930 | -- | 10943 | Weidong Zhang 0007, Qingmin Liu, Yikun Feng, Lei Cai, Peixian Zhuang. Underwater Image Enhancement via Principal Component Fusion of Foreground and Background |
| 10944 | -- | 10958 | Di Wang 0018, Jinyuan Liu 0001, Long Ma 0002, Risheng Liu, Xin Fan 0001. Improving Misaligned Multi-Modality Image Fusion With One-Stage Progressive Dense Registration |
| 10959 | -- | 10971 | Yong Wu, Guang Chen 0001, Linwei Ye, Yuanning Jia, Zhi Liu 0003, Yang Wang 0003. TTAGaze: Self-Supervised Test-Time Adaptation for Personalized Gaze Estimation |
| 10972 | -- | 10986 | Zhaobo Qi, Yibo Yuan, Xiaowen Ruan, Shuhui Wang, Weigang Zhang, Qingming Huang. Collaborative Debias Strategy for Temporal Sentence Grounding in Video |
| 10987 | -- | 10999 | Lizhi Xiong, Rui Ding, Ching-Nung Yang, Zhangjie Fu. Invertible Secret Image Sharing With Authentication for Embedding Color Palette Image Into True Color Image |
| 11000 | -- | 11012 | Asif Raza, Bang Yang, Yuexian Zou. Zero-Shot Temporal Action Detection by Learning Multimodal Prompts and Text-Enhanced Actionness |
| 11013 | -- | 11025 | Chunyan Wang, Dong Zhang, Rui Yan. Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator |
| 11026 | -- | 11039 | Duo Qiu, Bei Yang, Xiongjun Zhang. Robust Tensor Completion via Dictionary Learning and Generalized Nonconvex Regularization for Visual Data Recovery |
| 11040 | -- | 11055 | Qingxin Sheng, Chong Fu, Ming Tie, Xingwei Wang 0001, Junxin Chen 0001, Chiu-Wing Sham. A Chaos-Based Tunable Selective Encryption Algorithm for H.265/HEVC With Semantic Understanding |
| 11056 | -- | 11069 | Meng Liu 0006, Da Li, Yongqiang Li, Xuemeng Song, Liqiang Nie. Audio-Semantic Enhanced Pose-Driven Talking Head Generation |
| 11070 | -- | 11085 | Yuxin Feng, Zhuo Su 0001, Long Ma 0002, Xin Li, Risheng Liu, Fan Zhou 0001. Bridging the Gap Between Haze Scenarios: A Unified Image Dehazing Model |
| 11086 | -- | 11100 | Linfei Wang, Yibing Zhan, Long Lan, Xu Lin, Dapeng Tao, Xinbo Gao 0001. DeIoU: Toward Distinguishable Box Prediction in Densely Packed Object Detection |
| 11101 | -- | 11114 | Rui Guo, Linbin Wang, Chencheng Zhang, Lian Gu, Dianyou Li, Xiaohua Qian. A Causality-Informed Graph Convolutional Network for Video Assessment of Parkinsonian Leg Agility |
| 11115 | -- | 11127 | Bokang Wang, Qian Ning, Fangfang Wu, Xin Li 0005, Weisheng Dong, Guangming Shi. Uncertainty Modeling of the Transmission Map for Single Image Dehazing |
| 11128 | -- | 11141 | Junhui Li, Xingsong Hou. The Design of an Adaptive Enhanced AMP-Based Image Block Compressed Sensing and Its Application to Image Encryption |
| 11142 | -- | 11155 | Chen Yang, Guorong Li, Shuhui Wang, Li Su 0003, Laiyun Qing, Qingming Huang. SpikeODE: Image Reconstruction for Spike Camera With Neural Ordinary Differential Equation |
| 11156 | -- | 11168 | Yi-Chen Chen, Wei-Ta Chu. Positive and Negative Set Designs in Contrastive Feature Learning for Temporal Action Segmentation |
| 11169 | -- | 11183 | Bolin Ni, Xing Nie, Chenghao Zhang, Shixiong Xu, Xin Zhang 0093, Gaofeng Meng, Shiming Xiang. MoBoo: Memory-Boosted Vision Transformer for Class-Incremental Learning |
| 11184 | -- | 11197 | Xiaogang Song 0001, Pengfei Zhang, Xiaofeng Lu, Xinhong Hei 0001, Rongrong Liu. A Universal Multi-View Guided Network for Salient Object and Camouflaged Object Detection |
| 11198 | -- | 11213 | Yuqiao Zeng, Tengfei Liang, Yi Jin 0001, Yidong Li. MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection |
| 11214 | -- | 11228 | Yike Liu, Haipeng Li 0001, Shuaicheng Liu, Bing Zeng. CodingHomo: Bootstrapping Deep Homography With Video Coding |
| 11229 | -- | 11240 | Qinghua Sheng, Hongzhao Chen, Changcai Lai, Xiaofang Huang, Yuanyuan Liu, Xiaofeng Huang, Haibing Yin. Fast Linear Equation Solving Algorithm and its Pipelined Hardware Architecture Design for VVC Affine Motion Estimation |
| 11241 | -- | 11255 | Jian Wang, Qiang Ling. FDNet: Frequency Decomposition Network for Learned Image Compression |
| 11256 | -- | 11270 | Guohao Xu, Leilei Huang, Zhijian Hao, Wei Li, Shiyan Yi, Xiaoyang Zeng, Yibo Fan. A High Compression Efficiency Hardware Encoder for Intra and Inter Coding With 4K@30fps Throughput |
| 11271 | -- | 11285 | Siyu Zhou, Fuwei Zhang, Ruomei Wang 0001, Fan Zhou 0001, Zhuo Su 0001. Subtask Prior-Driven Optimized Mechanism on Joint Video Moment Retrieval and Highlight Detection |
| 11286 | -- | 11298 | Ming Jin, Wenbo Hu 0001, Lei Zhu 0002, Xiang Wang 0010, Richang Hong. Based on Spatial and Temporal Implicit Semantic Relational Inference for Cross-Modal Retrieval |
| 11299 | -- | 11312 | Linshan Hou, Zhongyun Hua, Yuhong Li, Yifeng Zheng, Leo Yu Zhang. M-to-N Backdoor Paradigm: A Multi-Trigger and Multi-Target Attack to Deep Learning Models |
| 11316 | -- | 11339 | Wang Xia, Guodao Sun, Tong Li, Baofeng Chang, Jingwei Tang, Gefei Zhang, Ronghua Liang. Video Visualization and Visual Analytics: A Task-Based and Application- Driven Investigation |
| 11340 | -- | 11359 | Zehai Niu, Ke Lu 0002, Jian Xue, Xiaoyu Qin, Jinbao Wang, Ling Shao 0001. From Methods to Applications: A Review of Deep 3D Human Motion Capture |
| 11360 | -- | 11372 | Jianhan Qi, Yuheng Jia, Hui Liu 0032, Junhui Hou. Superpixel Graph Contrastive Clustering With Semantic-Invariant Augmentations for Hyperspectral Images |
| 11373 | -- | 11385 | Bo Miao, Mohammed Bennamoun, Yongsheng Gao 0001, Mubarak Shah, Ajmal Mian. Temporally Consistent Referring Video Object Segmentation With Hybrid Memory |
| 11386 | -- | 11399 | Yan Liu 0043, Qingyong Hu, Yulan Guo. BSTS: A Weakly-Supervised Method for Semantic Learning of 3D Point Clouds |
| 11400 | -- | 11412 | Yuxi Liu, Guibo Luo, Zhenyu Weng, Yuesheng Zhu. Adaptive Face Recognition for Multi-Type Occlusions |
| 11413 | -- | 11422 | Junran Ding, Yunxiang He, Binzhe Yuan, Zhechen Yuan, Pingqiang Zhou, Jingyi Yu, Xin Lou. Ray Reordering for Hardware-Accelerated Neural Volume Rendering |
| 11423 | -- | 11437 | Yichen Guo, Mai Xu, Lai Jiang, Xin Deng 0002, Jing Zhou, Gaoxing Chen, Leonid Sigal. Proposal With Alignment: A Bi-Directional Transformer for 360° Video Viewport Proposal |
| 11438 | -- | 11450 | Weijia Liu, Shaoming Zhang, Yan Tang, Zhong Wang, Jianmei Wang. Style Reconstruction-Driven Networks for Occlusion-Aware License Plate Recognition |
| 11451 | -- | 11463 | De Cheng, Yuxin Zhao, Nannan Wang 0001, Guozhang Li, Dingwen Zhang, Xinbo Gao 0001. Efficient Statistical Sampling Adaptation for Exemplar-Free Class Incremental Learning |
| 11464 | -- | 11477 | Xingyu Tong, Yang Xiao 0007, Bo Tan, Jianyu Yang 0002, Zhiguo Cao 0001, Joey Tianyi Zhou, Junsong Yuan 0001. You Will Never Walk Alone: One-Shot 3D Action Recognition With Point Cloud Sequence |
| 11478 | -- | 11492 | Yi Shi, Long Qin, Shixuan Zhao 0001, Kaifu Yang, Yuyong Cui, Hongmei Yan. Weakly Supervised Fixated Object Detection in Traffic Videos Based on Driver's Selective Attention Mechanism |
| 11493 | -- | 11506 | Peng Zhao, Xiaoming Xi, Qiangchang Wang, Yilong Yin. Characterizing Hierarchical Semantic-Aware Parts With Transformers for Generalized Zero-Shot Learning |
| 11507 | -- | 11520 | Ruilin Yao, Yi Rong, Qiangqiang Huang, Shengwu Xiong. CTOD: Cross-Attentive Task-Alignment for One-Stage Object Detection |
| 11521 | -- | 11534 | Yizhu Zhang, Jingang Shi, Jiayin Wang, Yuan Zong, Wenming Zheng, Guoying Zhao 0001. MaskFusionNet: A Dual-Stream Fusion Model With Masked Pre-Training Mechanism for rPPG Measurement |
| 11535 | -- | 11550 | Sheng Yan, Mengyuan Liu, Yong Wang, Yang Liu 0264, Hong Liu 0008. MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions |
| 11551 | -- | 11563 | Xi Yang 0011, Menghui Tian, Nannan Wang 0001, Xinbo Gao 0001. Unleashing the Feature Hierarchy Potential: An Efficient Tri-Hybrid Person Search Model |
| 11564 | -- | 11578 | Haihong Xiao, Ying He 0001, Hao Liu, Wenxiong Kang, Yuqiong Li. Point Cloud Completion via Self-Projected View Augmentation and Implicit Field Constraint |
| 11579 | -- | 11591 | Xin Liu, Jiamin Wu, Wenfei Yang, Xu Zhou, Tianzhu Zhang. Multi-Modal Attribute Prompting for Vision-Language Models |
| 11592 | -- | 11604 | Yue Wang 0038, Lu Zhang 0053, Pingping Zhang, Yunzhi Zhuge, Junfeng Wu, Hong Yu, Huchuan Lu. Learning Local-Global Representation for Scribble-Based RGB-D Salient Object Detection via Transformer |
| 11605 | -- | 11618 | Congqi Cao, Ze Sun, Qinyi Lv, Lingtong Min, Yanning Zhang 0001. VS-TransGRU: A Novel Transformer-GRU-Based Framework Enhanced by Visual-Semantic Fusion for Egocentric Action Anticipation |
| 11619 | -- | 11629 | Xuxiang Sun 0001, Gong Cheng 0003, Hongda Li, Hongyu Peng, Junwei Han. Task-Specific Importance-Awareness Matters: On Targeted Attacks Against Object Detection |
| 11630 | -- | 11643 | Haolin Du, Jingfei He, YuanQing Zhao. CCR: A Counterfactual Causal Reasoning-Based Method for Cross-View Geo-Localization |
| 11644 | -- | 11656 | Zhuming Wang, Zun Li 0001, Xianglong Lang, Yihao Zheng, Meng Tian, Lifang Wu, Liang Wang 0001, Changwen Chen. Knowledge Augmented Relation Inference for Group Activity Recognition |
| 11657 | -- | 11667 | Ning Xu 0003, Tingting Zhang, Hongshuo Tian, An-An Liu. Rule-Driven News Captioning |
| 11668 | -- | 11681 | Jian Wang, Tianhong Dai, Xinqiao Zhao, Ángel F. García-Fernández, Eng Gee Lim, Jimin Xiao. Class Activation Map Calibration for Weakly Supervised Semantic Segmentation |
| 11682 | -- | 11694 | Jian Zhu 0006, Hanli Wang, Miaojing Shi. Multi-Modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning |
| 11695 | -- | 11708 | Zongyi Li, Yuxuan Shi, Hefei Ling, Jiazhong Chen, Runsheng Wang, Chengxin Zhao, Qian Wang 0001, Shijuan Huang. Knowledge Consistency Distillation for Weakly Supervised One Step Person Search |
| 11709 | -- | 11723 | Ziheng Jia, Xiongkuo Min, Wei Sun 0029, Guangtao Zhai. Continuous and Overall Quality of Experience Evaluation for Streaming Video Based on Rich Features Exploration and Dual-Stage Attention |
| 11724 | -- | 11738 | Liqing Gao, Fan Lyu, Peng Shi, Lei Zhu 0003, Junfu Pu, Liang Wang 0001, Wei Feng 0005. Overcoming Modality Bias in Question-Driven Sign Language Video Translation |
| 11739 | -- | 11750 | Xu Yin, Woobin Im, Dongbo Min, Yuchi Huo, Fei Pan, Sung-Eui Yoon. Fine-Grained Background Representation for Weakly Supervised Semantic Segmentation |
| 11751 | -- | 11767 | Shuyuan Wang, Qi Li 0005, Huiyuan Luo, Chengkan Lv, Zhengtao Zhang. Produce Once, Utilize Twice for Anomaly Detection |
| 11768 | -- | 11782 | Ling Lin 0002, Tao Wang, Hao Liu 0019, Congcong Zhu, Jingrun Chen. Toward Quantifiable Face Age Transformation Under Attribute Unbias |
| 11783 | -- | 11797 | Shaocong Long, Qianyu Zhou 0001, Chenhao Ying, Lizhuang Ma, Yuan Luo 0003. Rethinking Domain Generalization: Discriminability and Generalizability |
| 11798 | -- | 11809 | Xie Yang, Yuke Wang, Fangjun Huang. CNN-Based Reversible Data Hiding for JPEG Images |
| 11810 | -- | 11824 | Quan Chen, Tingyu Wang, Zihao Yang, Haoran Li, Rongfeng Lu, Yaoqi Sun, Bolun Zheng, Chenggang Yan 0001. SDPL: Shifting-Dense Partition Learning for UAV-View Geo-Localization |
| 11825 | -- | 11837 | Qianzi Yu, Kai Zhu 0004, Yang Cao 0010, Feijie Xia, Yu Kang 0001. TF²: Few-Shot Text-Free Training-Free Defect Image Generation for Industrial Anomaly Inspection |
| 11838 | -- | 11850 | Yichen Chi, Junhao Gu, Jiamiao Zhang, Wenming Yang, Yapeng Tian. EgoVSR: Toward High-Quality Egocentric Video Super-Resolution |
| 11851 | -- | 11873 | Junge Peng, Bing Luo, Li Xu, Jun Yang, Chao Zhang 0072, Zheng Pei 0001. Blind Image Deblurring via Minimizing Similarity Between Fuzzy Sets on Image Pixels |
| 11874 | -- | 11885 | Yongxian Wei, Zixuan Hu, Li Shen 0008, Zhenyi Wang, Lei Li, Yu Li, Chun Yuan. Meta-Learning Without Data via Unconditional Diffusion Models |
| 11886 | -- | 11899 | Haoliang Zhou, Shucheng Huang, Feifei Zhang, Changsheng Xu. CEPrompt: Cross-Modal Emotion-Aware Prompting for Facial Expression Recognition |
| 11900 | -- | 11913 | Liya Wang, Haipeng Chen 0002, Yu Liu 0004, Yingda Lyu. Regular Constrained Multimodal Fusion for Image Captioning |
| 11914 | -- | 11928 | Chen Feng, Georgios Tzimiropoulos, Ioannis Patras. NoiseBox: Toward More Efficient and Effective Learning With Noisy Labels |
| 11929 | -- | 11941 | Hongjun Wu 0003, Chenxi Wang, Luwei Tu, Constantin Patsch, Zhi Jin. CSPN: A Category-Specific Processing Network for Low-Light Image Enhancement |
| 11942 | -- | 11953 | Mingfeng Zha, Feiyang Fu, Yunqiang Pei, Guoqing Wang 0001, Tianyu Li, Xiongxin Tang, Yang Yang 0002, Heng Tao Shen. Dual Domain Perception and Progressive Refinement for Mirror Detection |
| 11954 | -- | 11964 | Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng Tao Shen. Ump: Unified Modality-Aware Prompt Tuning for Text-Video Retrieval |
| 11965 | -- | 11979 | Feifeng Wang, Liquan Shen, Qi Teng, Zhaoyi Tian. DSCIC: Deep Screen Content Image Compression |
| 11980 | -- | 11992 | Yi-Hsin Chen, Hong-Sheng Xie, Cheng-Wei Chen, Zong-Lin Gao, Martin Benjak, Wen-Hsiao Peng, Jörn Ostermann. MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression |
| 11993 | -- | 12005 | Tianyi Sun, Yanze Wang, Zhijie Huang, Jun Sun 0012. STRANet: Soft-Target and Restriction-Aware Neural Network for Efficient VVC Intra Coding |
| 12006 | -- | 12018 | Chris Henry, Li Song 0001, Zhu Li 0001. Fast Video Deduplication and Localization With Temporal Consistence Re-Ranking |
| 12019 | -- | 12031 | Huakai Lai, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang 0001. Reliable Phrase Feature Mining for Hierarchical Video-Text Retrieval |
| 12032 | -- | 12047 | Huadong Zhang, Shuli Cheng, Anyu Du. Multi-Stage Auxiliary Learning for Visible-Infrared Person Re-Identification |
| 12048 | -- | 12058 | Duc Quang Vu, Trang T. T. Phung, Jia-Ching Wang, Son T. Mai. LCSL: Long-Tailed Classification via Self-Labeling |
| 12059 | -- | 12072 | Dengdi Sun, Yajie Pan, Andong Lu, Chenglong Li 0002, Bin Luo 0001. Transformer RGBT Tracking With Spatio-Temporal Multimodal Tokens |
| 12073 | -- | 12085 | Xingjie Dai, Ziwen He, Xiang Zhang 0023, Zhangjie Fu. SCGM: Asymmetric Steganographic Embedding Cost Learning With Adaptive Modulation |
| 12086 | -- | 12091 | Xi Xie, Meng Wang 0017, Junru Li, Kai Zhang 0007, Li Zhang 0006, Shiqi Wang 0001. Enhanced Motion Compensated Temporal Filter for VVenC |
| 12092 | -- | 12096 | Linwei Zhu, Yun Zhang 0002, Na Li 0015, Wenhui Wu, Shiqi Wang 0001, Sam Kwong. Neural Network Based Multi-Level In-Loop Filtering for Versatile Video Coding |