Abstract is missing.
- A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide GenerationYongkang Wang, Xuan Liu, Feng Huang, Zhankun Xiong, Wen Zhang. 3-11 [doi]
- Towards Automated RISC-V Microarchitecture Design with Reinforcement LearningChen Bai, Jianwang Zhai, Yuzhe Ma, Bei Yu 0001, Martin D. F. Wong. 12-20 [doi]
- Generating Novel Leads for Drug Discovery Using LLMs with Logical FeedbackShreyas Bhat Brahmavar, Ashwin Srinivasan 0001, Tirtharaj Dash, Sowmya Ramaswamy Krishnan, Lovekesh Vig, Arijit Roy, Raviprasad Aduri. 21-29 [doi]
- SeGA: Preference-Aware Self-Contrastive Learning with Prompts for Anomalous User Detection on TwitterYing-Ying Chang, Wei-Yao Wang, Wen-Chih Peng. 30-37 [doi]
- Neural Embeddings for kNN Search in Biological SequenceZhihao Chang, Linzhu Yu, Yanchao Xu, Wentao Hu. 38-45 [doi]
- i-Rebalance: Personalized Vehicle Repositioning for Supply Demand BalanceHaoyang Chen, Peiyan Sun, Qiyuan Song, Wanyuan Wang, Weiwei Wu 0001, Wencan Zhang, Guanyu Gao, Yan Lyu. 46-54 [doi]
- GIN-SD: Source Detection in Graphs with Incomplete Nodes via Positional Encoding and Attentive FusionLe Cheng, Peican Zhu, Keke Tang, Chao Gao, Zhen Wang 0004. 55-63 [doi]
- Deep Quantum Error CorrectionYoni Choukroun, Lior Wolf. 64-72 [doi]
- Propagation Tree Is Not Deep: Adaptive Graph Contrastive Learning Approach for Rumor DetectionChaoqun Cui, Caiyan Jia. 73-81 [doi]
- Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt LearningLongchao Da, Minquan Gao, Hao Mei, Hua Wei 0001. 82-90 [doi]
- Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior FusionNa Fan 0002, Zeyue Tian, Amartansh Dubey, Samruddhi Deshmukh, Ross D. Murch, Qifeng Chen. 91-99 [doi]
- Heterogeneous Graph Reasoning for Fact Checking over Texts and TablesHaisong Gong, Weizhi Xu 0002, Shu Wu, Qiang Liu 0006, Liang Wang 0056. 100-108 [doi]
- Text-Guided Molecule Generation with Diffusion Language ModelHaisong Gong, Qiang Liu, Shu Wu, Liang Wang. 109-117 [doi]
- Adversarial Robust Safeguard for Evading Deep Facial ManipulationJiazhi Guan, Yi Zhao, Zhuoer Xu, Changhua Meng, Ke Xu 0002, Youjian Zhao. 118-126 [doi]
- FlightBERT++: A Non-autoregressive Multi-Horizon Flight Trajectory Prediction FrameworkDongyue Guo, Zheng Zhang, Zhen Yan, Jianwei Zhang, Yi Lin. 127-134 [doi]
- LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly DetectionHongcheng Guo, Jian Yang, Jiaheng Liu, Jiaqi Bai, Boyang Wang, Zhoujun Li, Tieqiao Zheng, Bo Zhang, Junran Peng, Qi Tian. 135-143 [doi]
- ContraNovo: A Contrastive Learning Approach to Enhance De Novo Peptide SequencingZhi Jin, Sheng Xu, Xiang Zhang, Tianze Ling, Nanqing Dong, Wanli Ouyang, Zhiqiang Gao, Cheng Chang, Siqi Sun. 144-152 [doi]
- Inducing Point Operator Transformer: A Flexible and Scalable Architecture for Solving PDEsSeungjun Lee, Taeil Oh. 153-161 [doi]
- MASTER: Market-Guided Stock Transformer for Stock Price ForecastingTong Li, Zhaoyang Liu, Yanyan Shen, Xue Wang, Haokun Chen, Sen Huang. 162-170 [doi]
- Learning from Polar Representation: An Extreme-Adaptive Model for Long-Term Time Series ForecastingYanhong Li, Jack Xu, David C. Anastasiu. 171-179 [doi]
- The Causal Impact of Credit Lines on Spending DistributionsYijun Li, Cheuk Hang Leung, Xiangqian Sun, Chaoqun Wang, Yiyan Huang, Xing Yan, Qi Wu 0009, Dongdong Wang, Zhixiang Huang. 180-187 [doi]
- Improving PTM Site Prediction by Coupling of Multi-Granularity Structure and Multi-Scale Sequence RepresentationZhengyi Li, Menglu Li, Lida Zhu, Wen Zhang. 188-196 [doi]
- Joint Learning Neuronal Skeleton and Brain Circuit Topology with Permutation Invariant Encoders for Neuron ClassificationMinghui Liao, Guojia Wan, Bo Du 0001. 197-205 [doi]
- Root Cause Analysis in Microservice Using Neural Granger Causal DiscoveryCheng Ming Lin, Ching Chang, Wei-Yao Wang, Kuang-Da Wang, Wen-Chih Peng. 206-213 [doi]
- Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNBShengheng Liu, Xingkang Li, Zihuan Mao, Peng Liu, Yongming Huang. 214-221 [doi]
- MID-FiLD: MIDI Dataset for Fine-Level DynamicsJesung Ryu, Seungyeon Rhyu, Hong-Gyu Yoon, Eunchong Kim, Ju Young Yang, Taehyun Kim. 222-230 [doi]
- PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with PerturbationsRui She, Sijie Wang, Qiyu Kang, Kai Zhao, Yang Song, Wee-Peng Tay, Tianyu Geng, Xingchao Jian. 231-239 [doi]
- StegaStyleGAN: Towards Generic and Practical Generative Image SteganographyWenkang Su, Jiangqun Ni, Yiyan Sun. 240-248 [doi]
- Dual-Channel Learning Framework for Drug-Drug Interaction Prediction via Relation-Aware Heterogeneous Graph TransformerXiaorui Su, Pengwei Hu, Zhu-Hong You, Philip S. Yu, Lun Hu. 249-256 [doi]
- Molecular Optimization Model with Patentability ConstraintSally Turutov, Kira Radinsky. 257-264 [doi]
- Generalizable Sleep Staging via Multi-Level Domain AlignmentJiquan Wang, Sha Zhao, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan 0001. 265-273 [doi]
- Inspecting Prediction Confidence for Detecting Black-Box Backdoor AttacksTong Wang, Yuan Yao 0001, Feng Xu, Miao Xu, Shengwei An, Ting Wang. 274-282 [doi]
- Conformal Crystal Graph Transformer with Robust Encoding of Periodic InvarianceYingheng Wang, Shufeng Kong, John M. Gregoire, Carla P. Gomes. 283-291 [doi]
- SuperJunction: Learning-Based Junction Detection for Retinal Image RegistrationYu Wang, Xiaoye Wang, Zaiwang Gu, Weide Liu, Wee Siong Ng, Weimin Huang, Jun Cheng 0024. 292-300 [doi]
- Explore 3D Dance Generation via Reward Model from Automatically-Ranked DemonstrationsZilin Wang, Haolin Zhuang, Lu Li, Yinmin Zhang, Junjie Zhong, Jun Chen, Yu Yang, Boshi Tang, Zhiyong Wu 0001. 301-309 [doi]
- PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction PredictionLirong Wu, Yufei Huang, Cheng Tan 0012, Zhangyang Gao, Bozhen Hu, Haitao Lin, Zicheng Liu 0006, Stan Z. Li. 310-319 [doi]
- Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global EvolutionTailin Wu, Willie Neiswanger, Hongtao Zheng, Stefano Ermon, Jure Leskovec. 320-328 [doi]
- Multilevel Attention Network with Semi-supervised Domain Adaptation for Drug-Target PredictionZhousan Xie, Shikui Tu, Lei Xu 0001. 329-337 [doi]
- Geometric-Facilitated Denoising Diffusion Model for 3D Molecule GenerationCan Xu, Haosen Wang, Weigang Wang, Pengfei Zheng, Hongyang Chen. 338-346 [doi]
- GAMC: An Unsupervised Method for Fake News Detection Using Graph Autoencoder with MaskingShu Yin, Peican Zhu, Lianwei Wu, Chao Gao, Zhen Wang. 347-355 [doi]
- Unsupervised Gene-Cell Collective Representation Learning with Optimal TransportJixiang Yu, Nanjun Chen, Ming Gao 0008, Xiangtao Li, Ka Chun Wong. 356-364 [doi]
- MCSSME: Multi-Task Contrastive Learning for Semi-supervised Singing Melody Extraction from Polyphonic MusicShuai Yu. 365-373 [doi]
- RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis PredictionYemin Yu, Luotian Yuan, Ying Wei 0001, Hanyu Gao, Fei Wu 0001, Zhihua Wang 0008, Xinhai Ye. 374-382 [doi]
- Designing Biological Sequences without Prior Knowledge Using Evolutionary Reinforcement LearningXi Zeng, Xiaotian Hao, Hongyao Tang, Zhentao Tang, Shaoqing Jiao, Dazhi Lu, Jiajie Peng. 383-391 [doi]
- Adversarial Socialbots Modeling Based on Structural Information PrinciplesXianghua Zeng, Hao Peng 0001, Angsheng Li. 392-400 [doi]
- NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order DispatchingHongbo Zhang, Guang Wang 0001, Xu Wang, Zhengyang Zhou, Chen Zhang, Zheng Dong, Yang Wang. 401-409 [doi]
- Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone ImageryJialu Zhang 0003, Xiaoying Yang, Wentao He, Jianfeng Ren, Qian Zhang, Yitian Zhao, Ruibin Bai, Xiangjian He, Jiang Liu. 410-418 [doi]
- Adversarial Attacks on Federated-Learned Adaptive Bitrate AlgorithmsRui-Xiao Zhang, Tianchi Huang. 419-427 [doi]
- Generalize for Future: Slow and Fast Trajectory Learning for CTR PredictionJian Zhu, Congcong Liu, Xue Jiang, Changping Peng, Zhangang Lin, JingPing Shao. 428-436 [doi]
- Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language ModelsYuqi Zhu, Jia Li, Ge Li, Yunfei Zhao, Jia Li, Zhi Jin, Hong Mei 0001. 437-445 [doi]
- Operationalizing Essential Characteristics of Creativity in a Computational System for Music CompositionPaul M. Bodily, Dan Ventura. 447-455 [doi]
- Neural Reasoning about Agents' Goals, Preferences, and ActionsMatteo Bortoletto, Lei Shi, Andreas Bulling. 456-464 [doi]
- An Empirical Study of CLIP for Text-Based Person SearchMin Cao, Yang Bai, Ziyin Zeng, Mang Ye, Min Zhang. 465-473 [doi]
- Social Physics Informed Diffusion Model for Crowd SimulationHongyi Chen, Jingtao Ding, Yong Li, Yue Wang, Xiao-Ping Zhang. 474-482 [doi]
- Trend-Aware Supervision: On Learning Invariance for Semi-supervised Facial Action Unit Intensity EstimationYingjie Chen, Jiarui Zhang, Tao Wang, Yun Liang 0001. 483-491 [doi]
- Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating MechanismsJianhao Ding, Zhaofei Yu, Tiejun Huang 0001, Jian K. Liu. 492-502 [doi]
- Imitation of Life: A Search Engine for Biologically Inspired DesignHen Emuna, Nadav Borenstein, Xin Qian, Hyeonsu B. Kang, Joel Chan, Aniket Kittur, Dafna Shahaf. 503-511 [doi]
- An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event DomainXiang He, Dongcheng Zhao, Yang Li, Guobin Shen, Qingqun Kong, Yi Zeng. 512-520 [doi]
- Responding to the Call: Exploring Automatic Music Composition Using a Knowledge-Enhanced ModelZhejing Hu, Yan Liu, Gong Chen 0006, Xiao Ma, Shenghua Zhong, Qianwen Luo. 521-529 [doi]
- Neural Amortized Inference for Nested Multi-Agent ReasoningKunal Jha, Tuan Anh Le 0001, Chuanyang Jin, Yen-ling Kuo, Joshua B. Tenenbaum, Tianmin Shu. 530-537 [doi]
- Hidden Follower Detection: How Is the Gaze-Spacing Pattern Embodied in Frequency Domain?Shu Li, Ruimin Hu, Suhui Li, Liang Liao. 538-546 [doi]
- Music Style Transfer with Time-Varying Inversion of Diffusion ModelsSifei Li, Yuxin Zhang, Fan Tang, Chongyang Ma, Weiming Dong, Changsheng Xu. 547-555 [doi]
- A Brain-Inspired Way of Reducing the Network Complexity via Concept-Regularized Coding for Emotion RecognitionHan Lu, Xiahai Zhuang, Qiang Luo. 556-564 [doi]
- Multi-Energy Guided Image Translation with Stochastic Differential Equations for Near-Infrared Facial Expression RecognitionBingjun Luo, Zewen Wang, Jinpeng Wang, JunJie Zhu, Xibin Zhao, Yue Gao 0002. 565-573 [doi]
- Successive POI Recommendation via Brain-Inspired Spatiotemporal Aware RepresentationGehua Ma, He Wang, Jingyuan Zhao, Rui Yan 0005, Huajin Tang. 574-582 [doi]
- BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of MindYuanyuan Mao, Xin Lin, Qin Ni, Liang He. 583-591 [doi]
- Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement LearningJunseok Park, Yoonsung Kim, Hee bin Yoo, Min Whoo Lee, Kibeom Kim, Won-Seok Choi 0006, Minsu Lee, Byoung-Tak Zhang. 592-600 [doi]
- Gated Attention Coding for Training High-Performance and Efficient Spiking Neural NetworksXuerui Qiu, Rui-Jie Zhu, Yuhong Chou, Zhaorui Wang 0005, Liang-Jian Deng, Guoqi Li. 601-610 [doi]
- Efficient Spiking Neural Networks with Sparse Selective Activation for Continual LearningJiangrong Shen, Wenyao Ni, Qi Xu, Huajin Tang. 611-619 [doi]
- Boosting Neural Cognitive Diagnosis with Student's Affective State ModelingShanshan Wang, Zhen Zeng, Xun Yang, Ke Xu, Xingyi Zhang 0001. 620-627 [doi]
- DMMR: Cross-Subject Domain Generalization for EEG-Based Emotion Recognition via Denoising Mixed Mutual ReconstructionYiming Wang, Bin Zhang, Yujiao Tang. 628-636 [doi]
- Transient Glimpses: Unveiling Occluded Backgrounds through the Spike CameraJiyuan Zhang, Shiyan Chen, Yajing Zheng, Zhaofei Yu, Tiejun Huang 0001. 637-645 [doi]
- Open-Set Facial Expression RecognitionYuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wenjing Wang, Weihong Deng. 646-654 [doi]
- Bootstrapping Cognitive Agents with a Large Language ModelFeiyu Zhu, Reid G. Simmons. 655-663 [doi]
- Data Augmented Graph Neural Networks for Personality DetectionYangfu Zhu, Yue Xia, Meiling Li, Tingting Zhang, Bin Wu 0001. 664-672 [doi]
- DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion ModelsNamhyuk Ahn, Junsoo Lee, Chunggi Lee, Kunhee Kim, DaeSik Kim, Seung-Hun Nam, Kibeom Hong. 674-681 [doi]
- Context Enhanced Transformer for Single Image Object Detection in Video DataSeungjun An, SeongHoon Park, Gyeongnyeon Kim, JeongYeol Baek, Byeongwon Lee, Seungryong Kim. 682-690 [doi]
- SHaRPose: Sparse High-Resolution Representation for Human Pose EstimationXiaoqi An, Lin Zhao, Chen Gong 0002, Nannan Wang, Di Wang, Jian Yang. 691-699 [doi]
- Comparing the Robustness of Modern No-Reference Image- and Video-Quality Metrics to Adversarial AttacksAnastasia Antsiferova, Khaled Abud, Aleksandr Gushchin, Ekaterina Shumitskaya, Sergey Lavrushkin, Dmitriy S. Vatolin. 700-708 [doi]
- DocFormerv2: Local Features for Document UnderstandingSrikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha. 709-718 [doi]
- Exposing the Deception: Uncovering More Forgery Clues for Deepfake DetectionZhongjie Ba, Qingyu Liu, Zhenguang Liu, Shuang Wu, Feng Lin, Li Lu, Kui Ren 0001. 719-728 [doi]
- Prompt-Based Distribution Alignment for Unsupervised Domain AdaptationShuanghao Bai, Min Zhang, Wanqi Zhou, Siteng Huang, Zhirong Luan, Donglin Wang, Badong Chen. 729-737 [doi]
- Local-Global Multi-Modal Distillation for Weakly-Supervised Temporal Video GroundingPeijun Bao, Yong Xia, Wenhan Yang, Boon Poh Ng, Meng Hwa Er, Alex C. Kot. 738-746 [doi]
- Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization: When Divergence Meets ConsistencyPeijun Bao, Zihao Shao, Wenhan Yang, Boon Poh Ng, Meng Hwa Er, Alex C. Kot. 747-755 [doi]
- Improving Diffusion-Based Image Restoration with Error Contraction and Error CorrectionQiqi Bao, Zheng Hui, Rui Zhu 0006, Peiran Ren, Xuansong Xie, Wenming Yang. 756-764 [doi]
- Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic SegmentationXiaoyi Bao, Jie Qin, Siyang Sun, Xingang Wang, Yun Zheng. 765-773 [doi]
- Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content CounterfactuallyMazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad. 774-782 [doi]
- DanceAnyWay: Synthesizing Beat-Guided 3D Dances with Randomized Temporal Contrastive LearningAneesh Bhattacharya, Manas Paranjape, Uttaran Bhattacharya, Aniket Bera. 783-791 [doi]
- DiffSED: Sound Event Detection with Denoising DiffusionSwapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang deng, Xiatian Zhu. 792-800 [doi]
- Learning Generalized Segmentation for Foggy-Scenes by Bi-directional Wavelet GuidanceQi Bi, Shaodi You, Theo Gevers. 801-809 [doi]
- Learning Generalized Medical Image Segmentation from Decoupled Feature QueriesQi Bi, Jingjun Yi, Hao Zheng, Wei Ji, Yawen Huang, Yuexiang Li, Yefeng Zheng 0001. 810-818 [doi]
- Learning Content-Enhanced Mask Transformer for Domain Generalized Urban-Scene SegmentationQi Bi, Shaodi You, Theo Gevers. 819-827 [doi]
- ShapeBoost: Boosting Human Shape Estimation with Part-Based Parameterization and Clothing-Preserving AugmentationSiyuan Bian, Jiefeng Li, Jiasheng Tang, Cewu Lu. 828-836 [doi]
- MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept AlignmentYequan Bie, Luyang Luo, Hao Chen. 837-845 [doi]
- VIXEN: Visual Text Comparison Network for Image Difference CaptioningAlexander Black 0001, Jing Shi, Yifei Fan, Tu Bui, John P. Collomosse. 846-854 [doi]
- SRFormer: Text Detection Transformer with Incorporated Segmentation and RegressionQingwen Bu, Sungrae Park, Minsoo Khang, Yichuan Cheng. 855-863 [doi]
- Orthogonal Dictionary Guided Shape Completion Network for Point CloudPingping Cai, Deja Scott, Xiaoguang Li, Song Wang. 864-872 [doi]
- Spherical Pseudo-Cylindrical Representation for Omnidirectional Image Super-resolutionQing Cai, Mu Li, Dongwei Ren, Jun Lyu, Haiyong Zheng, Junyu Dong, Yee-Hong Yang. 873-881 [doi]
- Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal DenoiserQingyuan Cai, Xuecai Hu, Saihui Hou, Li Yao, Yongzhen Huang. 882-890 [doi]
- Rethinking the Paradigm of Content Constraints in Unpaired Image-to-Image TranslationXiuding Cai, Yaoyao Zhu, Dong Miao, Linjie Fu, Yu Yao. 891-899 [doi]
- FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose EstimationYanlu Cai, Weizhong Zhang, Yuan Wu 0004, Cheng Jin 0001. 900-908 [doi]
- Decoupled Textual Embeddings for Customized Image GenerationYufei Cai, Yuxiang Wei 0001, Zhilong Ji, Jinfeng Bai, Hu Han 0001, Wangmeng Zuo. 909-917 [doi]
- Disguise without Disruption: Utility-Preserving Face De-identificationZikui Cai, Zhongpai Gao, Benjamin Planche, Meng Zheng, Terrence Chen, M. Salman Asif, Ziyan Wu. 918-926 [doi]
- Bi-directional Adapter for Multimodal TrackingBing Cao, Junliang Guo, Pengfei Zhu 0001, Qinghua Hu. 927-935 [doi]
- Domain-Controlled Prompt LearningQinglong Cao, Zhengqin Xu, Yuntian Chen, Chao Ma 0004, Xiaokang Yang. 936-944 [doi]
- LogoStyleFool: Vitiating Video Recognition Systems via Logo Style TransferYuxin Cao, Ziyu Zhao, Xi Xiao, Derui Wang, Minhui Xue, Jin Lu. 945-953 [doi]
- Descanning: From Scanned to the Original Images with a Color Correction Diffusion ModelJunghun Cha, Ali Haider, Seoyun Yang, Hoeyeong Jin, Subin Yang, A. F. M. Shahab Uddin, Jaehyoung Kim, Soo Ye Kim, Sung-Ho Bae. 954-963 [doi]
- Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human ReconstructionKennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan-Sheng Foo, Weisi Lin. 964-971 [doi]
- CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object DetectionGyusam Chang, Wonseok Roh, Sujin Jang, Dongwook Lee, Daehyun Ji, Gyeongrok Oh, Jinsun Park, Jinkyu Kim, Sangpil Kim. 972-980 [doi]
- A Hybrid Global-Local Perception Network for Lane DetectionQing Chang, Yifei Tong. 981-989 [doi]
- Improving Robustness for Joint Optimization of Camera Pose and Decomposed Low-Rank Tensorial Radiance FieldsBo-Yu Chen, Wei-chen Chiu, Yu-Lun Liu 0001. 990-1000 [doi]
- Sketch and Refine: Towards Fast and Accurate Lane DetectionChao Chen, Jie Liu 0040, Chang Zhou, Jie Tang 0006, Gangshan Wu. 1001-1009 [doi]
- Iterative Token Evaluation and Refinement for Real-World Super-resolutionChaofeng Chen, Shangchen Zhou, Liang Liao, Haoning Wu, Wenxiu Sun, Qiong Yan, Weisi Lin. 1010-1018 [doi]
- FeatWalk: Enhancing Few-Shot Classification through Local View LeveragingDalong Chen, Jianjia Zhang, Wei-Shi Zheng 0001, Ruixuan Wang. 1019-1027 [doi]
- Real3D: The Curious Case of Neural Scene DegenerationDengsheng Chen, Jie Hu, Xiaoming Wei, Enhua Wu. 1028-1036 [doi]
- DDAE: Towards Deep Dynamic Vision BERT PretrainingHonghao Chen, Xiangwen Kong, Xiangyu Zhang 0005, Xin Zhao 0012, Kaiqi Huang. 1037-1045 [doi]
- Rethinking Multi-Scale Representations in Deep Deraining TransformerHongming Chen 0004, Xiang Chen, Jiyang Lu, Yufeng Li. 1046-1053 [doi]
- Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive LearningHongxu Chen, Quan Zhang, Jian-Huang Lai, Xiaohua Xie. 1054-1062 [doi]
- Guiding a Harsh-Environments Robust Detector via RAW Data Characteristic MiningHongyang Chen, Hung-Shuo Tai, Kaisheng Ma. 1063-1071 [doi]
- CutFreq: Cut-and-Swap Frequency Components for Low-Level Vision AugmentationHongyang Chen, Kaisheng Ma. 1072-1080 [doi]
- Null Space Matters: Range-Null Decomposition for Consistent Multi-Contrast MRI ReconstructionJiacheng Chen, Jiawei Jiang, Fei Wu, Jianwei Zheng 0001. 1081-1090 [doi]
- PNeSM: Arbitrary 3D Scene Stylization via Prompt-Based Neural Style MappingJiafu Chen, Wei Xing, Jiakai Sun, Tianyi Chu, Yiling Huang, Boyan Ji, Lei Zhao, Huaizhong Lin, Haibo Chen 0006, Zhizhong Wang. 1091-1099 [doi]
- TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution DetectionJiankang Chen, Tong Zhang, Wei-Shi Zheng 0001, Ruixuan Wang. 1100-1109 [doi]
- EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoEJunyi Chen, Longteng Guo, Jia Sun, Shuai Shao 0005, Zehuan Yuan, Liang Lin, Dongyu Zhang. 1110-1119 [doi]
- CaMIL: Causal Multiple Instance Learning for Whole Slide Image ClassificationKaitao Chen, Shiliang Sun, Jing Zhao 0015. 1120-1128 [doi]
- Multi-Prototype Space Learning for Commonsense-Based Scene Graph GenerationLianggangxu Chen, Youqi Song, Yiqing Cai, Jiale Lu, Yang Li, Yuan Xie, Changbo Wang, Gaoqi He. 1129-1137 [doi]
- Kumaraswamy Wavelet for Heterophilic Scene Graph GenerationLianggangxu Chen, Youqi Song, Shaohui Lin, Changbo Wang, Gaoqi He. 1138-1146 [doi]
- ViT-Calibrator: Decision Stream Calibration for Vision TransformerLin Chen, Zhijie Jia, Lechao Cheng, Yang Gao, Jie Lei 0002, Yijun Bei, Zunlei Feng. 1147-1155 [doi]
- NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt TuningLinsheng Chen, Guangrun Wang, Liuchun Yuan, Keze Wang, Ken Deng, Philip H. S. Torr. 1156-1164 [doi]
- WebVLN: Vision-and-Language Navigation on WebsitesQi Chen, Dileepa Pitawela, Chongyang Zhao 0003, Gengze Zhou, Hsiang-Ting Chen, Qi Wu 0001. 1165-1173 [doi]
- Learning Multimodal Volumetric Features for Large-Scale Neuron TracingQihua Chen, Xuejin Chen, Chenxuan Wang, Yixiong Liu, Zhiwei Xiong, Feng Wu. 1174-1182 [doi]
- M-BEV: Masked BEV Perception for Robust Autonomous DrivingSiran Chen, Yue Ma, Yu Qiao, Yali Wang. 1183-1191 [doi]
- VPDETR: End-to-End Vanishing Point DEtection TRansformersTaiyan Chen, Xianghua Ying, Jinfa Yang, Ruibin Wang, Ruohao Guo, Bowei Xing, Ji Shi. 1192-1200 [doi]
- TCI-Former: Thermal Conduction-Inspired Transformer for Infrared Small Target DetectionTianxiang Chen, Zhentao Tan, Qi Chu 0001, Yue Wu, Bin Liu, Nenghai Yu. 1201-1209 [doi]
- Intrinsic Phase-Preserving Networks for Depth Super ResolutionXuanhong Chen, Hang Wang, Jialiang Chen, Kairui Feng, Jinfan Liu, Xiaohang Wang 0004, Weimin Zhang, Bingbing Ni. 1210-1218 [doi]
- Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated TextXuyang Chen, Dong Wang, Konrad Schindler, Mingwei Sun, Yongliang Wang, Nicoló Savioli, Liqiu Meng. 1219-1227 [doi]
- FashionERN: Enhance-and-Refine Network for Composed Fashion Image RetrievalYanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Jiahuan Zhou, Lele Cheng. 1228-1236 [doi]
- IT3D: Improved Text-to-3D Generation with Explicit View SynthesisYiwen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin. 1237-1244 [doi]
- Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic SegmentationYujun Chen, Xin Tan, Zhizhong Zhang, Yanyun Qu, Yuan Xie 0006. 1245-1253 [doi]
- Visual Chain-of-Thought Prompting for Knowledge-Based Visual ReasoningZhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Zhiqing Sun, Dan Gutfreund, Chuang Gan. 1254-1262 [doi]
- Blind Face Restoration under Extreme Conditions: Leveraging 3D-2D Prior Fusion for Superior Structural and Texture RecoveryZhengrui Chen, Liying Lu, Ziyang Yuan, Yiming Zhu, Yu Li 0003, Chun Yuan, Weihong Deng. 1263-1271 [doi]
- CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion ModelsZhongxi Chen, Ke Sun, Xianming Lin. 1272-1280 [doi]
- DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image GenerationZhuowei Chen, Shancheng Fang, Wei Liu, Qian He, Mengqi Huang, Zhendong Mao. 1281-1289 [doi]
- Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration NetworkZida Chen, Ziran Zhang, Haoying Li, Menghao Li, Yueting Chen, Qi Li, Huajun Feng, Zhihai Xu, Shiqi Chen. 1290-1298 [doi]
- Context-Aware Iteration Policy Network for Efficient Optical Flow EstimationRi Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan 0001. 1299-1307 [doi]
- SparseGNV: Generating Novel Views of Indoor Scenes with Sparse RGB-D ImagesWeihao Cheng 0002, Yan-Pei Cao, Ying Shan. 1308-1316 [doi]
- Colorizing Monochromatic Radiance FieldsYean Cheng, Renjie Wan, Shuchen Weng, Chengxuan Zhu, Yakun Chang, Boxin Shi. 1317-1325 [doi]
- Parallel Vertex Diffusion for Unified Visual GroundingZesen Cheng, Kehan Li 0002, Peng Jin, Siheng Li, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen 0001. 1326-1334 [doi]
- iDet3D: Towards Efficient Interactive Object Detection for LiDAR Point CloudsDongmin Choi, Wonwoo Cho, Kangyeol Kim, Jaegul Choo. 1335-1343 [doi]
- Fusion-Vital: Video-RF Fusion Transformer for Advanced Remote Physiological MeasurementJae-Ho Choi, Ki-Bong Kang, Kyung Tae Kim. 1344-1352 [doi]
- MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence GuidanceErnie Chu, Tzuhsuan Huang, Shuo-Yen Lin, Jun-Cheng Chen. 1353-1361 [doi]
- Attack Deterministic Conditional Image Generative Models for Diverse and Controllable GenerationTianyi Chu, Wei Xing, Jiafu Chen, Zhizhong Wang, Jiakai Sun, Lei Zhao, Haibo Chen, Huaizhong Lin. 1362-1370 [doi]
- NILUT: Conditional Neural Implicit 3D Lookup Tables for Image EnhancementMarcos V. Conde, Javier Vazquez-Corral, Michael S. Brown, Radu Timofte. 1371-1379 [doi]
- Decoupled Optimisation for Long-Tailed Visual RecognitionCong Cong, Shiyu Xuan, Sidong Liu, Shiliang Zhang, Maurice Pagnucco, Yang Song 0001. 1380-1388 [doi]
- Underwater Organism Color Fine-Tuning via Decomposition and GuidanceXiaofeng Cong, Jie Gui, Junming Hou. 1389-1398 [doi]
- Color Event Enhanced Single-Exposure HDR ImagingMengyao Cui, Zhigang Wang, Dong Wang, Bin Zhao, Xuelong Li. 1399-1407 [doi]
- PHFormer: Multi-Fragment Assembly Using Proxy-Level Hybrid TransformerWenting Cui, Runzhao Yao, Shaoyi Du. 1408-1416 [doi]
- Trash to Treasure: Low-Light Object Detection via Decomposition-and-AggregationXiaohan Cui, Long Ma 0002, Tengyu Ma 0004, Jinyuan Liu 0001, Xin Fan 0001, Risheng Liu. 1417-1425 [doi]
- Omni-Kernel Network for Image RestorationYuning Cui 0001, Wenqi Ren, Alois Knoll. 1426-1434 [doi]
- Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field AssumptionZiteng Cui, Lin Gu 0003, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada. 1435-1444 [doi]
- Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor SegmentationQian Dai, Dong Wei 0004, Hong Liu, Jinghan Sun, Liansheng Wang, Yefeng Zheng 0001. 1445-1453 [doi]
- Generating and Reweighting Dense Contrastive Patterns for Unsupervised Anomaly DetectionSongmin Dai, Yifan Wu, Xiaoqiang Li, Xiangyang Xue. 1454-1462 [doi]
- Noisy Correspondence Learning with Self-Reinforcing Errors MitigationZhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Xiaojun Chang, Jingdong Wang 0001. 1463-1471 [doi]
- LDMVFI: Video Frame Interpolation with Latent Diffusion ModelsDuolikun Danier, Fan Zhang 0017, David R. Bull. 1472-1480 [doi]
- No More Shortcuts: Realizing the Potential of Temporal Self-SupervisionIshan Rajendrakumar Dave, Simon Jenni, Mubarak Shah. 1481-1491 [doi]
- A Dynamic GCN with Cross-Representation Distillation for Event-Based LearningYongjian Deng, Hao Chen, Youfu Li. 1492-1500 [doi]
- ResMatch: Residual Attention Learning for Feature MatchingYuxin Deng, Kaining Zhang, Shihua Zhang, Yansheng Li, Jiayi Ma 0001. 1501-1509 [doi]
- SDGMNet: Statistic-Based Dynamic Gradient Modulation for Local Descriptor LearningYuxin Deng, Jiayi Ma 0001. 1510-1518 [doi]
- Stereo Vision Conversion from Planar Videos Based on Temporal Multiplane ImagesShanding Diao, Yuan Chen, Yang Zhao, Wei Jia, Zhao Zhang 0001, Ronggang Wang. 1519-1527 [doi]
- Weak Distribution Detectors Lead to Stronger Generalizability of Vision-Language Prompt TuningKun Ding, Haojian Zhang, Qiang Yu, Ying Wang, Shiming Xiang, Chunhong Pan. 1528-1536 [doi]
- Expressive Forecasting of 3D Whole-Body Human MotionsPengxiang Ding, Qiongjie Cui, Haofan Wang, Min Zhang, Mengyuan Liu, Donglin Wang. 1537-1545 [doi]
- Transferable Adversarial Attacks for Object Detection Using Object-Aware Significant Feature DistortionXinlong Ding, Jiansheng Chen, Hongwei Yu, Yu Shang, Yining Qin, Huimin Ma 0001. 1546-1554 [doi]
- Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object DetectionThang Doan, Xin Li, Sima Behpour, Wenbin He, Liang Gou, Liu Ren. 1555-1563 [doi]
- Exploiting Polarized Material Cues for Robust Car DetectionWen Dong 0008, Haiyang Mei, Ziqi Wei, Ao Jin, Sen Qiu, Qiang Zhang, Xin Yang. 1564-1572 [doi]
- Learning Multi-Modal Cross-Scale Deformable Transformer Network for Unregistered Hyperspectral Image Super-resolutionWenqian Dong, Yang Xu, Jiahui Qu, Shaoxiong Hou. 1573-1581 [doi]
- Joint Demosaicing and Denoising for Spike CameraYanchen Dong 0001, Ruiqin Xiong, Jing Zhao, Jian Zhang 0018, Xiaopeng Fan, Shuyuan Zhu, Tiejun Huang 0001. 1582-1590 [doi]
- ChromaFusionNet (CFNet): Natural Fusion of Fine-Grained Color EditingYi Dong, Yuxi Wang, Ruoxi Fan, Wenqi Ouyang, Zhiqi Shen 0001, Peiran Ren, Xuansong Xie. 1591-1599 [doi]
- HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid ExplorationsYilan Dong, Chunlin Yu, Ruiyang Ha, Ye Shi 0001, Yuexin Ma, Lan Xu, Yanwei Fu, Jingya Wang. 1600-1608 [doi]
- PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth EstimationYue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu, Fang-Lue Zhang, Song-Hai Zhang. 1609-1617 [doi]
- CycleVTON: A Cycle Mapping Framework for Parser-Free Virtual Try-OnChenghu Du, Junyin Wang, Yi Rong, Shuqing Liu, Kai Liu, Shengwu Xiong. 1618-1625 [doi]
- Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent LearningHang Du, Xuejun Yan, Jingjing Wang, Di Xie, Shiliang Pu. 1626-1634 [doi]
- CDPNet: Cross-Modal Dual Phases Network for Point Cloud CompletionZhenjiang Du, Jiale Dou, Zhitao Liu, Jiwei Wei, Guan Wang, Ning Xie 0003, Yang Yang 0002. 1635-1643 [doi]
- Tuning-Free Inversion-Enhanced Control for Consistent Image EditingXiaoyue Duan, Shuhao Cui, Guoliang Kang, Baochang Zhang 0001, Zhengcong Fei, Mingyuan Fan, Junshi Huang. 1644-1652 [doi]
- WeditGAN: Few-Shot Image Generation via Latent Space RelocationYuxuan Duan, Li Niu 0002, Yan Hong, Liqing Zhang 0001. 1653-1661 [doi]
- SkeletonGait: Gait Recognition Using Skeleton MapsChao Fan 0001, Jingzhe Ma, Dongyang Jin, Chuanfu Shen, Shiqi Yu 0001. 1662-1669 [doi]
- TDeLTA: A Light-Weight and Robust Table Detection Method Based on Learning Text ArrangementYang Fan, Xiangping Wu 0001, Qingcai Chen, Heng Li, Yan Huang, Zhixiang Cai, Qitian Wu. 1670-1678 [doi]
- Collaborative Tooth Motion Diffusion Model in Digital OrthodonticsYeying Fan, Guangshun Wei, Chen Wang, Shaojie Zhuang, Wenping Wang, Yuanfeng Zhou. 1679-1687 [doi]
- Everything2Motion: Synchronizing Diverse Inputs via a Unified Framework for Human Motion SynthesisZhaoxin Fan, Longbin Ji, Pengxin Xu, Fan Shen, Kai Chen. 1688-1697 [doi]
- Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image SegmentationChaowei Fang, Ziyin Zhou, Junye Chen, Hanjing Su, Qingyao Wu, Guanbin Li. 1698-1706 [doi]
- Evaluate Geometry of Radiance Fields with Low-Frequency Color PriorQihang Fang, Yafei Song, Keqiang Li, Li Shen 0003, Huaiyu Wu, Gang Xiong 0001, Liefeng Bo. 1707-1715 [doi]
- Simple Image-Level Classification Improves Open-Vocabulary Object DetectionRuohuan Fang, Guansong Pang, Xiao Bai 0001. 1716-1725 [doi]
- Self-Supervised Bird's Eye View Motion Prediction with Cross-Modality SignalsShaoheng Fang, Zuhong Liu, Mingyu Wang, Chenxin Xu, Yiqi Zhong, Siheng Chen. 1726-1734 [doi]
- Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using LanguageXiang Fang, Daizong Liu, Wanlong Fang, Pan Zhou, Zichuan Xu, Wenzheng Xu, Junyang Chen, Renfu Li. 1735-1743 [doi]
- An Embedding-Unleashing Video Polyp Segmentation Framework via Region Linking and Scale AlignmentZhixue Fang, Xinrong Guo, Jingyin Lin, Huisi Wu, Jing Qin 0001. 1744-1752 [doi]
- Debiased Novel Category Discovering and LocalizationJuexiao Feng, Yuhong Yang 0008, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Ding. 1753-1760 [doi]
- Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point CloudsTuo Feng 0001, Ruijie Quan, Xiaohan Wang, Wenguan Wang, Yi Yang. 1761-1769 [doi]
- Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial AnimationHui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang. 1770-1777 [doi]
- Fine-Grained Multi-View Hand Reconstruction Using Inverse RenderingQijun Gan, Wentong Li, Jinwei Ren, Jianke Zhu. 1779-1787 [doi]
- Attacking Transformers with Feature Diversity Adversarial PerturbationChenxing Gao, Hang Zhou, Junqing Yu, Yuteng Ye, Jiale Cai, Junle Wang, Wei Yang 0034. 1788-1796 [doi]
- Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object DetectionHongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao. 1797-1805 [doi]
- Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI DetectionJiayi Gao, Kongming Liang, Tao Wei, Wei Chen 0071, Zhanyu Ma, Jun Guo 0002. 1806-1814 [doi]
- LAMM: Label Alignment for Multi-Modal Prompt LearningJingsheng Gao, Jiacheng Ruan, Suncheng Xiang, Zefang Yu, Ke-ji, Mingye Xie, Ting Liu, Yuzhuo Fu. 1815-1823 [doi]
- Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image TranslationXiang Gao, Zhengbo Xu, Junhan Zhao, Jiaying Liu 0001. 1824-1832 [doi]
- A General Implicit Framework for Fast NeRF Composition and RenderingXinyu Gao, Ziyi Yang, Yunlu Zhao, Yuxiang Sun, Xiaogang Jin 0001, Changqing Zou. 1833-1841 [doi]
- Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object TrackingYan Gao, Haojun Xu, Jie Li, Nannan Wang 0001, Xinbo Gao 0001. 1842-1850 [doi]
- A Dual Stealthy Backdoor: From Both Spatial and Frequency PerspectivesYudong Gao, Honglong Chen, Peng Sun, Junjian Li, Anqing Zhang, Zhibo Wang 0001, Weifeng Liu 0001. 1851-1859 [doi]
- SoftCLIP: Softer Cross-Modal Alignment Makes CLIP StrongerYuting Gao, Jinfeng Liu 0007, Zihan Xu, Tong Wu, Enwei Zhang, Ke Li, Jie Yang, Wei Liu, Xing Sun. 1860-1868 [doi]
- Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex InteractionsPrajwal Gatti, Kshitij Parikh, Dhriti Prasanna Paul, Manish Gupta 0001, Anand Mishra 0001. 1869-1877 [doi]
- Neuromorphic Event Signal-Driven Network for Video De-rainingChengjie Ge, Xueyang Fu, Peng He, Kunyu Wang, Chengzhi Cao, Zheng-Jun Zha. 1878-1886 [doi]
- Beyond Prototypes: Semantic Anchor Regularization for Better Representation LearningYanqi Ge, Qiang Nie, Ye Huang, Yong Liu, Chengjie Wang, Feng Zheng, Wen Li, Lixin Duan. 1887-1895 [doi]
- Learning Multi-Scale Video-Text Correspondence for Weakly Supervised Temporal Article GrondingWenjia Geng, Yong Liu, Lei Chen, Sujia Wang, Jie Zhou 0001, Yansong Tang. 1896-1904 [doi]
- PoseGen: Learning to Generate 3D Human Pose Dataset with NeRFMohsen Gholami, Rabab Ward, Z. Jane Wang 0001. 1905-1913 [doi]
- SDAC: A Multimodal Synthetic Dataset for Anomaly and Corner Case Detection in Autonomous DrivingLei Gong, Yu Zhang 0086, Yingqing Xia, Yanyong Zhang, Jianmin Ji. 1914-1922 [doi]
- ContactGen: Contact-Guided Interactive 3D Human Generation for PartnersDongjun Gu, Jaehyeok Shim, Jaehoon Jang, Changwoo Kang, Kyungdon Joo. 1923-1931 [doi]
- AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language ModelsZhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen 0003, Ming Tang 0001, Jinqiao Wang. 1932-1940 [doi]
- SeqRank: Sequential Ranking of Salient ObjectsHuankang Guan, Rynson W. H. Lau. 1941-1949 [doi]
- Knowledge-Aware Neuron Interpretation for Scene ClassificationYong Guan, Freddy Lécué, Jiaoyan Chen, Ru Li 0001, Jeff Z. Pan. 1950-1958 [doi]
- Self-Supervised Representation Learning with Meta Comprehensive RegularizationHuijie Guo, Ying Ba, Jie Hu, Lingyu Si, Wenwen Qiang, Lei Shi. 1959-1967 [doi]
- Graph Context Transformation Learning for Progressive Correspondence PruningJunwen Guo, Guobao Xiao, Shiping Wang, Jun Yu. 1968-1975 [doi]
- Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input ViewsShuai Guo 0002, Qiuwen Wang, Yijie Gao, Rong Xie, Li Song 0001. 1976-1984 [doi]
- Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual ConfirmationTianyu Guo 0005, Haowei Wang 0001, Yiwei Ma, Jiayi Ji, Xiaoshuai Sun. 1985-1993 [doi]
- Learning to Manipulate Artistic ImagesWei Guo, Yuqi Zhang, De Ma, Qian Zheng. 1994-2002 [doi]
- PICNN: A Pathway towards Interpretable Convolutional Neural NetworksWengang Guo, Jiayi Yang, Huilin Yin, Qijun Chen, Wei Ye 0001. 2003-2012 [doi]
- GSN: Generalisable Segmentation in Neural Radiance FieldVinayak Gupta, Rahul Goel, Dhawal Sirikonda, P. J. Narayanan. 2013-2021 [doi]
- AMD: Autoregressive Motion DiffusionBo Han, Hao Peng, Minjing Dong, Yi Ren, Yixuan Shen, Chang Xu. 2022-2030 [doi]
- HuTuMotion: Human-Tuned Navigation of Latent Motion Diffusion Models with Minimal FeedbackGaoge Han, Shaoli Huang, Mingming Gong, Jinglei Tang. 2031-2039 [doi]
- MA-Net: Rethinking Neural Unit in the Light of AstrocytesMengqiao Han, Liyuan Pan, Xiabi Liu. 2040-2048 [doi]
- Dual-Perspective Knowledge Enrichment for Semi-supervised 3D Object DetectionYucheng Han, Na Zhao 0004, Weiling Chen, Keng Teck Ma, Hanwang Zhang. 2049-2057 [doi]
- Exploiting the Social-Like Prior in Transformer for Visual ReasoningYudong Han, Yupeng Hu, Xuemeng Song, Haoyu Tang, Mingzhu Xu, Liqiang Nie. 2058-2066 [doi]
- Improving Audio-Visual Segmentation with Bidirectional GenerationDawei Hao, Yuxin Mao, Bowen He, Xiaodong Han, Yuchao Dai, Yiran Zhong. 2067-2075 [doi]
- Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal ModelingYuze Hao, Jianrong Zhang, Tao Zhuo, Fuan Wen, Hehe Fan. 2076-2084 [doi]
- Progressive Feature Self-Reinforcement for Weakly Supervised Semantic SegmentationJingxuan He, Lechao Cheng, Chaowei Fang, Zunlei Feng, Tingting Mu, Mingli Song. 2085-2093 [doi]
- Prompting Multi-Modal Image Segmentation with Semantic GroupingQibin He. 2094-2102 [doi]
- Low-Latency Space-Time Supersampling for Real-Time RenderingRuian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan. 2103-2111 [doi]
- Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video AnalysisTianyao He, Huabin Liu 0001, Yuxi Li, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin. 2112-2120 [doi]
- Frequency-Adaptive Pan-Sharpening with Mixture of ExpertsXuanhua He, Keyu Yan, Rui Li, Chengjun Xie, Jie Zhang 0033, Man Zhou. 2121-2129 [doi]
- Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier DomainXuanhua He, Tao Hu, Guoli Wang, Zejin Wang, Run Wang, Qian Zhang, Keyu Yan, Ziyi Chen, Rui Li, Chengjun Xie, Jie Zhang, Man Zhou. 2130-2138 [doi]
- A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image SynthesisNailei Hei, Qianyu Guo, Zihao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang. 2139-2147 [doi]
- Optimize & Reduce: A Top-Down Approach for Image VectorizationOr Hirschorn, Amir Jevnisek, Shai Avidan. 2148-2156 [doi]
- MotionMix: Weakly-Supervised Diffusion for Controllable Motion GenerationNhat M. Hoang, Kehong Gong, Chuan Guo, Michael Bi Mi. 2157-2165 [doi]
- Commonsense for Zero-Shot Natural Language Video LocalizationMeghana Holla, Ismini Lourentzou. 2166-2174 [doi]
- Learning Subject-Aware Cropping by Outpainting Professional PhotosJames Hong, Lu Yuan, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian. 2175-2183 [doi]
- High-Fidelity Diffusion-Based Image EditingChen Hou, Guoqiang Wei, Zhibo Chen 0001. 2184-2192 [doi]
- Domain-Hallucinated Updating for Multi-Domain Face Anti-spoofingChengyang Hu, Ke-Yue Zhang, Taiping Yao, Shice Liu, Shouhong Ding, Xin Tan, Lizhuang Ma. 2193-2201 [doi]
- QI-IRA: Quantum-Inspired Interactive Ranking Aggregation for Person Re-identificationChunyu Hu, Hong Zhang, Chao Liang, Hao Huang. 2202-2210 [doi]
- SpaceGTN: A Time-Agnostic Graph Transformer Network for Handwritten Diagram Recognition and SegmentationHaoxiang Hu, Cangjun Gao, Yaokun Li, Xiaoming Deng 0001, Yu-Kun Lai, CuiXia Ma, Yong-Jin Liu, Hongan Wang. 2211-2219 [doi]
- Learning Explicit Contact for Implicit Reconstruction of Hand-Held Objects from Monocular ImagesJunxing Hu, Hongwen Zhang 0001, Zerui Chen, Mengcheng Li, Yunlong Wang 0003, Yebin Liu, Zhenan Sun. 2220-2228 [doi]
- DALDet: Depth-Aware Learning Based Object Detection for Autonomous DrivingKe Hu, Tongbo Cao, Yuan Li, Song Chen, Yi Kang. 2229-2237 [doi]
- COMMA: Co-articulated Multi-Modal LearningLianyu Hu 0003, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng 0005. 2238-2246 [doi]
- Latent Space Editing in Transformer-Based Flow MatchingVincent Tao Hu, Wei Zhang, Meng Tang, Pascal Mettes, Deli Zhao, Cees Snoek. 2247-2255 [doi]
- BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual QuestionsWenbo Hu, Yifan Xu, Yi Li, Weiyue Li, Zeyuan Chen, Zhuowen Tu. 2256-2264 [doi]
- A Dynamic Learning Method towards Realistic Compositional Zero-Shot LearningXiaoming Hu, Zilei Wang. 2265-2273 [doi]
- LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image RecognitionYoubing Hu, Yun Cheng, Anqi Lu, Zhiqiang Cao, DaWei Wei, Jie Liu 0001, Zhijun Li 0002. 2274-2284 [doi]
- O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion ModelYubin Hu 0001, Sheng Ye, Wang Zhao, Matthieu Lin, Yuze He, Yu-Hui Wen, Ying He, Yong-Jin Liu. 2285-2293 [doi]
- Arbitrary-Scale Video Super-resolution Guided by Dynamic ContextCong Huang, Jiahao Li, Lei Chu, Dong Liu, Yan Lu. 2294-2302 [doi]
- Dynamic Weighted Combiner for Mixed-Modal Image RetrievalFuxiang Huang, Lei Zhang, Xiaowei Fu, Suqi Song. 2303-2311 [doi]
- NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input ViewsHan Huang, Yulun Wu, Junsheng Zhou, Ge Gao, Ming Gu 0001, Yu-Shen Liu. 2312-2320 [doi]
- Seeing Dark Videos via Self-Learned Bottleneck Neural RepresentationHaofeng Huang, Wenhan Yang, Lingyu Duan, Jiaying Liu 0001. 2321-2329 [doi]
- Combinatorial CNN-Transformer Learning with Manifold Constraints for Semi-supervised Medical Image SegmentationHuimin Huang, Yawen Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong 0001, Yen-Wei Chen 0001, Yuexiang Li, Yefeng Zheng 0001. 2330-2338 [doi]
- Sparse Bayesian Deep Learning for Cross Domain Medical Image ReconstructionJiaxin Huang 0006, Qi Wu, Yazhou Ren 0001, Fan Yang, Aodi Yang, Qianqian Yang, Xiaorong Pu. 2339-2347 [doi]
- UniCell: Universal Cell Nucleus Classification via Prompt LearningJunjia Huang, Haofeng Li, Xiang Wan, Guanbin Li. 2348-2356 [doi]
- SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy ViewsShi-Sheng Huang, Zi-Xin Zou, Yichi Zhang, Yan-Pei Cao, Ying Shan. 2357-2365 [doi]
- MFTN: Multi-Level Feature Transfer Network Based on MRI-Transformer for MR Image Super-resolutionShuying Huang, Ge Chen, Yong Yang, Xiaozheng Wang, Chenbin Liang. 2366-2373 [doi]
- SDGAN: Disentangling Semantic Manipulation for Facial Attribute EditingWenmin Huang, Weiqi Luo 0001, Jiwu Huang, Xiaochun Cao. 2374-2381 [doi]
- Frozen CLIP Transformer Is an Efficient Point Cloud EncoderXiaoshui Huang, Zhou Huang, Sheng Li, Wentao Qu, Tong He 0004, Yuenan Hou, Yifan Zuo, Wanli Ouyang. 2382-2390 [doi]
- G2L-CariGAN: Caricature Generation from Global Structure to Local FeaturesXin Huang, Yunfeng Bai, Dong Liang, Feng Tian, Jinyuan Jia. 2391-2399 [doi]
- 3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting HandsXuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang. 2400-2408 [doi]
- Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object DetectionXun Huang, Hai Wu, Xin Li, Xiaoliang Fan, Chenglu Wen, Cheng Wang. 2409-2416 [doi]
- Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured RepresentationsYufeng Huang, Jiji Tang, Zhuo Chen, Rongsheng Zhang, Xinfeng Zhang, Weijie Chen, Zeng Zhao, Zhou Zhao, Tangjie Lv, Zhipeng Hu, Wen Zhang 0015. 2417-2425 [doi]
- Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object DetectionYuhao Huang, Sanping Zhou, Junjie Zhang, Jinpeng Dong, Nanning Zheng 0001. 2426-2435 [doi]
- COMBAT: Alternated Training for Effective Clean-Label Backdoor AttacksTran Huynh, Dang Nguyen, Tung Pham 0001, Anh Tran. 2436-2444 [doi]
- MagiCapture: High-Resolution Multi-Concept Portrait CustomizationJunha Hyung, Jaeyo Shin, Jaegul Choo. 2445-2453 [doi]
- Rethinking Peculiar Images by Diffusion Models: Revealing Local Minima's RoleJinhyeok Jang, Chan-Hyun Youn, Minsu Jeon, Changha Lee. 2454-2461 [doi]
- ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object DetectionJoonhyun Jeong, Geondo Park, Jayeon Yoo, Hyungsik Jung, Heesu Kim. 2462-2470 [doi]
- A Diffusion Model with State Estimation for Degradation-Blind Inverse ImagingLiya Ji, Zhefan Rao, Sinno Jialin Pan, Chenyang Lei, Qifeng Chen. 2471-2479 [doi]
- SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image GenerationChengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Mengmeng Wang, Jingdong Wang 0001. 2480-2488 [doi]
- TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-trainingChaoya Jiang, Wei Ye, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang 0011, Shikun Zhang. 2489-2497 [doi]
- Revealing the Proximate Long-Tail Distribution in Compositional Zero-Shot LearningChenyi Jiang, Haofeng Zhang. 2498-2506 [doi]
- MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous DrivingGuangfeng Jiang, Jun Liu, Yuzhi Wu, Wenlong Liao, Tao He, Pai Peng. 2507-2515 [doi]
- Transferable Video Moment Localization by Moment-Guided Query PromptingHao Jiang, Yang Yizhang, Yadong Mu. 2516-2524 [doi]
- In-Hand 3D Object Reconstruction from a Monocular RGB VideoShijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen. 2525-2533 [doi]
- AACP: Aesthetics Assessment of Children's Paintings Based on Self-Supervised LearningShiqi Jiang, Ning Li, Chen Shi, Liping Guo, Changbo Wang, Chenhui Li. 2534-2542 [doi]
- Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction DetectionWeibo Jiang, Weihong Ren, Jiandong Tian, Liangqiong Qu, Zhiyong Wang, Honghai Liu 0001. 2543-2551 [doi]
- Comprehensive Visual Grounding for Video DescriptionWenhui Jiang, Yibo Cheng, Linxin Liu, Yuming Fang, Yuxin Peng, Yang Liu. 2552-2560 [doi]
- Far3D: Expanding the Horizon for Surround-View 3D Object DetectionXiaohui Jiang, Shuailin Li, Yingfei Liu, Shihao Wang, Fan Jia, Tiancai Wang, Lijin Han, Xiangyu Zhang. 2561-2569 [doi]
- Delving into Multimodal Prompting for Fine-Grained Visual ClassificationXin Jiang, Hao Tang 0007, Junyao Gao, Xiaoyu Du, Shengfeng He, Zechao Li. 2570-2578 [doi]
- MCA: Moment Channel Attention NetworksYangbo Jiang, Zhiwei Jiang, Le Han, Zenan Huang, Nenggan Zheng. 2579-2588 [doi]
- Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible AttacksZhiying Jiang, Xingyuan Li 0005, Jinyuan Liu 0001, Xin Fan 0001, Risheng Liu. 2589-2597 [doi]
- Instance-Aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting LearningYang Jiao, Zequn Jie, Shaoxiang Chen 0001, Lechao Cheng, Jingjing Chen, Lin Ma 0002, Yu-Gang Jiang. 2598-2606 [doi]
- PromptMRG: Diagnosis-Driven Prompts for Medical Report GenerationHaibo Jin, Haoxuan Che, Yi Lin, Hao Chen. 2607-2615 [doi]
- PCE-Palm: Palm Crease Energy Based Two-Stage Realistic Pseudo-Palmprint GenerationJianlong Jin, Lei Shen, Ruixin Zhang, Chenglong Zhao, Ge Jin, Jingyun Zhang, Shouhong Ding, Yang Zhao, Wei Jia. 2616-2624 [doi]
- SwiftPillars: High-Efficiency Pillar Encoder for Lidar-Based 3D DetectionXin Jin, Kai Liu, Cong Ma, Ruining Yang, Fei Hui, Wei Wu 0021. 2625-2633 [doi]
- DeS3: Adaptive Attention-Driven Self and Soft Shadow Removal Using ViT SimilarityYeying Jin, Wei Ye 0005, Wenhan Yang, Yuan Yuan, Robby T. Tan. 2634-2642 [doi]
- AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and FusionBeibei Jing, Youjia Zhang, Zikai Song, Junqing Yu, Wei Yang 0034. 2643-2651 [doi]
- Retrieval-Augmented Primitive Representations for Compositional Zero-Shot LearningChenchen Jing, Yukun Li, Hao Chen, Chunhua Shen. 2652-2660 [doi]
- CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding ResiduesLinglin Jing, Sheng Xu, Yifan Wang 0008, Yuzhe Zhou, Tao Shen, Zhigang Ji, Hui Fang 0003, Zhen Li, Siqi Sun. 2661-2669 [doi]
- X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge TransferLinglin Jing, Ying Xue, Xu Yan 0014, Chaoda Zheng, Dong Wang, Ruimao Zhang, Zhigang Wang, Hui Fang, Bin Zhao, Zhen Li. 2670-2678 [doi]
- VVS: Video-to-Video Retrieval with Irrelevant Frame SuppressionWon Jo, Geuntaek Lim, Gwangjin Lee, Hyunwoo Kim, ByungSoo Ko, Yukyung Choi. 2679-2687 [doi]
- Rethinking Robustness of Model AttributionsSandesh Kamath, Sankalp Mittal, Amit Deshpande 0001, Vineeth N. Balasubramanian. 2688-2696 [doi]
- Cross-Constrained Progressive Inference for 3D Hand Pose Estimation with Dynamic Observer-Decision-Adjuster NetworksZhehan Kan, Xueting Hu, Zihan Liao, Ke Yu, Zhihai He. 2697-2704 [doi]
- Catch-Up Mix: Catch-Up Class for Struggling Filters in CNNMinsoo Kang, Minkoo Kang 0001, Suhyun Kim. 2705-2713 [doi]
- VLCounter: Text-Aware Visual Representation for Zero-Shot Object CountingSeunggu Kang, WonJun Moon, Euiyeon Kim, Jae-Pil Heo. 2714-2722 [doi]
- StegFormer: Rebuilding the Glory of Autoencoder-Based SteganographyXiao Ke, Huanqi Wu, Wenzhong Guo. 2723-2731 [doi]
- Expediting Contrastive Language-Image Pretraining via Self-Distilled EncodersBumsoo Kim, Jinhyung Kim, Yeonsik Jo, Seung Hwan Kim. 2732-2740 [doi]
- Weakly Supervised Semantic Segmentation for Driving ScenesDongseob Kim, Seungho Lee, Junsuk Choe, Hyunjung Shim. 2741-2749 [doi]
- FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance FieldsGeonu Kim, Kim Youwang, Tae Hyun Oh. 2750-2758 [doi]
- Let There Be Sound: Reconstructing High Quality Speech from Silent VideosJi-Hoon Kim, Jaehun Kim, Joon Son Chung. 2759-2767 [doi]
- Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product QuantizationJiyoung Kim, Kyuhong Shim, Insu Lee, Byonghyo Shim. 2768-2776 [doi]
- Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized VideosSeoha Kim, Jeongmin Bae, Youngsik Yun, Hahyun Lee, Gun Bang, Youngjung Uh. 2777-2785 [doi]
- Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense KnowledgeSeongyeop Kim, Hyung-il Kim, Yong Man Ro. 2786-2794 [doi]
- Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video GroundingSunoh Kim, Jungchan Cho, Joonsang Yu, Youngjoon Yoo, Jin Young Choi 0002. 2795-2803 [doi]
- PARSAC: Accelerating Robust Multi-Model Fitting with Parallel Sample ConsensusFlorian Kluger, Bodo Rosenhahn. 2804-2812 [doi]
- Distribution Matching for Multi-Task Learning of Classification Tasks: A Large-Scale Study on Faces & BeyondDimitrios Kollias, Viktoriia Sharmanska, Stefanos Zafeiriou. 2813-2821 [doi]
- Block Image Compressive Sensing with Local and Global Information InteractionXiaoyu Kong, Yongyong Chen, Feng Zheng, Zhenyu He 0001. 2822-2830 [doi]
- QDETRv: Query-Guided DETR for One-Shot Object Localization in VideosYogesh Kumar, Saswat Mallick, Anand Mishra 0001, Sowmya Rasipuram, Anutosh Maitra, Roshni R. Ramnani. 2831-2839 [doi]
- LaViP: Language-Grounded Visual PromptingNilakshan Kunananthaseelan, Jing Zhang 0052, Mehrtash Harandi. 2840-2848 [doi]
- Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQAChengen Lai, Shengli Song, Shiqi Meng, Jingyang Li, Sitong Yan, Guangneng Hu. 2849-2857 [doi]
- MatchDet: A Collaborative Framework for Image Matching and Object DetectionJinxiang Lai, Wenlong Wu, Bin-Bin Gao, Jun Liu, Jiawei Zhan, Congchong Nie, Yi Zeng, Chengjie Wang. 2858-2865 [doi]
- ViTree: Single-Path Neural Tree for Step-Wise Interpretable Fine-Grained Visual CategorizationDanning Lao, Qi Liu, Jiazi Bu, Junchi Yan, Wei Shen. 2866-2873 [doi]
- MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance SegmentationMinh-Quan Le, Tam V. Nguyen 0002, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, Minh-Triet Tran. 2874-2881 [doi]
- FRED: Towards a Full Rotation-Equivariance in Aerial Image Object DetectionChanho Lee, Jinsu Son, Hyounguk Shon, Yunho Jeon, Junmo Kim. 2883-2891 [doi]
- Domain Generalization with Vital Phase AugmentationIngyun Lee, Wooju Lee, Hyun Myung. 2892-2900 [doi]
- Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane SweepJae Young Lee, Woonghyun Ka, Jaehyun Choi, Junmo Kim. 2901-2910 [doi]
- MFOS: Model-Free & One-Shot Object Pose EstimationJongmin Lee, Yohann Cabon, Romain Brégier, Sungjoo Yoo, Jérôme Revaud. 2911-2919 [doi]
- Noise-Free Optimization in Early Training Steps for Image Super-resolutionMinkyu Lee, Jae-Pil Heo. 2920-2928 [doi]
- Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter ProfileSeokjun Lee, Seung-Won Jung, Hyunseok Seo. 2929-2937 [doi]
- Few-Shot Neural Radiance Fields under Unconstrained IlluminationSeokYeong Lee, Junyong Choi, Seungryong Kim, Ig-Jae Kim, Junghyun Cho. 2938-2946 [doi]
- Object-Aware Domain Generalization for Object DetectionWooju Lee, Dasol Hong, Hyungtae Lim, Hyun Myung. 2947-2955 [doi]
- Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-AttentionSaebom Leem, Hyunseok Seo. 2956-2964 [doi]
- Contrastive Tuning: A Little Help to Make Masked Autoencoders ForgetJohannes Lehner, Benedikt Alkin, Andreas Fürst, Elisabeth Rumetshofer, Lukas Miklautz, Sepp Hochreiter. 2965-2973 [doi]
- Few-Shot Learning from Augmented Label-Uncertain Queries in Bongard-HOIQinqian Lei, Bo Wang, Robby T. Tan. 2974-2982 [doi]
- Removing Interference and Recovering Content Imaginatively for Visible Watermark RemovalYicheng Leng, Chaowei Fang, Gen Li, Yixiang Fang, Guanbin Li. 2983-2990 [doi]
- Data Roaming and Quality Assessment for Composed Image RetrievalMatan Levy, Rami Ben-Ari, Nir Darshan, Dani Lischinski. 2991-2999 [doi]
- Point Transformer with Federated Learning for Predicting Breast Cancer HER2 Status from Hematoxylin and Eosin-Stained Whole Slide ImagesBao Li, Zhenyu Liu, Lizhi Shao, Bensheng Qiu, Hong Bu, Jie Tian. 3000-3008 [doi]
- Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal TransportBin Li, Ye Shi, Qian Yu, Jingya Wang. 3009-3017 [doi]
- Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image ClassificationBohan Li, Xiao Xu 0005, Xinghao Wang, Yutai Hou, Yunlong Feng, Feng Wang, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che. 3018-3027 [doi]
- One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene PerceptionBohan Li, Yasheng Sun, Jingxin Dong, Zheng Zhu, Jinming Liu, Xin Jin, Wenjun Zeng. 3028-3036 [doi]
- AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head SynthesisDongze Li, Kang Zhao, Wei Wang 0025, Bo Peng 0002, Yingya Zhang, Jing Dong, Tieniu Tan. 3037-3045 [doi]
- Monocular 3D Hand Mesh Recovery via Dual Noise EstimationHanhui Li, Xiaojian Lin, Xuan Huang, Zejun Yang, Zhisheng Wang, Xiaodan Liang. 3046-3054 [doi]
- Point2Real: Bridging the Gap between Point Cloud and Realistic Image for Open-World 3D RecognitionHanxuan Li, Bin Fu, Ruiping Wang, Xilin Chen. 3055-3063 [doi]
- Gradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute EditingHao Li, Mengqi Huang, Lei Zhang, Bo Hu, Yi Liu, Zhendong Mao. 3064-3072 [doi]
- Towards Automated Chinese Ancient Character Restoration: A Diffusion-Based Method with a New DatasetHaolong Li, Chenghao Du, Ziheng Jiang, Yifan Zhang, Jiawei Ma, Chen Ye 0002. 3073-3081 [doi]
- Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View StereoHongjie Li, Yao Guo, Xianwei Zheng, Hanjiang Xiong. 3082-3090 [doi]
- Catalyst for Clustering-Based Unsupervised Object Re-identification: Feature CalibrationHuafeng Li, Qingsong Hu, Zhanxuan Hu. 3091-3099 [doi]
- EAN: An Efficient Attention Module Guided by Normalization for Deep Neural NetworksJiafeng Li, Zelin Li, Ying Wen 0003. 3100-3108 [doi]
- Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-TrainingJianwu Li, Kaiyue Shi, Guo-Sen Xie, Xiaofeng Liu, Jian Zhang 0002, Tianfei Zhou. 3109-3117 [doi]
- FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy LabelsJichang Li, Guanbin Li, Hui Cheng, Zicheng Liao, Yizhou Yu. 3118-3126 [doi]
- Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic SegmentationJing Li, Junsong Fan, Yuran Yang, Shuqi Mei, Jun Xiao, Zhaoxiang Zhang. 3127-3135 [doi]
- FAVOR: Full-Body AR-Driven Virtual Object Rearrangement Guided by Instruction TextKailin Li 0001, Lixin Yang 0002, Zenan Lin, Jian Xu, Xinyu Zhan 0001, Yifei Zhao, Pengxiang Zhu, Wenxiong Kang, Kejian Wu, Cewu Lu. 3136-3144 [doi]
- Panoptic Scene Graph Generation with Semantics-Prototype LearningLi Li 0091, Wei Ji 0008, Yiming Wu, Mengze Li 0001, You Qin, Lina Wei, Roger Zimmermann. 3145-3153 [doi]
- SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance FieldRu Li, Jia Liu, Guanghui Liu 0001, Shengping Zhang, Bing Zeng, Shuaicheng Liu. 3154-3162 [doi]
- GridFormer: Point-Grid Transformer for Surface ReconstructionShengtao Li, Ge Gao, Yudong Liu, Yu-Shen Liu, Ming Gu 0001. 3163-3171 [doi]
- Adaptive Uncertainty-Based Learning for Text-Based Person RetrievalShenshen Li, Chen He, Xing Xu 0001, Fumin Shen, Yang Yang, Heng Tao Shen. 3172-3180 [doi]
- Learning Continuous Implicit Field with Local Distance Indicator for Arbitrary-Scale Point Cloud UpsamplingShujuan Li, Junsheng Zhou, Baorui Ma, Yu-Shen Liu, Zhizhong Han. 3181-3189 [doi]
- Long-Tailed Learning as Multi-Objective OptimizationWeiqi Li, Fan Lyu, Fanhua Shang, Liang Wan, Wei Feng. 3190-3198 [doi]
- Temporal-Distributed Backdoor Attack against Video Based Action RecognitionXi Li, Songhe Wang, Ruiquan Huang, Mahanth Gowda, George Kesidis. 3199-3207 [doi]
- DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object DetectionXiang Li, Junbo Yin, Wei Li, Chengzhong Xu 0001, Ruigang Yang, Jianbing Shen. 3208-3215 [doi]
- Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic SegmentationXiawei Li, Qingyuan Xu, Jing Zhang, Tianyi Zhang, Qian Yu, Lu Sheng, Dong Xu 0001. 3216-3224 [doi]
- IINet: Implicit Intra-inter Information Fusion for Real-Time Stereo MatchingXimeng Li 0007, Chen Zhang, Wanjuan Su, Wenbing Tao. 3225-3233 [doi]
- Causal Representation Learning via Counterfactual InterventionXiutian Li, Siqi Sun, Rui Feng. 3234-3242 [doi]
- Bi-ViT: Pushing the Limit of Vision Transformer QuantizationYanjing Li, Sheng Xu, Mingbao Lin, Xianbin Cao 0001, Chuanjian Liu, Xiao Sun, Baochang Zhang 0001. 3243-3251 [doi]
- Harnessing Edge Information for Improved Robustness in Vision TransformersYanxi Li 0001, Chengbin Du, Chang Xu 0001. 3252-3260 [doi]
- Multi-Region Text-Driven Manipulation of Diffusion ImageryYiming Li, Peng Zhou, Jun Sun, Yi Xu. 3261-3269 [doi]
- Direct May Not Be the Best: An Incremental Evolution View of Pose GenerationYuelong Li, Tengfei Xiao, Lei Geng, Jianming Wang. 3270-3278 [doi]
- FocalDreamer: Text-Driven 3D Editing via Focal-Fusion AssemblyYuhan Li, Yishun Dou, Yue Shi, Yu Lei, Xuanhong Chen, Yi Zhang, Peng Zhou, Bingbing Ni. 3279-3287 [doi]
- SAVSR: Arbitrary-Scale Video Super-Resolution via a Learned Scale-Adaptive NetworkZekun Li 0001, Hongying Liu, Fanhua Shang, Yuanyuan Liu 0001, Liang Wan, Wei Feng. 3288-3296 [doi]
- Sampling-Resilient Multi-Object TrackingZepeng Li, Dongxiang Zhang, Sai Wu, Mingli Song, Gang Chen. 3297-3305 [doi]
- Object-Aware Adaptive-Positivity Learning for Audio-Visual Question AnsweringZhangbin Li, Dan Guo, Jinxing Zhou, Jing Zhang, Meng Wang 0001. 3306-3314 [doi]
- Hypercorrelation Evolution for Video Class-Incremental LearningSen Liang, Kai Zhu 0004, Wei Zhai, Zhiheng Liu, Yang Cao 0010. 3315-3323 [doi]
- CoSTA: End-to-End Comprehensive Space-Time Entanglement for Spatio-Temporal Video GroundingYaoyuan Liang, Xiao Liang, Yansong Tang, Zhao Yang, Ziran Li, Jingang Wang, Wenbo Ding, Shao-Lun Huang. 3324-3332 [doi]
- Any-Stereo: Arbitrary Scale Disparity Estimation for Iterative Stereo MatchingZhaohuai Liang, Changhe Li. 3333-3341 [doi]
- Impartial Adversarial Distillation: Addressing Biased Data-Free Knowledge Distillation via Adaptive Constrained OptimizationDongping Liao, Xitong Gao, Chengzhong Xu 0001. 3342-3350 [doi]
- VLM2Scene: Self-Supervised Image-Text-LiDAR Learning with Foundation Models for Autonomous Driving Scene UnderstandingGuibiao Liao, Jiankun Li, Xiaoqing Ye. 3351-3359 [doi]
- Text-to-Image Generation for Abstract ConceptsJiayi Liao, Xu Chen, Qiang Fu, Lun Du, Xiangnan He 0001, Xiang Wang, Shi Han, Dongmei Zhang 0001. 3360-3368 [doi]
- VSFormer: Visual-Spatial Fusion Transformer for Correspondence PruningTangfei Liao, Xiaoqin Zhang, Li Zhao, Tao Wang, Guobao Xiao. 3369-3377 [doi]
- NightRain: Nighttime Video Deraining via Adaptive-Rain-Removal and Adaptive-CorrectionBeibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Shunli Zhang, Robby T. Tan. 3378-3385 [doi]
- Unsupervised Pan-Sharpening via Mutually Guided Detail RestorationHuangxing Lin, Yuhang Dong, Xinghao Ding, Tianpeng Liu, Yongxiang Liu. 3386-3394 [doi]
- Gramformer: Learning Crowd Counting via Graph-Modulated TransformerHui Lin, Zhiheng Ma, Xiaopeng Hong, Qinnan Shangguan, Deyu Meng. 3395-3403 [doi]
- Weakly Supervised Open-Vocabulary Object DetectionJianghang Lin, Yunhang Shen, Bingquan Wang, Shaohui Lin, Ke Li, Liujuan Cao. 3404-3412 [doi]
- Spot the Error: Non-autoregressive Graphic Layout Generation with Wireframe LocatorJieru Lin, Danqing Huang, Tiejun Zhao, Dechen Zhan, Chin-Yew Lin. 3413-3421 [doi]
- M2SD: Multiple Mixing Self-Distillation for Few-Shot Class-Incremental LearningJinhao Lin, Ziheng Wu, Weifeng Lin, Jun Huang 0004, Ronghua Luo. 3422-3431 [doi]
- EDA: Evolving and Distinct Anchors for Multimodal Motion PredictionLongzhong Lin, Xuewu Lin, Tianwei Lin, Lichao Huang, Rong Xiong, Yue Wang 0020. 3432-3440 [doi]
- PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition WarpingLuoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, Xiaochun Cao. 3441-3449 [doi]
- Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic SegmentationMatthieu Lin, Jenny Sheng, Yubin Hu 0001, Yangguang Li, Lu Qi, Andrew Zhao, Gao Huang, Yong-Jin Liu. 3450-3458 [doi]
- Boosting Adversarial Transferability across Model Genus by Deformation-Constrained WarpingQinliang Lin, Cheng Luo, Zenghao Niu, Xilin He, Weicheng Xie 0001, Yuanbo Hou, LinLin Shen, Siyang Song. 3459-3467 [doi]
- A Fixed-Point Approach to Unified Prompt-Based CountingWei Lin 0018, Antoni B. Chan. 3468-3476 [doi]
- Boosting Multiple Instance Learning Models for Whole Slide Image Classification: A Model-Agnostic Framework Based on Counterfactual InferenceWeiping Lin, Zhenfeng Zhuang, Lequan Yu, Liansheng Wang. 3477-3485 [doi]
- Relightable and Animatable Neural Avatars from VideosWenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu 0005. 3486-3494 [doi]
- TD²-Net: Toward Denoising and Debiasing for Video Scene Graph GenerationXin Lin, Chong Shi, Yibing Zhan, Zuopeng Yang, Yaqi Wu, Dacheng Tao. 3495-3503 [doi]
- Ced-NeRF: A Compact and Efficient Method for Dynamic Neural Radiance FieldsYoutian Lin. 3504-3512 [doi]
- TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP without TrainingYuqi Lin, Minghao Chen, Kaipeng Zhang, Hengjia Li, Mingming Li, Zheng Yang, Dongqin Lv, Binbin Lin, Haifeng Liu, Deng Cai 0001. 3513-3521 [doi]
- Independency Adversarial Learning for Cross-Modal Sound SeparationZhenkai Lin, Yanli Ji, Yang Yang 0002. 3522-3530 [doi]
- BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving ScenariosZhiwei Lin, Yongtao Wang, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang 0001. 3531-3539 [doi]
- Focus Stacking with High Fidelity and Superior Visual EffectsBo Liu, Bin Hu, Xiuli Bi, Weisheng Li, Bin Xiao. 3540-3547 [doi]
- DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature LearningChao Liu, Ting Zhao, Nenggan Zheng. 3548-3557 [doi]
- Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display CameraChengxu Liu, Xuan Wang, Yuanting Fan, Shuai Li, Xueming Qian. 3558-3566 [doi]
- Unsupervised Domain Adaptative Temporal Sentence Localization with Mutual Information MaximizationDaizong Liu, Xiang Fang, Xiaoye Qu, Jianfeng Dong, He Yan, Yang Yang 0002, Pan Zhou, Yu Cheng. 3567-3575 [doi]
- Explicitly Perceiving and Preserving the Local Geometric Structures for 3D Point Cloud AttackDaizong Liu, Wei Hu 0003. 3576-3584 [doi]
- Adv-Diffusion: Imperceptible Adversarial Face Identity Attack via Latent Diffusion ModelDecheng Liu, Xijun Wang, Chunlei Peng, Nannan Wang 0001, Ruimin Hu, Xinbo Gao 0001. 3585-3593 [doi]
- Multi-View Dynamic Reflection Prior for Video Glass Surface DetectionFang Liu, Yuhao Liu, Jiaying Lin, Ke Xu 0010, Rynson W. H. Lau. 3594-3602 [doi]
- Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components DeliberationHao Liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun. 3603-3611 [doi]
- DiDA: Disambiguated Domain Alignment for Cross-Domain Retrieval with Partial LabelsHaoran Liu, Ying Ma, Ming Yan, Yingke Chen, Dezhong Peng, Xu Wang. 3612-3620 [doi]
- Test-Time Personalization with Meta Prompt for Gaze EstimationHuan Liu, Julia Qi, Zhenhao Li, Mohammad Hassanpour, Yang Wang, Konstantinos N. Plataniotis, Yuanhao Yu. 3621-3629 [doi]
- M3SOT: Multi-Frame, Multi-Field, Multi-Space 3D Single Object TrackingJiaming Liu, Yue Wu 0004, Maoguo Gong, Qiguang Miao, Wenping Ma 0001, Cai Xu, Can Qin. 3630-3638 [doi]
- Unsupervised Continual Anomaly Detection with Contrastively-Learned PromptJiaqi Liu 0004, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong Liu, Jinbao Wang, Chengjie Wang, Feng Zheng. 3639-3647 [doi]
- Region-Aware Exposure Consistency Network for Mixed Exposure CorrectionJin Liu, Huiyuan Fu, Chuanming Wang, Huadong Ma. 3648-3656 [doi]
- R3CD: Scene Graph to Image Generation with Relation-Aware Compositional Contrastive Control DiffusionJinxiu Liu, Qi Liu. 3657-3665 [doi]
- DifAttack: Query-Efficient Black-Box Adversarial Attack via Disentangled Feature SpaceJun Liu 0071, Jiantao Zhou 0001, Jiandian Zeng, Jinyu Tian. 3666-3674 [doi]
- Frequency Shuffling and Enhancement for Open Set RecognitionLijun Liu, Rui Wang, Yuan Wang, Lihua Jing, Chuan Wang. 3675-3683 [doi]
- KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose TrackingLiu Liu, Anran Huang, Qi Wu, Dan Guo, Xun Yang, Meng Wang. 3684-3692 [doi]
- UVAGaze: Unsupervised 1-to-2 Views Adaptation for Gaze EstimationRuicong Liu, Feng Lu. 3693-3701 [doi]
- Compact HD Map Construction via Douglas-Peucker Point TransformerRuixin Liu, Zejian Yuan. 3702-3710 [doi]
- Primitive-Based 3D Human-Object Interaction Modelling and ProgrammingSiqi Liu, Yong-Lu Li 0001, Zhou Fang, Xinpeng Liu, Yang You, Cewu Lu. 3711-3719 [doi]
- Fast Inter-frame Motion Prediction for Compressed Dynamic Point Cloud Attribute EnhancementWang Liu, Wei Gao 0003, Xingming Mu. 3720-3728 [doi]
- RWMS: Reliable Weighted Multi-Phase for Semi-supervised SegmentationWensi Liu, Xiao-Yu Tang, Chong Yang, Chunjie Yang. 3729-3737 [doi]
- Learning Real-World Image De-weathering with Imperfect SupervisionXiaohui Liu, Zhilu Zhang, Xiaohe Wu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Wangmeng Zuo. 3738-3746 [doi]
- Differentiable Auxiliary Learning for Sketch Re-IdentificationXingyu Liu, Xu Cheng, Haoyu Chen, Hao Yu, Guoying Zhao 0001. 3747-3755 [doi]
- Keypoint Fusion for RGB-D Based 3D Hand Pose EstimationXingyu Liu, Pengfei Ren, Yuanyuan Gao, Jingyu Wang, Haifeng Sun, Qi Qi, Zirui Zhuang, Jianxin Liao. 3756-3764 [doi]
- CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy EnvironmentsXiulong Liu, Sudipta Paul 0007, Moitreya Chatterjee, Anoop Cherian. 3765-3773 [doi]
- DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative ModelsYitian Liu, Zhouhui Lian. 3774-3782 [doi]
- Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing NoiseYixin Liu, Kaidi Xu, Xun Chen, Lichao Sun 0001. 3783-3791 [doi]
- Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality AssessmentYongxu Liu 0001, Yinghui Quan, Guoyao Xiao, Aobo Li, Jinjian Wu. 3792-3801 [doi]
- Implicit Modeling of Non-rigid Objects with Cross-Category SignalsYuchun Liu, Benjamin Planche, Meng Zheng, Zhongpai Gao, Pierre Sibut-Bourde, Fan Yang, Terrence Chen, Ziyan Wu. 3802-3809 [doi]
- Recasting Regional Lighting for Shadow RemovalYuhao Liu, Zhanghan Ke, Ke Xu, Fang Liu, Zhenwei Wang, Rynson W. H. Lau. 3810-3818 [doi]
- Rolling-Unet: Revitalizing MLP's Ability to Efficiently Extract Long-Distance Dependencies for Medical Image SegmentationYutong Liu, Haijiang Zhu, Mengting Liu, Huaiyuan Yu, Zihan Chen, Jie Gao. 3819-3827 [doi]
- Advancing Video Synchronization with Fractional Frame Analysis: Introducing a Novel Dataset and ModelYuxuan Liu, Haizhou Ai, Junliang Xing, Xuri Li, Xiaoyi Wang, Pin Tao. 3828-3836 [doi]
- FedCD: Federated Semi-Supervised Learning with Class Awareness Balance via Dual TeachersYuzhi Liu, Huisi Wu, Jing Qin. 3837-3845 [doi]
- BLADE: Box-Level Supervised Amodal Segmentation through Directed ExpansionZhaochen Liu, Zhixuan Li, Tingting Jiang 0001. 3846-3854 [doi]
- Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment RetrievalZhihang Liu, Jun Li, Hongtao Xie, Pandeng Li, Jiannan Ge, Sun'ao Liu, Guoqing Jin. 3855-3863 [doi]
- Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image CaptioningZhiyue Liu, Jinyuan Liu, Fanrong Ma. 3864-3872 [doi]
- Cell Graph Transformer for Nuclei ClassificationWei Lou, Guanbin Li, Xiang Wan, Haofeng Li. 3873-3881 [doi]
- Detect Any Keypoints: An Efficient Light-Weight Few-Shot Keypoint DetectorChangsheng Lu, Piotr Koniusz. 3882-3890 [doi]
- TCNet: Continuous Sign Language Recognition from Trajectories and Correlated RegionsHui Lu, Albert Ali Salah, Ronald Poppe. 3891-3899 [doi]
- MLNet: Mutual Learning Network with Neighborhood Invariance for Universal Domain AdaptationYanzuo Lu, Meng Shen, Andy J. Ma, Xiaohua Xie, Jian-Huang Lai. 3900-3908 [doi]
- Set Prediction Guided by Semantic Concepts for Diverse Video CaptioningYifan Lu, Ziqi Zhang, Chunfeng Yuan, Peng Li, Yan Wang, Bing Li, Weiming Hu. 3909-3917 [doi]
- Entropy Induced Pruning Framework for Convolutional Neural NetworksYiheng Lu, Ziyu Guan, Yaming Yang 0002, Wei Zhao, Maoguo Gong, Cai Xu. 3918-3926 [doi]
- Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic ImagesZhan Lu, Qian Zheng, Boxin Shi, Xudong Jiang 0001. 3927-3935 [doi]
- ScanERU: Interactive 3D Visual Grounding Based on Embodied Reference UnderstandingZiyang Lu, Yunqiang Pei, Guoqing Wang, Peiwei Li, Yang Yang, Yinjie Lei, Heng Tao Shen. 3936-3944 [doi]
- MGNet: Learning Correspondences via Multiple GraphsLuanyuan Dai, Xiaoyu Du, Hanwang Zhang, Jinhui Tang 0001. 3945-3953 [doi]
- SCP: Spherical-Coordinate-Based Learned Point Cloud CompressionAo Luo, Linxin Song, Keisuke Nonaka, Kyohei Unno, Heming Sun, Masayuki Goto, Jiro Katto. 3954-3962 [doi]
- DLCA-Recon: Dynamic Loose Clothing Avatar Reconstruction from Monocular VideosChunjie Luo, Fei Luo 0004, Yusen Wang, Enxu Zhao, Chunxia Xiao. 3963-3971 [doi]
- Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive ImagingFulin Luo, Xi Chen, Xiuwen Gong, Weiwen Wu, Tan Guo. 3972-3980 [doi]
- Electron Microscopy Images as Set of Fragments for Mitochondrial SegmentationNaisong Luo, Rui Sun, Yuwen Pan, Tianzhu Zhang, Feng Wu. 3981-3989 [doi]
- DiffusionTrack: Diffusion Model for Multi-Object TrackingRun Luo, Zikai Song, Lintao Ma, Jinlin Wei, Wei Yang 0034, Min Yang. 3991-3999 [doi]
- Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer with Adaptive Channel ExpansionShenghong Luo, Xuhang Chen 0002, Weiwen Chen, Zinuo Li, Shuqiang Wang, Chi-Man Pun. 4000-4008 [doi]
- AdaFormer: Efficient Transformer with Adaptive Token Sparsification for Image Super-resolutionXiaotong Luo, Zekun Ai, Qiuyuan Liang, Ding Liu, Yuan Xie 0006, Yanyun Qu, Yun Fu 0001. 4009-4016 [doi]
- SkipDiff: Adaptive Skip Diffusion Model for High-Fidelity Perceptual Image Super-resolutionXiaotong Luo, Yuan Xie 0006, Yanyun Qu, Yun Fu 0001. 4017-4025 [doi]
- Modeling Continuous Motion for 3D Point Cloud Object TrackingZhipeng Luo, Gongjie Zhang, Changqing Zhou, Zhonghua Wu, Qingyi Tao, Lewei Lu, Shijian Lu. 4026-4034 [doi]
- SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph GenerationChangsheng Lv, Mengshi Qi, Xia Li, Zhengyuan Yang, Huadong Ma. 4035-4043 [doi]
- Privileged Prior Information Distillation for Image MattingCheng Lyu, Jiake Xie, Bo Xu, Cheng Lu 0006, Han Huang 0005, Xin Huang, Ming Wu 0001, Chuang Zhang, Yong Tang. 4044-4052 [doi]
- FedST: Federated Style Transfer Learning for Non-IID Image SegmentationBoyuan Ma, Xiang Yin, Jing Tan, Yongfeng Chen, Haiyou Huang, Hao Wang, Weihua Xue, Xiaojuan Ban. 4053-4061 [doi]
- SlowTrack: Increasing the Latency of Camera-Based Perception in Autonomous Driving Using Adversarial ExamplesChen Ma, Ningfei Wang, Qi Alfred Chen, Chao Shen 0001. 4062-4070 [doi]
- Uncertainty-Aware GAN for Single Image Super ResolutionChenxi Ma. 4071-4079 [doi]
- Stitching Segments and Sentences towards Generalization in Video-Text Pre-trainingFan Ma, Xiaojie Jin, Heng Wang, Jingjia Huang, Linchao Zhu, Yi Yang 0001. 4080-4088 [doi]
- Image Captioning with Multi-Context Synthetic DataFeipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun 0001. 4089-4097 [doi]
- Directed Diffusion: Direct Control of Object Placement through Attention GuidanceWan-Duo Kurt Ma, Avisek Lahiri, John P. Lewis, Thomas Leung, W. Bastiaan Kleijn. 4098-4106 [doi]
- Unifying Visual and Vision-Language Tracking via Contrastive LearningYinchao Ma, Yuyang Tang 0001, Wenfei Yang, Tianzhu Zhang, Jinpeng Zhang, Mengxue Kang. 4107-4116 [doi]
- Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free VideosYue Ma, Yingqing He, Xiaodong Cun, Xintao Wang, Siran Chen, Xiu Li 0001, Qifeng Chen. 4117-4125 [doi]
- Let All Be Whitened: Multi-Teacher Distillation for Efficient Visual RetrievalZhe Ma, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang 0002, Zonghui Wang, Sifeng He, Feng Qian, Xiaobo Zhang, Lei Yang. 4126-4135 [doi]
- Cross-Layer and Cross-Sample Feature Optimization Network for Few-Shot Fine-Grained Image ClassificationZhen-Xiang Ma, Zhen-Duo Chen, Li-jun Zhao, Zi-Chao Zhang 0002, Xin Luo 0006, Xin-Shun Xu. 4136-4144 [doi]
- LMD: Faster Image Reconstruction with Latent Masking DiffusionZhiyuan Ma, Zhihuan Yu, Jianjun Li, Bowen Zhou. 4145-4153 [doi]
- AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image EditingZhiyuan Ma, Guoli Jia, Bowen Zhou. 4154-4161 [doi]
- Pay Attention to Target: Relation-Aware Temporal Consistency for Domain Adaptive Video Semantic SegmentationHuayu Mai, Rui Sun 0006, Yuan Wang, Tianzhu Zhang, Feng Wu 0001. 4162-4170 [doi]
- Improving Automatic VQA Evaluation Using Large Language ModelsOscar Mañas, Benno Krojer, Aishwarya Agrawal. 4171-4179 [doi]
- Inconsistency-Based Data-Centric Active Open-Set AnnotationRuiyu Mao, Ouyang Xu, Yunhui Guo. 4180-4188 [doi]
- Progressive High-Frequency Reconstruction for Pan-Sharpening with Implicit Neural RepresentationGe Meng, Jingjia Huang, Yingying Wang, Zhenqi Fu, Xinghao Ding, Yue Huang 0001. 4189-4197 [doi]
- NaMa: Neighbor-Aware Multi-Modal Adaptive Learning for Prostate Tumor Segmentation on Anisotropic MR ImagesRunqi Meng, Xiao Zhang, Shijie Huang, Yuning Gu, Guiqin Liu, Guangyu Wu, Nizhuan Wang 0002, Kaicong Sun, Dinggang Shen. 4198-4206 [doi]
- ConVQG: Contrastive Visual Question Generation with Multimodal GuidanceLi Mi, Syrielle Montariol, Javiera Castillo-Navarro, Xianjie Dai, Antoine Bosselut, Devis Tuia. 4207-4215 [doi]
- Out-of-Distribution Detection in Long-Tailed Recognition with Calibrated Outlier Class LearningWenjun Miao, Guansong Pang, Xiao Bai 0001, Tianqi Li, Jin Zheng. 4216-4224 [doi]
- BCLNet: Bilateral Consensus Learning for Two-View Correspondence PruningXiangyang Miao, Guobao Xiao, Shiping Wang, Jun Yu. 4225-4232 [doi]
- Understanding the Role of the Projector in Knowledge DistillationRoy Miles, Krystian Mikolajczyk. 4233-4241 [doi]
- Robust Blind Text Image Deblurring via Maximum Consensus FrameworkZijian Min, Gundu Mohamed Hassan, Geun Sik Jo. 4242-4250 [doi]
- Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated VideosShankhanil Mitra, Rajiv Soundararajan. 4251-4260 [doi]
- Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQAWentao Mo, Yang Liu. 4261-4268 [doi]
- Augmented Commonsense Knowledge for Remote Object GroundingBahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen Qinfeng Shi. 4269-4277 [doi]
- Recurrent Partial Kernel Network for Efficient Optical Flow EstimationHenrique Morimitsu, Xiaobin Zhu 0001, Xiangyang Ji, Xu-Cheng Yin. 4278-4286 [doi]
- TETRIS: Towards Exploring the Robustness of Interactive SegmentationAndrey Moskalenko, Vlad Shakhuro, Anna Vorontsova, Anton Konushin 0001, Anton Antonov, Alexander Krapukhin, Denis Shepelev, Konstantin Soshin. 4287-4295 [doi]
- T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion ModelsChong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang 0018, Zhongang Qi, Ying Shan. 4296-4304 [doi]
- Semi-supervised Open-World Object DetectionSahal Shaji Mullappilly, Abhishek Singh Gehlot, Rao Muhammad Anwer, Fahad Shahbaz Khan, Hisham Cholakkal. 4305-4314 [doi]
- Adversarial Attacks on the Interpretation of Neuron Activation MaximizationGéraldin Nanfack, Alexander Fulleringer, Jonathan Marty, Michael Eickenberg, Eugene Belilovsky. 4315-4324 [doi]
- ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance FieldZhangkai Ni, Peiqi Yang, Wenhan Yang, Hanli Wang, Lin Ma 0002, Sam Kwong. 4325-4333 [doi]
- Wavelet-Driven Spatiotemporal Predictive Learning: Bridging Frequency and Time VariationsXuesong Nie, Yunfeng Yan, Siyuan Li, Cheng Tan 0012, Xi Chen, Haoyuan Jin, Zhihang Zhu, Stan Z. Li, Donglian Qi. 4334-4342 [doi]
- Painterly Image Harmonization by Learning from Painterly ObjectsLi Niu, Junyan Cao, Yan Hong, Liqing Zhang. 4343-4351 [doi]
- Progressive Painterly Image Harmonization from Low-Level Styles to High-Level StylesLi Niu, Yan Hong, Junyan Cao, Liqing Zhang. 4352-4360 [doi]
- Domain Generalizable Person Search Using Unreal DatasetMinyoung Oh, Duhyun Kim, Jae-Young Sim. 4361-4368 [doi]
- OctOcc: High-Resolution 3D Occupancy Prediction with OctreeWenzhe Ouyang, Xiaolin Song, Bailan Feng, Zenglin Xu. 4369-4377 [doi]
- NeSyFOLD: A Framework for Interpretable Image ClassificationParth Padalkar, Huaduo Wang, Gopal Gupta 0001. 4378-4387 [doi]
- Semi-Supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental LearningWensheng Pan, Timin Gao, Yan Zhang, Xiawu Zheng, Yunhang Shen, Ke Li, Runze Hu, Yutao Liu, Pingyang Dai. 4388-4396 [doi]
- Less Is More: Label Recommendation for Weakly Supervised Point Cloud Semantic SegmentationZhiyi Pan, Nan Zhang, Wei Gao 0003, Shan Liu 0001, Ge Li 0002. 4397-4405 [doi]
- patchDPCC: A Patchwise Deep Compression Framework for Dynamic Point CloudsZirui Pan, Mengbai Xiao, Xu Han, Dongxiao Yu, Guanghui Zhang, Yao Liu 0001. 4406-4414 [doi]
- LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis FunctionsAtharva Pandey, Vishal Yadav, Rajendra Nagar, Santanu Chaudhury. 4415-4423 [doi]
- RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity EstimationChangsong Pang, Xieyuanli Chen, Yimin Liu, Huimin Lu 0002, Yuwei Cheng. 4424-4432 [doi]
- NeBLa: Neural Beer-Lambert for 3D Reconstruction of Oral Structures from Panoramic RadiographsSihwa Park, Seongjun Kim, Doeyoung Kwon, Yohan Jang, In-Seok Song, Seung Jun Baek. 4433-4441 [doi]
- Task-Disruptive Background Suppression for Few-Shot SegmentationSuho Park, Su Been Lee, Sangeek Hyun, Hyun Seok Seong, Jae-Pil Heo. 4442-4449 [doi]
- SA²VP: Spatially Aligned-and-Adapted Visual PromptWenjie Pei, Tongqi Xia, Fanglin Chen 0001, Jinsong Li, Jiandong Tian, Guangming Lu. 4450-4458 [doi]
- ConditionVideo: Training-Free Condition-Guided Video GenerationBo Peng, Xinyuan Chen, Yaohui Wang 0004, Chaochao Lu, Yu Qiao. 4459-4467 [doi]
- ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM PretrainingDezhi Peng, Chongyu Liu, Yuliang Liu, Lianwen Jin. 4468-4477 [doi]
- FRIH: Fine-Grained Region-Aware Image HarmonizationJinlong Peng, Zekun Luo, Liang Liu, Boshen Zhang. 4478-4486 [doi]
- Navigating Open Set Scenarios for Skeleton-Based Action RecognitionKunyu Peng, Cheng Yin, Junwei Zheng, Ruiping Liu, David Schneider, Jiaming Zhang 0001, Kailun Yang 0001, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg. 4487-4496 [doi]
- LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity EnhancementRenyuan Peng, Xinyue Cai, Hang Xu, Jiachen Lu, Feng Wen, Wei Zhang, Li Zhang. 4497-4505 [doi]