Abstract is missing.
- Resource Efficient Sleep Staging via Multi-Level Masking and Prompt LearningLejun Ai, Yulong Li, Haodong Yi, Jixuan Xie, Yue Wang 0092, Jia Liu 0009, Min Chen 0003, Rui Wang 0077. 3-11 [doi]
- AutoMalDesc: Large-Scale Script Analysis for Cyber Threat ResearchAlexandru-Mihai Apostu, Andrei Preda, Alexandra Daniela Damir, Diana Bolocan, Radu-Tudor Ionescu, Ioana Croitoru, Mihaela Gaman. 12-20 [doi]
- Beyond Content: A Comprehensive Speech Toxicity Dataset and Detection Framework Incorporating Paralinguistic CuesZhongjie Ba, Liang Yi, Peng Cheng 0007, Qingcao Li, Qinglong Wang 0003, Li Lu 0008. 21-29 [doi]
- Modulation-Based Backdoors: Leveraging Amplitude and Frequency Patterns to Attack Speaker RecognitionHanbo Cai, Pengcheng Zhang 0001, Yan Xiao 0002, De-li, Hanting Chu, Ying Luo. 30-38 [doi]
- Learning Structurally Stabilized Representations for Lossless DNA StorageBen Cao, Xue Li 0019, Tiantian He 0001, Bin Wang 0005, Shihua Zhou, Xiaohu Wu, Qiang Zhang 0008. 39-47 [doi]
- ViG-RAG: Video-aware Graph Retrieval-Augmented Generation via Temporal and Semantic Hybrid ReasoningZongsheng Cao, Anran Liu, Yangfan He, Jing Li 0114, Bo Zhang 0069, Zigan Wang. 48-56 [doi]
- Transferable Backdoor Attacks for Code Models via Sharpness-Aware Adversarial PerturbationShuyu Chang, Haiping Huang, Yanjun Zhang, Yujin Huang, Fu Xiao, Leo Yu Zhang. 57-65 [doi]
- Toward Multimodal Fake News Detection by Multi-perspective Rationale Generation and VerificationJunyang Chen 0001, Yueqian Li, Ka Chung Ng, Huan Wang 0005, Liang-Jie Zhang. 66-74 [doi]
- RTMol: Rethinking Molecule-text Alignment in a Round-trip ViewLetian Chen, Runhan Shi, Gufeng Yu, Yang Yang. 75-82 [doi]
- Physical-regularized Hierarchical Generative Model for Metallic Glass Structural Generation and Energy PredictionQiyuan Chen, Ajay Annamareddy, Ying Fei Li, Dane Morgan, Bu Wang. 83-91 [doi]
- Regressor-guided Diffusion Model for De Novo Peptide Sequencing with Explicit Mass ControlShaorong Chen, Jingbo Zhou, Jun Xia 0001. 92-100 [doi]
- RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and TreatmentXuanzhong Chen, Ye Jin, Xiaohao Mao, Lun Wang, Shuyang Zhang, Ting Chen 0006. 101-109 [doi]
- Transferring Causal Driving Patterns for Generalizable Traffic Simulation with Diffusion-Based DistillationYuhang Chen, Jie Sun, Jialin Fan, Jian Sun 0010. 110-118 [doi]
- TRACE: Transformation-Aware Graph Refinement for Reaction Condition PredictionYujie Chen 0002, Tengfei Ma 0002, Yuansheng Liu, Leyi Wei, Shu Wu, Dongsheng Cao 0001, Yiping Liu, Xiangxiang Zeng. 119-127 [doi]
- SIDE: Surrogate Conditional Data Extraction from Diffusion ModelsYunhao Chen, Shujie Wang, Difan Zou, Xingjun Ma. 128-136 [doi]
- DyC-STG: Dynamic Causal Spatio-Temporal Graph Network for Real-time Data Credibility Analysis in IoTGuanjie Cheng, Boyi Li, Peihan Wu, Feiyi Chen, Xinkui Zhao, Mengying Zhu, ShuiGuang Deng. 137-146 [doi]
- ProAR: Probabilistic Autoregressive Modeling for Molecular DynamicsKaiwen Cheng, Yutian Liu 0004, Zhiwei Nie, MuJie Lin, Yanzhen Hou, Yiheng Tao, Chang Liu, Jie Chen 0001, Youdong Mao, Yonghong Tian 0001. 147-155 [doi]
- Light but Sharp: SlimSTAD for Real-Time Action Detection from Sensor DataWei Cui 0002, Lukai Fan, Zhenghua Chen, Min Wu 0008, Shili Xiang, Haixia Wang 0003, Bing Li 0002. 156-165 [doi]
- VFCionX: Bridging Large and Small Models for Robust Vulnerability-Fixing Commit IdentificationXing Cui, JingZheng Wu, Wenxiang Ou, Tianyue Luo, Zhiyuan Li, Xiang Ling 0001. 166-174 [doi]
- T2Agent: A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree SearchXing Cui, Yueying Zou, Zekun Li 0001, Peipei Li 0002, Xinyuan Xu, Xuannan Liu, Huaibo Huang. 175-183 [doi]
- Measuring What Matters: Scenario-Driven Evaluation for Trajectory Predictors in Autonomous DrivingLongchao Da, David Isele, Hua Wei 0001, Manish Saroya. 184-192 [doi]
- DensiCrafter: Physically-Constrained Generation and Fabrication of Self-Supporting Hollow StructuresShengqi Dang, Fu Chai, Jiaxin Li, Chao Yuan, Wei Ye, Nan Cao 0001. 193-201 [doi]
- Topology-Enhanced and Label Correlation-Aware Model for Protein-Protein Interaction PredictionBin Deng, Huifang Ma, Ruijia Zhang, Meihuizi Jia, Rui Bing. 202-210 [doi]
- InteChar: A Unified Oracle Bone Character List for Ancient Chinese Language ModelingXiaolei Diao, Zhihan Zhou 0003, Lida Shi, Ting Wang 0019, Ruihua Qi, Daqian Shi, Hao Xu 0012. 211-219 [doi]
- NucEL: Single-Nucleotide ELECTRA-Style Genomic Pre-training for Efficient and Interpretable RepresentationsKe Ding, Brian J. Parker, Jiayu Wen. 220-227 [doi]
- OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement LearningZezhen Ding, Zhen Tan 0001, Jiheng Zhang, Tianlong Chen 0001. 228-236 [doi]
- Learning from Long-Term Engagement: Adaptive Tutoring Dialogue Planning for Personalized EducationZhiang Dong, Zhenlong Dai, Xiangwei Lv, Jingyuan Chen. 237-245 [doi]
- Toward Time-Continuous Data Inference in Sparse Urban CrowdSensingHao Du, Wenbin Liu, Ziyu Sun, Haoyang Su, En Wang, Yuanbo Xu. 246-254 [doi]
- Multi-Horizon Time Series Forecasting of Non-Parametric CDFs with Deep Lattice NetworksNiklas Erdmann, Lars Ødegaard Bentsen, Roy Stenbro, Heine Nygard Riise, Narada Dilp Warakagoda, Paal E. Engelstad. 255-264 [doi]
- 3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at ScaleYijia Fan, Jusheng Zhang, Kaitong Cai, Jing Yang, Jian Wang 0100, Keze Wang. 265-273 [doi]
- From Stimuli to Minds: Enhancing Psychological Reasoning in LLMs via Bilateral Reinforcement LearningYichao Feng, Haoran Luo 0001, Lang Feng 0007, Shuai Zhao 0007, Anh Tuan Luu. 274-282 [doi]
- Unveiling the Attribute Misbinding Threat in Identity-Preserving ModelsJunming Fu, Jishen Zeng, Yi Jiang, Peiyu Zhuang, Baoying Chen, Siyu Lu, Jianquan Yang. 283-291 [doi]
- DeepSenseMoE: Harnessing Power of Time Series Foundation Models for Few-Shot Human Activity RecognitionZenan Fu, Dongzhou Cheng, Lei Zhang 0130, Wenbo Huang 0001, Zhenghao Chen, Hao Wu 0010. 292-299 [doi]
- RCAFlow: A Workflow-Informed Hierarchical Planning Multi-Agent System for Root Cause AnalysisYufei Gao, Zhengong Cai, Bowei Yang. 300-308 [doi]
- Energy-based Autoregressive Generation for Neural Population DynamicsNingling Ge, Sicheng Dai, Yu Zhu, Shan Yu. 309-317 [doi]
- Failure Localization in Multi-Agent Code Generation via Knowledge-Guided and Transferable ReasoningMingyang Geng, Shanzhi Gu, ZhiPeng Liu, Chuanfu Xu, Zhaoyang Qu, Haotian Wang 0001. 318-326 [doi]
- GARNET: GoT-Based Alert Reduction and Narrative Event TracingYiru Gong, Song Liu, Changzhi Zhao, Junrong Liu, Tian Tian, Xiaobo Yang, Bo Jiang 0013, Zhigang Lu 0002. 327-335 [doi]
- From Passive Perception to Active Memory: A Weakly Supervised Image Manipulation Localization Framework Driven by Coarse-Grained AnnotationsZhiqing Guo, Dongdong Xi, Songlin Li, Gaobo Yang. 336-344 [doi]
- CHASE: Contextual History for Adaptive and Simple Exploitation in Large Language Model JailbreakingZhiqiang Hao, Chuanyi Li, Ye Fan, Jun Cai, Xiao Fu 0005, Shangqi Wang, Hao Shen, Jiao Yin 0007, JiDong Ge, Bin Luo 0003, Vincent Ng 0001. 345-353 [doi]
- S²Drug: Bridging Protein Sequence and 3D Structure in Contrastive Representation Learning for Virtual ScreeningBowei He, Bowen Gao, Yankai Chen 0001, Yanyan Lan, Chen Ma 0001, Philip S. Yu, Ya-Qin Zhang, Wei-Ying Ma. 354-362 [doi]
- FACTGUARD: Event-Centric and Commonsense-Guided Fake News DetectionJing He 0012, Han Zhang, Yuanhui Xiao, Wei Guo, Shaowen Yao 0001, Renyang Liu 0001. 363-371 [doi]
- Transferable Hypergraph Attack via Injecting Nodes into Pivotal HyperedgesMeixia He, Peican Zhu, Le Cheng, Yangming Guo, Manman Yuan, Keke Tang. 372-380 [doi]
- Directing Uncertainty-Aware Information Flow for Robust Diffusion PredictionWeikang He, Yunpeng Xiao 0001, Mengyang Huang, Xuemei Mou, Rong Wang 0003, Qian Li 0009. 381-389 [doi]
- Learning Neural Operators from Partial Observations via Latent Autoregressive ModelingJingren Hou, Hong Wang, Pengyu Xu, Chang Gao 0007, Huafeng Liu 0001, Liping Jing. 390-398 [doi]
- VoiceCloak: A Multi-Dimensional Defense Framework Against Unauthorized Diffusion-Based Voice CloningQianyue Hu, Junyan Wu, Wei Lu 0001, Xiangyang Luo 0001. 399-407 [doi]
- Transolver Is a Linear Transformer: Revisiting Physics-Attention Through the Lens of Linear AttentionWenjie Hu, Sidun Liu, Peng Qiao, Zhenglun Sun, Yong Dou. 408-416 [doi]
- Revisiting the Canonicalization for Fast and Accurate Crystal Tensor Property PredictionHaowei Hua, Jingwen Yang, Wanyu Lin, Pan Zhou. 417-425 [doi]
- Towards Distance-Invariant Radio Frequency Fingerprinting via Augmented Unsupervised LearningShiyue Huang, Yuchen Su 0001, Hongbo Liu 0002, Zikang Ding, Xuewan He, Yanzhi Ren, Haitao Jia. 426-434 [doi]
- TIDE: Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image GenerationVictor Shea-Jay Huang, Le Zhuo, Yi Xin 0003, Zhaokai Wang, Fu-Yun Wang, YuChi Wang, Renrui Zhang, Peng Gao 0007, Hongsheng Li 0001. 435-443 [doi]
- Dynamic Geometric Equivariant Network for Full-Atom Antibody DesignWeihong Huang, Feng Yang, Qiang Zhang 0031, Juan Liu 0007. 444-452 [doi]
- PanFoMa: A Lightweight Foundation Model and Benchmark for Pan-CancerXiaoshui Huang, Tianlin Zhu, Yifan Zuo, Xue Xia 0005, Zonghan Wu, Jiebin Yan, Dingli Hua, Zongyi Xu, Yuming Fang, Jian Zhang 0002. 453-461 [doi]
- Controllable Financial Market Generation with Diffusion Guided Meta AgentYu-Hao Huang 0002, Chang Xu 0008, Yang Liu 0278, Weiqing Liu, Wu-Jun Li, Jiang Bian 0002. 462-470 [doi]
- Uncovering and Mitigating Destructive Multi-Embedding Attacks in Deepfake Proactive ForensicsLixin Jia, Haiyang Sun, Zhiqing Guo, Yunfeng Diao, Dan Ma, Gaobo Yang. 471-479 [doi]
- HyperLoad: A Cross-Modality Enhanced Large Language Model-Based Framework for Green Data Center Cooling Load PredictionHaoyu Jiang, Boan Qu, JunJie Zhu, Fanjie Zeng, Xiaojie Lin, Wei Zhong. 480-488 [doi]
- RMSAGen: Integrating Multiple Sequence Alignment for Function RNA DesignJiyue Jiang, Yanyu Chen, Qingchuan Zhang, Jiayi Li, Xiangyu Shi, Chang Zhou, Ziqian Lin, Jiuming Wang, Dongchen He, Liang Hong, Qintong Li, Pengan Chen, Jiayang Chen, Xinrui Zhang, Jiao Yuan, Tianqing Zhang, Yu Li. 489-497 [doi]
- Magnitude-Modulated Equivariant Adapter for Parameter-Efficient Fine-Tuning of Equivariant Graph Neural NetworksDian Jin, Yancheng Yuan, Xiaoming Tao. 498-506 [doi]
- FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report GenerationSong Jin, Shuqi Li, Shukun Zhang, Rui Yan. 507-515 [doi]
- Stochastic Universal Adversarial Perturbations with Fixed Optimization Constraint and Ensured High-probability TransferabilityYulin Jin, Xiaoyu Zhang 0010, Haoyu Tong, Jian Lou 0001, Kai Wu 0003, Haibo Hu 0001, Xiaofeng Chen 0001. 516-524 [doi]
- SVD-NO: Learning PDE Solution Operators with SVD Integral KernelsNoam Koren, Ralf J. J. Mackenbach, Ruud J. G. van Sloun, Kira Radinsky, Daniel Freedman. 525-533 [doi]
- Targeted Pathway Inference for Biological Knowledge Bases via Graph Learning and ExplanationRikuto Kotoge, Ziwei Yang 0002, Zheng Chen 0012, Yushun Dong, Yasuko Matsubara, Jimeng Sun 0001, Yasushi Sakurai. 534-542 [doi]
- RLSLM: A Hybrid Framework Combining Reinforcement Learning and a Rule-based Social Locomotion Model for Socially-aware NavigationYitian Kou, Yihe Gu, Chen Zhou, Dandan Zhu 0001, Shu-Guang Kuai. 543-551 [doi]
- OneFont: A Unified Agent for End-to-End Font CreationYingxin Lai, Yufei Liu, Guoqing Yang, Jiaxing Chai, Zhiming Luo, Shaozi Li. 552-560 [doi]
- RAG-Enhanced Collaborative LLM Agents for Drug DiscoveryNamkyeong Lee, Edward De Brouwer, Ehsan Hajiramezanali, Tommaso Biancalani, Chanyoung Park 0001, Gabriele Scalia. 561-569 [doi]
- BugSweeper: Function-Level Detection of Smart Contract Vulnerabilities Using Graph Neural NetworksUisang Lee, Changhoon Chung, Junmo Lee, Soo-Mook Moon. 570-578 [doi]
- Do Large Language Models Think like the Brain? Sentence-Level Evidences from Layer-Wise Embeddings and fMRIYu Lei, Xingyang Ge, Yi Zhang, Yiming Yang, Bolei Ma. 579-587 [doi]
- Drifting Away from Truth: GenAI-Driven News Diversity Challenges LVLM-Based Misinformation DetectionFanxiao Li, Jiaying Wu, Tingchao Fu, Yunyun Dong, Bingbing Song, Wei Zhou 0011. 588-596 [doi]
- anyECG-chat: A Generalist ECG-MLLM for Flexible ECG Input and Multi-Task UnderstandingHaitao Li 0010, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang. 597-605 [doi]
- MetaDiT: Enabling Fine-grained Constraints in High-degree-of Freedom Metasurface DesignHao Li, Andrey Bogdanov. 606-614 [doi]
- Scalable Vision-Guided Crop Yield EstimationHarrison H. Li, Medhanie Irgau, Nabil Janmohamed, Karen Solveig Rieckmann, David B. Lobell. 615-622 [doi]
- Learning Cell-Aware Hierarchical Multi-Modal Representations for Robust Molecular ModelingMengran Li 0001, Zelin Zang, Wenbin Xing, Junzhou Chen 0001, Ronghui Zhang, Jiebo Luo 0001, Stan Z. Li. 623-631 [doi]
- Detecting Fake News in Short Videos Through Multi-View AggregationNuo Li, Yuan Xiong, Chengliang Liu 0003, Jie Wen 0001, Chao Huang 0008. 632-640 [doi]
- WeightFlow: Learning Stochastic Dynamics via Evolving Weight of Neural NetworkRuikun Li 0002, Jiazhen Liu, Huandong Wang, Qingmin Liao, Yong Li 0008. 641-649 [doi]
- Intention Chain-of-Thought Prompting with Dynamic Routing for Code GenerationShen Li, Li Huang 0006, Shaoxiong Zhan, Weifeng Sun 0004, Tao Yin, Zhongxin Liu 0002, Meng Yan 0001. 650-658 [doi]
- PLA-MGRA: Multi-Granularity and Relation-Aware Learning for Efficient and Generalizable Protein-Ligand Binding Affinity PredictionShunfan Li, Jiangkai Long, Xin Zou, Chang Tang, Yuanyuan Liu 0004, Xiao He 0010, Xuesong Yan. 659-667 [doi]
- MergeDNA: Context-Aware Genome Modeling with Dynamic Tokenization Through Token MergingSiyuan Li 0002, Kai Yu, Anna Wang, Zicheng Liu 0006, Chang Yu 0001, Jingbo Zhou, Qirong Yang, Yucheng Guo, Xiaoming Zhang, Stan Z. Li. 668-676 [doi]
- Beyond Fully Supervised Pixel Annotations: Scribble-Driven Weakly-Supervised Framework for Image Manipulation LocalizationSonglin Li, Guofeng Yu, Zhiqing Guo, Yunfeng Diao, Dan Ma, Gaobo Yang. 677-685 [doi]
- RFF-TTA: Physical Information-Aware Prototype for Temporally Varying RF Fingerprinting Online Test-Time-AdaptationTaotao Li, Yiyang Li, Zhenyu Wen, Jiahao Lin, Jinhao Wan, Jie Su 0001, Cong Wang, Zhen Hong. 686-694 [doi]
- Adaptive Fidelity Estimation for Quantum Programs with Graph-Guided Noise AwarenessTingting Li 0004, Ziming Zhao 0008, Jianwei Yin. 695-703 [doi]
- Guided Distillation and Risk Adaptive Evolution for Multi-Robot NavigationXuyang Li, Jianwu Fang, Lin Li 0085, Boyuan Chen, Guangliang Li, Jianru Xue. 704-711 [doi]
- DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein DesignYanting Li, Zikang Wang, Jiyue Jiang, Ziqian Lin, Dongchen He, Yuheng Shan, Yanruisheng Shao, Jiayi Li, Xiangyu Shi, Jiuming Wang, Yanyu Chen, Yimin Fan, Han Li, Yu Li 0006. 712-720 [doi]
- Uncovering Pretraining Code in LLMs: A Syntax-Aware Attribution ApproachYuanheng Li, Zhuoyang Chen, Xiaoyun Liu, Yuhao Wang, Mingwei Liu, Yang Shi 0002, Kaifeng Huang 0001, Shengjie Zhao 0001. 721-729 [doi]
- Factorization-in-Loop: Proximal Fill-in Minimization for Sparse Matrix ReorderingZiwei Li, Shuzi Niu, Tao Yuan, Huiyuan Li, Wenjia Wu. 730-737 [doi]
- Benchmarking LLMs for Political Science: A United Nations PerspectiveYueqing Liang, Liangwei Yang, Chen Wang 0018, Congying Xia, Rui Meng, Xiongxiao Xu, Haoran Wang 0005, Ali Payani, Kai Shu. 738-745 [doi]
- CellStream: Dynamical Optimal Transport Informed Embeddings for Reconstructing Cellular Trajectories from Snapshots DataYue Ling, Peiqi Zhang, Zhenyi Zhang, Peijie Zhou. 746-754 [doi]
- Multimodal Table Understanding with Difficulty-aware Reinforcement LearningChaohu Liu, Haoyu Cao 0001, YongXiang Hua, Linli Xu. 755-763 [doi]
- NOTAM-Evolve: A Knowledge-Guided Self-Evolving Optimization Framework with LLMs for NOTAM InterpretationMaoqi Liu, Quan Fang, Yuhao Wu, Can Zhao, Yang Yang 0122, Kaiquan Cai. 764-772 [doi]
- ProtSAE: Disentangling and Interpreting Protein Language Models via Semantically-Guided Sparse AutoencodersXiangyu Liu 0001, Haodi Lei, Yi Liu 0071, Yang Liu, Wei Hu 0007. 773-781 [doi]
- Predict and Resist: Long-Term Accident Anticipation Under Sensor NoiseXingcheng Liu, Bin Rao, Yanchen Guan, Chengyue Wang 0001, Haicheng Liao, Jiaxun Zhang, Chengyu Lin 0003, Meixin Zhu, Zhenning Li 0001. 782-790 [doi]
- CLM-Access: A Specialized Foundation Model for High-Dimensional Single-Cell ATAC-Seq AnalysisZiqiang Liu, Bowen Li, Zhenyu Xu, Yantao Li, Junwei Zhang, Chulin Sha, Xiaolin Li 0001. 791-799 [doi]
- When Genes Speak: A Semantic-Guided Framework for Spatially Resolved Transcriptomics Data ClusteringJiangkai Long, Yanran Zhu, Chang Tang, Kun Sun, Yuanyuan Liu 0004, Xuesong Yan. 800-808 [doi]
- LLMs Unleashed: Generating Protocol Code from RFC SpecificationsJunfeng Long, Jinshu Su, Biao Han 0003. 809-817 [doi]
- From Blind Transfer to Wise Selection: Prototype-Driven Neighbor-Domain Adaptation for Fake News DetectionWayne Lu, Yiheng Li. 818-826 [doi]
- PHPFND: Detecting Fake News via Post-Hoc Processing of LLMs HallucinationJinke Ma, Jiachen Ma 0003, Wei Zhang 0106, Yong Liu 0029. 827-835 [doi]
- DynamicRTL: RTL Representation Learning for Dynamic Circuit BehaviorRuiyang Ma, Yunhao Zhou, Yipeng Wang, Yi Liu 0081, Zhengyuan Shi, Ziyang Zheng, Kexin Chen, Zhiqiang He, Lingwei Yan, Gang Chen 0023, Qiang Xu 0001, Guojie Luo. 836-843 [doi]
- MedLA: A Logic-Driven Multi-Agent Framework for Complex Medical Reasoning with Large Language ModelsSiqi Ma, Jiajie Huang, Fan Zhang 0010, Jinlin Wu, Yue Shen, Guohui Fan, Zhu Zhang, Zelin Zang. 845-853 [doi]
- DRMD: Deep Reinforcement Learning for Malware Detection Under Concept DriftShae McFadden, Myles Foley, Mario D'Onghia, Chris Hicks, Vasilios Mavroudis, Nicola Paoletti, Fabio Pierazzi. 854-862 [doi]
- BDD2Seq: Enabling Scalable Reversible-Circuit Synthesis via Graph-to-Sequence LearningMingkai Miao, Jianheng Tang, Guangyu Hu, Hongce Zhang. 863-872 [doi]
- The GATTACA Framework: Graph Neural Network-Based Reinforcement Learning for Controlling Biological NetworksAndrzej Mizera, Jakub Zarzycki. 873-880 [doi]
- Interpretable and Robust Behavior Abstraction via Environment-Disentangled Heterogeneous GraphZhibin Ni, Hai Wan, Xibin Zhao. 881-889 [doi]
- Tracing the Heart's Pathways: ECG Representation Learning from a Cardiac Conduction PerspectiveTan Pan, Yixuan Sun, Chen Jiang 0006, Qiong Gao, Rui Sun, Xingmeng Zhang, Zhenqi Yang, Limei Han, Yixiu Liang, Yuan Cheng, Kaiyu Guo. 890-898 [doi]
- Improving Large Molecular Language Model via Relation-aware Multimodal CollaborationJinyoung Park 0005, Minseong Bae, Jeehye Na, Hyunwoo J. Kim. 899-907 [doi]
- Refinement Contrastive Learning of Cell-Gene Associations for Unsupervised Cell Type IdentificationLiang Peng, Haopeng Liu, Yixuan Ye, Cheng Liu 0001, Wenjun Shen, Si Wu 0002, Hau-San Wong. 908-916 [doi]
- Resilient UAV Swarm with Fast Connectivity Recovery and Extensive CoverageYabin Peng, Chenyu Zhou, Hainan Cui, Tong Duan, Haoyang Chen, Fan Zhang, Shaoxun Liu. 917-925 [doi]
- FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AIYuhang Peng, Yizhou Pan, Xinning He, Jihaoyu Yang, Xinyu Yin, Han Wang, Xiaoji Zheng, Chao Gao, Jiangtao Gong. 926-934 [doi]
- Gene Incremental Learning for Single-Cell TranscriptomicsJiaxin Qi, Yan Cui, Jianqiang Huang 0001, Gaogang Xie. 935-943 [doi]
- Designed to Spread: A Generative Approach to Enhance Information DiffusionZiqing Qian, Jiaying Lei, Shengqi Dang, Nan Cao 0001. 944-952 [doi]
- MSAnchor: De Novo Molecular Generation from Mass Spectrometry Data with Anchor-Extended Molecular ScaffoldsXiaohan Qin, Chao Wang, Zhengyang Zhou, Linjiang Chen, Wenjie Du 0003, Yang Wang 0015. 953-961 [doi]
- Differentiable Semantic Meta-Learning Framework for Long-Tail Motion Forecasting in Autonomous DrivingBin Rao, Chengyue Wang 0001, Haicheng Liao, Qianfang Wang, Yanchen Guan, Jiaxun Zhang, Xingcheng Liu, Meixin Zhu, Kanye Ye Wang, Zhenning Li 0001. 962-970 [doi]
- DeepRWCap: Neural-Guided Random-Walk Capacitance Solver for IC DesignHector Rodriguez Rodriguez, Jiechen Huang, Wenjian Yu. 971-979 [doi]
- HSA-Net: Hierarchical and Structure-Aware Framework for Efficient and Scalable Molecular Language ModelingZihang Shao, Wentao Lei, Lei Wang, Wencai Ye, Li Liu. 980-987 [doi]
- Weakly-Supervised Image Forgery Localization via Vision-Language Collaborative Reasoning FrameworkZiqi Sheng, Junyan Wu, Wei Lu 0001, Jiantao Zhou 0001. 988-996 [doi]
- Navigating the Alpha Jungle: An LLM-Powered MCTS Framework for Formulaic Alpha Factor MiningYu Shi, Yitong Duan, Jian Li 0015. 997-1005 [doi]
- Structure-based RNA Design by Step-wise Optimization of Latent Diffusion ModelQi Si, Xuyang Liu, Penglei Wang, Xin Guo, Yuan Qi, Yuan Cheng. 1006-1014 [doi]
- Dual-Branch Asymmetric Discrepancy Learning Based on Fake Image Pattern-Coexistence for AI-Generated Image DetectionChunli Song, Jie Liu, Peiyang Wang, Ying Huang, Guixuan Zhang, Zhi Zeng, Shuwu Zhang. 1015-1023 [doi]
- Physically-Informed Flow Matching with Graph Neural Networks for Complex Fluid DynamicsXiaozhuang Song, Tianshu Yu 0001. 1024-1032 [doi]
- GlyphShield: Document Watermarking for the Physical World via Vector Typeface SynthesisNan Sun, Yuxing Lu, Han Fang, Hefei Ling, Sijing Xie, Luyu Yuan, Chengxin Zhao. 1033-1041 [doi]
- De Novo Molecular Generation from Mass Spectra via Many-Body Enhanced DiffusionXichen Sun, Wentao Wei, Jiahua Rao, Jiancong Xie, Yuedong Yang. 1042-1050 [doi]
- TermGPT: Multi-Level Contrastive Fine-Tuning for Terminology Adaptation in Legal and Financial DomainsYidan Sun, Mengying Zhu, Feiyue Chen, Yangyang Wu, Xiaolei Dan, Mengyuan Yang 0002, Xiaolin Zheng, Shenglin Ben. 1051-1059 [doi]
- EPO: Diverse and Realistic Protein Ensemble Generation via Energy Preference OptimizationYuancheng Sun, Yuxuan Ren, Zhaoming Chen, Xu Han, Kang Liu 0001, Qiwei Ye. 1060-1068 [doi]
- VietCheckMed: Explainable Regulatory Compliance Checking for Medical Advertisements on Vietnamese Social MediaNguyen Thanh Tam, Khanh Quoc Tran, Dat Thanh Pham, Truong Phu Le, Nguyen Hoang Gia Han, Binh T. Nguyen 0001. 1069-1077 [doi]
- Estimating Online Influence Needs Causal Modeling! Counterfactual Analysis of Misinformation Engagement on Social MediaLin Tian, Marian-Andrei Rizoiu. 1078-1086 [doi]
- FIXME: Towards End-to-End Benchmarking of LLM-Aided Design VerificationGwok-Waa Wan, SamZaak Wong, Shengchu Su, Chenxu Niu, Ning Wang, Xinlai Wan, Qixiang Chen, Mengnv Xing, Jingyi Zhang, JianMin Ye, Yubo Wang, Rongchang Song, Tao Ni, Qiang Xu 0001, Nan Guan, Zhe Jiang 0004, Xi Wang 0009, Yong Chen, Jun Yang. 1087-1095 [doi]
- PIMRL: Physics-Informed Multi-Scale Recurrent Learning for Burst-Sampled Spatiotemporal DynamicsHan Wan, Qi Wang 0123, Yuan Mi, Rui Zhang, Hao Sun 0002. 1096-1104 [doi]
- Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality PerspectiveBing Wang 0018, Ximing Li 0002, Yanjun Wang, Changchun Li, Lin Yuanbo Wu, Buyu Wang, Shengsheng Wang 0001. 1105-1113 [doi]
- CP-FREEZER: Latency Attacks Against Vehicular Cooperative PerceptionChenyi Wang 0005, Ruoyu Song 0001, Raymond Muller, Jean Philippe Monteuuis, Z. Berkay Celik, Jonathan Petit, Ryan M. Gerdes, Ming Li 0003. 1114-1122 [doi]
- Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-temporal Graph Learning Method for Traffic Flow ForecastingFeng Wang, Tianxiang Chen, Shuyue Wei 0001, Qian Chu, Yi Zhang, Yifan Sun, Zhiming Zheng 0001. 1123-1131 [doi]
- Efficient Protein Optimization via Structure-aware Hamiltonian DynamicsJiahao Wang, Shuangjia Zheng. 1132-1140 [doi]
- MUSE: Multimodal Uncertainty-Based Self-Driven Evolution for Robust Physiological-Signal-Based Driver Fatigue DetectionJiaheng Wang, Yuan Si, Ang Li 0053, Zhenyu Wang 0007, Tianheng Xu, Honglin Hu. 1141-1149 [doi]
- Learning Protein-Ligand Binding in Hyperbolic SpaceJianhui Wang, Wenyu Zhu, Bowen Gao, Xin Hong, Ya-Qin Zhang, Wei-Ying Ma, Yanyan Lan. 1150-1158 [doi]
- CTX-Coder: Cross-Attention Architectures Empower LLMs for Long-Context Vulnerability DetectionJujie Wang, Kangfeng Zheng, Bin Wu 0012, Chunhua Wu, Yulin Yao, Jiaqi Gao, Minjiao Yang. 1159-1167 [doi]
- Driving with Advice: Large Model as Motion Advisor for Joint PlanningJunyin Wang, Jinlei Yu, Hao Lin, Huikai Liu, Wenqian Zhu, Shengwu Xiong 0001. 1168-1176 [doi]
- PsyPARSE: Retrieval-Augmented Slow Thinking for Personalized Empathetic CounselingLongxiang Wang, Pukun Zhao, Chen Chen, Jinhe Bi, Huacan Wang, Tong Zhang, Ronghao Chen. 1177-1185 [doi]
- Explicit Intent-Enhanced Knowledge Distillation for Trip RecommendationShuliang Wang 0001, Xiaoting Leng, Sijie Ruan, Dingqi Yang, Yicheng Tang, Qianyu Yang, Qianxiong Xu, Jiabao Zhu, Hanning Yuan. 1186-1194 [doi]
- S^2-KD: Semantic-Spectral Knowledge Distillation Spatiotemporal ForecastingWenshuo Wang, Yaomin Shen, Yingjie Tan, Yihao Chen. 1195-1203 [doi]
- Rejoining Precious Artifacts: Efficiently Bone Stick Rejoining Based Massive Fragment Images by Contour, Script, and TextureXingyi Wang, Wen Huang 0002, Mengqiang Hu, Junhui Chen, Weixin Zhao, Wenzheng Xu, Jian Peng 0002. 1204-1212 [doi]
- PASA: Progressive-Adaptive Spectral Augmentation for Automated Auscultation in Data-Scarce EnvironmentsYing Wang, Guoheng Huang, Xueyuan Gong, Xinxin Wang, Xiaochen Yuan. 1213-1221 [doi]
- BDLF-Qwen3: Enhanced Cross-Architecture Binary Function Similarity Detection Through Binary Dynamic Layer FusionYuanda Wang, Ji Zhou, Xinhui Han, Chao Zhang 0008. 1222-1230 [doi]
- Modeling Trend Dynamics with Variational Neural ODEs for Information Popularity PredictionYuchen Wang, Dongpeng Hou, Weikai Jing, Chao Gao 0001, Xianghua Li, Yang Liu. 1231-1239 [doi]
- MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation LearningYusong Wang, Jialun Shen, Zhihao Wu, Yicheng Xu, Shiyin Tan, Mingkun Xu, Changshuo Wang 0001, Zixing Song, Prayag Tiwari. 1240-1248 [doi]
- Reasoning About the Unsaid: Misinformation Detection with Omission-Aware Graph InferenceZhengjia Wang 0001, Danding Wang, Qiang Sheng 0001, Jiaying Wu, Juan Cao 0001. 1249-1257 [doi]
- Dual-Channel Learning Framework for Zero-Shot CircRNA-miRNA Interaction Prediction via State Space ModelingMengmeng Wei, Lei Wang 0121, Zhu-Hong You, Pengwei Hu 0001, Bowei Zhao, Zhi-an Huang, Yu-An Huang, Haicheng Yi. 1258-1266 [doi]
- Synergizing Multigrid Algorithms with Vision Transformer: A Novel Approach to Enhance the Seismic Foundation ModelHuiwen Wu, Shuo Zhang, Yi Liu, Hongbin Ye. 1267-1275 [doi]
- Generalizable Drug-Target Interaction Prediction via ESM-2 Representations and Progressive Contrastive Curriculum LearningQianyang Wu, Jingwei Lv, Zilong Zhang, Feifei Cui. 1276-1284 [doi]
- Exploring Selective Avoidance for Online User Behavior Analysis: A Forest of Thought ExplanationXiaohua Wu, Lin Li 0001, Kaize Shi, Xiaohui Tao 0001, Jianwei Zhang, Yuefeng Li 0001. 1285-1293 [doi]
- Investigating Data Pruning for Pretraining Biological Foundation Models at ScaleYifan Wu, Jiyue Jiang, Xichen Ye, Yiqi Wang, Chang Zhou, Yitao Xu, Jiayang Chen, He Hu, Weizhong Zhang, Cheng Jin, Jiao Yuan, Yu Li. 1294-1302 [doi]
- Sim-to-Real: An Unsupervised Noise Layer for Screen-Camera Watermarking RobustnessYufeng Wu, Xin Liao 0001, Baowei Wang, Han Fang, Xiaoshuai Wu, Mingyue Chen, Guiling Wang 0001. 1303-1310 [doi]
- GROVER: Graph-guided Representation of Omics and Vision with Expert Regulation for Adaptive Spatial Multi-omics FusionYongjun Xiao, Dian Meng, Xinlei Huang, Yanran Liu, Shiwei Ruan, Ziyue Qiao, Xubin Zheng. 1311-1318 [doi]
- Informative Subgraph Extraction with Deep Reinforcement Learning for Drug-Drug Interaction PredictionJiancong Xie, Wentao Wei, Chi Zhang, Jiahua Rao, Yuedong Yang. 1319-1327 [doi]
- Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct PositionZhixin Xie, Xurui Song, Jun Luo 0001. 1328-1336 [doi]
- ChipMind: Retrieval-Augmented Reasoning for Long-Context Circuit Design SpecificationsChangwen Xing, SamZaak Wong, Xinlai Wan, Yanfeng Lu, Mengli Zhang, Zebin Ma, Lei Qi 0001, Zhengxiong Li, Nan Guan, Zhe Jiang 0004, Xi Wang 0009, Jun Yang 0006. 1337-1345 [doi]
- SynWeather: Weather Observation Data Synthesis Across Multiple Regions and Variables via a General Diffusion TransformerKaiyi Xu, Junchao Gong, Zhiwang Zhou, Zhangrui Li, Yuandong Pu, Yihao Liu, Ben Fei, Fenghua Ling, Wenlong Zhang, Lei Bai 0001. 1346-1354 [doi]
- KCLNet: Electrically Equivalence-Oriented Graph Representation Learning for Analog CircuitsPeng Xu 0052, Yapeng Li, Tinghuan Chen, Tsung-Yi Ho, Bei Yu 0001. 1355-1363 [doi]
- scCluBench: Comprehensive Benchmarking of Clustering Algorithms for Single-Cell RNA SequencingPing Xu 0003, Zaitian Wang, Zhirui Wang, Pengjiang Li 0001, Jiajia Wang, Ran Zhang 0008, Pengfei Wang 0008, Yuanchun Zhou. 1364-1372 [doi]
- GeoGen: A Two-stage Coarse-to-Fine Framework for Fine-grained Synthetic Location-based Social Network Trajectory GenerationRongchao Xu, Kunlin Cai, Lin Jiang 0007, Zhiqing Hong, Yuan Tian 0001, Guang Wang 0001. 1373-1381 [doi]
- Learning to Curate Context: Jointly Optimizing Retrieval and Prediction for Multimodal Social Media PopularityXovee Xu, Shuojun Lin, Fan Zhou 0002, Jingkuan Song. 1382-1390 [doi]
- SoMe: A Realistic Benchmark for LLM-based Social Media AgentsDizhan Xue, Jing Cui, Shengsheng Qian, Chuanrui Hu, Changsheng Xu. 1391-1399 [doi]
- GenePheno: Interpretable Gene Knockout-Induced Phenotype Abnormality Prediction from Gene SequencesJingquan Yan, Yuwei Miao, Lei Yu, Yuzhi Guo, Xue Xiao, Lin Xu, JunZhou Huang. 1400-1408 [doi]
- Sentient: Detecting APTs via Capturing Indirect Dependencies and Behavioral LogicWenhao Yan, Ning An, Wei Qiao, Weiheng Wu, Zhigang Lu 0002, Bo Jiang 0013, Baoxu Liu, Junrong Liu. 1409-1417 [doi]
- An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply ChainsZihe Yan, Kai Luo, Haoyu Yang, Yang Yu, Zhuosheng Zhang 0001, Guancheng Li. 1418-1425 [doi]
- Authority Backdoor: A Certifiable Backdoor Mechanism for Authoring DNNsHan Yang, Shaofeng Li 0001, Tian Dong, Xiangyu Xu 0001, Guangchi Liu, Zhen Ling 0001. 1426-1434 [doi]
- HogVul: Black-box Adversarial Code Generation Framework Against LM-based Vulnerability DetectorsJingxiao Yang, Ping He, Tianyu Du, Sun Bing, Xuhong Zhang 0002. 1435-1443 [doi]
- Poisoned Distillation: Injecting Backdoors into Distilled Datasets Without Raw Data AccessZiyuan Yang, Ming Yan 0007, Yi Zhang, Joey Tianyi Zhou. 1444-1452 [doi]
- PDE-Driven Spatiotemporal Generative Modeling for Multilead ECG SynthesisYakir Yehuda, Kira Radinsky. 1453-1461 [doi]
- CrystalDiT: Simple Diffusion Transformers for Crystal GenerationXiaohan Yi, Guikun Xu, Zhong Zhang 0014, Liu Liu 0014, Yatao Bian, Xi Xiao, Peilin Zhao. 1462-1470 [doi]
- Frequency Mining Empowered by Text Aggregation: A New Perspective on Document Image Tampering DetectionZiqi Yi, Guitao Xu, Shihang Wu, Peirong Zhang 0001, Lianwen Jin. 1471-1479 [doi]
- MicLog: Towards Accurate and Efficient LLM-based Log Parsing via Progressive Meta In-Context LearningJianbo Yu 0003, Yixuan Li, Hai Xu, Kang Xu, Junjielong Xu, Zhijing Li 0007, Pinjia He, Wanyuan Wang. 1480-1488 [doi]
- Every Little Bit Helps: Exploring Better Utilization of Unlabeled Data for Semi-supervised Singing Melody Extraction Using Multi-bands Diffusion ModelShuai Yu 0002, Xiaoliang He, Kangjie Dong, Yi Yu 0001. 1489-1497 [doi]
- Towards Provably Secure and Highly Robust Generative Image Steganography Leveraging Latent Diffusion ModelChengsheng Yuan 0001, Zhaonan Ji, Qi Cui, Zhili Zhou 0001, Xinting Li, Zhihua Xia. 1498-1506 [doi]
- AirCopBench: A Benchmark for Multi-drone Collaborative Embodied Perception and ReasoningJirong Zha, Yuxuan Fan, Tianyu Zhang, Geng Chen, Yingfeng Chen, Chen Gao 0001, Xinlei Chen. 1507-1515 [doi]
- RABot: Reinforcement-Guided Graph Augmentation for Imbalanced and Noisy Social Bot DetectionLonglong Zhang, Xi Wang, Haotong Du, Yangyi Xu, Zhuo Liu, Yang Liu. 1516-1524 [doi]
- Domain-Aware Multi-View Contrastive Representation Learning for Protein Subcellular Localization PredictionQiang Zhang 0031, Feng Yang, Weihong Huang, Jing Feng 0005, Juan Liu 0007. 1525-1533 [doi]
- HGATSolver: A Heterogeneous Graph Attention Solver for Fluid-Structure InteractionQin-Yi Zhang, Hong Wang, Siyao Liu, Haichuan Lin, Linying Cao, Xiao-Hu Zhou, Chen Chen, Shuang-Yi Wang, Zeng-Guang Hou. 1534-1542 [doi]
- MicroEvoEval: A Systematic Evaluation Framework for Image-Based Microstructure Evolution PredictionQinyi Zhang, Duanyu Feng, Ronghui Han, Yangshuai Wang, Hao Wang. 1543-1551 [doi]
- Wi-CBR: Salient-aware Adaptive WiFi Sensing for Cross-domain Behavior RecognitionRuobei Zhang, Shengeng Tang, Huan Yan, Xiang Zhang 0011, Jiabao Guo. 1552-1560 [doi]
- Breaking the Modality Barrier: Generative Modeling for Accurate Molecule Retrieval from Mass SpectraYiwen Zhang, Keyan Ding, Yihang Wu, Xiang Zhuang, Yi Yang, Qiang Zhang 0026, Huajun Chen. 1561-1569 [doi]
- Spike Imaging Velocimetry: Dense Motion Estimation of Fluids Using Spike StreamsYunzhong Zhang, You Zhou, Changqing Su, Zhen Cheng 0005, Zhaofei Yu, Bo Xiong, Tiejun Huang 0001, Xun Cao. 1570-1578 [doi]
- AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output DesignZhishuai Zhang, Xintian Li, Shilong Liu, Aodong Zhang, Lu Jie 0001, Nan Sun 0001. 1579-1586 [doi]
- PriAgent: A Collaborative Multi-Agent Framework for Auditing Android Privacy ComplianceZiwei Zhang, Zhao Li 0010, Zhuojun Jiang, Jiangyi Yin, Xuebin Wang, Jiangchao Chen, Qingyun Liu 0001. 1587-1595 [doi]
- TinyChemVL: Advancing Chemical Vision-Language Models via Efficient Visual Token Reduction and Complex Reaction TasksXuanle Zhao, Shuxin Zeng, Xinyuan Cai, Xiang Cheng, Duzhen Zhang, Xiuyi Chen, Bo Xu 0002. 1596-1604 [doi]
- Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios?Shiyan Zheng, Herun Wan, Minnan Luo, Junhang Huang. 1605-1613 [doi]
- Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion ModelsXinzhe Zheng 0003, Shiyu Jiang, Gustavo de M. Seabra, Chenglong Li, Yanjun Li 0005. 1614-1622 [doi]
- SculptDrug: A Spatial Condition-Aware Bayesian Flow Model for Structure-based Drug DesignQingsong Zhong, Haomin Yu, Yan Lin 0006, Wangmeng Shen, Long Zeng 0004, Jilin Hu. 1623-1631 [doi]
- IO-RAE: Information-Obfuscation Reversible Adversarial Example for Audio Privacy ProtectionJiajie Zhu, Xia Du, Xiaoyuan Liu, Ji-Zhe Zhou 0001, Qizhen Xu, Zheng Lin 0001, Chi-Man Pun. 1632-1640 [doi]
- DIFT: Protecting Contrastive Learning Against Data Poisoning Backdoor AttacksJiang Zhu, Yulin Jin, Qingqing Ye 0001, Zhibiao Guo, Kun Fang, Ruochen Du, Yingnan Zhao 0002, Haibo Hu 0001. 1641-1649 [doi]
- Advancing Protein Design via Multi-Agent Reinforcement Learning with Pareto-Based Collaborative OptimizationMingming Zhu, Jiahua Rao, Xiaoyu Chen, Qianmu Yuan, Yuedong Yang. 1650-1658 [doi]
- Physics-Informed Multi-Task Learning for Battery State of Health Prediction with Uncertainty QuantificationTianwen Zhu, Guangyu Wu, Zhiwei Cao, Ruihang Wang, Jimin Jia, Yong Luo 0002, Yonggang Wen 0001. 1659-1667 [doi]
- Tree-Based Stochastic Optimization for Solving Large-Scale Urban Network Security GamesShuxin Zhuang, Linjian Meng, Shuxin Li 0001, Minming Li, Youzhi Zhang 0001. 1668-1675 [doi]
- AdaField: Generalizable Surface Pressure Modeling with Physics-Informed Pre-training and Flow-Conditioned AdaptationJunhong Zou, Wei Qiu, Zhenxu Sun, Xiaomei Zhang, Zhaoxiang Zhang 0001, Xiangyu Zhu 0001. 1676-1684 [doi]
- Hypothesis-Driven Reasoning for Large Language ModelsAakash Kumar Agarwal, Moyuru Yamada. 1686-1693 [doi]
- Plot'n Polish: Zero-Shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion ModelsKiymet Akdemir, Jing Shi 0005, Kushal Kafle, Brian L. Price, Pinar Yanardag. 1694-1702 [doi]
- Towards Training-Free and Accurate ANN-to-SNN Conversion via Activation-Aware RedistributionHonglin Cao, Shuai Wang 0058, Zijian Zhou 0005, Ammar Belatreche, Wenjie Wei, Yu Liang, Yu Yang, Rui Xi, Malu Zhang, Haizhou Li 0001. 1703-1711 [doi]
- Parallel Training Time-to-First-Spike Spiking Neural NetworksKaiwei Che, Wei Fang 0006, Peng Xue, Yifan Huang 0002, Zhengyu Ma, Yonghong Tian 0001. 1712-1720 [doi]
- Chain-of-Search: Parameter-Efficient Reasoning for Zero-Shot Object NavigationHanrui Chen, Liqi Yan, Qifan Wang 0001, Jianhui Zhang, Fangli Guan, Pan Li 0001. 1721-1729 [doi]
- LAS: Loss-less ANN-SNN Conversion for Fully Spike-Driven Large Language ModelsLong Chen, Xiaotian Song, Yanan Sun 0001. 1730-1738 [doi]
- VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement LearningSiran Chen, Boyu Chen, Yuxiao Luo 0001, Chenyun Yu, Yi Ouyang, Lei Cheng 0005, Chengxiang Zhuo, Zang Li, Yali Wang 0001. 1739-1747 [doi]
- Unveiling the Landscape of Clinical Depression Assessment: From Behavioral Signatures to Psychiatric ReasoningZhuang Chen 0002, Guanqun Bi, Wen Zhang, Jiawei Hu, Aoyun Wang, Xiyao Xiao, Kun Feng, Minlie Huang. 1748-1756 [doi]
- A Theory of Adaptive Scaffolding for LLM-Based Pedagogical AgentsClayton Cohn, Surya Rayala, Namrata Srivastava, Joyce Horn Fonteles, Shruti Jain, Xinying Luo, Divya Mereddy, Naveeduddin Mohammed, Gautam Biswas. 1757-1765 [doi]
- NL2CA: Auto-formalizing Cognitive Decision-Making from Natural Language Using an Unsupervised CriticNL2LTL FrameworkZihao Deng, Yijia Li, Renrui Zhang, Peijun Ye 0001. 1766-1773 [doi]
- SyncBrain: Exploring Brain Functional Dynamics Through Neural Oscillatory SynchronizationJiaqi Ding, Tingting Dan, Zhixuan Zhou, Guorong Wu 0001. 1774-1782 [doi]
- Advancing Multimodal Teacher Sentiment Analysis: The Large-Scale T-MED Dataset & the Effective AAM-TSA ModelZhiyi Duan, Xiangren Wang, Hongyu Yuan, Qianli Xing 0002. 1783-1791 [doi]
- U2UData+: A Scalable Swarm UAVs Autonomous Flight Dataset for Embodied Long-horizon TasksTongtong Feng, Xin Wang 0019, Feilin Han, Leping Zhang, Wenwu Zhu 0001. 1792-1800 [doi]
- Personality-guided Public-Private Domain Disentangled Hypergraph-Former Network for Multimodal Depression DetectionChangzeng Fu, Shiwen Zhao, Yunze Zhang, ZhongQuan Jian, Shiqi Zhao, Chaoran Liu. 1801-1809 [doi]
- CoEvo: Continual Evolution of Symbolic Solutions Using Large Language ModelsPing Guo 0007, Qingfu Zhang 0001, Xi Lin 0001. 1810-1818 [doi]
- MvP-ECR: Multi-Perspective Emotion-Cause Reasoning for Empathetic DialogueYuanyuan He, Guotai Huang, Wei Li 0308, Jiali You 0002, Jiawen Deng, Fuji Ren. 1819-1827 [doi]
- Reality vs Counterfactual: Multi-World Contrastive Reinforcement Learning for Enhancing MLLM's Theory of Mind in Egocentric VideosGuiyang Hou, Yihui Fu, Chen Wu, Xiang Huang, Zhe Zheng, Wenqi Zhang 0001, Yongliang Shen 0001, Weiming Lu 0001. 1828-1836 [doi]
- Is Symbolic Music a Specific Language? Exploring Inspiration-to-Structure Machine Composition via LLMsZhejing Hu, Yan Liu 0004, Zhi Zhang 0004, Aiwei Zhang, Sheng-hua Zhong, Bruce X. B. Yu, Gong Chen 0006. 1837-1845 [doi]
- Surgical AI Copilot: Energy-Based Fourier Gradient Low-Rank Adaptation for Surgical LLM Agent Reasoning and PlanningJiayuan Huang, Runlong He, Danyal Z. Khan, Evangelos B. Mazomenos, Danail Stoyanov, Hani J. Marcus, Linzhe Jiang, Matthew John Clarkson, Mobarak I. Hoque. 1846-1854 [doi]
- DS-ATGO: Dual-Stage Synergistic Learning via Forward Adaptive Threshold and Backward Gradient Optimization for Spiking Neural NetworksJiaqiang Jiang, Wenfeng Xu, Jing Fan, Rui Yan 0005. 1855-1863 [doi]
- MPD-SGR: Robust Spiking Neural Networks with Membrane Potential Distribution-Driven Surrogate Gradient RegularizationRunhao Jiang, Chengzhi Jiang, Rui Yan 0005, Huajin Tang. 1864-1872 [doi]
- Learning Personalised Human Internal Cognition from External Expressive Behaviours for Real Personality RecognitionXiangyu Kong 0001, Hengde Zhu, Haoqin Sun, Zhihao Guo, Jiayan Gu, Xinyi Ni, Wei Zhang 0243, Shizhe Liu, Siyang Song. 1873-1881 [doi]
- MASP: Multi-Aspect Guided Emotion Reasoning with Soft Prompt Tuning In Vision-Language ModelsSangeun Lee, Yubeen Lee, Eunil Park, Wonseok Chae. 1882-1890 [doi]
- AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home EnvironmentsZikang Leng, Megha Thukral, Yaqi Liu, Hrudhai Rajasekhar, Shruthi K. Hiremath, Jiaman He, Thomas Plötz. 1891-1899 [doi]
- ARCHE: A Novel Task to Evaluate LLMs on Latent Reasoning Chain ExtractionPengze Li, Jiaqi Liu, Junchi Yu, Lihao Liu, Mingyu Ding, Wanli Ouyang, Shixiang Tang, Xi Chen 0004. 1900-1908 [doi]
- DCHO: A Decomposition-Composition Framework for Predicting Higher-Order Brain Connectivity to Enhance Diverse Downstream ApplicationsWeibin Li 0003, Wendu Li, Quanying Liu. 1909-1917 [doi]
- MAUGen: A Unified Diffusion Approach for Multi-Identity Facial Expression and AU Label GenerationXiangdong Li, Ye Lou, Ao Gao, Wei Zhang 0243, Siyang Song. 1918-1927 [doi]
- Explainable Depression Assessment from Face Videos by Weakly Supervised LearningRongfan Liao, Xiangyu Kong 0001, Shiqing Tang, Lang He, Changzeng Fu, Weicheng Xie 0001, Xiaofeng Liu, Lu Liu 0001, Siyang Song. 1928-1936 [doi]
- HardF-SNN: Hardware-Friendly Quantization for Spiking Neural Networks with Efficient Integer-Arithmetic-Only InferenceHanwen Liu, Kexin Shi, Jieyuan Zhang, Yimeng Shan, Jibin Wu, Wenyu Chen 0001, Malu Zhang. 1937-1945 [doi]
- A Closer Look at Knowledge Distillation in Spiking Neural Network TrainingXu Liu, Na Xia, Jinxing Zhou, Jingyuan Xu, Dan Guo 0001. 1946-1954 [doi]
- FINE: Factorized Multimodal Sentiment Analysis via Mutual INformation EstimationYadong Liu 0003, Shangfei Wang. 1955-1963 [doi]
- Mind the Gap: The Divergence Between Human and LLM-Generated TasksYi-Long Lu, Jiajun Song, Chunhui Zhang, Wei Wang. 1964-1972 [doi]
- Temporal Dynamics Enhancer for Directly Trained Spiking Object DetectorsFan Luo, Zeyu Gao, Xinhao Luo, Kai Zhao, Yanfeng Lu. 1973-1981 [doi]
- I2E: Real-Time Image-to-Event Conversion for High-Performance Spiking Neural NetworksRuichen Ma, Liwei Meng, Guanchao Qiao, Ning Ning 0002, Yang Liu 0062, Shaogang Hu. 1982-1990 [doi]
- Agentic Design Review SystemSayan Nag, K. J. Joseph, Koustava Goswami, Vlad I. Morariu, Balaji Vasan Srinivasan. 1991-1999 [doi]
- SteerMusic: Enhanced Musical Consistency for Zero-shot Text-Guided and Personalized Music EditingXinlei Niu, Kin Wai Cheuk, Jing Zhang 0052, Naoki Murata, Chieh-Hsin Lai, Michele Mancusi, Woosung Choi, Giorgio Fabbro, Wei-Hsiang Liao 0001, Charles Patrick Martin, Yuki Mitsufuji. 2000-2010 [doi]
- Modelling the Effects of Hearing Loss on Neural Coding in the Auditory Midbrain with Variational ConditioningLloyd Pellatt, Fotios Drakopoulos, Shievanie Sabesan, Nicholas A. Lesica. 2011-2018 [doi]
- A Network of Biologically Inspired Rectified Spectral Units (ReSUs) Learns Hierarchical Features Without Error BackpropagationShanshan Qin, Joshua Pughe-Sanford, Alexander Genkin, Pembe Gizem Özdil, Philip Greengard, Anirvan M. Sengupta, Dmitri B. Chklovskii. 2019-2028 [doi]
- Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale VerifierHyeongseop Rha, Jeong Hun Yeo, Yeonju Kim, Yong Man Ro. 2029-2037 [doi]
- SCORE: Semantic Collage by Optimizing Rendered ElementsZefan Shao, Jin Zhou, Hongliang Yang, Pengfei Xu 0002. 2038-2046 [doi]
- Invariant Representation Learning for Memory Behavior Modeling via Adaptive Environment SeparationXiaoxuan Shen, Zhihai Hu, Fuqing Li, Shengyingjie Liu, Jianwen Sun. 2047-2055 [doi]
- Activation-wise Propagation: A One-Timestep Strategy for Spiking Neural NetworksJian Song, Xiangfei Yang, Shangke Lyu, Donglin Wang. 2056-2064 [doi]
- IntentMotion: Learning Intent-Aware Human Motion from Language in 3D ScenesWenfeng Song, Shi Zheng, Xinyu Zhang, Xingliang Jin, Aimin Hao, Fei Hou 0001, Xia Hou, Shuai Li 0001. 2065-2073 [doi]
- Detecting Emotional Dynamic Trajectories: An Evaluation Framework for Emotional Support in Language ModelsZhouxing Tan, Ruochong Xiong, Yulong Wan, Jinlong Ma, Hanlin Xue, Qichun Deng, Haifeng Jing, Zhengtong Zhang, Depei Liu, Shiyuan Luo, Junfei Liu. 2074-2082 [doi]
- HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological EvolutionJinzhou Tang, Jusheng Zhang, Qinhan Lv, Sidi Liu, Jing Yang, Chengpei Tang, Keze Wang. 2083-2091 [doi]
- Multi-dimensional Neural Decoding with Orthogonal Representations for Brain-Computer InterfacesKaixi Tian, Shengjia Zhao, Yuhan Zhang, Shan Yu. 2092-2100 [doi]
- Dep-MAP: A Multi-level Alignment Framework with Semantic Prototypes for Video-based Automatic Depression AssessmentHao Wang, Jiayu Ye, Qingxiang Wang. 2101-2109 [doi]
- Hi-EF: Benchmarking Emotion Forecasting in Human-interactionHaoran Wang 0006, Xinji Mai, Zeng Tao, Junxiong Lin, Xuan Tong, Ivy Pan, Shaoqi Yan, Yan Wang 0068, Shuyong Gao. 2110-2118 [doi]
- SpikCommander: A High-performance Spiking Transformer with Multi-view Learning for Efficient Speech Command RecognitionJiaqi Wang 0003, Liutao Yu, Xiongri Shen, Sihang Guo, Chenlin Zhou, Leilei Zhao, Yi Zhong, Zhiguo Zhang 0001, Zhengyu Ma. 2119-2127 [doi]
- Training-Free ANN-to-SNN Conversion for High-Performance Spiking TransformersJingya Wang, Xin Deng, Wenjie Wei, Dehao Zhang, Shuai Wang 0058, Qian Sun 0014, Jieyuan Zhang, Hanwen Liu, Ning Xie, Malu Zhang. 2128-2136 [doi]
- InterMoE: Individual-Specific 3D Human Interaction Generation via Dynamic Temporal-Selective MoELipeng Wang 0005, Hongxing Fan, Haohua Chen, Zehuan Huang, Lu Sheng. 2137-2145 [doi]
- Computer Vision Modeling of the Development of Geometric and Numerical Concepts in HumansZekun Wang, Sashank Varma. 2146-2154 [doi]
- Large Connectome Model: An fMRI Foundation Model of Brain Connectomes Empowered by Brain-Environment Interaction in Multitask Learning LandscapeZiquan Wei, Tingting Dan, Guorong Wu 0001. 2155-2163 [doi]
- DHCM-CACL: Dynamic Hierarchical Cross-modal Mamba with Confidence-Adaptive Contrastive Learning for Multimodal Emotion RecognitionBaiqiang Wu, Yang Li. 2164-2172 [doi]
- SPP-SCL: Semi-Push-Pull Supervised Contrastive Learning for Image-Text Sentiment Analysis and BeyondJiesheng Wu, Shengrong Li. 2173-2181 [doi]
- Let the Model Learn to Feel: Mode-Guided Tonality Injection for Symbolic Music Emotion RecognitionHaiying Xia, Zhongyi Huang, Yumei Tan, Shuxiang Song 0001. 2182-2190 [doi]
- PSA-MF: Personality-Sentiment Aligned Multi-Level Fusion for Multimodal Sentiment AnalysisHeng Xie, Kang Zhu, Zhengqi Wen, Jianhua Tao 0001, Xuefei Liu, Ruibo Fu, Changsheng Li. 2191-2199 [doi]
- Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMsXikang Yang, Biyu Zhou, Xuehai Tang, Jizhong Han, Songlin Hu 0001. 2200-2208 [doi]
- Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion ModelsYi Yang, Haowen Li, Tianxiang Li, Boyu Cao, Xiaohan Zhang, Liqun Chen, Qi Liu. 2209-2217 [doi]
- A Multimodal EEG-Eye Movement Model for Automatic Depression DetectionHao-Long Yin, Jian-ming Zhang, Ren-Jie Dai, Wei-Long Zheng, Qinyu Lv, Zhenghui Yi, Bao-Liang Lu. 2218-2226 [doi]
- Generalized Threshold Optimization with Harmony Multi-Threshold Neurons for Accurate ANN-to-SNN ConversionWenhan Zhang, Zihan Huang, Tong Bu, Tiejun Huang 0001, Zhaofei Yu. 2227-2235 [doi]
- Spikingformer: A Key Foundation Model for Spiking Neural NetworksChenlin Zhou, Liutao Yu, Zhaokun Zhou, Han Zhang 0035, Jiaqi Wang 0003, Huihui Zhou, Zhengyu Ma, Yonghong Tian 0001. 2236-2244 [doi]
- TDSNNs: Competitive Topographic Deep Spiking Neural Networks for Visual Cortex ModelingDeming Zhou, Yuetong Fang, Zhaorui Wang 0006, Renjing Xu. 2245-2253 [doi]
- Investigating Prosocial Behavior Theory in LLM Agents Under Policy-Induced InequitiesYujia Zhou 0002, Hexi Wang, Qingyao Ai, Zhen Wu, Yiqun Liu 0001. 2254-2262 [doi]
- Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health UnderstandingZhiyuan Zhou, Yanrong Guo, Shijie Hao. 2263-2271 [doi]
- Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite FeedbackShijing Zhu, Zhuang Chen 0002, Guanqun Bi, Binghang Li, Yaxi Deng, Dazhen Wan, Libiao Peng, Xiyao Xiao, Rongsheng Zhang, Tangjie Lv, Zhipeng Hu, Fangfang Li, Minlie Huang. 2272-2280 [doi]
- TMDC: A Two-Stage Modality Denoising and Complementation Framework for Multimodal Sentiment Analysis with Missing and Noisy ModalitiesYan Zhuang 0002, Minhao Liu, Yanru Zhang, Jiawen Deng, Fuji Ren. 2281-2289 [doi]
- LiteGE: Lightweight Geodesic Embedding for Efficient Geodesics Computation and Non-Isometric Shape CorrespondenceYohanes Yudhi Adikusuma, Qixing Huang, Ying He 0001. 2291-2299 [doi]
- Open-World Object Counting in VideosNiki Amini-Naieni, Andrew Zisserman. 2300-2308 [doi]
- Spatio-Temporal Distortion Aware Omnidirectional Video Super-ResolutionHongyu An, Xinfeng Zhang 0001, Shijie Zhao 0001, Li Zhang 0006, Ruiqin Xiong. 2309-2317 [doi]
- Enhancing Retrieval-Augmented Large Vision Language Models via Knowledge Conflict MitigationWenbin An, Jiahao Nie 0002, Feng Tian 0002, Mingxiang Cai, Yaqiang Wu, Xiaoqin Zhang, Shijian Lu. 2318-2326 [doi]
- Cross-temporal 3D Gaussian Splatting for Sparse-view Guided Scene UpdateZeyuan An, Yanghang Xiao, Zhiying Leng, Frederick W. B. Li, Xiaohui Liang 0001. 2327-2335 [doi]
- Towards Temporal Fusion Beyond the Field of View for Camera-based Semantic Scene CompletionJongseong Bae, Junwoo Ha, Jinnyeong Heo, Yeongin Lee, Ha-Young Kim. 2336-2344 [doi]
- DogFit: Domain-guided Fine-tuning for Efficient Transfer Learning of Diffusion ModelsYara Bahram, Mohammadhadi Shateri, Eric Granger. 2345-2353 [doi]
- Learning Compact Latent Space for Representing Neural Signed Distance Functions with High-fidelity Geometry DetailsQiang Bai, Bojian Wu, Xi Yang 0017, Zhizhong Han. 2354-2362 [doi]
- HyperCOD: The First Challenging Benchmark and Baseline for Hyperspectral Camouflaged Object DetectionShuyan Bai, Tingfa Xu, Peifu Liu, Yuhao Qiu, Huiyan Bai, Huan Chen 0018, Yanyan Peng, Jianan Li 0001. 2363-2371 [doi]
- Plug-and-Play Optimization for 3D Gaussian Splatting Compression: Distribution Regularization, Probabilistic Pruning and Detail CompensationTian Bai 0005, Zheng Qiu, Haojie Chen, Ziyang Dai. 2372-2380 [doi]
- SDNet: LiDAR Semantic Scene Completion with Sparse-Dense Fusion and Input-Aware Label RefinementTingming Bai, Zhiyu Xiang, Peng Xu 0026, Tianyu Pu, Kai Wang, Eryun Liu. 2381-2389 [doi]
- Complex Mathematical Expression Recognition: Benchmark, Large-Scale Dataset and Strong BaselineWeikang Bai, Yongkun Du, Yuchen Su, Yazhen Xie, Zhineng Chen. 2390-2398 [doi]
- MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial GuidanceXuehai Bai, Xiaoling Gu, Akide Liu, Hangjie Yuan, Yifan Zhang, Jack Ma. 2399-2407 [doi]
- Stop Mixing Things Up! BISCUIT Teaches Vision-Language Models to Learn New Concepts from Images on the SpotJiahua Bao, Siyao Cheng, Jiaxing Du, Yuhang Jia, Boyang Niu, Zeming Lang, Changjiang He, Hao Zhang 0016, Jie Liu 0001. 2408-2416 [doi]
- TripleFDS: Triple Feature Disentanglement and Synthesis for Scene Text EditingYuchen Bao, Yiting Wang, Wenjian Huang 0001, Haowei Wang 0001, Shen Chen 0004, Taiping Yao, Shouhong Ding, Jianguo Zhang 0001. 2417-2425 [doi]
- CPOStream: Collaborating Prediction and Observation for Flicker-Free Streamable Free-Viewpoint Video with 3DGSZhenyu Bao, Qing Li 0006, Jinhan Xie, Kanglin Liu. 2426-2434 [doi]
- Text-to-Scene with Large Reasoning ModelsFrédéric Berdoz, Luca A. Lanzendörfer, Nick Tuninga, Roger Wattenhofer. 2435-2443 [doi]
- SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised SegmentationXiuli Bi, Die Xiao, Junchao Fan, Bin Xiao 0002. 2444-2452 [doi]
- Foundation-Adaptive Integrated Refinement for Generalized Category DiscoveryYuwei Bian, Shidong Wang, Yazhou Yao, Haofeng Zhang 0001. 2453-2461 [doi]
- Robust Pedestrian Detection with Uncertain ModalityQian Bie, Xiao Wang 0029, Bin Yang 0026, Zhixi Yu, Jun Chen 0001, Xin Xu 0007. 2462-2470 [doi]
- Knowledge-Enhanced Explainable Prompting for Vision-Language ModelsYequan Bie, Andong Tan, Zhixuan Chen, Zhiyuan Cai, Luyang Luo, Hao Chen 0011. 2471-2479 [doi]
- MoBGS: Motion Deblurring Dynamic 3D Gaussian Splatting for Blurry Monocular VideoMinh-Quan Viet Bui, Jongmin Park, Juan Luis Gonzalez 0001, Jaeho Moon, Jihyong Oh, Munchurl Kim. 2480-2489 [doi]
- DOS: Directional Object Separation in Text Embeddings for Multi-Object Image GenerationDongnam Byun, Jungwon Park, Jungmin Ko, Changin Choi, Wonjong Rhee. 2490-2497 [doi]
- Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative ModelsFrancisco Caetano, Christiaan G. A. Viviers, Peter H. N. de With, Fons van der Sommen. 2498-2506 [doi]
- FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich TrainingFuhan Cai, Yong Guo, Jie Li, Wenbo Li 0002, Jian Chen 0011, Xiangzhong Fang. 2507-2515 [doi]
- Seeing in Double: Dual-Granularity BEV Segmentation via Mamba-Driven Alignment and Polar-Decoupled ExpertsJiaxin Cai, Rui Lin, Jingze Su, Qi Li 0038, Wenjie Yang 0005, Yuanlong Yu 0001, Wenxi Liu. 2516-2524 [doi]
- DriveLiDAR4D: Sequential and Controllable LiDAR Scene Generation for Autonomous DrivingKaiwen Cai, Xinze Liu, Xia Zhou, Hengtong Hu, Jie Xiang, Luyao Zhang, Xueyang Zhang, Kun Zhan, Yifei Zhan, Xianpeng Lang. 2525-2533 [doi]
- Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification ApproachLvpan Cai, Haowei Wang 0001, Jiayi Ji, YanShu ZhouMen, Shen Chen 0004, Taiping Yao, Xiaoshuai Sun. 2534-2542 [doi]
- SEMC: Structure-Enhanced Mixture-of-Experts Contrastive Learning for Ultrasound Standard Plane RecognitionQing Cai, Guihao Yan, Fan Zhang 0070, Cheng Zhang, Zhi Liu. 2543-2551 [doi]
- Mem4D: Decoupling Static and Dynamic Memory for Dynamic Scene ReconstructionXudong Cai, Shuo Wang 0015, Peng Wang 0106, Yongcai Wang, Zhaoxin Fan, Wanting Li, Tianbao Zhang, Jianrong Tao, Yeying Jin, Deying Li 0001. 2552-2560 [doi]
- Split-Layer: Enhancing Implicit Neural Representation by Maximizing the Dimensionality of Feature SpaceZhicheng Cai, Hao Zhu 0004, Linsen Chen, Qiu Shen, Xun Cao. 2561-2570 [doi]
- FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token PruningJiajun Cao, Qizhe Zhang, Peidong Jia, Xuhui Zhao, Bo Lan, Xiaoan Zhang, Lizhuo, Xiaobao Wei, Sixiang Chen, Liyun Li, Xianming Liu, Ming Lu 0002, Yang Wang, Shanghang Zhang. 2571-2579 [doi]
- Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth EstimationJing Cao, Kui Jiang, Shenyi Li, Xiaocheng Feng, Yong Huang. 2580-2588 [doi]
- UQ-Bench: A Benchmark for Evaluating Multimodal LLMs on Underwater Image Quality AssessmentJingchao Cao, Guo An, Feng Gao 0005, Ke Gu 0001, Yutao Liu 0002. 2589-2597 [doi]
- RelaCtrl: Relevance-Guided Efficient Control for Diffusion TransformersKe Cao, Jing Wang 0021, Ao Ma 0005, Jiasong Feng, Xuanhua He, Run Ling, Haowei Liu, Jian Lu, Wei Feng, Haozhe Wang 0002, Hongjuan Pei, Yihua Shao, Zhanjie Zhang, Jie Zhang 0033. 2598-2606 [doi]
- VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement LearningLinhan Cao, Wei Sun 0029, Weixia Zhang, Xiangyang Zhu, Jun Jia, Kaiwei Zhang, Dandan Zhu 0001, Guangtao Zhai, Xiongkuo Min. 2607-2615 [doi]
- Video SimpleQA: Towards Factuality Evaluation in Large Video Language ModelsMeng Cao 0002, Pengfei Hu, Yingyao Wang, Jihao Gu, Haoran Tang, Haoze Zhao, Chen Wang, Jiahua Dong 0001, Wangbo Yu, Ge Zhang 0009, Xiang Li 0117, Ian Reid 0003, Xiaodan Liang. 2616-2624 [doi]
- Latent Knowledge-Guided Video Diffusion for Scientific Phenomena Generation from a Single Initial FrameQinglong Cao, Xirui Li, Ding Wang, Chao Ma 0004, Yuntian Chen, Xiaokang Yang 0001. 2625-2633 [doi]
- EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric VisionYifei Cao, Yu Liu 0035, Guolong Wang 0001, Zhu Liu, Kai Wang 0057, Xianjie Zhang, Jizhe Yu, Xun Tu 0001. 2634-2642 [doi]
- Robust Noise Modeling for Spike Camera via Time-Interval Quantification and Spike-DSLR Multimodal Dataset in Low-Light ImagingYue Cao, Sizhao Li, Liguo Zhang 0002. 2643-2651 [doi]
- Automatic Translational Correction of Multi-View Coronary Angiography Based on Auto-Annotation Data GenerationYue Cao, Zhuo Zhang 0025, Shuai Xiao 0001, Jialin Li, Guipeng Lan, Jiabao Wen, Jiachen Yang. 2652-2660 [doi]
- Audio-Assisted Face Video Restoration with Temporal and Identity Complementary LearningYuqin Cao, Yixuan Gao, Wei Sun 0029, Xiaohong Liu 0001, Yulun Zhang 0001, Xiongkuo Min. 2661-2669 [doi]
- Multivariate Diffusion Transformer with Decoupled Attention for High-Fidelity Mask-Text Collaborative Facial GenerationYuShe Cao, Dianxi Shi, Xing Fu, Xuechao Zou, Haikuo Peng, Xueqi Li, Chun Yu, Junliang Xing. 2670-2679 [doi]
- Maniflat3D: Learning 3D Geometry Through Planar Representations from Multi-Layer UnwrappingZijian Cao 0007, Dayou Zhang, Zeyuan Liu, Zhicheng Liang, Fangxin Wang 0001. 2680-2688 [doi]
- QuantVSR: Low-Bit Post-Training Quantization for Real-World Video Super-ResolutionBowen Chai, Zheng Chen 0014, Libo Zhu, Wenbo Li, Yong Guo, Yulun Zhang 0001. 2689-2697 [doi]
- AbductiveMLLM: Boosting Visual Abductive Reasoning Within MLLMsBoyu Chang, Qi Wang 0009, Xi Guo, Zhixiong Nan, Yazhou Yao, Tianfei Zhou. 2698-2706 [doi]
- Reconstruction Using the Invisible: Intuition from NIR and Metadata for Enhanced 3D Gaussian SplattingGyusam Chang, Tuan Anh Vu, Vivek Alumootil, Harris Song, Deanna Pham, Sangpil Kim, M. Khalid Jawed. 2707-2715 [doi]
- MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian SplattingHanzhi Chang, Ruijie Zhu 0002, Wenjie Chang, Mulin Yu, Yanzhe Liang, Jiahao Lu 0001, Zhuoyuan Li, Tianzhu Zhang 0001. 2716-2724 [doi]
- MambaOVSR: Multiscale Fusion with Global Motion Modeling for Chinese Opera Video Super-ResolutionHua Chang, Xin Xu 0007, Wei Liu 0183, Wei Wang 0170, Xin Yuan 0009, Kui Jiang. 2725-2733 [doi]
- Escaping the CAM Shadow: Uncertainty-Guided Reliable Learning for Weakly Supervised Semantic SegmentationLuyao Chang, Leiting Chen, Chen Yang, Chuan Zhou 0004. 2734-2742 [doi]
- BulletTime4D: Towards High Spatio-Temporal Resolution Dynamic Scene Rendering via Spike-Guided Stereo VisionYiqian Chang, Haoran Xu, Qinghong Ye, Jianing Li 0001, Xuan Wang 0002, Wei Zhang 0161, Peixi Peng. 2743-2751 [doi]
- PerTouch: VLM-Driven Agent for Personalized and Semantic Image RetouchingZewei Chang, Zheng-Peng Duan, Jianxing Zhang, Chun-Le Guo, Siyu Liu, Hyungju Chun, Hyunhee Park, Zikun Liu 0001, Chongyi Li. 2752-2759 [doi]
- Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality AssessmentBaoliang Chen, Siyi Pan, Dongxu Wu, Liang Xie 0013, Xiangjie Sui, Lingyu Zhu 0006, Hanwei Zhu. 2760-2768 [doi]
- Style4D-Bench: A Benchmark Suite for 4D StylizationBeiqi Chen, Shuai Shao, Haitang Feng, Jianhuang Lai, Jianlou Si, Guangcong Wang. 2769-2777 [doi]
- Towards Ultrasound-based Reliable Disease Diagnosis Using Causal InferenceBolei Chen, Jiaxu Kang, Haonan Yang 0001, Ping Zhong 0002, Yixiong Liang, Rui Fan 0001, Jianxin Wang 0001. 2778-2786 [doi]
- Perspective from a Broader Context: Can Room Style Knowledge Help Visual Floorplan Localization?Bolei Chen, Shengsheng Yan, Yongzheng Cui, Jiaxu Kang, Ping Zhong 0002, Jianxin Wang 0001. 2787-2795 [doi]
- Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy PredictionCheng Chen 0078, Hao Huang 0003, Saurabh Bagchi. 2796-2804 [doi]
- AerialMind: Towards Referring Multi-Object Tracking in UAV ScenariosChenglizhao Chen, Shaofeng Liang, Runwei Guan, Xiaolou Sun, Haocheng Zhao, Haiyun Jiang, Tao Huang 0008, Henghui Ding, Qing-Long Han. 2805-2813 [doi]
- Action-and-object Aware Alignment for Partially Relevant Video RetrievalChuanshen Chen, Kai Zhou, Zhiquan Wen, Zeng You, Yirui Li, Tianhang Xiang, Mingkui Tan. 2814-2822 [doi]
- UniABG: Unified Adversarial View Bridging and Graph Correspondence for Unsupervised Cross-View Geo-LocalizationCuiqun Chen, Qi Chen, Bin Yang, Xingyi Zhang. 2823-2831 [doi]
- Vision-language Incremental Learning with Dual Class-individual MemoryFuhai Chen, Feng Zhang, Xiaoguang Ma, Yiyi Zhou, Jiarong Liu, Xuri Ge. 2832-2840 [doi]
- CoRA: A Collaborative Robust Architecture with Hybrid Fusion for Efficient PerceptionGong Chen, Chaokun Zhang, Pengcheng Lv, Xiaohui Xie. 2841-2849 [doi]
- VPSentry: Semi-supervised Video Polyp Segmentation via Sentry-guided Long-term Prototype Fusion with Correlation Dynamic PropagationGuilian Chen, Xiaoling Luo 0001, Huisi Wu, Jing Qin 0001. 2850-2858 [doi]
- Sortblock: Similarity-Aware Feature Reuse for Diffusion ModelHanqi Chen, Xu Zhang, Xiaoliu Guan, Lielin Jiang, Guanzhong Wang, Zeyu Chen, Yi Liu. 2859-2867 [doi]
- HiFi-Mamba: Dual-Stream ?-Laplacian Enhanced Mamba for High-Fidelity MRI ReconstructionHongli Chen, Pengcheng Fang, Yuxia Chen, Yingxuan Ren, Jing Hao, Fangfang Tang, Xiaohao Cai, Shanshan Shan, Feng Liu 0005. 2868-2876 [doi]
- CareCom: Generative Image Composition with Calibrated Reference FeaturesJiaxuan Chen, Bo Zhang 0075, Qingdong He, Jinlong Peng, Li Niu 0002. 2877-2885 [doi]
- LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic SegmentationJunyang Chen, Xiangbo Lv, Zhiqiang Kou, Xingdong Sheng, Ning Xu, Yiguo Qiao. 2886-2894 [doi]
- Intra-Image Mining and Symmetric Maximum Concept Matching for Few Shot Out-of-Distribution DetectionKaixiang Chen, Pengfei Fang, Hui Xue 0002. 2895-2903 [doi]
- EvDiff3D: Event-Aware Diffusion Repair for High-Fidelity Event-Based 3D ReconstructionKanghao Chen, Zixin Zhang, Hangyu Li, Lin Wang 0025, Zeyu Wang 0003. 2904-2912 [doi]
- MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document UnderstandingKetong Chen, Yuhao Chen, Yang Xue. 2913-2921 [doi]
- CoMA-SLAM: Collaborative Multi-Agent Gaussian SLAM with Geometric ConsistencyLin Chen 0042, Yongxin Su, Jvboxi Wang, Pengcheng Han, Zhenyu Xia, Shuhui Bu, Kun Li, Boni Hu, Shengqi Meng, Guangming Wang 0001. 2922-2929 [doi]
- StegaVAR: Privacy-Preserving Video Action Recognition via Steganographic Domain AnalysisLixin Chen, Chaomeng Chen, Jiale Zhou, Zhijian Wu, Xun Lin. 2930-2938 [doi]
- Human-Centric Video Generation via Collaborative Multi-Modal ConditioningLiyang Chen, Tianxiang Ma, Jiawei Liu, Bingchuan Li, Zhuowei Chen, Lijie Liu, Xu He, Gen Li, Qian He, Zhiyong Wu 0001. 2939-2947 [doi]
- Fast Multi-view Consistent 3D Editing with Video PriorsLiyi Chen 0002, Ruihuang Li, Guowen Zhang, Pengfei Wang, Lei Zhang 0006. 2948-2956 [doi]
- Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion TransformersPengtao Chen, Xianfang Zeng, Maosen Zhao, Mingzhu Shen, Wei Cheng, Gang Yu 0002, Tao Chen 0003. 2957-2965 [doi]
- 3D-DRES: Detailed 3D Referring Expression SegmentationQi Chen, Changli Wu, Jiayi Ji, Yiwei Ma, Liujuan Cao. 2966-2974 [doi]
- LiDAR-GS++: Improving LiDAR Gaussian Reconstruction via Diffusion PriorsQifeng Chen, Jiarun Liu, Rengan Xie, Tao Tang, Sicong Du, Yiru Zhao, Yuchi Huo, Sheng Yang 0007. 2975-2983 [doi]
- VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy RobustnessQimao Chen, Fang Li, Shaoqing Xu, Zhiyi Lai, Zixun Xie, Yuechen Luo, Shengyin Jiang, Hanbing Li, Long Chen 0005, Bing Wang, Yi Zhang, Zhi-Xin Yang. 2984-2992 [doi]
- FreeGaussian: Annotation-free Control of Articulated Objects via 3D Gaussian Splats with Flow DerivativesQizhi Chen, Delin Qu, Junli Liu, Yiwen Tang, Haoming Song, Dong Wang 0028, Yuan Yuan 0001, Bin Zhao 0001. 2993-3001 [doi]
- DeNAS-ViT: Data Efficient NAS-Optimized Vision Transformer for Ultrasound Image SegmentationRenqi Chen, Xinzhe Zheng 0001, Haoyang Su, Kehan Wu. 3002-3010 [doi]
- Domain-Auxiliary Infrared Moving Small Target Detection by Learning to Overlook Domain DiscrepancyShengjia Chen, Luping Ji, Shuang Peng, Sicheng Zhu, Mao Ye 0001. 3011-3019 [doi]
- Unleashing Semantic and Geometric Priors for 3D Scene CompletionShiyuan Chen, Wei Sui, Bohao Zhang, Zeyd Boukhers, John See, Cong Yang. 3020-3028 [doi]
- Exploring the Potentials of Spiking Neural Networks for Image DerainingShuang Chen 0010, Tomás Krajník, Farshad Arvin, Amir Atapour Abarghouei. 3029-3037 [doi]
- Spectral Property-Driven Data Augmentation for Hyperspectral Single-Source Domain GeneralizationTaiqin Chen, Yifeng Wang 0001, Xiaochen Feng, Zhilin Zhu, Hao Sha 0001, Yingjian Li, Yongbing Zhang 0002. 3038-3046 [doi]
- EndoIR: Degradation-Agnostic All-in-One Endoscopic Image Restoration via Noise-Aware Routing DiffusionTong Chen, Xinyu Ma, Long Bai 0008, WenYang Wang, Yue Sun, Luping Zhou. 3047-3055 [doi]
- IGIANet: Illumination Guided Implicit Alignment Network for Infrared-Visible UAV DetectionXiangqi Chen, Dawei Zhang 0002, Li Zhao 0005, Chengzhuan Yang, Zhongyu Chen, Jungang Lou, Zhonglong Zheng, Sang-Woon Jeon, Hua Wang 0002. 3056-3064 [doi]
- Unsupervised Multi-View Visual Anomaly Detection via Progressive Homography-Guided AlignmentXintao Chen, Xiaohao Xu, Bozhong Zheng, Yun Liu, Yingna Wu. 3065-3073 [doi]
- Flowing Backwards: Improving Normalizing Flows via Reverse Representation AlignmentYang Chen, Xiaowei Xu, Shuai Wang, Chenhui Zhu, Ruxue Wen, Xubin Li, Tiezheng Ge, Limin Wang 0002. 3074-3082 [doi]
- SAM2-OV: A Novel Detection-Only Tuning Paradigm for Open-Vocabulary Multi-Object TrackingYangkai Chen, Qiangqiang Wu, Guangyao Li, Junlong Gao, Guanglin Niu, Hanzi Wang. 3083-3091 [doi]
- Multimodal Gaussian Mixture Variational Autoencoder with Consistency RegularizationsYarui Chen, Lehan Hong, Jianlin Shao, Jianning Yang, Tingting Zhao 0001, Yun Liao, Yancui Shi. 3092-3100 [doi]
- ProPL: Universal Semi-Supervised Ultrasound Image Segmentation via Prompt-Guided Pseudo-LabelingYaxiong Chen, Qicong Wang, Chunlei Li, Jingliang Hu, Yilei Shi, Shengwu Xiong 0001, Xiao Xiang Zhu 0001, Lichao Mou. 3101-3110 [doi]
- PCGS: Progressive Compression of 3D Gaussian SplattingYihang Chen 0002, Mengyao Li, Qianyi Wu, Weiyao Lin, Mehrtash Harandi, Jianfei Cai 0001. 3111-3119 [doi]
- StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and CompressionYilong Chen, Xiang Bai, Zhibin Wang, Chengyu Bai, Yuhan Dai, Ming Lu 0002. 3120-3128 [doi]
- ContextFlow: Training-Free Video Object Editing via Adaptive Context EnrichmentYiyang Chen, Xuanhua He, Xiujun Ma, Jack Ma. 3129-3137 [doi]
- RefAdGen: High-Fidelity Advertising Image GenerationYiyun Chen, Weikai Yang. 3138-3146 [doi]
- HybriDLA: Hybrid Generation for Document Layout AnalysisYufan Chen 0001, Omar Moured, Ruiping Liu 0001, Junwei Zheng, Kunyu Peng, Jiaming Zhang 0001, Rainer Stiefelhagen. 3147-3155 [doi]
- Revisiting Network Inertia: Dynamic Inertia Inhibition Coupled Multidimensional Periodicity for Infrared and Visible Image FusionYufeng Chen 0006, Yuan Sun 0016, Hao Pan, Xujian Zhao, Jian Dai 0002, Zhenwen Ren, Xingfeng Li 0004. 3156-3164 [doi]
- O-DisCo-Edit: Object Distortion Control for Unified Realistic Video EditingYuqing Chen, Junjie Wang 0012, Lin Liu 0016, Ruihang Chu, Xiaopeng Zhang 0008, Qi Tian 0001, Yujiu Yang 0001. 3165-3173 [doi]
- LAMIC: Layout-Aware Multi-Image Composition via Scalability of Multimodal Diffusion TransformerYuzhuo Chen, Zehua Ma, Jianhua Wang, Kai Kang, Shunyu Yao, Weiming Zhang 0001. 3174-3182 [doi]
- RAPTOR: Real-Time High-Resolution UAV Video Prediction with Efficient Video AttentionZhan Chen, Zile Guo, Enze Zhu, Peirong Zhang, Xiaoxuan Liu, Lei Wang, Yidan Zhang. 3183-3190 [doi]
- Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image CompressionZheng Chen 0014, Mingde Zhou, Jinpei Guo, Jiale Yuan, Yifei Ji, Yulun Zhang 0001. 3192-3200 [doi]
- Empowering DINO Representations for Underwater Instance Segmentation via Aligner and PrompterZhiyang Chen, Chen Zhang 0013, Hao Fang 0010, Runmin Cong. 3201-3209 [doi]
- CaPro: Curvilinear-aware Prompt Learning with Single Unlabeled Image for Cost-effective Curvilinear Structure SegmentationZhuangzhuang Chen, Qiangyu Chen, Chubin Ou, Xiaomeng Li 0001. 3210-3218 [doi]
- Force-Aware 3D Contact Modeling for Stable Grasp GenerationZhuo Chen 0028, Zhongqun Zhang, Yihua Cheng, Ales Leonardis, Hyung Jin Chang. 3219-3227 [doi]
- DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical ImagingHuimin Cheng, Xiaowei Yu 0001, Shushan Wu, Luyang Fang, Chao Cao, Jing Zhang 0010, Tianming Liu 0001, Dajiang Zhu, Wenxuan Zhong, Ping Ma 0001. 3228-3236 [doi]
- Phased One-Step Adversarial Equilibrium for Video Diffusion ModelsJiaxiang Cheng, Bing Ma, Xuhua Ren, Hongyi Henry Jin, Kai Yu, Peng Zhang, Wenyue Li, Yuan Zhou, Tianxiang Zheng 0001, Qinglin Lu. 3237-3245 [doi]
- Thinking Aesthetics Assessment of Image Color Temperature: Models, Datasets and BenchmarksJinguang Cheng, Chunxiao Li, Shuai He, Taiyu Chen, Anlong Ming. 3246-3254 [doi]
- RFI: Rectified Flow Intervention for Mitigating Object Hallucination in Large Vision-Language ModelsJunyu Cheng, Zhibiao Liang, Yidong Chen 0001, Shuangyin Li. 3255-3263 [doi]
- First Learn, Then Review: Human-Like Continual Learning for Cross-View Geo-Localization with Limited Field of ViewLei Cheng, Daikun Liu, Zhikun Chen, Teng Wang. 3264-3272 [doi]
- Multi-Semantic Modeling for Glass Surface Detection in the WildQianyu Cheng, Huankang Guan, Rynson W. H. Lau. 3273-3281 [doi]
- RadarMP: Motion Perception for 4D mmWave Radar in Autonomous DrivingRuiqi Cheng, Huijun Di, Jian Li, Feng Liu, Wei Liang 0008. 3282-3290 [doi]
- HandMCM: Multi-modal Point Cloud-based Correspondence State Space Model for 3D Hand Pose EstimationWencan Cheng, Gim Hee Lee. 3291-3299 [doi]
- 360Explorer: Exploring 4D Controllable World in Panoramic VideosXinhua Cheng, Haiyang Zhou, Wangbo Yu, Tanghui Jia, Bin Lin 0014, Yunyang Ge, Weiqi Li, Li Yuan 0007. 3300-3308 [doi]
- Decomposing Prompts, Composing Actions: A Multi-Granularity Prompting Approach for Incremental Action LearningXinyi Cheng, Chenghao Xu, Xi Wang, Jiexi Yan, Yanhua Yang. 3309-3317 [doi]
- RFNNS: Robust Fixed Neural Network Steganography with Universal Text-to-Image ModelsYu Cheng 0013, Jiuan Zhou, Jiawei Chen, Zhaoxia Yin, Xinpeng Zhang 0001. 3318-3326 [doi]
- Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial DefectsYuqi Cheng, Yihan Sun 0007, Hui Zhang, Weiming Shen 0001, Yunkang Cao. 3327-3334 [doi]
- Adaptive Agent Selection and Interaction Network for Image-to-Point Cloud RegistrationZhixin Cheng, Xiaotian Yin, Jiacheng Deng 0002, Bohao Liao, Yujia Chen, Xu Zhou, Baoqun Yin, Tianzhu Zhang 0001. 3335-3343 [doi]
- Learning 3D Texture-Aware Representations for Parsing Diverse Human Clothing and Body PartsKiran Chhatre, Christopher E. Peters, Srikrishna Karanam. 3344-3352 [doi]
- VMChill: A Dataset for Fine-Grained Visual-Musical SynergyXiaowei Chi, Zeyue Tian, Jialiang Chen, Wei Xue 0002. 3353-3362 [doi]
- 4D Scaffold Gaussian Splatting with Dynamic-Aware Anchor Growing for Efficient and High-Fidelity Dynamic Scene ReconstructionWoong Oh Cho, In Cho, Seoha Kim, Jeongmin Bae 0001, Youngjung Uh, Seon Joo Kim. 3363-3371 [doi]
- UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery Using Gaussian SplattingJaehoon Choi, Dongki Jung, Chris Maxey, Sungmin Eum, Yonghan Lee 0001, Dinesh Manocha, Heesung Kwon. 3372-3380 [doi]
- Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data AugmentationYuxuan Chou, Tao Yu, Wen Huang, Yuheng Zhang, Tao Dai 0001, Shu-Tao Xia. 3381-3389 [doi]
- TraveLLaMA: A Multimodal Travel Assistant with Large-Scale Dataset and Structured ReasoningMeng Chu, Yukang Chen, Haokun Gui, Shaozuo Yu, Yi Wang 0074, Jiaya Jia. 3390-3398 [doi]
- uCLIP: Parameter-Efficient Multilingual Extension of Vision-Language Models with Unpaired DataDahyun Chung, DongHyun Shin, Yujin Sung, Seunggi Moon, Jinwoo Jeon, Byung Jun Lee. 3399-3406 [doi]
- Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric PerspectiveNhat Chung, Taisei Hanyu, Toan Nguyen 0004, Huy Le 0001, Frederick Bumgarner, Duy Minh Ho Nguyen, Khoa Vo 0001, Kashu Yamazaki, Chase Rainwater, Tung Kieu, Anh Nguyen 0003, Ngan Le. 3407-3415 [doi]
- Divide-and-Conquer Decoupled Network for Cross-Domain Few-Shot SegmentationRunmin Cong, Anpeng Wang, Bin Wan, Cong Zhang, Xiaofei Zhou 0003, Wei Zhang. 3416-3424 [doi]
- Towards Efficient and Effective Interactive 3D SegmentationWei Cong, Yang Cong, Jiahua Dong 0001, Gan Sun. 3425-3433 [doi]
- On Model and Data Scaling for Skeleton-based Self-Supervised Gait RecognitionAdrian Cosma, Andy-Eduard Catruna, Emilian Radoi. 3434-3442 [doi]
- Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and RelabelingXiao Cui, Yulei Qin, Xinyue Li, Wengang Zhou 0001, Hongsheng Li, Houqiang Li. 3443-3451 [doi]
- CKDA: Cross-modality Knowledge Disentanglement and Alignment for Visible-Infrared Lifelong Person Re-identificationZhenyu Cui, Jiahuan Zhou, Yuxin Peng 0001. 3452-3460 [doi]
- DentalGS: Pose-Free 3D Gaussian Splatting from Five Intraoral Images for Novel View SynthesisHonghao Dai, Yuanfeng Zhou, Guangshun Wei, Zhihao Li, Wenping Wang 0001. 3461-3469 [doi]
- GuideGen: A Text-Guided Framework for Paired Full-torso Anatomy and CT Volume GenerationLinrui Dai, Rongzhao Zhang, Yongrui Yu, Xiaofan Zhang 0002. 3470-3478 [doi]
- CLIP-FTI: Fine-Grained Face Template Inversion via CLIP-Driven Attribute ConditioningLongchen Dai, Zixuan Shen, Zhiheng Zhou, Peipeng Yu, Zhihua Xia. 3479-3487 [doi]
- AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior DistillationSisi Dai, Kai Xu 0004. 3488-3496 [doi]
- SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic QueriesChenxu Dang, Haiyan Liu, Jason Bao, Pei-an, Xinyue Tang, An Pan, Jie Ma 0003, Bingchuan Sun, Yan Wang. 3497-3505 [doi]
- Primary Visual Cortex Inspired Point Cloud Analysis FrameworkJisheng Dang, Delin Deng, Bimei Wang, Jingze Wu, Hui Zhang, Haijiang Li, Jingmei Jiao, Dengyue Pan, Mangang Xie, Jizhao Liu. 3506-3514 [doi]
- Clean-Label Physical Backdoor Attacks with Data DistillationThinh Dao, Khoa D. Doan, Kok Seng Wong. 3515-3523 [doi]
- EigenShield: Inference-Time, Model-Agnostic Jailbreaking Defense via Causal Subspace FilteringNastaran Darabi, Devashri Naik, Sina Tayebati, Dinithi Jayasuriya, Ranganath Krishnan, Amit Ranjan Trivedi. 3524-3532 [doi]
- Human Motion UnlearningEdoardo De Matteis, Matteo Migliarini, Alessio Sampieri, Indro Spinelli, Fabio Galasso. 3533-3541 [doi]
- Class-Partitioned VQ-VAE and Latent Flow Matching for Point Cloud Scene GenerationDasith de Silva Edirimuni, Ajmal Saeed Mian. 3542-3550 [doi]
- Panda: Test-Time Adaptation with Negative Data AugmentationRuxi Deng, Wenxuan Bao, Tianxin Wei, Jingrui He. 3551-3559 [doi]
- Pano-GS: Perception-Aware Gaussian Optimization with Gradient Consistency and Multi-Criteria Densification for High-Quality RenderingYang Deng, Zhanke Wang, Jiahao Wu, Jie Liang, Jingui Ma, Yang Hu, Ronggang Wang. 3560-3568 [doi]
- SGPFeat: Semantic and Geometric Priors for Multi-modal Image MatchingYuxin Deng 0002, Botian Wang, Kaining Zhang, Hao Zhang 0073, Jiayi Ma 0001. 3569-3577 [doi]
- Uncertainty-Propelled Physics-MAE Fusion for Self-Supervised Diffusion-Weighted Image DenoisingZeyu Deng, Lihui Wang 0002, Xi Tao, Qijian Chen, Ying Cao, Xulin Hu, Yingfeng Ou. 3578-3586 [doi]
- UniC-Lift: Unified 3D Instance Segmentation via Contrastive LearningAnkit Dhiman, R. Srinath, Jaswanth Reddy, Lokesh R. Boregowda, Venkatesh Babu Radhakrishnan. 3587-3595 [doi]
- S2C: A Noise-Resistant Difference Learning Framework for Unsupervised Change Detection in VHR Remote Sensing ImagesLei Ding 0008, Xibing Zuo, Haitao Guo, Jun Lu 0005, Zhihui Gong, Xuanguang Liu, Jicang Lu. 3596-3604 [doi]
- RayD3D: Distilling Depth Knowledge Along the Ray for Robust Multi-View 3D Object DetectionRui Ding, Zhaonian Kuang, Zongwei Zhou, Meng Yang 0002, Xinhu Zheng, Gang Hua 0001. 3605-3613 [doi]
- ReACT: Reward-informed Autoregressive Decision CAD TransformerYijie Ding, Yang Liu, Haobo Jiang, Jianmin Zheng. 3614-3622 [doi]
- Bring Your Dreams to Life: Continual Text-to-Video CustomizationJiahua Dong 0001, Xudong Wang, Wenqi Liang, Zongyan Han, Meng Cao 0002, Duzhen Zhang, Hanbin Zhao, Zhi Han, Salman Khan 0001, Fahad Shahbaz Khan. 3623-3631 [doi]
- RacketVision: A Multiple Racket Sports Benchmark for Unified Ball and Racket AnalysisLinfeng Dong, Yuchen Yang, Hao Wu, Wei Wang, Yuenan Hou, Zhihang Zhong, Xiao Sun. 3632-3640 [doi]
- Zero-Reference Joint Low-Light Enhancement and Deblurring via Visual Autoregressive Modeling with VLM-Derived ModulationWei Dong 0011, Han Zhou 0003, Junwei Lin, Jun Chen 0005. 3641-3649 [doi]
- Spike Stream Memory Transfer for Dynamic Scene ReconstructionYanchen Dong 0001, Ruiqin Xiong, Rui Zhao 0010, Xinfeng Zhang 0001, Tiejun Huang 0001. 3650-3658 [doi]
- One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step DiffusionYitong Dong, Qi Zhang, Minchao Jiang, Zhiqiang Wu, Qingnan Fan, Ying Feng, Huaqi Zhang, Hujun Bao, Guofeng Zhang 0001. 3659-3667 [doi]
- AIR-DR: Adaptive Image Retargeting with Instance Relocation and Dual-guidance RepaintingZhitong Dong, Chao Li, Yongjian Deng, Hao Chen 0034. 3668-3676 [doi]
- DEIG: Detail-Enhanced Instance Generation with Fine-Grained Semantic ControlShiyan Du, Conghan Yue, Xinyu Cheng, Dongyu Zhang 0002. 3677-3685 [doi]
- DEFANet: Dual-Path Edge-Target Collaboration with Frequency-Aware Enhancement for Infrared Small Target DetectionShuaiyuan Du, Yang Xiao 0007, Zhiguo Cao 0001. 3686-3695 [doi]
- Pansharpening for Thin-Cloud Contaminated Remote Sensing Images: A Unified Framework and Benchmark DatasetSongcheng Du, Yang Zou 0004, Jiaxin Li, Mingxuan Liu, Ying Li 0017, Changjing Shang, Qiang Shen 0001. 3696-3704 [doi]
- MDiff4STR: Mask Diffusion Model for Scene Text RecognitionYongkun Du, Miaomiao Zhao, Songlin Fan, Zhineng Chen, Caiyan Jia, Yu-Gang Jiang 0001. 3705-3713 [doi]
- Learning 3D Occupancy from Beam Overlap in 2D Rotating mmWave RadarYu Du, Ruifeng Nie, Long Ma 0002, Chengpei Xu, Yu Liu 0012, Weimin Wang 0007. 3714-3722 [doi]
- SeViL: Semi-supervised Vision-Language Learning with Text Prompt Guiding for Moving Infrared Small Target DetectionWeiwei Duan, Luping Ji, Jianghong Huang, Sicheng Zhu. 3723-3731 [doi]
- Cross-domain Joint Learning with Prototype-guided Mixture-of-Experts for Infrared Moving Small Target DetectionWeiwei Duan, Luping Ji, Jianghong Huang, Sicheng Zhu, Mao Ye 0001. 3732-3740 [doi]
- Mix-QSAM2: Mixed-Precision Quantization for High Fidelity Segmentation in Resource Constrained ScenariosYuzhe Duan, Xuanxuan Ren, Guizhe Dong, Xu Yang 0019, Yanhua Yang. 3741-3749 [doi]
- Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for ExplanationsYehonatan Elisha, Seffi Cohen, Oren Barkan, Noam Koenigstein. 3750-3758 [doi]
- Empowering Semantic-Sensitive Underwater Image Enhancement with VLMGuodong Fan, Shengning Zhou, Genji Yuan, Huiyu Li, Jingchun Zhou, Jinjiang Li 0001. 3759-3767 [doi]
- AutoPP: Towards Automated Product Poster Generation and OptimizationJiahao Fan, Yuxin Qin, Wei Feng, Yanyin Chen, Yaoyu Li, Ao Ma, Yixiu Li, Li Zhuang, Haoyi Bian, Zheng Zhang, Jingjing Lv, Junjie Shen, Ching Law. 3768-3776 [doi]
- mmPred: Radar-based Human Motion Prediction in the DarkJunqiao Fan, Haocong Rao, Jiarui Zhang, Jianfei Yang, Lihua Xie 0001. 3777-3785 [doi]
- Enhancing Interpretability for Vision Models via Shapley Value OptimizationKanglong Fan, Yunqiao Yang, Chen Ma. 3786-3794 [doi]
- BCE3S: Binary Cross-Entropy Based Tripartite Synergistic Learning for Long-Tailed RecognitionWeijia Fan, Qiufu Li, Jiajun Wen 0001, Xiaoyang Peng. 3795-3803 [doi]
- Leveraging Dissimilarity Invariance as a Robust Anchor for Learning with Noisy LabelsWenxiao Fan, Kan Li 0001. 3804-3812 [doi]
- Segment and Matte Anything in a Unified ModelZezhong Fan, Xiaohan Li 0001, Topojoy Biswas, Kaushiki Nag, Kannan Achan. 3813-3821 [doi]
- Point Cloud Semantic Scene Completion with Prototype-Guided TransformerChenghao Fang 0001, Jianqing Liang, Jiye Liang, Zijin Du, Feilong Cao. 3822-3830 [doi]
- MotionCharacter: Fine-Grained Motion Controllable Human Video GenerationHaopeng Fang, Di Qiu, Binjie Mao, He Tang 0002. 3831-3839 [doi]
- Spatio-Temporal Context Learning with Temporal Difference Convolution for Moving Infrared Small Target DetectionHouzhang Fang, Shukai Guo, Qiuhuan Chen, Yi Chang 0002, Luxin Yan. 3840-3848 [doi]
- Depth-Synergized Mamba Meets Memory Experts for All-Day Image Reflection SeparationSiyan Fang, Long Peng, Yuntao Wang, Ruonan Wei, Yuehuan Wang. 3849-3857 [doi]
- Towards Unified Vision-Language Models with Incomplete Multi-Modal InputsXiang Fang, Wanlong Fang, Changshuo Wang 0001, Keke Tang, Daizong Liu, Siyi Wang, Wei Ji 0008. 3858-3866 [doi]
- Unveiling the Fragility of Vision-Language Models: Multi-Modal Adversarial Synergy via Texture-Constrained Perturbations and Cross-Modal OptimizationXiang Fang, Wanlong Fang, Changshuo Wang 0001. 3867-3875 [doi]
- Disentangling Adversarial Prompts: A Semantic-Graph Defense for Robust LLM SecurityXiang Fang, Wanlong Fang. 3876-3884 [doi]
- Rethinking Video-Language Model from the Language Input PerspectiveXiang Fang, Wanlong Fang, Changshuo Wang 0001, Xiaoye Qu, Daizong Liu. 3885-3893 [doi]
- Open-World 3D Scene Graph Generation for Retrieval-Augmented ReasoningFei Yu 0012, Quan Deng, Shengeng Tang, Yuehua Li, Lechao Cheng. 3894-3902 [doi]
- Scene-Aware Spatiotemporal Generalization: Towards Robust Temporal Action Detection Across DomainsFangming Feng, Sihang Cai, Zequn Xie, Yangyang Wu, Tao Jin 0004. 3903-3911 [doi]
- Sparse3DPR: Training-Free 3D Hierarchical Scene Parsing and Task-Adaptive Subgraph Reasoning from Sparse RGB ViewsHaida Feng, Hao Wei 0008, Zewen Xu, Haolin Wang 0005, Chade Li, Yihong Wu 0002. 3912-3920 [doi]
- Personalize Anything for Free with Diffusion TransformerHaoran Feng, Zehuan Huang, Lin Li, Lu Sheng. 3921-3929 [doi]
- Zero-shot Implicit Neural Manifold Representation (INMR) for Ultra-high Temporal Resolution Dynamic MRIJie Feng 0013, Rui Luo, Tian Zeng, Xin Shen, Haikun Qi, Yuyao Zhang 0005, Dong Liang, Hongjiang Wei. 3930-3938 [doi]
- Stabilizing Cross-Modal Bidirectional Attribution: Few-Shot Adversarial Prompt Tuning for Robust Vision-Language ModelsJun Feng 0007, Shuhong Wu, Hong Sun, Pengfei Zhang 0010, Bocheng Ren, Shunli Zhang 0003. 3939-3947 [doi]
- UV-RGS: Relightable 3D Gaussian Splatting from Unposed Views Under Varied IlluminationsWei Feng 0005, Chi Huang, Qi Zhang 0071, Qi Zhang 0071, Nan Li 0048. 3948-3956 [doi]
- IE-SRGS: An Internal-External Knowledge Fusion Framework for High-Fidelity 3D Gaussian Splatting Super-ResolutionXiang Feng, Tieshi Zhong, Shuo Chang, Weiliu Wang, Chengkai Wang, Yifei Chen, Tongyu Hu, Yuhe Wang, Zhenzhong Kuang, Xuefei Yin, Yanming Zhu 0001. 3957-3965 [doi]
- ElastoGen: 4D Generative ElastodynamicsYutao Feng, Yintong Shang, Xiang Feng, Lei Lan, Shandian Zhe, Tianjia Shao, Hongzhi Wu, Kun Zhou 0001, Chenfanfu Jiang, Yin Yang 0002. 3966-3975 [doi]
- SCAN: Self-Calibrated AutoregressioN for High-Quality Visual GenerationZhanzhou Feng, Qingpei Guo, Jingdong Chen, Feng Gao, Ming Yang 0007, Shiliang Zhang. 3976-3984 [doi]
- DR.Experts: Differential Refinement of Distortion-Aware Experts for Blind Image Quality AssessmentBohan Fu, Guanyi Qin, Fazhan Zhang, Zihao Huang, Mingxuan Li, Runze Hu. 3985-3993 [doi]
- MOGO: Residual Quantized Hierarchical Causal Transformer for Real-Time and Infinite-Length 3D Human Motion GenerationDongjie Fu, Tengjiao Sun, Pengcheng Fang, Xiaohao Cai, Hansung Kim 0001. 3994-4002 [doi]
- LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer LearningFengyi Fu, Mengqi Huang, Lei Zhang 0119, Zhendong Mao 0001. 4003-4011 [doi]
- RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point CloudsJingyun Fu, Zhiyu Xiang, Na Zhao 0004. 4012-4021 [doi]
- OscuFit: Learning to Fit Osculating Implicit Quadrics for Point CloudsRao Fu, Qian Li, Liang Yu, Jianmin Zheng. 4022-4030 [doi]
- OmniPT: Unleashing the Potential of Large Vision Language Models for Pedestrian Tracking and UnderstandingTeng Fu 0001, Mengyang Zhao, Ke Niu 0004, Kaixin Peng, Bin Li 0015. 4031-4039 [doi]
- Scene Experts: Specializing in 3D Gaussian Splatting with Adaptive DecompositionXiaowen Fu, Yang Zhang, Yuhan Tang, Huazhong Zhang, Tianxing Zhao, Yuhang Guo, Yu Huang, Jinbao Wang. 4040-4048 [doi]
- Unleashing the Power of Image-Tabular Self-Supervised Learning via Breaking Cross-Tabular BarriersYibing Fu, Yunpeng Zhao, Zhitao Zeng, Cheng Chen 0013, Yueming Jin. 4049-4057 [doi]
- Semi-Supervised Semantic Segmentation via Derivative Label PropagationYuanbin Fu, Xiaojie Guo 0001. 4058-4066 [doi]
- SAME: Spatial-Aware Multimodal Egocentric Human Pose EstimationYurong Fu, Peng Dai, Yu Zhang, Yiqiang Feng, Yang Zhang, Haoqian Wang. 4067-4075 [doi]
- DeFB: Decomposed Feature Learning for Real-Time Multi-Person Eyeblink Detection in Untrimmed In-the-Wild VideosJinfang Gan, Wenzheng Zeng, Yang Xiao 0007, Xintao Zhang, Chaoyang Zheng, Ran Zhao, Ran Wang 0005, Min Du, Zhiguo Cao 0001. 4076-4084 [doi]
- GenPTW: Latent Image Watermarking for Provenance Tracing and Tamper LocalizationZhenliang Gan, Chunya Liu, Yichao Tang, Binghao Wang, Shiwen Cui, Weiqiang Wang, Xinpeng Zhang. 4085-4093 [doi]
- AdaptCLIP: Adapting CLIP for Universal Visual Anomaly DetectionBin-Bin Gao, Yue Zhou, Jiangtao Yan, Yuezhi Cai, Weixi Zhang, Meng Wang, Jun Liu 0071, Yong Liu 0032, Lei Wang, Chengjie Wang 0001. 4095-4103 [doi]
- OrgaCast: A Trustworthy Spatiotemporal Diffusion Model for Fluorescence Organoid ForecastingDawei Gao, Angello Huerta Gomez, Mingchen Li, Marcel El-Mokahal, Huaxiao Yang, Yunhe Feng. 4104-4112 [doi]
- APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information RetrievalHong Gao, Yiming Bao, Xuezhen Tu, Bin Zhong, Linan Yue, Min-Ling Zhang. 4113-4121 [doi]
- The Structure-Equivalent Prior: Unifying Temporal Dynamics and 3D Evolution in 4D Latent SpaceJingyuan Gao, Tianyu Shen, Ruosen Hao, Te Guo 0003, Zhiwei Li, Kunfeng Wang. 4122-4130 [doi]
- ViType: High-Fidelity Visual Text Rendering via Glyph-Aware Multimodal DiffusionLishuai Gao, Jun-Yan He, Yingsen Zeng, Yujie Zhong, Xiaopeng Sun, Jie Hu, Zan Gao, Xiaoming Wei. 4131-4139 [doi]
- Remember Me: Bridging the Long-Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience StrategiesPeng Gao, Yujian Lee, Xiaofeng Zhang, Zailong Chen, Hui Zhang 0062. 4140-4148 [doi]
- VisAssist: A Visually Impaired-Captured Video Question Answering Benchmark for Assistive SystemsQi Gao, Heng Li 0006, Yixin Zhou, Meixuan Zhou, Jieqiong Chen, Xinyu Chai. 4149-4157 [doi]
- T-APT: Text-Guided Modality-Aware Prompt Tuning for Arbitrary Multimodal Remote Sensing Data Joint ClassificationQinghao Gao, Jiahui Qu, Wenqian Dong. 4158-4166 [doi]
- VAGU & GtS: LLM-Based Benchmark and Framework for Joint Video Anomaly Grounding and UnderstandingShibo Gao, Peipei Yang, Yangyang Liu, Yi Chen 0027, Han Zhu, Xu-Yao Zhang, LinLin Huang. 4167-4175 [doi]
- BrainLMM: A Label-Free Framework for Mapping Multi-Semantic Representation in the Human Visual CortexTan Gao, Mufan Xue, Haofang Zheng, Shuo Lv, Jia Xu, Dabin Sheng, Ziming Mao, Xinyu Wu, Andrew Luo, Guoyuan Yang. 4176-4184 [doi]
- AdaDepth: Exploiting Inherent Scene Information for Self-Supervised Depth Estimation in Dynamic ScenesXuanang Gao, Xiongbin Wu, Zhiwei Ning, Runze Yang, Zhonglong Zheng, Jie Yang 0002, Wei Liu 0044. 4185-4193 [doi]
- IQGS: Instance Query-based Gaussian SegmentationYichao Gao, Xinyuan Liu 0003, Yike Ma, Yucheng Zhang, Feng Dai. 4194-4202 [doi]
- HKAFER: Achieve Visual Parameter-Efficient Fine-Tuning via Heterogeneous Kronecker Adaptation for Facial Expression RecognitionYu Gao 0010, Haoyu Ji 0001, Zhiyong Wang 0009, Wenze Huang, Qian Dong, Zhihao Yang, Xueting Liu 0009, Weihong Ren, Honghai Liu 0001. 4203-4211 [doi]
- High-Quality Full-Head 3D Avatar Generation from Any Single Portrait ImageYujie Gao 0001, Chencheng Wang, Xianbing Sun, Jiahui Zhan, Wentao Wang 0009, Yiyi Zhang, Haohua Zhao 0001, Liqing Zhang 0001, Jianfu Zhang 0003. 4212-4220 [doi]
- Evolving Semantic Propagation for Aerial Semantic 3D Gaussian SplattingZihan Gao, Lingling Li 0002, Xu Liu 0006, Fang Liu 0001, Licheng Jiao, Puhua Chen, Wenping Ma 0001, Shuyuan Yang 0001. 4221-4229 [doi]
- HDGS: Hierarchical Dynamic Gaussian Splatting for Urban Driving ScenesFudong Ge, Jin Gao, Hanshi Wang, YiWei Zhang, Ke Wang, Weiming Hu 0004, Zhipeng Zhang. 4230-4238 [doi]
- FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image InpaintingChao Gong, Dong Li 0019, Yingwei Pan, Jingjing Chen 0001, Ting Yao 0003, Tao Mei 0001. 4239-4247 [doi]
- Human Motion Synthesis in 3D Scenes via Unified Scene Semantic OccupancyJingyu Gong, Kunkun Tong, Zhuoran Chen, Chuanhan Yuan, Mingang Chen, Zhizhong Zhang 0001, Xin Tan 0002, Yuan Xie 0006. 4248-4256 [doi]
- Diffusion Implicit Policy for Unpaired Scene-aware Motion SynthesisJingyu Gong, Chong Zhang, Fengqi Liu, Ke-fan, Qianyu Zhou 0001, Xin Tan 0002, Zhizhong Zhang 0001, Yuan Xie 0006. 4257-4265 [doi]
- From Discriminative to Generative: A Diffusion-Based Paradigm for Multi-Agent Collaborative PerceptionKexin Gong, Puyi Yao, Guiyang Luo, Quan Yuan 0004, Tiange Fu, Hui Zhang 0091, Jinglin Li. 4266-4274 [doi]
- Concepts from Representations: Post-hoc Concept Bottleneck Models via Sparse Decomposition of Visual RepresentationsShizhan Gong, Xiaofan Zhang 0002, Qi Dou 0001. 4275-4283 [doi]
- A Theory-Inspired Framework for Few-Shot Cross-Modal Sketch Person Re-IdentificationYunpeng Gong, Yongjie Hou, Jiangming Shi, Kim Long Diep, Min Jiang 0005. 4284-4292 [doi]
- SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image GenerationPaul Grimal, Michaël Soumm, Hervé Le Borgne, Olivier Ferret, Akihiro Sugimoto. 4293-4301 [doi]
- MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic SegmentationFuqiang Gu, Yuanke Li, Xianlei Long, Kangping Ji, Chao Chen 0004, Qingyi Gu, Zhenliang Ni. 4302-4310 [doi]
- SparseSurf: Sparse-View 3D Gaussian Splatting for Surface ReconstructionMeiying Gu, Jiawei Zhang, Jiahe Li 0007, Xiaohan Yu 0001, Haonan Luo 0002, Jin Zheng, Xiao Bai 0001. 4311-4319 [doi]
- Multimodal Robust Prompt Distillation for 3D Point Cloud ModelsXiang Gu, Liming Lu, Xu Zheng, AnAn Du, Yongbin Zhou, Shuchao Pang. 4320-4328 [doi]
- CD-DPE: Dual-Prompt Expert Network Based on Convolutional Dictionary Feature Decoupling for Multi-Contrast MRI Super-ResolutionXianming Gu, Lihui Wang 0002, Ying Cao, Zeyu Deng, Yingfeng Ou, Guodong Hu, Yi Chen. 4329-4338 [doi]
- CogniTrust: Cognitive Memory-Driven Verifiable Supervision for Robust HashingYiyang Gu, Bohan Wu, Yifang Qin, Jiaru Tang, Rong-Cheng Tu, Zhiping Xiao 0001, Taian Guo, Junyu Luo 0002, Wei Ju 0001, Xiao Luo 0001, Dacheng Tao, Ming Zhang 0004. 4339-4347 [doi]
- AnomalyMoE: Towards a Language-free Generalist Model for Unified Visual Anomaly DetectionZhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen 0003, Wei Ge, Ming Tang 0001, Jinqiao Wang. 4348-4356 [doi]
- Rectified Noise: A Generative Model Using Positive-incentive NoiseZhenyu Gu, Yanchen Xu, Sida Huang, Yubin Guo, Hongyuan Zhang 0001. 4357-4365 [doi]
- RoadSceneVQA: Benchmarking Visual Question Answering in Roadside Perception Systems for Intelligent Transportation SystemRunwei Guan, Rongsheng Hu, Shangshu Chen, Ningyuan Xiao, Xue Xia, Jiayang Liu, Beibei Chen, Ziren Tang, Ningwei Ouyang, Shaofeng Liang, Yuxuan Fan, Wanjie Sun, Yutao Yue. 4366-4375 [doi]
- Domain Adaptation Guided Infrared and Visible Image FusionTianwei Guan, Haozhen Wei, Yuhan Zhou, Jun Ma, Zecheng Xu, Zhiying Jiang, Jinyuan Liu 0001, Xingyuan Li 0005. 4376-4384 [doi]
- Point Cloud Segmentation of Integrated Circuits Package Substrates Surface Defects Using Causal Inference: Dataset Construction and MethodologyBingyang Guo, Qiang Zuo, Ruiyun Yu. 4385-4394 [doi]
- Guiding Point Cloud Denoising with Learned Structural PriorsChuchen Guo, Zheng Liu 0004, Ying He 0001. 4395-4402 [doi]
- Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention MechanismsJiaxun Guo, Manar Amayri, Nizar Bouguila, Xin Liu 0011, Wentao Fan 0001. 4403-4411 [doi]
- PhysPatch: A Physically Realizable and Transferable Adversarial Patch Attack for Multimodal Large Language Models-based Autonomous Driving SystemsQi Guo 0008, Xiaojun Jia, Shanmin Pang, Simeng Qin, Lin Wang, Ju Jia, Yang Liu 0003, Qing Guo 0005. 4412-4420 [doi]
- HyperSign: Hierarchical Hypergraph-based Co-occurrence Modeling for Sign Language Recognition and TranslationQianren Guo, Yuehang Wang, Yongji Zhang, Qi Chu 0010, Sen Liu, Yu Jiang 0006. 4421-4429 [doi]
- SPSC: Sparse and Scalable Multi-Modal 3D Occupancy Prediction for Autonomous DrivingQingju Guo, Shuang Li 0008, Binhui Xie, Jing Geng, Wei Li. 4430-4438 [doi]
- Physics-Aware Accelerated Unrolling Model for Sparse-View CT ReconstructionShaojie Guo, Yingying Fang, Junkang Zhang, Yan Wang 0033. 4439-4447 [doi]
- Revisiting Attention in the Dark for Low-Light Person Re-IdentiffcationXiang Guo, Ruimin Hu, Dongliang Zhu 0001, Mei Wang. 4448-4457 [doi]
- Ev-iCRF: Self-supervised Event-guided iCRF Estimation for HDR Image ReconstructionXucheng Guo, Bing Li, Lin Wang 0025, Yiran Shen 0001. 4458-4466 [doi]
- OWL: Unsupervised 3D Object Detection by Occupancy Guided Warm-up and Large Model Priors ReasoningXusheng Guo, Wanfa Zhang, Shijia Zhao, Qiming Xia, Xiaolong Xie, Mingming Wang, Hai Wu, Chenglu Wen. 4467-4475 [doi]
- Decoupling Continual Semantic SegmentationYifu Guo, Yuquan Lu, Wentao Zhang 0005, Zishan Xu, Dexia Chen, Siyu Zhang, Yizhe Zhang 0001, Ruixuan Wang. 4476-4484 [doi]
- Splats in Splats: Robust and Effective 3D Steganography Towards Gaussian SplattingYijia Guo, Wenkai Huang 0003, Yang Li, Gaolei Li, Hang Zhang 0010, Liwen Hu 0002, Jianhua Li 0001, Tiejun Huang 0001, Lei Ma 0008. 4485-4493 [doi]
- RouterNet: Hierarchical Point Routing Network for Robust Vertebral Landmark Localization on AP X-ray ImagesYingjie Guo, Jinxin Lv, Wei Fang, Qiang Li 0018, Zhiwei Wang 0002. 4494-4502 [doi]
- SymGS: Leveraging Reflective Symmetries for 3DGS CompressionKeshav Gupta, Akshat Sanghvi, Shreyas Reddy Palley, Astitva Srivastava, Charu Sharma, Avinash Sharma. 4503-4510 [doi]
- O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language ModelRishi Gupta, Mukilan Karuppasamy, Shyam Marjit, Aditay Tripathi, Anirban Chakraborty 0001. 4511-4519 [doi]
- I-INR: Iterative Implicit Neural RepresentationsAli Haider, Muhammad Salman Ali, Maryam Qamar, Tahir Khalil, Soo Ye Kim, Jihyong Oh, Enzo Tartaglione, Sung-Ho Bae. 4520-4528 [doi]
- Universal Compressed Image Restoration via Codec-Aware Conditioning with Reinforcement LearningChangwoo Han 0001, Hongil Kim 0001, Donghyun Kim 0017, Sung-Chang Lim, Seung-Won Jung. 4529-4537 [doi]
- Realistic Face Reconstruction from Facial Embeddings via Diffusion ModelsDong Han, Yong Li 0021, Joachim Denzler. 4538-4546 [doi]
- E-MaT: Event-oriented Mamba for Egocentric Point TrackingHan Han, Wei Zhai, Baocai Yin, Yang Cao 0010, Bin Li, Zhengjun Zha. 4547-4555 [doi]
- GS-Checker: Tampering Localization for 3D Gaussian SplattingHaoliang Han, Ziyuan Luo, Jun Qi, Anderson Rocha 0001, Renjie Wan. 4556-4564 [doi]
- GOAL: Geometrically Optimal Alignment for Continual Generalized Category DiscoveryJizhou Han, Chenhao Ding, Songlin Dong, Yuhang He 0001, Shaokun Wang, Qiang Wang, Yihong Gong. 4565-4573 [doi]
- MODA: The First Challenging Benchmark for Multispectral Object Detection in Aerial ImagesShuaihao Han, Tingfa Xu, Peifu Liu, Jianan Li 0001. 4574-4582 [doi]
- EvalMuse-40K: A Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Alignment EvaluationShuhao Han, Haotian Fan, Jiachen Fu, Liang Li, Tao Li, Junhui Cui, Yunqiu Wang, Yang Tai, Jingwei Sun, Chun-Le Guo, Chongyi Li. 4583-4591 [doi]
- Polysemic Semantic Instance Network for Cross-Modal HashingShuo Han, Qibing Qin, Kezhen Xie, Wenfeng Zhang, Lei Huang 0010. 4592-4600 [doi]
- Filter, Correlate, Compress: Training-Free Token Reduction for MLLM AccelerationYuhang Han, Xuyang Liu, Zihan Zhang, Pengxiang Ding, Junjie Chen, Honggang Chen, Donglin Wang, Qingsen Yan, Siteng Huang. 4601-4609 [doi]
- Dual-Geometry Graph Network: Unifying Local and Global Priors for Few-Shot LearningZheng Han, Xiaobin Zhu 0001, Chun Yang, Jingyan Qin, Xu-Cheng Yin. 4610-4618 [doi]
- A Geometric Perspective on Optimizing Vector Quantized Latent Diffusion Model for Image RestorationChen Hang, Haoming Chen, Xuwei Fang, Weisheng Xie, Xiangxiang Gao, Faming Fang, Guixu Zhang, Haichuan Song. 4619-4626 [doi]
- StyleDrive: Towards Driving-Style Aware Benchmarking of End-To-End Autonomous DrivingRuiyang Hao, Bowen Jing 0001, Haibao Yu, Zaiqing Nie. 4627-4635 [doi]
- Semantic Document Derendering: SVG Reconstruction via Vision-Language ModelingAdam Hazimeh, Ke Wang, Mark Collier, Gilles Baechler, Efi Kokiopoulou, Pascal Frossard. 4636-4644 [doi]
- Pre-Trained Video Generative Models as World SimulatorsHaoran He, Yang Zhang, Liang Lin, Zhongwen Xu, Ling Pan. 4645-4653 [doi]
- Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric RefinementLian He, Meng Liu, Qilang Ye, Yu Zhou 0015, Xiang Deng, Gangyi Ding. 4654-4662 [doi]
- S3Net: Spatiotemporally Separated Sparse Network for Neuromorphic Vision ProcessingPing He, Rong Xiao 0001, Wanying Xu, Chenwei Tang, Shudong Huang, Huajin Tang. 4663-4671 [doi]
- LiNeXt: Revisiting LiDAR Completion with Efficient Non-Diffusion ArchitecturesWenzhe He, Xiaojun Chen, Ruiqi Wang, Ruihui Li, Huilong Pi, Jiapeng Zhang, Zhuo Tang, Kenli Li 0001. 4672-4680 [doi]
- Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-ResolutionXiao He 0014, Zhijun Tu, Kun Cheng, Mingrui Zhu, Jie Hu 0021, Nannan Wang 0001, Xinbo Gao 0001. 4681-4689 [doi]
- Attention to Threat-Relevant Objects: Reasoning Detection in Autonomous Driving via Multimodal Large Language ModelsYulin He, Wei Chen 0009, Xinbiao Gan, Siqi Wang 0001, Haotian Wang 0001, Yusong Tan. 4690-4698 [doi]
- Vision-MoR: Scaling Vision Transformer via Patch-Level Mixture-of-RecursionsYunhong He, Zhengqing Yuan, Weixiang Sun, Yiyang Li, Yixin Liu 0002, Yanfang Ye 0001, Lichao Sun 0001. 4699-4707 [doi]
- LLM-Free Image Captioning Evaluation in Reference-Flexible SettingsShinnosuke Hirano, Yuiga Wada, Kazuki Matsuda, Seitaro Otsuki, Komei Sugiura. 4708-4716 [doi]
- DEGRE: Dynamic Gating Ensembles for Trust-Aware Rejection in Medical Image DiagnosticsHong Hai Nguyen, Duong Bach, Nam Phan, Cuong V. Nguyen, Cuong Do 0001. 4717-4724 [doi]
- Physics-Informed Deformable Gaussian Splatting: Towards Unified Constitutive Laws for Time-Evolving Material FieldHaoqin Hong, Ding Fan, Fubin Dou, Zhi-li Zhou, Haoran Sun, Congcong Zhu, Jingrun Chen. 4725-4733 [doi]
- MaskAnyNet: Rethinking Masked Image Regions as Valuable Information in Supervised LearningJingshan Hong, Haigen Hu, Huihuang Zhang, Qianwei Zhou, Li Zhao. 4734-4743 [doi]
- Margin-Aware Preference Optimization for Aligning Diffusion Models Without ReferenceJiwoo Hong, Sayak Paul, Noah Lee, Kashif Rasul, James Thorne, Jongheon Jeong. 4744-4752 [doi]
- NODiff: Neural Operator Diffusion for Multispectral Image FusionJunming Hou, Ran Ran 0001, Sixing Chen, Zihao Chen, Xiaofeng Cong, Junling Li, Liang-Jian Deng. 4753-4761 [doi]
- Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional EncodingsLiang Hou, Cong Liu, Mingwu Zheng, Xin Tao 0001, Pengfei Wan 0001, Di Zhang 0026, Kun Gai. 4762-4770 [doi]
- Gait Transformer: End-to-End Transformer Backbone for Gait RecognitionSaihui Hou, Wenpeng Lang, Jilong Wang 0010, Yan Huang 0008, Liang Wang 0001, Yongzhen Huang. 4771-4779 [doi]
- DAPE: Harmonizing Content-Position Encoding for Versatile Dense Visual PredictionXiuquan Hou, Meiqin Liu 0001, Senlin Zhang, Shaoyi Du. 4780-4788 [doi]
- FVNet: Harnessing Liquid Neural Dynamics for Lightweight Visual RepresentationZhenzhe Hou, Xiaohui Chu, Runze Hu, Yang Li, Yutao Liu 0002. 4789-4797 [doi]
- Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern GenerationChenggong Hu, Yi Wang, Mengqi Xue, Haofei Zhang, Jie Song 0011, Li Sun. 4798-4806 [doi]
- Hierarchical Direction Perception via Atomic Dot-Product Operators for Rotation-Invariant Point Clouds LearningChenyu Hu, Xiaotong Li, Hao Zhu 0009, Biao Hou. 4807-4815 [doi]
- GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous DrivingChunyong Hu, Qi Luo, Jianyun Xu, Song Wang 0019, Qiang Li, Sheng Yang 0007. 4816-4824 [doi]
- Segment Anything Across Shots: A Method and BenchmarkHengrui Hu, Kaining Ying, Henghui Ding. 4825-4833 [doi]
- Multi-Step Deformable Gaussian Splatting for Dynamic Scene RenderingJiaheng Hu, Zhizhong Zhang 0001, Jingyu Gong, Lizhuang Ma, Xin Tan 0002, Yuan Xie 0006. 4834-4842 [doi]
- AquaSplatting: A Hybrid 3D Representation for Robust Underwater Scene Reconstruction via Dual-Branch RenderingJiangbei Hu, Haobo Wang, Baixin Xu, Nan Ding, Zhimao Lu, Na Lei, Ying He 0001. 4843-4850 [doi]
- DLDA: Unified Dual-Level Domain Adaptation for Low-Light Object DetectionJiayi Hu, Qian Zhao, Gang Li. 4851-4859 [doi]
- Dereflection Any Image with Diffusion Priors and Diversified DataJichen Hu, Chen Yang 0023, Zanwei Zhou, Jiemin Fang, Qi Tian 0001, Wei Shen 0002. 4860-4868 [doi]
- Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and EnhancementJiesi Hu, Jianfeng Cao, Yanwu Yang, Chenfei Ye, Yixuan Zhang, Hanyang Peng, Ting Ma 0001. 4869-4877 [doi]
- Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion PredictionJuncheng Hu, Zijian Zhang, Zeyu Wang, Guoyu Wang, Yingji Li, Kedi Lyu. 4878-4886 [doi]
- Breaking Alignment Barriers: TPS-Driven Semantic Correlation Learning for Alignment-Free RGB-T Salient Object DetectionLupiao Hu, Fasheng Wang, Fangmei Chen, Fuming Sun, Haojie Li. 4887-4895 [doi]
- Fast Guaranteed Robust Local-Smooth Principal Component SeparationMingdi Hu, Hailin Wang 0001, Shuaijiang Li, Kexin Shi, Jiangjun Peng. 4896-4904 [doi]
- Pairing-free Group-level Knowledge Distillation for Robust Gastrointestinal Lesion Classification in White-Light EndoscopyQiang Hu, Qimei Wang, Yingjie Guo, Qiang Li 0018, Zhiwei Wang 0002. 4905-4913 [doi]
- Learning Topology-Aware Dynamic Associations for Robust Multi-Person Pose EstimationShengnan Hu, Yandong Liu, Jiangnan Liu, Yahong Chen. 4914-4922 [doi]
- UltraGen: High-Resolution Video Generation with Hierarchical AttentionTeng Hu, Jiangning Zhang, Zihan Su, Ran Yi 0002. 4923-4931 [doi]
- IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans FusionWenhao Hu, Zesheng Li, Haonan Zhou, Liu Liu, Xuexiang Wen, Zhizhong Su, Xi Li, Gaoang Wang. 4932-4940 [doi]
- Earth-Adapter: Bridge the Geospatial Domain Gaps with a Frequency-Guided Mixture of AdaptersXiaoxing Hu, Ziyang Gong, Yupei Wang, Yuru Jia, Fei Lin 0005, Dexiang Gao, Ke An, Jianhong Han, Zhuoran Sun, Gen Luo, Xue Yang 0005. 4941-4949 [doi]
- SRSplat: Feed-Forward Super-Resolution Gaussian Splatting from Sparse Multi-View ImagesXinyuan Hu, Changyue Shi, Chuxiao Yang, Minghao Chen, Jiajun Ding, Tao Wei, Chen Wei, Zhou Yu 0001, Min Tan 0005. 4950-4958 [doi]
- GLoMOT: Efficient Online GNN-based Low-Frame-Rate Multi-Object TrackerYaxuan Hu 0001, Jie Hua 0005, Gang Wu 0010, Yuhong Yang 0001, Atsushi Suzuki 0002, Zhongyuan Wang 0001. 4959-4967 [doi]
- MSPCaps: A Multi-Scale Patchify Capsule Network with Cross-Agreement Routing for Visual RecognitionYudong Hu, Yueju Han, Rui Sun, Jinke Ren. 4968-4975 [doi]
- Geometric Correspondence Constrained Pseudo-Label Alignment for Source-Free Domain Adaptive Fundus Image SegmentationZhouhongyuan Hu, Lei Zhang 0005, Lituan Wang, Zhenwei Zhang, Minjuan Zhu, Zhenbin Wang. 4976-4984 [doi]
- VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel SegmentationDe-Xing Huang, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Tian-Yu Xiang, Rui-Ze Ma, Nu-Fang Xiao, Zeng-Guang Hou. 4985-4993 [doi]
- LongSplat: Online Generalizable 3D Gaussian Splatting from Long Sequence ImagesGuichen Huang, Ruoyu Wang 0014, Xiangjun Gao, Che Sun, Yuwei Wu 0001, Shenghua Gao, Yunde Jia. 4994-5002 [doi]
- Bayesian Neural Networks for One-to-Many Mapping in Image EnhancementGuoxi Huang, Qirui Yang, Ruirui Lin, Zipeng Qi, David Bull 0001, Nantheera Anantrasirichai. 5004-5012 [doi]
- Adaptive Evidential Learning for Temporal-Semantic Robustness in Moment RetrievalHaojian Huang, Kaijing Ma, Jin Chen, Haodong Chen, Zhou Wu, Xianghao Zang, Han Fang, Chao Ban, Hao Sun 0038, Mulin Chen, Zhongjiang He. 5013-5021 [doi]
- Fair Facial Attribute Recognition via Group-Decoupled Vision Transformer with Mask-Guided Correlation SuppressionHuichang Huang, Kunchi Li, Si Chen 0002, Da-Han Wang. 5022-5030 [doi]
- Understanding Dynamic Scenes in Ego Centric 4D Point CloudsJunsheng Huang, Shengyu Hao, Bocheng Hu, Hongwei Wang 0001, Gaoang Wang. 5031-5039 [doi]
- Bidirectional Channel-selective Semantic Interaction for Semi-Supervised Medical SegmentationKaiwen Huang 0002, Yizhe Zhang 0001, Yi Zhou 0007, Tianyang Xu 0001, Tao Zhou 0002. 5040-5048 [doi]
- GENMAC: Compositional Text-to-Video Generation with Multi-Agent CollaborationKaiyi Huang, Yukun Huang, Xuefei Ning, Zinan Lin 0001, Yu Wang 0002, Xihui Liu. 5049-5057 [doi]
- From Pixels to Logic: A Perception-Reasoning Decomposition Framework for Open-World Referring Expression ComprehensionLihong Huang, Sheng-hua Zhong, Zhi Zhang 0004, Yan Liu 0004. 5058-5066 [doi]
- Transferability of Adversarial Attacks in Video-based MLLMs: A Cross-modal Image-to-Video ApproachLinhao Huang, Xue Jiang, Zhiqiang Wang, Wentao Mo, Xi Xiao, Yongjie Yin, Bo Han 0003, Feng Zheng 0001. 5067-5075 [doi]
- MIRAGE: Towards AI-Generated Image Detection in the WildOucheng Huang, Manxi Lin, Jiexiang Tan, Xiaoxiong Du, Yang Qiu, Junjun Zheng, Xiangheng Kong, Yuning Jiang 0001, Bo Zheng 0007. 5076-5084 [doi]
- BuildingWorld: A Structured 3D Building Dataset for Urban Foundation ModelsShangfeng Huang, Ruisheng Wang 0001, Xin Wang. 5085-5094 [doi]
- Self-Supervised One-Step Diffusion Refinement for Snapshot Compressive ImagingShaoguang Huang, Yunzhen Wang, Haijin Zeng, Hongyu Chen 0003, Hongyan Zhang 0001. 5095-5103 [doi]
- Dual-stream Relation-modeling Disentanglement for Cloth-Changing Person Re-IdentificationShijuan Huang, Hefei Ling, Zongyi Li, Xu Li, Zhao Lv. 5104-5112 [doi]
- Laytrol: Preserving Pretrained Knowledge in Layout Control for Multimodal Diffusion TransformersSida Huang, Siqi Huang, Ping Luo 0002, Hongyuan Zhang 0001. 5113-5121 [doi]
- ROVER: Robust Generative Continual Identity Unlearning Against Relearning AttacksTairan Huang, Qiang Chen 0016, Beibei Hu, Yunlong Zhao 0003, Hongyan Xu 0002, Zhiyuan Chen, Yi Chen, Xiu Su. 5122-5130 [doi]
- LLM2CLIP: Powerful Language Model Unlocks Richer Cross-Modality RepresentationWeiquan Huang, Aoqi Wu, Yifan Yang 0004, Xufang Luo, Yuqing Yang 0001, Usman Naseem, Chunyu Wang 0001, Qi Dai 0001, Xiyang Dai, Dongdong Chen 0001, Chong Luo 0001, Lili Qiu, Liang Hu. 5131-5139 [doi]
- Otter: Mitigating Background Distractions of Wide-Angle Few-Shot Action Recognition with Enhanced RWKVWenbo Huang 0001, Jinghui Zhang 0001, Zhenghao Chen, Guang Li 0008, Lei Zhang 0130, Yang Cao, Fang Dong 0001, Takahiro Ogawa 0001, Miki Haseyama. 5140-5148 [doi]
- Can Protective Watermarking Safeguard the Copyright of 3D Gaussian Splatting?Wenkai Huang 0003, Yijia Guo, Gaolei Li, Lei Ma 0008, Hang Zhang 0010, Liwen Hu 0002, Jiazheng Wang, Jianhua Li 0001, Tiejun Huang 0001. 5149-5157 [doi]
- SkySplat: Generalizable 3D Gaussian Splatting from Multi-Temporal Sparse Satellite ImagesXuejun Huang, Xinyi Liu 0002, Yi Wan 0001, Zhi Zheng, Bin Zhang 0046, Mingtao Xiong, Yingying Pei, Yongjun Zhang 0002. 5158-5166 [doi]
- BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow MatchingYachuan Huang, Xianrui Luo, Qiwen Wang, Liao Shen, Jiaqi Li 0007, Huiqiang Sun, Zihao Huang 0001, Wei Jiang, Zhiguo Cao 0001. 5167-5175 [doi]
- Less Is Better: Sparse Instance Learning for Cross-Domain Few-Shot Object DetectionYali Huang, Jie Mei, Ziyi Wu, Yiming Yang, Hongru Zhao, Mingyuan Jiu, Hichem Sahbi. 5176-5184 [doi]
- Enhancing Generalization of Depth Estimation Foundation Model via Weakly-Supervised Adaptation with RegularizationYan Huang, Yongyi Su, Xin Lin, Le Zhang 0001, Xun Xu 0002. 5185-5193 [doi]
- 3D-ANC: Adaptive Neural Collapse for Robust 3D Point Cloud RecognitionYuanmin Huang 0001, Wenxuan Li, Mi Zhang 0001, Xiaohan Zhang 0001, Xiaoyu You, Min Yang 0002. 5194-5202 [doi]
- Multi-view Invariance Learning for 3D Scene Graph Pre-training via Collaborative Cross-Modal RegularizationYucheng Huang, Luping Ji, Ruijie Xiao, Jiayuan Sun. 5203-5211 [doi]
- Text-Guided Gradient Refinement: Resolving Multimodal Gradient Conflicts to Boost Adversarial Attacks on Vision-Language ModelsYuyang Huang, Tianzuo Luo, Hengyuan Guo, Yuren Zhang. 5212-5220 [doi]
- BayesVQA: Energy-Guided Bayesian Debiasing for Language-Bias-Robust Visual Question AnsweringZhiqi Huang, Huanjia Zhu, Xiangwen Deng, Qinghao Zhong, Bingzhi Chen. 5221-5229 [doi]
- Opt3DGS: Optimizing 3D Gaussian Splatting with Adaptive Exploration and Curvature-Aware ExploitationZiyang Huang, Jiagang Chen, Jin Liu, Shunping Ji. 5230-5238 [doi]
- TSBOW - Traffic Surveillance Benchmark for Occluded Vehicles Under Various Weather ConditionsNgoc Doan-Minh Huynh, Duong Nguyen-Ngoc Tran, Long Hoang Pham, Tai Huu-Phuong Tran, Hyung Joon Jeon, Huy Hung Nguyen, Duong Khac Vu, Hyung-Min Jeon, Son Hong Phan, Quoc Pham-Nam Ho, Chi Dai Tran, Trinh Le Ba Khanh, Jae Wook Jeon. 5239-5247 [doi]
- DISCODE: Distribution-Aware Score Decoder for Robust Automatic Evaluation of Image CaptioningNakamasa Inoue, Kanoko Goto, Masanari Oi, Martyna Gruszka, Mahiro Ukai, Takumi Hirose, Yusuke Sekikawa. 5248-5256 [doi]
- STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving ScenesKeishi Ishihara, Kento Sasaki, Tsubasa Takahashi 0001, Daiki Shiono, Yu Yamaguchi. 5257-5266 [doi]
- LAMP: Learning Universal Adversarial Perturbations for Multi-Image Tasks via Pre-trained ModelsAlvi Md. Ishmam, Najibul Haque Sarker, Zaber Ibn Abdul Hakim, Chris Thomas 0004. 5267-5275 [doi]
- JRDB-Reasoning: A Difficulty-Graded Benchmark for Visual Reasoning in RoboticsSimindokht Jahangard, Mehrzad Mohammadi, Yi Shen, Zhixi Cai, Hamid Rezatofighi. 5276-5286 [doi]
- GranAlign: Granularity-Aware Alignment Framework for Zero-shot Video Moment RetrievalMingyu Jeon, Sunjae Yoon, Jonghee Kim, Junyeong Kim. 5287-5295 [doi]
- A²LC: Active and Automated Label Correction for Semantic SegmentationYoujin Jeon, Kyusik Cho, Suhan Woo, Euntai Kim. 5296-5304 [doi]
- COSMOS: Coherent SuperGaussian Modeling with Spatial Priors for Sparse-View 3D SplattingChaeyoung Jeong, Kwangsu Kim. 5305-5313 [doi]
- An Adaptive Sampling Framework for Diffusion-based Dataset Distillation with High Fidelity and DiversitySunbeom Jeong, Sehwan Kim, Hyeonggeun Han, Hyungjun Joo, Sangwoo Hong, Jungwoo Lee 0001. 5314-5322 [doi]
- Towards Robust Event-Based Depth Estimation: Bridging Synthetic and Real Domains with Motion AdaptationYuzhe Ji, Haotian Wang, Yijie Chen, Xiang Cheng 0001, Liuqing Yang 0001, Xinhu Zheng. 5323-5331 [doi]
- LidarPainter: One-Step Away from Any Lidar View to Novel GuidanceYuzhou Ji, Ke Ma, Hong Cai, Anchun Zhang, Lizhuang Ma, Xin Tan 0002. 5332-5340 [doi]
- Explicit Temporal-Semantic Modeling for Dense Video Captioning via Context-Aware Cross-Modal InteractionMingda Jia, Weiliang Meng, Zenghuang Fu, Yiheng Li, Qi Zeng, Yifan Zhang, Ju Xin, Rongtao Xu, Jiguang Zhang, Xiaopeng Zhang 0001. 5341-5349 [doi]
- CLIPPan: Adapting CLIP as a Supervisor for Unsupervised PansharpeningLihua Jian, Jiabo Liu, Shaowu Wu, Lihui Chen. 5350-5358 [doi]
- Diversifying Counterattacks: Orthogonal Exploration for Robust CLlP InferenceChengze Jiang, Minjing Dong, Xinli Shi, Jie Gui. 5359-5368 [doi]
- CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models FeedbackChenhan Jiang, Yihan Zeng, Dit-Yan Yeung. 5369-5377 [doi]
- ExpertAD: Enhancing Autonomous Driving Systems with Mixture of ExpertsHaowen Jiang, Xinyu Huang, You Lu 0005, Dingji Wang, Yuheng Cao, Chaofeng Sha, Bihuan Chen 0001, Keyu Chen, Xin Peng 0001. 5378-5387 [doi]
- SAM2MOT: A Novel Paradigm of Multi-Object Tracking by SegmentationJunjie Jiang, Zelin Wang, Manqi Zhao, Yin Li, Dongsheng Jiang. 5388-5396 [doi]
- Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive ModelsLongtao Jiang, Jie Huang, Mingfei Han 0002, Lei Chen, Yongqiang Yu, Feng Zhao, Xiaojun Chang, Zhihui Li 0001. 5397-5405 [doi]
- MFmamba: A Multi-function Network for Panchromatic Image Resolution Restoration Based on State-Space ModelQian Jiang, Qianqian Wang 0013, Xin Jin 0005, Michal Wozniak 0001, Shaowen Yao 0001, Wei Zhou 0011. 5406-5414 [doi]
- UniScene-MoTion: Unified Scene & Motion-aware Diffusion Transition FrameworkRui Jiang, Chongmian Wang, Xinghe Fu, Yehao Lu, Teng Li, Xi Li. 5415-5423 [doi]
- MPJudge: Towards Perceptual Assessment of Music-Induced PaintingsShiqi Jiang 0001, Tianyi Liang 0002, Huayuan Ye, Changbo Wang, Chenhui Li 0001. 5424-5431 [doi]
- Less Is More: Rethinking Parameter-Efficient Fine-Tuning from a Subtractive PerspectiveTianqi Jiang, Liu Yang 0010, Xi-Le Zhao, Zixuan Qin, Qinghua Hu. 5432-5440 [doi]
- Semi-Supervised High Dynamic Range Image Reconstructing via Bi-Level Uncertain Area MaskingWei Jiang, Jiahao Cui 0002, Yizheng Wu, Zhan Peng, Zhiyu Pan, Zhiguo Cao 0001. 5441-5449 [doi]
- Fine-Grained Image Retrieval via Dual-Vision AdaptationXin Jiang 0010, Meiqi Cao, Hao Tang 0007, Fei Shen, Zechao Li. 5450-5458 [doi]
- DECON: Reconstruction of Clothed-Geometric Multiple Humans from a Single Image via Geometry-Guided DecouplingYiming Jiang 0018, Wenfeng Song, Shuai Li 0001, Aimin Hao. 5459-5467 [doi]
- SatireDecoder: Visual Cascaded Decoupling for Enhancing Satirical Image ComprehensionYue Jiang, Haiwei Xue, Minghao Han, Mingcheng Li, Xiaolu Hou, Dingkang Yang, Lihua Zhang 0002, Xu Zheng 0002. 5468-5476 [doi]
- Circuit-Think: A Multimodal Reasoning Framework for Automated Circuit-to-Netlist Translation with Trajectory-Guided Reinforcement LearningYuqi Jiang, Yupeng Hu, Jinyuan Deng, Xiaotian Qiu, Yucheng Cui, Xuyang He, Ruidong Li, Qi Sun 0002, Cheng Zhuo. 5477-5484 [doi]
- Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly GenerationYuxin Jiang, Wei Luo, Hui Zhang, Qiyu Chen 0002, Haiming Yao, Weiming Shen 0001, Yunkang Cao. 5485-5493 [doi]
- AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache OptimizationZhonghua Jiang 0007, Kui Chen, Kunxi Li, Keting Yin, Yiyun Zhou, Zhaode Wang, Chengfei Lv, Shengyu Zhang 0001. 5494-5502 [doi]
- MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular VideosDaisheng Jin, Ying He 0001. 5503-5511 [doi]
- FocusDPO: Dynamic Preference Optimization for Multi-Subject Personalized Image Generation via Adaptive FocusQiaoqiao Jin, Siming FU, Dong She, Weinan Jia, Hualiang Wang, Mu Liu, Jidong Jiang. 5512-5520 [doi]
- Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian SplattingShilong Jin, Haoran Duan 0001, Litao Hua, Wentao Huang, Yuan Zhou 0023. 5521-5529 [doi]
- Reliable-View 2D-3D Key-Part Aligned Transformer with Reinforced Masking for 3D Point Cloud UnderstandingXianglong Jin, Zheng Wang 0037, Rong Wang 0001, Feiping Nie 0001. 5530-5538 [doi]
- EVOKE: Efficient and High-Fidelity EEG-to-Video Reconstruction via Decoupling Implicit Neural RepresentationHaodong Jing, Panqi Yang, Dongyao Jiang, ZhiPeng Liu, Nanning Zheng 0001, Yongqiang Ma. 5539-5547 [doi]
- ResProto-FD: Visual-Language Residual Prototype Sets for Generalized Face Forgery DetectionJiuyao Jing, Yu Zheng 0006, Chunlei Peng. 5548-5556 [doi]
- False Positives Matter: Multidimensional Localization Evaluation and Training-Free Explainable Adversarial Patch DefenseLihua Jing, Rui Wang 0032, Jinwen Zhong, Runbo Li, Zixuan Zhu 0002. 5557-5565 [doi]
- Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic NetworksMinsoo Jo, Dongyoon Yang, Taesup Kim. 5566-5574 [doi]
- HyFI: Hyperbolic Feature Interpolation for Brain-Vision AlignmentSangmin Jo, Wootaek Jeong, Da-Woon Heo, Yoohwan Hwang, Heung-Il Suk. 5575-5583 [doi]
- PINGS-X: Physics-Informed Normalized Gaussian Splatting with Axes Alignment for Efficient Super-Resolution of 4D Flow MRISun Jo, Seok Young Hong, Jinhyun Kim, Seungmin Kang, Ahjin Choi, Don-Gwan An, Simon Song, Je Hyeong Hong. 5584-5592 [doi]
- Seeing the Unseen: Zooming in the Dark with Event CamerasDachun Kai, Zeyu Xiao 0002, Huyue Zhu, Jiaxiao Wang, Yueyi Zhang, Xiaoyan Sun 0001. 5593-5601 [doi]
- Timestep-Compressed Attack on Spiking Neural Networks Through Timestep-Level BackpropagationDonghwa Kang, Doohyun Kim, Sang-Ki Ko, Jinkyu Lee 0001, Hyeongboo Baek, Brent ByungHoon Kang. 5602-5610 [doi]
- Rethinking Direct Preference Optimization in Diffusion ModelsJunyong Kang, Seohyun Lim, Kyungjune Baek, Hyunjung Shim. 5611-5619 [doi]
- Forget Less by Learning from Parents Through Hierarchical RelationshipsArjun Ramesh Kaushik, Naresh Kumar Devulapally, Vishnu Suresh Lokhande, Nalini K. Ratha, Venu Govindaraju. 5620-5628 [doi]
- StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint VideoZhihui Ke, Yvyang Liu, Xiaobo Zhou 0003, Tie Qiu 0001. 5629-5638 [doi]
- GuidNoise: Single-Pair Guided Diffusion for Generalized Noise SynthesisChangjin Kim, HyeokJun Lee, Youngjoon Yoo. 5639-5647 [doi]
- DANCE: Density-agnostic and Class-aware Network for Point Cloud CompletionDa Yeong Kim, Yeong-Jun Cho. 5648-5655 [doi]
- State-Space Hierarchical Compression with Gated Attention and Learnable Sampling for Hour-Long Video Understanding in Large Multimodal ModelsGeewook Kim, Minjoon Seo. 5656-5664 [doi]
- Continuous Degradation Modeling via Latent Flow Matching for Real-World Super-ResolutionHyeonjae Kim, Dongjin Kim, Eugene Jin, Tae-Hyun Kim. 5665-5672 [doi]
- Towards Test-time Efficient Visual Place Recognition via Asymmetric Query ProcessingJaeyoon Kim, Yoonki Cho, Sung-Eui Yoon. 5673-5681 [doi]
- Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention GuidanceKwanyoung Kim. 5682-5690 [doi]
- LampQ: Towards Accurate Layer-wise Mixed Precision Quantization for Vision TransformersMinjun Kim 0010, Jaeri Lee, Jongjin Kim 0001, Jeongin Yun, Yongmo Kwon, U Kang. 5691-5699 [doi]
- Improving Target Presence and Plurality Recognition for Generalized Referring Image SegmentationNamyup Kim, Jinsung Lee, Suha Kwak. 5700-5708 [doi]
- PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object TrackingSeungjae Kim, Seungjoon Lee, MyeongAh Cho. 5709-5716 [doi]
- Do We Need Perfect Data? Leveraging Noise for Domain Generalized SegmentationTaeyeong Kim, Seungjoon Lee, Jung-Uk Kim, MyeongAh Cho. 5717-5725 [doi]
- BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape CompletionDequan Kong, Honghua Chen, Zhe Zhu, Mingqiang Wei. 5726-5734 [doi]
- SalDiff-DTM: A Novel Dual-Temporal Modulated Diffusion Model for Omnidirectional Images Scanpath PredictionXiaohui Kong, Qian Liu, Dandan Zhu 0001, Kaiwei Zhang, Xiongkuo Min. 5735-5743 [doi]
- Gaussian Blending: Rethinking Alpha Blending in 3D Gaussian SplattingJunseo Koo, Jinseo Jeong, Gunhee Kim. 5744-5754 [doi]
- Generating-Filtering-Ranking: A Three-Stage MultiModal Data Augmentation Framework Under Partial Modality MissingZhirui Kuai, Huan Zhang, Yang Yang, Yiping Ma, Mingjing Huang, Ning Gui, Li Kuang. 5755-5763 [doi]
- Temporal Object-Aware Vision Transformer for Few-Shot Video Object DetectionYogesh Kumar 0004, Anand Mishra 0001. 5764-5772 [doi]
- OceanSplat: Object-aware Gaussian Splatting with Trinocular View Consistency for Underwater Scene ReconstructionMinseong Kweon, Jinsun Park. 5773-5781 [doi]
- Easy to Learn, Yet Hard to Forget: Towards Robust Unlearning Under BiasJunehyoung Kwon, Mihyeon Kim, Eunju Lee, Yoonji Lee, Seunghoon Lee, Youngbin Kim. 5782-5790 [doi]
- RadarLLM: Empowering Large Language Models to Understand Human Motion from Millimeter-wave Point Cloud SequenceZengyuan Lai, Jiarui Yang, Songpengcheng Xia, Lizhou Lin, Lan Sun, Renwen Wang, Jianran Liu, Qi Wu 0007, Ling Pei. 5791-5799 [doi]
- AnomalyPainter: Vision-Language-Diffusion Synergy for Realistic and Diverse Unseen Industrial Anomaly SynthesisZhangyu Lai, Yilin Lu, Xinyang Li, Jianghang Lin, Yansong Qu, Ming Li 0010, Liujuan Cao. 5800-5808 [doi]
- OFL-SAM2: Prompt SAM2 with Online Few-shot Learner for Efficient Medical Image SegmentationMeng Lan, Lefei Zhang, Xiaomeng Li. 5809-5817 [doi]
- GeoCoBox: Box-supervised 3D Tumor Segmentation via Geometric Co-embeddingTianzhong Lan, Zhang Yi 0001, Xiuyuan Xu, Min Zhu 0005. 5818-5826 [doi]
- Lightweight Optimal-Transport Harmonization on Edge DevicesMaria A. Larchenko, Dmitry Guskov, Alexander Lobashev, Georgy Derevyanko. 5827-5835 [doi]
- Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Convex Parametric ShapesDuy-Tho Le, Trung Pham, Jianfei Cai 0001, Hamid Rezatofighi. 5836-5844 [doi]
- GeoMoE: Divide-and-Conquer Motion Field Modeling with Mixture-of-Experts for Two-View GeometryJiajun Le, Jiayi Ma 0001. 5845-5853 [doi]
- Targeted Data Protection for Diffusion Model by Matching Training TrajectoryHojun Lee 0002, Mijin Koo, Yeji Song, Nojun Kwak. 5854-5862 [doi]
- Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion ModelsHyundo Lee, Suhyung Choi, Inwoo Hwang, Byoung-Tak Zhang. 5863-5871 [doi]
- Tuning-Free Amodal Segmentation via the Occlusion-Free Bias of Inpainting ModelsJae Joong Lee, Bedrich Benes, Raymond A. Yeh. 5872-5880 [doi]
- DipGuava: Disentangling Personalized Gaussian Features for 3D Head Avatars from Monocular VideoJeonghaeng Lee, Seokkeun Choi, Zhixuan Li, Weisi Lin, Sanghoon Lee 0001. 5881-5889 [doi]
- CHIMERA: Controllable High-quality Image-Mask Extraction for Reliable Diffusion-based Anomaly SynthesisJoungbin Lee, Hyunkoo Lee, Jini Yang, Chaehyun Kim, Jung Yi, Seok Hwangbo, Hyeoncheol Lee, Minho Chun, Eunjo Jeong, Seungryong Kim. 5890-5898 [doi]
- RefineVAD: Semantic-Guided Feature Recalibration for Weakly Supervised Video Anomaly DetectionJunhee Lee, ChaeBeen Bang, MyoungChul Kim, MyeongAh Cho. 5899-5907 [doi]
- A Paradigm Shift in High-Resolution Depth Estimation Using SPAD-Based LiDAR Histograms: From Signal Filtering to Lightweight Similarity LearningMinsung Lee, Seo Hyun Kim, Yeonsu Park, Hyeongseok Seo, Jongmin Lee. 5909-5917 [doi]
- Difficulty-Aware Label-Guided Denoising for Monocular 3D Object DetectionSoyul Lee, Seungmin Baek, Dongbo Min. 5918-5926 [doi]
- GPGS: Consistent 3D Object Removal via Geometry-Aware 3D Inpainting and Projected Image Refinement in 3D Gaussian SplattingYongjoon Lee, Donghyeon Cho. 5927-5935 [doi]
- See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight DetectionYueun Lee, Jung-Uk Kim. 5936-5944 [doi]
- Versatile Vision-Language Model for 3D Computed TomographyJiayu Lei, Ziqing Fan, Yanyong Zhang, Weidi Xie, Ya Zhang 0002, Yanfeng Wang 0001. 5945-5954 [doi]
- Dynamic-Static Collaboration for Unsupervised Domain Adaptive Video-Based Visible-Infrared Person Re-IdentificationJiaxu Leng, Zhengjie Wang, Shuang Li, Xinbo Gao 0001. 5955-5963 [doi]
- PIF-Net: Ill-Posed Prior Guided Multispectral and Hyperspectral Image Fusion via Invertible Mamba and Fusion-Aware LoRABaisong Li, Xingwang Wang, Haixiao Xu. 5964-5972 [doi]
- Test-Time Preference Optimization for Image RestorationBingchen Li 0001, Xin Li 0082, Jiaqi Xu, Jiaming Guo, Wenbo Li, Renjing Pei, Zhibo Chen 0001. 5973-5981 [doi]
- Exploring Efficient Open-Vocabulary Segmentation in the Remote SensingBingyu Li, Haocheng Dong, Da Zhang 0010, Zhiyuan Zhao 0005, Hao Sun 0038, Junyu Gao 0001. 5982-5991 [doi]
- FreLay: Frequency-aware Energy Function for Training-free Layout-to-Image GenerationBonan Li, Yinhan Hu, Songhua Liu, Zeyu Xiao, Xinchao Wang. 5992-6000 [doi]
- DigimonGPT: An Evolvable Agent with Hierarchical Human-like Memory for Video Question AnsweringBorui Li 0001, Xingcai Zhang, Tianen Liu, Shuai Wang, Yun Cheng, Shuai Wang. 6001-6009 [doi]
- MR-COSMO: Visual-Text Memory Recall and Direct CrOSs-MOdal Alignment Method for Query-Driven 3D SegmentationChade Li, Pengju Zhang, Yihong Wu 0002. 6010-6018 [doi]
- Exploring Surround-View Fisheye Camera 3D Object DetectionChangcai Li, Wenwei Lin, Zuoxun Hou, Gang Chen 0023, Wei Zhang 0009, Huihui Zhou, Weishi Zheng 0001. 6019-6027 [doi]
- Polarization Uncertainty-Guided Diffusion Model for Color Polarization Image DemosaickingChenggong Li, Yidong Luo, Junchao Zhang 0001, Degui Yang. 6028-6036 [doi]
- Dual-Teacher Interactive Knowledge Distillation Network for Text-to-Visible & Infrared Person RetrievalChenglong Li 0002, Zhengyu Chen, Yifei Deng, Aihua Zheng. 6037-6045 [doi]
- MotivDance: Fine-Grained Text-Guided Motivation Choreography with Music SynchronizationChenguang Li, Yu-Hui Wen, Liping Jing. 6046-6054 [doi]
- RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited LabelsChengzhou Li, Ping Guo, Guanchen Meng, Qi Jia 0001, Jinyuan Liu 0001, Zhu Liu 0004, Xiaokang Liu, Yu Liu 0012, Zhongxuan Luo, Xin Fan 0001. 6055-6063 [doi]
- Learning a Fix and Explore Framework for Continuous Generalized Category DiscoveryChunming Li, Shidong Wang, Haofeng Zhang 0001. 6064-6072 [doi]
- Refine3D: Scene-Adaptive Reference Point Refinement for Sparse 3D Object DetectionFan Li, Jing Lu 0004, Yunlu Xu, Changhong Wu, Tao Xu, Zhaoyi Xiang, Yi Niu. 6073-6081 [doi]
- Ambiguity-aware Truncated Flow Matching for Ambiguous Medical Image SegmentationFanding Li, Xiangyu Li 0004, Xianghe Su, Xingyu Qiu, Suyu Dong, Wei Wang 0169, Kuanquan Wang, Gongning Luo, Shuo Li 0001. 6082-6090 [doi]
- Mask2IV: Interaction-Centric Video Generation via Mask TrajectoriesGen Li 0008, Bo Zhao, Jianfei Yang, Laura Sevilla-Lara. 6091-6099 [doi]
- Modality and Task Adaptation for Enhanced Zero-shot Composed Image RetrievalHaiwen Li, Delong Liu, Zhaohui Hou, Zeliang Ma, Fei Su, Zhicheng Zhao 0001. 6100-6108 [doi]
- CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT TrackingHao Li, Yuhao Wang, Xiantao Hu, Wenning Hao, Pingping Zhang, Dong Wang 0004, Huchuan Lu. 6109-6117 [doi]
- FDP: A Frequency-Decomposition Preprocessing Pipeline for Unsupervised Anomaly Detection in Brain MRIHao Li, Zhenfeng Zhuang, Jingyu Lin, Yu Liu, Yifei Chen, Qiong Peng, Lequan Yu, Liansheng Wang 0002. 6118-6126 [doi]
- PEFT-BoA: Parameter-Efficient Fine-Tuning with Bag-of-Adapters for Multi-Modal Object Re-identificationHongchao Li, Guangxing Liu, Xixi Wang, Baihe Liang, Yonglong Luo. 6127-6135 [doi]
- Context-aware Dynamic Contrastive Learning Network and E-Bike Rider Benchmark for Person SearchHongchao Li, Chengcheng Li, Xixi Wang, Yonglong Luo. 6136-6144 [doi]
- Point Cloud Quantization Through Multimodal Prompting for 3D UnderstandingHongxuan Li, Wencheng Zhu, Huiying Xu, Xinzhong Zhu, Pengfei Zhu 0001. 6145-6153 [doi]
- Image Restoration via Primal Dual Hybrid Gradient and Flow Generative ModelJi Li, Chao Wang. 6154-6162 [doi]
- Integrating Reweighted Least Squares with Plug-and-Play Diffusion Priors for Noisy Image RestorationJi Li, Chao Wang. 6163-6171 [doi]
- Do Audio-Visual Segmentation Models Truly Segment Sounding Objects?Jia Li, WenJie Zhao, Ziru Huang, Yunhui Guo, Yapeng Tian. 6172-6180 [doi]
- Target Refocusing via Attention Redistribution for Open-Vocabulary Semantic Segmentation: An Explainability PerspectiveJiahao Li 0003, Yang Lu 0009, Yachao Zhang 0001, Yong Xie, Fangyong Wang, Yuan Xie 0006, Yanyun Qu. 6181-6189 [doi]
- ReCAD: Reinforcement Learning Enhanced Parametric CAD Model Generation with Vision-Language ModelsJiahao Li, Yusheng Luo, Yunzhong Lou, Xiangdong Zhou. 6190-6198 [doi]
- FANoise: Singular Value-Adaptive Noise Modulation for Robust Multimodal Representation LearningJiaoyang Li, Jun Fang, Tianhao Gao, Xiaohui Zhang, Zhiyuan Liu 0001, Chao Liu, Pengzhang Liu, Qixia Jiang. 6199-6207 [doi]
- CLIP2Pose: Frozen CLIP as Semantic Guide for Domain Adaptive Pose EstimationJiawen Li, Fei Jiang 0006, Dandan Zhu 0001, Jinxin Shi, Aimin Zhou. 6208-6216 [doi]
- FIND: A Simple Yet Effective Baseline for Diffusion-Generated Image DetectionJie Li, Yingying Feng, Chi Xie, Jie Hu, Lei Tan, Jiayi Ji. 6217-6225 [doi]
- MdaIF: Robust One-Stop Multi-Degradation-Aware Image Fusion with Language-Driven SemanticsJing Li, Yifan Wang, Jiafeng Yan, RenLong Zhang, Bin Yang. 6226-6234 [doi]
- MIRA: Evaluating Multimodal AI on Complex Clinical Reasoning in Interventional RadiologyJingxiong Li, Chenglu Zhu, Sunyi Zheng, Yuxuan Sun 0002, Yifei Wang, He Liu, YunLong Zhang, Yixuan Si, Lin Yang 0002, Liang Xiao 0001. 6235-6243 [doi]
- CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language ModelsJingyao Li, Jingyun Wang, Molin Tan, Haochen Wang, Cilin Yan, Likun Shi, Jiayin Cai, Xiaolong Jiang, Yao Hu 0002. 6244-6252 [doi]
- TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video GroundingJinxuan Li, Yi Zhang, Jian-Fang Hu, Chaolei Tan, Tianming Liang, Beihao Xia. 6253-6261 [doi]
- RMLer: Synthesizing Novel Objects Across Diverse Categories via Reinforcement Mixing LearningJun Li 0027, Zikun Chen, Haibo Chen 0006, Shuo Chen 0003, Jian Yang 0003. 6262-6270 [doi]
- Decision-Driven Orthogonal Learning with Complementary Feature Mining for Robust Synthetic Image DetectionKai Li, Wei Wang 0335, Linchao Zhang, Siying Zhu, Wenqi Ren. 6271-6278 [doi]
- DynamicEarth: How Far Are We from Open-Vocabulary Change Detection?Kaiyu Li, Xiangyong Cao, Yupeng Deng 0001, Chao Pang 0001, Zepeng Xin, Hui Qiao, Tieliang Gong, Deyu Meng, Zhi Wang 0002. 6279-6287 [doi]
- RSVG-ZeroOV: Exploring a Training-Free Framework for Zero-Shot Open-Vocabulary Visual Grounding in Remote Sensing ImagesKe Li 0024, Di Wang 0011, Ting Wang, Fuyu Dong, Yiming Zhang, Luyao Zhang, Xiangyu Wang, Shaofeng Li, Quan Wang 0006. 6288-6296 [doi]
- Multiple Human Motion UnderstandingLei Li 0050, Sen Jia 0003, Jenq-Neng Hwang. 6297-6305 [doi]
- Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation ComprehensionLin Li 0065, Wei Chen 0070, Jiahui Li 0003, Kwang-Ting Cheng, Long Chen 0016. 6306-6314 [doi]
- SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution ImagesLinfei Li, Lin Zhang 0014, Zhong Wang 0009, Ying Shen 0005. 6315-6323 [doi]
- DFMN: A Dual-feet Matching Network with Hybrid Transformer-based Feature Extractor for Unsupervised Deformable Medical Image RegistrationLiwen Li, Xinrui Guo, Wentao Guo, Shunqi Yang, Fumin Guo. 6324-6332 [doi]
- DSP-PCQA: Integrating Multiple Perception Preferences for Point Cloud Quality AssessmentMingxuan Li, Fazhan Zhang, Zhenzhe Hou, Zihao Huang, Bohan Fu, Runze Hu, Xiaohui Chu. 6333-6341 [doi]
- Points Meet Pixels: Bridging 2D Vision-Language Model and 3D Perception Gaps for Point Cloud Quality AssessmentMingxuan Li, Zihao Huang, Xiaohui Chu, Fazhan Zhang, Bohan Fu, Runze Hu. 6342-6350 [doi]
- MDND: Unsupervised Learning Guided by Non-Differentiable Refinement for Shape CorrespondenceQinsong Li, Jing Meng 0004, Haibo Wang 0009, Shengjun Liu 0002. 6351-6359 [doi]
- Adaptive Piecewise Distillation for Efficient LiDAR Data GenerationRuibo Li, Xiaofeng Yang, Ze Yang 0002, Jiacheng Wei, Chunyan Miao, Guosheng Lin. 6360-6368 [doi]
- An Efficient and Harmonized Framework for Balanced Cross-Domain Feature IntegrationShaoxu Li, Ye Pan. 6369-6377 [doi]
- TEMPLE: Incentivizing Temporal Understanding of Video Large Language Models via Progressive Pre-SFT AlignmentShicheng Li, Lei Li 0039, Kun Ouyang, Shuhuai Ren, Yuanxin Liu, Yuanxing Zhang, Fuzheng Zhang, Lingpeng Kong, Qi Liu 0049, Xu Sun 0001. 6378-6386 [doi]
- SSR-SAM: Retrieval-Style Segment Anything Model for Semi-Supervised Ultra-High-Resolution Image SegmentationShijie Li, Yiming Chen, Zhineng Chen, Kai Hu 0002, Xieping Gao. 6387-6395 [doi]
- VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive InteractionShiying Li, Xingqun Qi, Bingkun Yang, Weile Chen, Zezhao Tian, Muyi Sun, Qifeng Liu, Man Zhang 0005, Zhenan Sun. 6396-6404 [doi]
- SynerDetect: Hierarchical Synergistic Learning for Generalizable AI-Generated Image DetectionShuaibo Li, Yijun Yang, Zhaohu Xing, Hongqiu Wang, Pengfei Hao, Xingyu Li, Zekai Liu, Qing Zhang 0006, Lei Zhu 0003. 6405-6414 [doi]
- From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot LearningShuangzhi Li 0003, Junlong Shen, Lei Ma 0003, Xingyu Li. 6415-6423 [doi]
- Bridging Day and Night: Target-Class Hallucination Suppression in Unpaired Image TranslationShuwei Li, Lei Tan, Robby T. Tan. 6424-6432 [doi]
- WorldGrow: Generating Infinite 3D WorldSikuang Li, Chen Yang 0023, Jiemin Fang, Taoran Yi, Jia Lu, Jiazhong Cen, Lingxi Xie, Wei Shen 0002, Qi Tian 0001. 6433-6441 [doi]
- GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian SplattingTiantian Li, Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Jun Zhang 0004, Yan Wang 0105. 6442-6449 [doi]
- Monocular Vehicle Pose and Shape Reconstruction via Dynamic Context Adaptation and Progressive Geometry RefinementWei Li 0110, Long Ji, Ying Wang, Xiao Wu 0001, Zhaoquan Yuan, Penglin Dai. 6450-6458 [doi]
- When Trackers Date Fish: A Benchmark and Framework for Underwater Multiple Fish TrackingWeiran Li, Yeqiang Liu 0001, Qiannan Guo, Yijie Wei, Hwa Liang Leo, Zhenbo Li. 6459-6467 [doi]
- Seeing Through the Rain: Resolving High-Frequency Conflicts in Deraining and Super-Resolution via Diffusion GuidanceWenjie Li, Jinglei Shi, Jin Han, Heng Guo 0003, Zhanyu Ma. 6468-6476 [doi]
- Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D RetrievalWenrui Li 0001, Yidan Lu, Yeyu Chai, Rui Zhao 0010, Hengyu Man, Xiaopeng Fan 0001. 6477-6485 [doi]
- TCoT: Trajectory Chain-of-Thoughts for Robotic Manipulation with Failure Recovery in Vision-Language-Action ModelXiang Li, Ya-Li Li, Yuan Wang, Huaqiang Wang, Shengjin Wang. 6486-6494 [doi]
- HDRMovieformer: A Transformer Framework and Benchmark for Cinematic SDR-to-HDR ConversionXianwei Li, Huiyuan Fu, Chuanming Wang, Huadong Ma. 6495-6503 [doi]
- Multi-Level Blur-Aware Stable Diffusion for Region-Adaptive Defocus DeblurringXiaopan Li, Yi Jiang, Shiqian Wu, Shoulie Xie, Sos S. Agaian. 6504-6512 [doi]
- Rethinking the Spatio-Temporal Alignment of End-to-End 3D PerceptionXiaoyu Li, Peidong Li, Xian Wu, Long Shi, Dedong Liu, Yitao Wu, Jiajia Fu, Dixiao Cui, Lijun Zhao 0003, Lining Sun. 6513-6520 [doi]
- Text-Guided Channel Perturbation and Pre-Trained Knowledge Integration for Unified Multi-Modality Image FusionXilai Li, Xiaosong Li, Weijun Jiang. 6521-6529 [doi]
- CausalStep: A Benchmark for Explicit Stepwise Causal Reasoning in VideosXuchen Li 0001, Xuzhao Li, Shiyu Hu, Kaiqi Huang, Wentao Zhang. 6530-6538 [doi]
- Any-Optical-Model: A Universal Foundation Model for Optical Remote SensingXuyang Li, Chenyu Li 0002, Danfeng Hong. 6539-6547 [doi]
- RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph OptimizationYan Li, Ze Yang, Keisuke Tateno, Federico Tombari, Liang Zhao 0003, Gim Hee Lee. 6548-6556 [doi]
- Active3D: Active High-Fidelity 3D Reconstruction via Multi-Level Uncertainty QuantificationYan Li, Yingzhao Li, Gim Hee Lee. 6557-6565 [doi]
- HiFi-Mesh: High-Fidelity Efficient 3D Mesh Generation via Compact Autoregressive DependenceYanfeng Li, Tao Tan 0002, Qinquan Gao, Zhiwen Cao, Xiaohong Liu 0001, Yue Sun 0001. 6566-6574 [doi]
- Not All Distortions Are Created Equal: Distortion-Selective Domain Adaptation for Point Cloud Quality AssessmentYangwei Li, Xiaochuan Wang, Xin Shang, Haisheng Li 0002. 6575-6582 [doi]
- IAD-R1: Reinforcing Consistent Reasoning in Industrial Anomaly DetectionYanhui Li, Yunkang Cao, Chengliang Liu 0003, Yuan Xiong, Xinghui Dong, Chao Huang 0008. 6583-6591 [doi]
- EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question AnsweringYanjun Li, Yuqian Fu, Tianwen Qian, Qi'ao Xu, Silong Dai, Danda Pani Paudel, Luc Van Gool, Xiaoling Wang. 6592-6600 [doi]
- From Scene to Object: Enhancing Open-Vocabulary Object Detection via Foreground-Background Context ReasoningYanqi Li, Jianwei Niu 0002, Ningbo Gu, Tao Ren 0001. 6601-6609 [doi]
- Make LVLMs Focus: Context-Aware Attention Modulation for Better Multimodal In-Context LearningYanshu Li, Jianjiang Yang, Ziteng Yang, Bozheng Li, Ligong Han, Hongyang He, Zhengtao Yao, Yingjie Victor Chen, Songlin Fei, Dongfang Liu, Ruixiang Tang. 6610-6618 [doi]
- CATP: Contextually Adaptive Token Pruning for Efficient and Enhanced Multimodal In-Context LearningYanshu Li, Jianjiang Yang, Zhennan Shen, Ligong Han, Haoyan Xu, Ruixiang Tang. 6619-6627 [doi]
- SurgPub-Video: A Comprehensive Surgical Video Framework for Enhanced Surgical Intelligence in Vision-Language ModelYaoqian Li, Xikai Yang, Dunyuan Xu, Yang Yu, Litao Zhao, Xiaowei Hu 0001, Jinpeng Li 0004, Pheng-Ann Heng. 6628-6635 [doi]
- Analyzing and Mitigating Object Hallucination: A Training Bias PerspectiveYifan Li 0009, Kun Zhou 0002, Xin Zhao 0018, Lei Fang, Jirong Wen. 6636-6643 [doi]
- ManipDreamer3D: Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D TrajectoryYing Li 0128, Xiaobao Wei, Xiaowei Chi, Yuming Li, Zhongyu Zhao, Hao Wang 0073, Ningning Ma, Ming Lu 0002, Sirui Han. 6644-6652 [doi]
- DAPointMamba: Domain Adaptive Point Mamba for Point Cloud CompletionYinghui Li, Qianyu Zhou 0001, Di Shao, Hao Yang, Ye Zhu 0002, Richard Dazeley, Xuequan Lu. 6653-6661 [doi]
- RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document UnderstandingYinglu Li, Zhiying Lu, Zhihang Liu, Yiwei Sun, Chuanbin Liu 0001, Hongtao Xie. 6662-6670 [doi]
- TIM++: Transductive Information Maximization for Few-Shot CLIPYingping Li, Yutong Zou, Yunshi Huang, Changzhe Jiao, Xinlin Wang, Shen Peng, Zhang Guo 0001, Shuiping Gou. 6671-6680 [doi]
- Temporal Inconsistency Guidance for Super-resolution Video Quality AssessmentYixiao Li, Xiaoyuan Yang 0003, Weide Liu, Xin Jin 0014, Xu Jia 0012, Yu-Kun Lai, Paul L. Rosin, Hantao Liu, Wei Zhou 0021. 6681-6689 [doi]
- Dual-Phase Visual-Language Pretraining and Adaptation for Long-Tailed Multi-Label RecognitionYongcheng Li, Xuekuan Wang, Zhifei Zhang, Cairong Zhao. 6690-6698 [doi]
- TechCoach: Towards Technical-Point-Aware Descriptive Action CoachingYuan-Ming Li, An-Lan Wang, Ling-An Zeng, Kun-Yu Lin, Yu-Ming Tang, Weishi Zheng 0001. 6699-6707 [doi]
- Drive-R1: Bridging Reasoning and Planning in VLMs for Autonomous Driving with Reinforcement LearningYue Li, Meng Tian, Dechang Zhu, Jiangtong Zhu, ZhenYu Lin, Zhiwei Xiong, Xinhai Zhao. 6708-6716 [doi]
- SM3Det: A Unified Model for Multi-Modal Remote Sensing Object DetectionYuxuan Li 0004, Xiang Li 0041, Yunheng Li, Yicheng Zhang, Yimian Dai, Qibin Hou, Ming-Ming Cheng, Jian Yang 0003. 6717-6725 [doi]
- Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual GroundingYuzhen Li, Min Liu 0008, Zhaoyang Li, Yuan Bian 0002, Xueping Wang, Erbo Zhai, Yaonan Wang 0001. 6726-6734 [doi]
- Composition-Incremental Learning for Compositional GeneralizationZhen Li 0026, Yuwei Wu 0001, Chenchen Jing, Che Sun, Chuanhao Li 0001, Yunde Jia. 6735-6743 [doi]
- FGNet: Leveraging Feature-Guided Attention to Refine SAM2 for 3D EM Neuron SegmentationZhenghua Li, Hang Chen, Zihao Sun, Kai Li, Xiaolin Hu. 6744-6752 [doi]
- DTTNet: Improving Video Shadow Detection via Dark-Aware Guidance and Tokenized Temporal ModelingZhicheng Li, Kunyang Sun, Rui Yao 0006, Hancheng Zhu, Fuyuan Hu, Jiaqi Zhao 0001, Zhiwen Shao, Yong Zhou 0003. 6753-6761 [doi]
- HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image RetrievalZixu Li 0001, Yupeng Hu 0003, Zhiwei Chen 0003, ShiQi Zhang, Qinlei Huang, Zhiheng Fu, Yinwei Wei. 6762-6770 [doi]
- Object-Centric Framework for Video Moment RetrievalZongyao Li, Yongkang Wong, Satoshi Yamazaki, Jianquan Liu, Mohan Kankanhalli. 6771-6779 [doi]
- Diffusion-Based Contextual Reconstruction for Point Cloud Segmentation with Limited AnnotationsJiawei Lian, Zhengxue Wang, Wentao Qu, Haobo Jiang, Le Hui, Jian Yang 0003. 6780-6788 [doi]
- MoSs: Mixture of Scales for Efficient High-Resolution Autoregressive Image GenerationYaoxiu Lian, Hao Liang 0003, Zhihong Gou, Yijia Zhang, Jiaming Xu, Guohao Dai 0001, Ningyi Xu. 6789-6797 [doi]
- DW-DGAT: Dynamically Weighted Dual Graph Attention Network for Neurodegenerative Disease DiagnosisChengjia Liang, Zhenjiong Wang, Chao Chen, Ruizhi Zhang, Songxi Liang, Hai Xie, Haijun Lei, Zhongwei Huang. 6798-6806 [doi]
- Multi-Agent Undercover Gaming: Hallucination Removal Through Counterfactual Test for Multimodal ReasoningDayong Liang, Xiao-Yong Wei, Changmeng Zheng. 6807-6815 [doi]
- Improved Masked Image Generation with Knowledge-Augmented Token RepresentationsGuotao Liang, Baoquan Zhang, Zhiyuan Wen, Zihao Han, Yunming Ye. 6817-6825 [doi]
- OTI: A Model-free and Visually Interpretable Measure of Image AttackabilityJiaming Liang, Haowei Liu, Chi-Man Pun. 6826-6834 [doi]
- MM-R1: Unleashing the Power of Unified Multimodal Large Language Models for Personalized Image GenerationQian Liang, Yujia Wu, Kuncheng Li, Jiwei Wei, Shiyuan He, Jinyu Guo, Ning Xie 0003. 6835-6843 [doi]
- Rethinking Surgical Smoke: A Smoke-Type-Aware Laparoscopic Video Desmoking Method and DatasetQifan Liang, Junlin Li, Zhen Han 0002, Xihao Wang, Zhongyuan Wang 0001, Bin Mei. 6844-6852 [doi]
- Tensor Decomposition and Language Description for Open-Vocabulary Object DetectionQiuyu Liang, Yongqiang Zhang. 6853-6861 [doi]
- Persistent Autoregressive Mapping with Traffic Rules for Autonomous DrivingShiyi Liang, Xinyuan Chang, Changjie Wu, Huiyuan Yan, Yifan Bai 0001, Xinran Liu, Hang Zhang, Yujian Yuan, Shuang Zeng, Mu Xu, Xing Wei 0001. 6862-6870 [doi]
- Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMsXiao Liang, Chenxi Liu, Zhi Ma, Di Wang 0011, Bin Jing, Quan Wang 0006, Yuanyuan Shi. 6871-6879 [doi]
- Burst Image Quality Assessment: A New Benchmark and Unified Framework for Multiple Downstream TasksXiaoye Liang, Lai Jiang 0004, Minglang Qiao, Yichen Guo, Yue Zhang 0082, Xin Deng 0002, Shengxi Li, Yufan Liu 0001, Mai Xu. 6880-6888 [doi]
- AlignCVC: Aligning Cross-View Consistency for Single-Image-to-3D GenerationXinyue Liang, Zhiyuan Ma 0002, Lingchen Sun, Yanjun Guo, Lei Zhang 0006. 6889-6897 [doi]
- FAMDR: Feature-Aligned Multimodal Denoising for Reliable Diagnostic Reconciliation in Medical ImagingXun Liang 0001, Zhiying Li 0004, Hongxun Jiang. 6898-6906 [doi]
- IPFormer: Instance Prompt-guided Transformer for Multi-modal Multi-shot Video UnderstandingYujia Liang, Jile Jiao, Xuetao Feng, Xinchen Liu, Kun Liu, Yuan Wang, Zixuan Ye, Hao Lu, Zhicheng Wang. 6907-6915 [doi]
- FloorPlanFormer: Multi-Task Transformer Network for Floor Plan Recognition with Outer-to-Inner Feature RefinementYun Liang, Zihao Wu, Run Zheng, Shuai Xie, Bo Hong, Yishen Lin. 6916-6924 [doi]
- Improving Batch Normalization with Test-Time Adaptation for Robust Object Detection in Self-DrivingDacheng Liao, Mengshi Qi, Liang Liu 0001, Huadong Ma. 6925-6933 [doi]
- Beyond Cosine Similarity: Magnitude-Aware CLIP for No-Reference Image Quality AssessmentZhicheng Liao, Dongxu Wu, Zhenshan Shi, Sijie Mai, Hanwei Zhu, Lingyu Zhu 0006, Yuncheng Jiang 0004, Baoliang Chen. 6934-6942 [doi]
- DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous DrivingHongbin Lin, Yiming Yang 0001, Chaoda Zheng, Yifan Zhang 0004, Shuaicheng Niu, Zilu Guo, Yafeng Li, Gui Gui, Shuguang Cui, Zhen Li 0026. 6943-6951 [doi]
- SAM-DAQ: Segment Anything Model with Depth-guided Adaptive Queries for RGB-D Video Salient Object DetectionJia Lin, Xiaofei Zhou 0003, Jiyuan Liu, Runmin Cong, Guodao Zhang, Zhi Liu 0003, Jiyong Zhang 0001. 6952-6960 [doi]
- Robust Pseudo-Labeling via Decoupled Class-Aware Filtering and Dynamic Category CorrectionJianghang Lin, Yilin Lu, Chaoyang Zhu, Yunhang Shen, Shengchuan Zhang, Liujuan Cao. 6961-6969 [doi]
- Skeletons Speak Louder than Text: A Motion-Aware Pretraining Paradigm for Video-Based Person Re-IdentificationRifen Lin, Alex Jinpeng Wang, Jiawei Mo, Min Li 0007. 6970-6978 [doi]
- SC-Net: Robust Correspondence Learning via Spatial and Cross-Channel ContextShuyuan Lin, Hailiang Liao, Qiang Qi, Junjie Huang, Taotao Lai, Jian Weng 0001. 6979-6987 [doi]
- V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR LocalizationWenkai Lin, Qiming Xia, Wen Li 0005, Xun Huang 0003, Chenglu Wen. 6988-6996 [doi]
- S²Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object DetectionYu Lin, Jianghang Lin, Kai Ye, You Shen, Shengchuan Zhang, Liujuan Cao. 6997-7005 [doi]
- Frequency-Aligned Cross-Modal Learning with Top-K Wavelet Fusion and Dynamic Expert Routing for Enhanced Retinal Disease DiagnosisYuxin Lin, Haoran Li 0024, Haoyu Cao 0002, Yongting Hu, QiHao Xu, Chengliang Liu 0003, Xiaoling Luo 0001, Zhihao Wu 0002, Yong Xu 0001, Wei Wang 0169. 7006-7014 [doi]
- Commonality in Few: Few-Shot Multimodal Anomaly Detection via Hypergraph-Enhanced MemoryYuxuan Lin 0001, Hanjing Yan, Xuan Tong, Yang Chang, Huanzhen Wang, Ziheng Zhou 0005, Shuyong Gao, Yan Wang 0068, Wenqiang Zhang. 7015-7023 [doi]
- CrossCut: Cross-Patch Aware Interactive Segmentation for Remote Sensing ImagesZheng Lin, Nan Zhou, Yuhan Wang, Bojian Zhang. 7024-7032 [doi]
- MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video GenerationRun Ling, Ke Cao, Jian Lu, Ao Ma 0005, Haowei Liu, Runze He, Changwei Wang, Rongtao Xu, Yihua Shao, Zhanjie Zhang, Peng Wu, Guibing Guo, Wei Feng, Zheng Zhang, Jingjing Lv, Junjie Shen 0008, Ching Law, Xingwei Wang 0001. 7033-7041 [doi]
- CIA: Cluster-Instance Alignment for Unsupervised Day-Night Vehicle Re-IdentificationYongguo Ling, Chen Zhang, Yiming Liu, Wenhao Shao. 7042-7050 [doi]
- DenoiseGS: Delta-Based 3D Gaussian Splatting with B-spline Trajectory Optimization for Dynamic Driving Scene ReconstructionJunjie Linghu, Qiang Ling 0001. 7051-7059 [doi]
- Pb4U-GNet: Resolution-Adaptive Garment Simulation via Propagation-before-Update Graph NetworkAoran Liu, Kun Hu, Clinton Ansun Mo, Qiuxia Wu, Wenxiong Kang, Zhiyong Wang 0001. 7060-7068 [doi]
- CausalCLIP: Causally-Informed Feature Disentanglement and Filtering for Generalizable Detection of Generated ImagesBo Liu, Qiao Qin, Qinghui He. 7069-7077 [doi]
- RCP-LO: A Relative Coordinate Prediction Framework for Generalizable Deep LiDAR OdometryChen Liu, Wen Li 0005, Yongshu Huang, Minghang Zhu, Yuyang Yang, Dunqiang Liu, Sheng Ao, Cheng Wang 0003. 7078-7086 [doi]
- Rethinking Bias in Generative Data Augmentation for Medical AI: A Frequency Recalibration MethodChi Liu 0002, Jincheng Liu, Congcong Zhu, Minghao Wang, Sheng Shen 0005, Jia Gu, Tianqing Zhu, Wanlei Zhou 0001. 7087-7095 [doi]
- De-biased Natural Language Egocentric Task Verification via Prototypical Evidence LearningChong Liu, Xun Jiang 0001, Fumin Shen, Lei Zhu 0002, Jingkuan Song, Heng Tao Shen, Xing Xu 0001. 7096-7104 [doi]
- Learning to Cluster Rare Cell Types: Implicit Semantic Data Augmentation for Spatial Multi-modal Omics AnalysisDaixian Liu, Hau-Sing So, Haoran Chen, Jiao Li, Shanshan Wang 0008, Mengzhu Wang, Jingcai Guo. 7105-7113 [doi]
- Spatial-Spectral Homogeneous Attacks on Physical-World Large Vision-Language ModelsDaizong Liu, Baoquan Chen, Wei Hu 0003. 7114-7122 [doi]
- RAA: Achieving Interactive Remove/Add Anything via Fully Synthetic DataDelong Liu, Haotian Hou, Zhaohui Hou, Shihao Han, Zhiyuan Huang, Mingjie Zhan, Fei Su, Zhicheng Zhao 0001. 7123-7131 [doi]
- PathFLIP: Fine-grained Language-Image Pretraining for Versatile Computational PathologyFengchun Liu, Songhan Jiang, Linghan Cai, Ziyue Wang 0005, Yongbing Zhang 0002. 7132-7140 [doi]
- MRT: Learning Compact Representations with Mixed RWKV-Transformer for Extreme Image CompressionHan Liu, Hengyu Man, Xingtao Wang, Wenrui Li 0001, Debin Zhao. 7141-7149 [doi]
- Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative ApproachHangyu Liu 0001, Bo Peng 0012, Pengxiang Ding, Donglin Wang. 7150-7158 [doi]
- SCo-Cloud: Satellite Constellation Collaboration for Cloud-Aware Onboard-Computed Imaging and TransmissionJia Liu 0008, Qian Li 0033, Yongqi Li, Cheng Ji 0001, Shangguang Wang. 7159-7167 [doi]
- SkyMoE: A Vision-Language Foundation Model for Enhancing Geospatial Interpretation with Mixture of ExpertsJiaqi Liu, Ronghao Fu, Lang Sun, Haoran Liu, Xiao Yang, Weipeng Zhang, Xu Na, Zhuoran Duan, Bo Yang. 7168-7178 [doi]
- Generalized Geometry Encoding Volume for Real-time Stereo MatchingJiaxin Liu, Gangwei Xu, Xianqi Wang 0001, Chengliang Zhang, Xin Yang 0008. 7179-7187 [doi]
- ReasonAct: Progressive Training for Fine-Grained Video Reasoning in Small ModelsJiaxin Liu, Zhaolu Kang. 7188-7196 [doi]
- PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable TypographyJunle Liu, Peirong Zhang 0001, Yuyi Zhang 0002, Pengyu Yan, Hui Zhou, Xinyue Zhou, Fengjun Guo, Lianwen Jin. 7197-7205 [doi]
- PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report GenerationKang Liu 0025, Zhuoqi Ma, Zikang Fang, Yunan Li 0001, Kun Xie 0011, Qiguang Miao. 7206-7214 [doi]
- Accelerating Controllable Generation via Hybrid-grained CacheLin Liu, Huixia Ben, Shuo Wang 0008, Jinda Lu, Junxiang Qiu, Shengeng Tang, Yanbin Hao. 7215-7223 [doi]
- 4DSTR: Advancing Generative 4D Gaussians with Spatial-Temporal Rectification for High-Quality and Consistent 4D GenerationMengmeng Liu, Jiuming Liu, Yunpeng Zhang, Jiangtao Li, Michael Ying Yang, Francesco Nex, Hao Cheng 0008. 7224-7232 [doi]
- MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion TransformerPenghui Liu, Jiangshan Wang, Yutong Shen, Shanhui Mo, Chenyang Qi, Jack Ma. 7233-7241 [doi]
- Channel-masked Asymmetric Distribution Matching for Cross-Domain Generalized Dataset DistillationQi Liu, Chenghao Xu, Jiexi Yan, Guangtao Lyu, Erkun Yang, Guihai Chen, Yanhua Yang. 7242-7250 [doi]
- DMGINE: Day-Memory Guided Nighttime Image Enhancement for Dynamic Traffic ScenesRuizhou Liu, Zhe Wu 0006, Zimo Liu, Qingfang Zheng, Qingming Huang. 7251-7259 [doi]
- EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent DiffusionShang Liu 0002, Chenjie Cao, Chaohui Yu, Wen Qian, Jing Wang 0224, Fan Wang 0019. 7260-7268 [doi]
- Improving Sustainability of Adversarial Examples in Class-Incremental LearningTaiFeng Liu, Xinjing Liu, Liangqiu Dong, Yang Liu 0118, Yilong Yang 0004, Zhuo Ma 0001. 7269-7277 [doi]
- DGKAN: Dual-branch Graph Kolmogorov-Arnold Network for Unsupervised Multimodal Change DetectionTongfei Liu, Jianjian Xu, Tao Lei 0003, Yingbo Wang, Xiaogang Du, Zhiyong Lv. 7278-7286 [doi]
- Orthogonal Spatial-temporal Distributional Transfer for 4D GenerationWei Liu, Shengqiong Wu, Bobo Li 0001, Haoyu Zhao, Hao Fei 0001, Mong-Li Lee, Wynne Hsu. 7287-7295 [doi]
- GT2-GS: Geometry-aware Texture Transfer for Gaussian SplattingWenjie Liu, Zhongliang Liu, Junwei Shu, Changbo Wang, Yang Li. 7296-7304 [doi]
- Beyond the Horizon: Decoupling Multi-View UAV Action Recognition via Partial Order TransferWenxuan Liu 0008, Zhuo Zhou, Xuemei Jia, Siyuan Yang 0001, Wenxin Huang, Xian Zhong, Chia-Wen Lin. 7305-7313 [doi]
- Collaborative Feature Matching with Progressive Correspondence LearningXin Liu 0091, Yanbing Han, Rong Qin 0001, Bing Wang, Jufeng Yang. 7314-7322 [doi]
- Discretization Is Not Always Better: Rethinking Deep Quantization for Asymmetric Image RetrievalXinze Liu, Dayan Wu, Hengjie Zhu, Chenming Wu, Pengwen Dai. 7323-7331 [doi]
- SOAR: Semi-Supervised Open-Vocabulary Aerial Object Detection via Dual-Aware Enhanced Prior DenoisingXu Liu 0006, Yihong Huang, Dan Zhang, Lingling Li 0002, Long Sun, Licheng Jiao. 7332-7340 [doi]
- DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Semantic Instance SegmentationXuexun Liu, Xiaoxu Xu, Qiudan Zhang, Lin Ma 0002, Xu Wang 0006. 7341-7349 [doi]
- Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language ModelsXuyang Liu, Ziming Wang, Junjie Chen, Yuhang Han, Yingyao Wang, Jiale Yuan, Jun Song, Siteng Huang, Honggang Chen. 7350-7358 [doi]
- Signal: Selective Interaction and Global-local Alignment for Multi-Modal Object Re-IdentificationYangyang Liu, Yuhao Wang, Pingping Zhang. 7359-7367 [doi]
- FracSegmentator: Fracture Instance Segmentation with Trauma-Prior-Guided Contrastive LearningYanzhen Liu, Sutuke Yibulayimu, Yang Zhou, Yudi Sang, Yu Wang. 7368-7376 [doi]
- La La LiDAR: Large-Scale Layout Generation from LiDAR DataYouquan Liu, Lingdong Kong, Weidong Yang 0001, Xin Li 0110, Alan Liang, Runnan Chen, Ben Fei, Tongliang Liu. 7377-7385 [doi]
- View-on-Graph: Zero-Shot 3D Visual Grounding via Vision-Language Reasoning on Scene GraphsYuanyuan Liu, Haiyang Mei, Dongyang Zhan, Jiayue Zhao, Dongsheng Zhou, Bo Dong 0004, Xin Yang 0011. 7386-7394 [doi]
- Clear Nights Ahead: Towards Multi-Weather Nighttime Image RestorationYuetong Liu, Yunqiu Xu, Yang Wei 0002, Xiuli Bi, Bin Xiao 0002. 7395-7403 [doi]
- Nighttime Flare Removal via Wavelet-Guided and Gated-Enhanced Spatial-Frequency Fusion NetworkYun Liu 0002, Guang Yang, Tao Li, Weisi Lin. 7404-7412 [doi]
- ReFINE: A Reward-Based Framework for Interpretable and Nuanced Evaluation of Radiology Report GenerationYunyi Liu, Yingshu Li 0001, Zhanyu Wang, Xinyu Liang, Lingqiao Liu, Lei Wang 0001, Luping Zhou. 7413-7421 [doi]
- Few-Shot Precise Event Spotting via Unified Multi-Entity Graph and DistillationZhaoyu Liu, Kan Jiang, Murong Ma, Zhe Hou, Yun Lin 0001, Jin Song Dong 0001. 7422-7430 [doi]
- RAW-Flow: Advancing RGB-to-RAW Image Reconstruction with Deterministic Latent Flow MatchingZhen Liu 0022, Diedong Feng, Hai Jiang 0006, Liaoyuan Zeng, Hao Wang 0073, Chaoyu Feng, Lei Lei, Bing Zeng 0001, Shuaicheng Liu. 7431-7439 [doi]
- MoEGaze: A Mixture of Experts Approach for Generalizable Gaze EstimationZheng Liu, Feng Lu. 7440-7448 [doi]
- ConsistTalk: Intensity Controllable Temporally Consistent Talking Head Generation with Diffusion Noise SearchZhenjie Liu, Jianzhang Lu, Renjie Lu, Cong Liang 0002, Shangfei Wang. 7449-7457 [doi]
- PUFM: Efficient Point Cloud Upsampling via Flow MatchingZhi-Song Liu, Chenhang He, Yakun Ju, Lei Li. 7458-7466 [doi]
- LandCraft: Designing the Structured 3D Landscapes via Text GuidanceZhihao Liu, Fang Liu, Weihao Xuan, Naoto Yokoya. 7467-7475 [doi]
- RMFAT: Recurrent Multi-scale Feature Atmospheric Turbulence MitigatorZhiming Liu, Nantheera Anantrasirichai. 7476-7484 [doi]
- PatientVLM Meets DocVLM: Pre-Consultation Dialogue Between Vision-Language Models for Efficient DiagnosisK. Lokesh, Abhirama Subramanyam Penamakuri, Uday Agarwal, Apoorva Challa, Shreya K. Gowda, Somesh Gupta, Anand Mishra 0001. 7485-7493 [doi]
- Task-Specific Distance Correlation Matching for Few-Shot Action RecognitionFei Long 0001, Yao Zhang, Jiaming Lv, Jiangtao Xie, Peihua Li. 7494-7502 [doi]
- Lightning Fast Caching-based Parallel Denoising Prediction for Accelerating Talking Head GenerationJianzhi Long, Wenhao Sun, Rong-Cheng Tu, Dacheng Tao. 7503-7511 [doi]
- Multitasks-based Deep Evidential Fusion Network for Blind Image Quality AssessmentYiwei Lou, Yuanpeng He, Rongchao Zhang, Yongzhi Cao, Hanpin Wang, Yu Huang 0004. 7512-7520 [doi]
- Infrared-Privileged UAV Detection via Cross-Modal Vector-QuantizationZhibo Lou, Ruijie Zhang, Zeyu Luo, Qianxi Cao, Feng Qian, Junjie Chen, Yuming Fang. 7521-7529 [doi]
- LUMIN: A Longitudinal Multi-modal Knowledge Decomposition Network for Predicting Breast Cancer RecurrenceChunyao Lu, Tianyu Zhang 0006, Xinglong Liang, Yuan Gao, Luyi Han, Xin Wang 0121, Nika Rasoolzadeh, Tao Tan 0002, Ritse Mann. 7530-7538 [doi]
- Vista: Scene-Aware Optimization for Streaming Video Question Answering Under Post-Hoc QueriesHaocheng Lu, Nan Zhang, Wei Tao, Xiaoyang Qu, Guokuan Li, Jiguang Wan 0001, Jianzong Wang. 7539-7547 [doi]
- From Pretrain to Pain: Adversarial Vulnerability of Video Foundation Models Without Task KnowledgeHui Lu, Yi Yu 0011, Song Xia, Yiming Yang, Deepu Rajan, Boon Poh Ng, Alex C. Kot, Xudong Jiang 0001. 7548-7556 [doi]
- DCA-LUT: Deep Chromatic Alignment with 5D LUT for Purple Fringing RemovalJialang Lu, Shuning Sun, Pu Wang 0008, Chen Wu 0006, Feng Gao 0005, Lina Gong, Dianjie Lu, Guijuan Zhang, Zhuoran Zheng. 7557-7564 [doi]
- EasyText: Controllable Diffusion Transformer for Multilingual Text RenderingRunnan Lu, Yuxuan Zhang, Jiaming Liu, Haofan Wang, Yiren Song. 7565-7573 [doi]
- LWGANet: Addressing Spatial and Channel Redundancy in Remote Sensing Visual Tasks with Light-Weight Grouped AttentionWei Lu, Xue Yang, Si-Bao Chen 0001. 7574-7582 [doi]
- Unlearning in Cross-Modal Retrieval via Prior-Prototype Guided Partitioned DampeningYi Lu, Shu Li, Yurong Qian. 7583-7590 [doi]
- MVGD-Net: A Novel Motion-aware Video Glass Surface Detection MethodYiwei Lu, Hao Huang, Tao Yan 0001. 7591-7599 [doi]
- HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion ModelsZhiguang Lu, Qianqian Xu 0001, Peisong Wen, Siran Dai, Qingming Huang. 7600-7608 [doi]
- 3DTeethSAM: Taming SAM2 for 3D Teeth SegmentationZhiguo Lu, Jianwen Lou, Mingjun Ma, Hairong Jin, Youyi Zheng, Kun Zhou 0001. 7609-7617 [doi]
- Walking Further: Semantic-Aware Multimodal Gait Recognition Under Long-Range ConditionsZhiyang Lu, Wen Jiang, Tianren Wu, Zhichao Wang, Changwang Zhang, Siqi Shen, Ming Cheng 0002. 7618-7626 [doi]
- R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual ScenariosLu Zhu, Tiantian Geng, Yangye Chen, Teng Wang, Ping Luo 0002, Feng Zheng 0001. 7627-7635 [doi]
- Negative Entity Suppression for Zero-Shot Captioning with Synthetic ImagesZimao Lu, Hui Xu, Bing Liu, Ke Wang. 7636-7643 [doi]
- Inpaint-Anywhere: Zero-Shot Multi-Identity Inpainting with Efficient Diffusion TransformerJunsheng Luan, Lei Zhao 0011, Wei Xing 0001. 7644-7652 [doi]
- Textured Geometry Evaluation: Perceptual 3D Textured Shape Metric via 3D Latent-Geometry NetworkTianyu Luan, Xuelu Feng, Zixin Zhu, Phani Nuney, Sheng Liu 0017, Xuan Gong, David S. Doermann, Chunming Qiao, Junsong Yuan 0001. 7653-7661 [doi]
- Unsupervised Contrastive Learning for Efficient and Robust Spectral Shape MatchingFeifan Luo, Hongyang Chen 0001. 7662-7670 [doi]
- Connecting the Dots: Training-Free Visual Grounding via Agentic ReasoningLiqin Luo, Guangyao Chen, Xiawu Zheng, Yongxing Dai, Yixiong Zou, Yonghong Tian 0001. 7671-7679 [doi]
- Rethinking Multimodal Point Cloud Completion: A Completion-by-Correction PerspectiveWang Luo, Di Wu 0001, Hengyuan Na, Yinlin Zhu, Miao Hu 0001, Guocong Quan. 7680-7688 [doi]
- FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive DiffusionXiangyang Luo, Qingyu Li, Xiaokun Liu, Wenyu Qin, Miao Yang, Meng Wang 0001, Pengfei Wan 0001, Di Zhang 0026, Kun Gai, Shao-Lun Huang. 7689-7697 [doi]
- TraceTrans: Translation and Spatial Tracing for Surgical PredictionXiyu Luo, Haodong Li, Xinxing Cheng, He Zhao, Yang Hu, Xuan Song, Tianyang Zhang. 7698-7706 [doi]
- Revisiting Downsampling in Semantic Segmentation: Fighting Aliasing with Dynamic Gaussian and Gabor Frequency FiltersYuBing Luo, Nian Shi, Jia Qin, Zekai Ji, Pinle Qin, Jianchao Zeng 0001, Jianghui Cai. 7707-7715 [doi]
- AURORA: Augmented Understanding via Structured Reasoning and Reinforcement Learning for Reference Audio-Visual SegmentationZiyang Luo, Nian Liu 0002, Fahad Shahbaz Khan, Junwei Han 0001. 7716-7724 [doi]
- S5: Scalable Semi-Supervised Semantic Segmentation in Remote SensingLiang Lv, Di Wang 0023, Jing Zhang 0037, Lefei Zhang. 7726-7734 [doi]
- CertMask: Certifiable Defense Against Adversarial Patches via Theoretically Optimal Mask CoverageXuntao Lyu, Ching-Chi Lin, Abdullah Al Arafat, Georg von der Brüggen, Jian-Jia Chen, Zhishan Guo. 7735-7743 [doi]
- CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair DisentanglementChenrui Ma, Xi Xiao, Tianyang Wang 0004, Xiao Wang 0004, Yanning Shen. 7744-7754 [doi]
- CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous DrivingEnhui Ma, Lijun Zhou, Tao Tang, Jiahuan Zhang, Junpeng Jiang, Zhan Zhang, Dong Han, Kun Zhan, Xueyang Zhang, Xianpeng Lang, Haiyang Sun, Xia Zhou, Di Lin 0002, Kaicheng Yu. 7755-7763 [doi]
- X2Edit: Revisiting Arbitrary-Instruction Image Editing Through Self-Constructed Data and Task-Aware Representation LearningJian Ma 0010, Xujie Zhu, Zihao Pan, Qirong Peng, Xu Guo, Chen Chen 0015, Haonan Lu. 7764-7772 [doi]
- Tracking the Unstable: Appearance-Guided Motion Modeling for Robust Multi-Object Tracking in UAV-Captured VideosJianbo Ma 0001, Hui Luo 0002, Qi Chen 0014, Yuankai Qi, Yumei Sun, Amin Beheshti, Jianlin Zhang 0001, Ming-Hsuan Yang 0001. 7773-7781 [doi]
- Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable LiquidsKe Ma 0012, Yizhou Fang, Jean-Baptiste Weibel, Shuai Tan, Xinggang Wang, Yang Xiao 0007, Yi Fang 0006, Tian Xia. 7782-7790 [doi]
- UM-Text: A Unified Multimodal Model for Image Understanding and Visual Text EditingLichen Ma, Xiaolong Fu, Gaojing Zhou, Zipeng Guo, Ting Zhu, Yichun Liu, Yu Shi, Jason Li, Junshi Huang. 7791-7799 [doi]
- MFINet: Multi-view Fusion and 2D-3D Interaction Enhancement for Real-Time LiDAR Semantic SegmentationNan Ma 0012, Zhijie Liu 0002, Yiheng Han. 7800-7808 [doi]
- Landsat30-AU: A Vision-Language Dataset for Australian Landsat ImagerySai Ma, Zhuang Li, John A. Taylor. 7809-7817 [doi]
- Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward FeedbackXingpei Ma, Shenneng Huang, Jiaran Cai, Yuansheng Guan, Shen Zheng, Hanfeng Zhao, Qiang Zhang, Shunsi Zhang. 7818-7826 [doi]
- Regression over Classification: Assessing Image Aesthetics via Multimodal Large Language ModelsXingyuan Ma, Shuai He, Anlong Ming, Haobin Zhong, Huadong Ma. 7827-7835 [doi]
- Compositional Attribute Imbalance in Vision DatasetsYanbiao Ma, Jiayi Chen, Wei Dai 0015, Dong Zhao, Zeyu Zhang, Yuting Yang 0008, Bowei Liu, Jiaxuan Zhao, Andi Zhang 0004. 7836-7846 [doi]
- Edge-Centric Relational Reasoning for 3D Scene Graph PredictionYanni Ma, Hao Liu 0061, Yulan Guo, Theo Gevers, Martin R. Oswald. 7847-7855 [doi]
- PA-FAS: Towards Interpretable and Generalizable Multimodal Face Anti-Spoofing via Path-Augmented Reinforcement LearningYingjie Ma, Xun Lin, Yong Xu 0001, Weicheng Xie 0001, Zitong Yu. 7856-7864 [doi]
- StyleFM: Frequency Manipulation Empowered by Recursive Attention on Diffusion Models for Arbitrary Style TransferYingnan Ma, Zhenye Liu, Siying Liu, Anup Basu. 7865-7873 [doi]
- HyperDiag: Temporal-Regional Hypergraph Learning via Topology-Enhanced State Propagation for Brain Disease DiagnosisYulan Ma, Fangkun Li, Wenchao Yang, Qian Si, Chenglong Yu, Yang Li. 7874-7882 [doi]
- One2Seq: One-Token Wise Decoder for Efficient Scene Text RecognitionZhibin Ma, Pengwen Dai, Wei Zhuo, Xugong Qin. 7883-7891 [doi]
- Where and What Matters: Sensitivity-Aware Task Vectors for Many-Shot Multimodal In-Context LearningZiyu Ma, Chenhui Gou, Yiming Hu, Yong Wang, Bohan Zhuang, Jianfei Cai 0001. 7892-7900 [doi]
- VALIANT: Prompt Instability for Active Learning in Black-Box Medical ImagingDwarikanath Mahapatra, Behzad Bozorgtabar, Sudipta Roy 0002, Imran Razzak, Mauricio Reyes 0001. 7901-7909 [doi]
- Copyright Infringement Detection in Text-to-Image Diffusion Models via Differential PrivacyXiafeng Man, Zhipeng Wei 0001, Jingjing Chen 0001. 7910-7917 [doi]
- TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text RenderingDongxing Mao, Yilin Wang, Linjie Li, Zhengyuan Yang, Alex Jinpeng Wang. 7918-7926 [doi]
- Omni-Effects: Unified and Spatially-Controllable Visual Effects GenerationFangyuan Mao, Aiming Hao, Jintao Chen, Dongxia Liu, Xiaokun Feng, Jiashu Zhu, Meiqi Wu, Chubin Chen, Jiahong Wu 0005, Xiangxiang Chu. 7927-7935 [doi]
- TweezeEdit: Consistent and Efficient Image Editing with Path RegularizationJianda Mao, Kaibo Wang, Yang Xiang, Kani Chen. 7936-7944 [doi]
- Learning Spatial Decay for Vision TransformersYuxin Mao, Zhen Qin 0003, Jinxing Zhou, Bin Fan 0002, Jing Zhang 0052, Yiran Zhong, Yuchao Dai. 7945-7953 [doi]
- RefleXNet: Targeted Self-Reflection for Accurate Chest X-ray ReportingXin Mei, Rui Mao 0010, Xiaoyan Cai, Libin Yang, Erik Cambria. 7954-7962 [doi]
- Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language RetrievalGuanghao Meng, Jinpeng Wang 0002, Jieming Zhu, Letian Zhang, Yong Jiang 0001, Dan Zhao 0003, Qing Li 0006. 7963-7971 [doi]
- Imagine with Layout and Sketch: Enhancing Vision-Language Retrieval with Dual-Stream Multi-Modal Query RefinementGuanghao Meng, Jinpeng Wang 0002, Qian-Wei Wang, Xudong Ren, Dan Zhao 0003. 7972-7980 [doi]
- Enhanced Privacy Leakage from Noise-Perturbed Gradients via Gradient-Guided Conditional Diffusion ModelsJiayang Meng, Tao Huang, Hong Chen 0001, Chen Hou, Guolong Zheng. 7981-7989 [doi]
- Anti-Avatar: Protect Against Unauthorized 3D Head Avatar Generation via Dual-Space DivergenceLingzhuang Meng, Mingwen Shao, Xiang Lv, Mengyao Wu, Yuanjian Qiao 0001, Jie Zhang 0133. 7990-7998 [doi]
- Appearance-Motion Decomposed Alignment for Text-Video RetrievalMeng Meng, Zichang Tan, Yong Zhang, Xu Zhou. 7999-8007 [doi]
- EchoMimicV3: 1.3B Parameters Are All You Need for Unified Multi-Modal and Multi-Task Human AnimationRang Meng, Yan Wang, Weipeng Wu, Ruobing Zheng, Yuming Li, Chenguang Ma. 8008-8015 [doi]
- Exploring Category-level Articulated Object Pose Tracking on SE(3) ManifoldsXianHui Meng, Yukang Huo, Li Zhang 0104, Liu Liu 0012, Haonan Jiang, Yan Zhong 0001, Pingrui Zhang, Cewu Lu, Jun Liu. 8016-8024 [doi]
- Image Content Matters: An Image Content Aware State Space Model for Accelerated MRI ReconstructionYucong Meng, Zhiwei Yang, Kexue Fu 0001, Zhijian Song, Yonghong Shi. 8025-8033 [doi]
- Improving Sparse IMU-based Motion Capture with Motion Label SmoothingZhaorui Meng, Lu Yin, Yangqing Hou, Anjun Chen, Shihui Guo, Yipeng Qin. 8034-8042 [doi]
- MotionFlow: Attention-Driven Motion Transfer in Video Diffusion ModelsTuna Han Salih Meral, Hidir Yesiltepe, Connor Dunlop, Pinar Yanardag. 8043-8051 [doi]
- BioDPP: Dynamic Prompt Policy Learning for Biomedical Vision-Language ModelsPingyi Miao, Xianlai Chen, Kai Sun, Yunbo Wang, Shuang Zhao, Ying An. 8052-8060 [doi]
- Augmentation-invariant Learning Strategy via Data Augmentation for Improving Model GeneralizationYu Miao, Juanjuan Zhao 0002, Sijie Song, Ran Gong, Yuanqian Zhu, Lusha Qi, Yan Qiang 0001. 8061-8070 [doi]
- Hilbert Curve-Encoded Rotation-Equivariant Oriented Object Detector with Locality-Preserving Spatial MappingQi Ming, Liuqian Wang, Juan Fang, Xudong Zhao, Yucheng Xu, Ziyi Teng, Yue Zhou, Xiaoxi Hu, Xiaohan Zhang, Yufei Guo. 8071-8079 [doi]
- VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the WildXin-ming, Yuxuan Han, Tianyu Huang, Feng Xu 0005. 8080-8088 [doi]
- Neural Outline Cache for Real-time Anti-aliasing Font RenderingJiashuaizi Mo, Sang-Woon Jeon, Hua Wang 0002, Xiangqi Chen, Yanchao Wang, Minglu Li 0001, Zhonglong Zheng. 8089-8097 [doi]
- QRShield: Exploiting Vulnerabilities of Latent Diffusion Models for Preventing AI Art PlagiarismXunyue Mo, Weibin Wu 0002, Qingrui Tu, Hang Wang, Junxi He, Zibin Zheng. 8098-8106 [doi]
- BREPS: Bounding-Box Robustness Evaluation of Promptable SegmentationAndrey Moskalenko, Danil Kuznetsov, Irina Dudko, Anastasiia Iasakova, Nikita Boldyrev, Denis Shepelev, Andrei Spiridonov, Andrey Kuznetsov, Vlad Shakhuro. 8107-8115 [doi]
- FantasyHSI: Video-Generation-Centric 4D Human Synthesis in Any Scene Through a Graph-Based Multi-Agent FrameworkLingzhou Mu, Qiang Wang, Fan Jiang, Mengchao Wang, Mu Xu, Kai Zhang. 8116-8124 [doi]
- Benchmarking Visual LLMs Resilience to Unanswerable Questions on Visually Rich DocumentsDavide Napolitano, Luca Cagliero, Fabrizio Battiloro. 8125-8133 [doi]
- AHAN: Asymmetric Hierarchical Attention Network for Identical Twin Face VerificationHoang Nhat Nguyen. 8134-8141 [doi]
- DenoDet V2: Phase-Amplitude Cross Denoising for SAR Object DetectionKang Ni, Minrui Zou, Yuxuan Li 0004, Xiang Li 0041, Kehua Guo, Ming-Ming Cheng, Yimian Dai. 8142-8150 [doi]
- Beyond Wide-Angle Images: Structure-to-Detail Video Portrait Correction via Unsupervised Spatiotemporal AdaptationWenbo Nie, Lang Nie, Chunyu Lin, Jingwen Chen, Ke Xing, Jiyuan Wang 0001, Kang Liao. 8151-8159 [doi]
- From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code GenerationKe Niu 0004, Haiyang Yu 0004, Zhuofan Chen, Mengyang Zhao, Teng Fu 0001, Bin Li 0015, Xiangyang Xue 0001. 8160-8167 [doi]
- Virtual Multiplex Staining for Histological Images Using a Marker-Wise Conditioned Diffusion ModelHyun-Jic Oh, Junsik Kim 0001, Zhiyi Shi, Yichen Wu, Yu-An Chen, Peter K. Sorger, Hanspeter Pfister, Won-Ki Jeong. 8168-8176 [doi]
- LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry GroundingJulian Ost, Andrea Ramazzina, Amogh Joshi 0004, Maximilian Bömer, Mario Bijelic, Felix Heide. 8177-8187 [doi]
- DeLightMono: Enhancing Self-Supervised Monocular Depth Estimation in Endoscopy by Decoupling Uneven IlluminationMingyang Ou, Haojin Li 0003, Yifeng Zhang, Ke Niu 0002, Zhongxi Qiu, Heng Li 0010, Jiang Liu 0001. 8188-8196 [doi]
- PMPGuard: Catching Pseudo-Matched Pairs in Remote Sensing Image-Text RetrievalPengxiang Ouyang, Qing Ma, Zheng Wang 0059, Cong Bai. 8197-8205 [doi]
- Taming the Phantom: Token-Asymmetric Filtering for Hallucination Mitigation in Large Vision-Language ModelsShuyi Ouyang, Hongyi Wang 0002, Gongfan Fang, Xinyin Ma, Lanfen Lin, Xinchao Wang. 8206-8214 [doi]
- SpikingIR: A Novel Converted Spiking Neural Network for Efficient Image RestorationYang Ouyang, Zihan Cheng, Xiaotong Luo, Guoqi Li, Yanyun Qu. 8215-8223 [doi]
- LORETTA: A Low Resource Framework to Poison Continuous Time Dynamic GraphsHimanshu Pal, Venkata Sai Pranav Bachina, Ankit Gangwal, Charu Sharma. 8224-8232 [doi]
- SpecDiff: Accelerating Diffusion Model Inference with Self-SpeculationJiayi Pan, Jiaming Xu, Yongkang Zhou, Guohao Dai 0001. 8233-8241 [doi]
- Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict RegularizationMiao Pan, Wangjie Gan, Jintao Chen 0001, Wenqi Zhang, Bing Sun, Jianwei Yin, Xuhong Zhang 0002. 8242-8250 [doi]
- OneLIP: Unlocking and Improving Long-Text Representations of CLIP via One-Stage AdaptationRenjie Pan 0001, Jiayan Song, Hua Yang 0001. 8251-8259 [doi]
- Next Patch Prediction for AutoRegressive Visual GenerationYatian Pang, Peng Jin 0001, Shuo Yang, Bin Zhu, Bin Lin 0014, Chaoran Feng 0001, Zhenyu Tang 0004, Liuhan Chen, Francis E. H. Tay, Ser-Nam Lim, Harry Yang, Li Yuan 0007. 8260-8268 [doi]
- Image-Text Knowledge Modeling for Unsupervised Multi-Scenario Person Re-IdentificationZhiqi Pang, Lingling Zhao, Yang Liu 0006, Chunyu Wang 0002, Gaurav Sharma 0001. 8269-8277 [doi]
- Infinite-Story: A Training-Free Consistent Text-to-Image GenerationJihun Park, Kyoungmin Lee 0001, Jongmin Gim, Hyeonseo Jo, Minseok Oh, Wonhyeok Choi, Kyumin Hwang, Jaeyeul Kim, Minwoo Choi, Sunghoon Im. 8278-8286 [doi]
- Neural Collapse-Informed Initialization with Perturbation Injection in Classification-based Metric LearningJinhee Park 0001, Hee bin Yoo, MinJun Kim, Byoung-Tak Zhang, Junseok Kwon. 8287-8295 [doi]
- Leveraging Textual Compositional Reasoning for Robust Change CaptioningKyu Ri Park, Jiyoung Park, Seong Tae Kim 0001, Hong Joo Lee 0001, Jung-Uk Kim. 8296-8304 [doi]
- SphereDiff: Tuning-free 360° Static and Dynamic Panorama Generation via Spherical Latent RepresentationMinho Park 0003, Taewoong Kang, Jooyeol Yun, Sungwon Hwang, Jaegul Choo. 8305-8313 [doi]
- Generalized-Scale Object Counting with Gradual Query AggregationJer Pelhan, Alan Lukezic, Matej Kristan. 8314-8321 [doi]
- How Foundational Skills Influence VLM-based Embodied Agents: A Native PerspectiveBo Peng, Pi Bu, Keyu Pan, Xinrun Xu, Yingxiu Zhao, Miao Chen, Yang Du, Lin Li, Jun Song, Tong Xu 0001. 8322-8330 [doi]
- Correcting Quantization-Induced Gradient Mismatch in Neural Image CompressionChanghao Peng, Yuqi Ye, Wei Gao 0003. 8331-8339 [doi]
- FreeMem: Enhancing Consistency in Long Video Generation via Tuning-Free MemoryJibin Peng, Di Lin 0002, Zhecheng Xu, Haoran Lu, Ruonan Liu, Wuyuan Xie, Miaohui Wang, Lingyu Liang, Yi Wang, Qing Guo 0005. 8340-8348 [doi]
- Hierarchical Frequency-Guided Alignment Transformer for Compressed Video Quality EnhancementLiuhan Peng, Shuai Li 0005, Yanbo Gao, Mao Ye 0001, Chong Lv. 8349-8357 [doi]
- Lifelong Domain Adaptive 3D Human Pose EstimationQucheng Peng, Hongfei Xue, Pu Wang 0001, Chen Chen 0001. 8358-8366 [doi]
- Revisiting Cross-Architecture Distillation: Adaptive Dual-Teacher Transfer for Lightweight Video ModelsYing Peng, Hongsen Ye, Changxin Huang, Xiping Hu, Jian Chen 0011, Runhao Zeng. 8367-8375 [doi]
- GAGS: Granularity-Aware Feature Distillation for Language Gaussian SplattingYuning Peng, Haiping Wang 0004, Yuan Liu 0025, Chenglu Wen, Zhen Dong 0005, Bisheng Yang. 8376-8384 [doi]
- TARA: Token-Aware LoRA for Composable Personalization in Diffusion ModelsYuqi Peng, Lingtao Zheng, Yufeng Yang, Yi Huang 0035, Mingfu Yan, Jianzhuang Liu, Shifeng Chen. 8385-8393 [doi]
- CP-CLIP: Customized Parameter Generation for Open-vocabulary Semantic SegmentationZelin Peng, Zhengqin Xu, Feilong Tang, Wei Shen 0002. 8394-8402 [doi]
- FineVAU: A Novel Human-Aligned Benchmark for Fine-Grained Video Anomaly UnderstandingJoão Alexandre Cardeira Pereira, Vasco Lopes, João Neves 0006, David Semedo. 8403-8411 [doi]
- LLaVA³: Representing 3D Scenes Like a Cubist Painter to Boost 3D Scene Understanding of VLMsDoriand Petit, Steve Bourgeois, Vincent Gay-Bellile, Florian Chabot, Loïc Barthe. 8412-8420 [doi]
- Fractured Glass, Failing Cameras: Simulating Physics-Based Adversarial Samples for Autonomous Driving SystemsManav Prabhakar, Jwalandhar Girnar, Arpan Kusari. 8421-8429 [doi]
- Topology-Inspired Backward-Free Framework for Test-Time Adaptation in Medical DetectionBin Pu, Xingguo Lv, Jiewen Yang, Kai Xu, Lei Zhao 0013, Zuozhu Liu, Kenli Li 0001. 8430-8438 [doi]
- Unified Mixture-of-Experts Framework for Joint Cardiac and Vascular Ultrasound Analysis and Report GenerationBin Pu, Jiewen Yang, Xingguo Lv, Kai Xu, Kenli Li 0001. 8439-8447 [doi]
- Organ-Aware Routing Mixture-of-Retrieval Augmented Generation for Fetal Ultrasound ReportingBin Pu, Siyu Wang, Rongbin Li, Xinpeng Ding, Lei Zhao 0013, Chaoqi Chen, Shengli Li 0001, Kenli Li 0001. 8448-8456 [doi]
- Instance-Guided Scene Adaptation for Unsupervised Person SearchLinfeng Qi, Huibing Wang, Jinjia Peng, XianPing Fu, Jiqing Zhang. 8457-8465 [doi]
- Localization-Anchored Instance Discrimination for Domain Adaptive Person SearchLinfeng Qi 0001, Huibing Wang, Jinjia Peng, Jiqing Zhang. 8466-8474 [doi]
- MSTDiff: Multiscale-Aware Transformer Diffusion Network for Video Object DetectionQiang Qi, Wenqi Shang, Xiao Wang, Yanjie Liang, Shuyuan Lin. 8475-8483 [doi]
- What to Trust? A Trust-aware Knowledge-guided Method for Zero-shot Object State Understanding in VideosYayun Qi, Xinxiao Wu. 8484-8492 [doi]
- Game Ground Bench: Probing the Limits of LVLMs in Complex Semantic Grounding Across Game UniversesZhangyang Qi, Jinsong Li 0001, Hongjian Wu, Jiaqi Wang 0003, Hengshuang Zhao. 8493-8501 [doi]
- Event-Guided Scene Text Image Super-ResolutionZihan Qi, Zeyu Xiao 0002, Haoyi Zhao, Yang Zhao 0002, Feng Xue 0002, Wei Jia 0001. 8502-8510 [doi]
- WeatherEdit: Controllable Weather Editing with 4D Gaussian FieldChenghao Qian, Wenjing Li 0005, Yuhu Guo, Gustav Markkula. 8511-8519 [doi]
- SplatSSC: Decoupled Depth-Guided Gaussian Splatting for Semantic Scene CompletionRui Qian 0005, Haozhi Cao, Tianchen Deng, Shenghai Yuan 0001, Lihua Xie 0001. 8520-8528 [doi]
- Text-guided Controllable Diffusion for Realistic Camouflage Images GenerationYuhang Qian, Haiyan Chen 0001, Wentong Li 0001, Ningzhong Liu, Jie Qin 0004. 8529-8537 [doi]
- Sharp Eyes and Memory for VideoLLMs: Information-Aware Visual Token Pruning for Efficient and Reliable VideoLLM ReasoningJialong Qin, Xin Zou 0001, Di Lu, Yibo Yan, Xuming Hu. 8538-8546 [doi]
- MAPI-GNN: Multi-Activation Plane Interaction Graph Neural Network for Multimodal Medical DiagnosisZiwei Qin, Xuhui Song, Deqing Huang, Na Qin 0001, Jun Li. 8547-8555 [doi]
- Graph-Semantic Guided Learning for Virtual Immunohistochemistry Staining on Consecutive Histology SectionsFanhao Qiu, Yangyang Zhang, Zhengxia Wang. 8556-8564 [doi]
- Ego-PMOVE: Prompt-aware Mixture of View Experts Network for Egocentric Gaze PredictionHeqian Qiu, Lanxiao Wang, Taijin Zhao, Zhaofeng Shi, Xiang Li, Linfeng Xu 0001, Hongliang Li 0001. 8565-8573 [doi]
- Beyond Euclidean Assumptions: Geometry-Aware Adaptive Routing for Remote Sensing SegmentationJie Qiu, Dizuo Cao, Linwei Dai, Xin Li, Fan Yang, Dong Yu, Changying Wang, Zongheng Wen, Youqin Chen, Jianzhang Chen. 8574-8582 [doi]
- AerialFusion: Co-Motion-Driven Unified Registration and Fusion on Multi-modal Data Streams from Aerial ViewJunhui Qiu, Xiang Xiang 0001, Hongyun Wang, Jiaqi Gui. 8583-8591 [doi]
- Rectification Reimagined: A Unified Mamba Model for Image Correction and Rectangling with PromptsLinwei Qiu, Gongzhe Li, Xiaozhe Zhang, Qi Sun 0001, Fengying Xie. 8592-8601 [doi]
- DualScope: Capturing Critical Spatial and Temporal Cues for Distracted Driving Activity RecognitionZhijie Qiu, Shuaibo Li, Laixin Zhang, Xuming Hu, Wei Ma 0008. 8602-8611 [doi]
- EmoVid: A Multimodal Emotion Video Dataset for Emotion-Centric Video Understanding and GenerationZongyang Qiu, Bingyuan Wang, Xingbei Chen, Yingqing He, Zeyu Wang 0003. 8612-8620 [doi]
- TextShield-R1: Reinforced Reasoning for Tampered Text DetectionChenfan Qu, Yiwu Zhong, Jian Liu, Xuekang Zhu, Bohan Yu, Lianwen Jin. 8621-8629 [doi]
- Breaking Measurement Barriers: From Compressed Sensing to Deep ReconstructionGang Qu 0005, Ping Wang 0029, Siming Zheng, Xin Yuan 0002. 8631-8639 [doi]
- RL-U2Net: A Dual-Branch UNet with Reinforcement Learning-Assisted Multimodal Feature Fusion for Accurate 3D Whole-Heart SegmentationJierui Qu, Jianchun Zhao. 8640-8648 [doi]
- SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D FeaturesJinyuan Qu, Hongyang Li 0003, Xingyu Chen, Shilong Liu 0004, Yukai Shi, Tianhe Ren, Ruitao Jing, Lei Zhang 0001. 8649-8658 [doi]
- CloudMamba: Grouped Selective State Spaces for Point Cloud AnalysisKanglin Qu, Pan Gao 0001, Qun Dai, Zhanzhi Ye, Rui Ye 0003, Yuanhao Sun. 8659-8667 [doi]
- Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent DiffusionWentao Qu, Guofeng Mei, Jing Wang 0201, Yujiao Wu, Xiaoshui Huang, Liang Xiao 0001. 8668-8676 [doi]
- DeOcc-1-to-3: 3D De-Occlusion from a Single Image via Self-Supervised Multi-View DiffusionYansong Qu, Shaohui Dai, Xinyang Li, Yuze Wang 0006, You Shen, Shengchuan Zhang, Liujuan Cao. 8677-8685 [doi]
- Transformer with Controlled Attention for Synchronous Motion CaptioningKarim Radouane, Sylvie Ranwez, Julien Lagarde, Andon Tchechmedjiev. 8686-8693 [doi]
- FlashKAT: Understanding and Addressing Performance Bottlenecks in the Kolmogorov-Arnold TransformerMatthew Raffel, Lizhong Chen. 8694-8702 [doi]
- Learning Latent Imaging Biomarkers for Interpretable Microvascular Invasion Prediction in Hepatocellular CarcinomaJi Rao, Xinyu Liu, Yong Yi, Ying Xiao, Ye Luo. 8703-8711 [doi]
- Masked Clustering Prediction for Unsupervised Point Cloud Pre-trainingBin Ren 0005, Xiaoshui Huang, Mengyuan Liu 0001, Hong Liu 0008, Fabio Poiesi, Nicu Sebe, Guofeng Mei. 8712-8720 [doi]
- Mitigating Negative Flips via Margin Preserving TrainingSimone Ricci, Niccolò Biondi, Federico Pernici, Alberto Del Bimbo. 8721-8730 [doi]
- ImageSet2Text: Describing Sets of Images Through TextPiera Riccio, Francesco Galati, Kajetan Schweighofer, Noa Garcia, Nuria Oliver. 8731-8739 [doi]
- CART: Compositional AutoRegressive Transformer for Image GenerationSiddharth Roheda, Rohit Chowdhury, Aniruddha Bala, Rohan Jaiswal. 8740-8750 [doi]
- RS2-SAM2: Customized SAM2 for Referring Remote Sensing Image SegmentationFu Rong, Meng Lan, Qian Zhang 0009, Lefei Zhang. 8751-8759 [doi]
- MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language ModelsJiacheng Ruan, Dan Jiang, Xian Gao, Ting Liu 0016, Yuzhuo Fu, Yangyang Kang. 8760-8768 [doi]
- GIIM: Graph-based Learning of Inter- and Intra-view Dependencies for Multi-view Medical Image DiagnosisTran Bao Sam, Hung Vu, Trung-Kien Dao, Tran Dat Dang, Van Ha Tang, Steven Q. H. Truong. 8769-8777 [doi]
- CoCoLIT: ControlNet-Conditioned Latent Image Translation for MRI to Amyloid PET SynthesisAlec Sargood, Lemuel Puglisi, James H. Cole, Neil P. Oxtoby, Daniele Ravì, Daniel C. Alexander. 8778-8786 [doi]
- Video Camera Trajectory Editing with Generative Rendering from Estimated GeometryJunyoung Seo, Jisang Han, Jaewoo Jung, Siyoon Jin, Joungbin Lee, Takuya Narihira, Kazumi Fukuda, Takashi Shibuya 0001, Donghoon Ahn, Shoukang Hu, Seungryong Kim, Yuki Mitsufuji. 8787-8795 [doi]
- Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion PriorForam Niravbhai Shah, Parshwa Shah, Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Pu Wang 0001, Hongfei Xue, Ahmed Helmy. 8796-8804 [doi]
- NeuS-QA: Grounding Long-Form Video Understanding in Temporal Logic and Neuro-Symbolic ReasoningSahil Shah, S. P. Sharan, Harsh Goel, Minkyu Choi 0001, Mustafa Munir, Manvik Pasula, Radu Marculescu, Sandeep Chinchali. 8805-8813 [doi]
- DiA-gnostic VLVAE: Disentangled Alignment-Constrained Vision Language Variational AutoEncoder for Robust Radiology Reporting with Missing ModalitiesNagur Shareef Shaik, Teja Krishna Cherukuri, Adnan Masood, Dong Hye Ye. 8814-8823 [doi]
- 2D Gaussians Spatial Transport for Point-supervised Density RegressionMiao Shang, Xiaopeng Hong. 8824-8832 [doi]
- LSAP-PV: High-Fidelity Palm Vein Image Synthesis via Layered Spectral Absorption Projection-Guided Diffusion ModelSheng Shang, Chenglong Zhao, Ruixin Zhang, Jianlong Jin, Jingyun Zhang, Jun Wang, Yang Zhao 0002, Shouhong Ding, Wei Jia 0001. 8833-8841 [doi]
- FineTec: Fine-Grained Action Recognition Under Temporal Corruption via Skeleton Decomposition and Sequence CompletionDian Shao, Mingfei Shi, Like Liu. 8842-8850 [doi]
- vMFCoOp: Towards Equilibrium on a Unified Hyperspherical Manifold for Prompting Biomedical VLMsMinye Shao, Sihan Guo, Xinrun Li, Xingyu Miao, Haoran Duan 0001, Yang Long 0001. 8851-8859 [doi]
- ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task AdaptationYihua Shao, Xiaofeng Lin, Xinwei Long, Siyu Chen 0021, Minxi Yan, Yang Liu 0360, Ziyang Yan, Ao Ma, Hao Tang 0005, Jingcai Guo. 8860-8868 [doi]
- TR-DQ: Time-Rotation Diffusion QuantizationYihua Shao, Deyang Lin, Minxi Yan, Siyu Chen 0021, Fanhu Zeng, Minwen Liao, Ao Ma, Ziyang Yan, Haozhe Wang 0002, Yan Wang 0068, Zhi Chen 0010, Xiaofeng Cao 0002, Haotong Qin, Hao Tang 0005, Jingcai Guo. 8869-8877 [doi]
- PromptMoE: Generalizable Zero-Shot Anomaly Detection via Visually-Guided Prompt MixturesYuheng Shao, Lizhang Wang, Changhao Li, Peixian Chen, Qinyuan Liu. 8878-8886 [doi]
- Group Orthogonal Low-Rank Adaptation for RGB-T TrackingZekai Shao 0002, Yufan Hu, Jingyuan Liu, Bin Fan 0001, Hongmin Liu 0001. 8887-8895 [doi]
- TarPro: Targeted Protection Against Malicious Image EditingKaixin Shen, Ruijie Quan, Jiaxu Miao, Jun Xiao 0001. 8896-8904 [doi]
- FineXtrol: Controllable Motion Generation via Fine-Grained TextKeming Shen, Bizhu Wu, Junliang Chen 0002, Xiaoqin Wang, LinLin Shen. 8905-8913 [doi]
- Fine-grained Image Quality Assessment for Perceptual Image RestorationXiangfei Sheng, Xiaofeng Pan, Zhichao Yang 0013, Pengfei Chen 0003, Leida Li. 8914-8922 [doi]
- Edge Consistency for 4D Gaussian Splatting in Dynamic Scene RenderingBoya Shi, Thomas N. Guan, Xiaodong Yi 0002. 8923-8932 [doi]
- Sparse4DGS: 4D Gaussian Splatting for Sparse-Frame Dynamic Scene ReconstructionChangyue Shi, Chuxiao Yang, Xinyuan Hu, Minghao Chen, Wenwen Pan 0003, Yan Yang, Jiajun Ding, Zhou Yu 0001, Jun Yu 0002. 8933-8941 [doi]
- SGMHand: Structure-Guided Modulation for Structure-Aware Hand InpaintingChuancheng Shi, Shiming Guo, Ke Shui, Yixiang Chen, Fei Shen. 8942-8950 [doi]
- WaveC2R: Wavelet-Driven Coarse-to-Refined Hierarchical Learning for Radar RetrievalChunlei Shi, Han Xu, Yinghao Li, Yi-Lin Wei, Yongchao Feng, YeCheng Zhang, Dan Niu. 8951-8959 [doi]
- TrackGS: Optimizing COLMAP-Free 3D Gaussian Splatting with Global Track ConstraintsDongbo Shi, Shen Cao, Lubin Fan, Bojian Wu, Jinhui Guo, Ligang Liu 0001, Renjie Chen 0001. 8960-8968 [doi]
- SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic ManipulationHao Shi, Bin Xie, Yingfei Liu, Yang Yue, Tiancai Wang, Haoqiang Fan, Xiangyu Zhang 0005, Gao Huang 0001. 8969-8977 [doi]
- Exploring Reliable Spatiotemporal Dependencies for Efficient Visual TrackingJunze Shi, Yang Yu, Jian Shi, Haibo Luo. 8978-8987 [doi]
- DP-GenG: Differentially Private Dataset Distillation Guided by DP-Generated DataShuo Shi, Jinghuai Zhang, Shijie Jiang, Chunyi Zhou 0001, Yuyuan Li 0001, Mengying Zhu, Yangyang Wu, Tianyu Du. 8988-8996 [doi]
- GeoBayes: Probabilistic Image Geo-Localization Inference via Sequential Bayesian UpdatingWeimin Shi, Xiang Li, Kaige Li, Junhao Fang, Qiang Zhou, Qichuan Geng, Zhong Zhou. 8997-9005 [doi]
- Causality Matters: How Temporal Information Emerges in Video Language ModelsYumeng Shi, Quanyu Long, Yin Wu 0001, Wenya Wang 0001. 9006-9014 [doi]
- Auxiliary Gene Learning: Spatial Gene Expression Estimation by Auxiliary Gene SelectionKaito Shiku, Kazuya Nishimura, Shinnosuke Matsuo, Yasuhiro Kojima, Ryoma Bise. 9015-9023 [doi]
- FastAnimate: Towards Learnable Template Construction and Pose Deformation for Fast 3D Human Avatar AnimationJian Shu 0001, Nanjie Yao, Gangjian Zhang, Junlong Ren, Yu Feng, Hao Wang 0094. 9024-9032 [doi]
- Free-Form Scene Editor: Enabling Multi-Round Object Manipulation Like in a 3D EngineXincheng Shuai, Zhenyuan Qin, Henghui Ding, Dacheng Tao. 9033-9041 [doi]
- Towards Effective and Efficient Context-aware Nucleus Detection in Histopathology Whole Slide ImagesZhongyi Shui, Honglin Li 0001, YunLong Zhang, Yuxuan Sun 0002, Yiwen Ye, Pingyi Chen, Ruizhe Guo, Lei Cui 0004, Chenglu Zhu, Lin Yang 0002. 9042-9050 [doi]
- T-LoRA: Single Image Diffusion Model Customization Without OverfittingVera Soboleva, Aibek Alanov, Andrey Kuznetsov, Konstantin Sobolev. 9051-9059 [doi]
- KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMsBaiyang Song, Jun Peng 0007, Yuxin Zhang 0002, Guangyao Chen, Feidiao Yang, Jianyuan Guo. 9060-9068 [doi]
- Doubly Debiased Test-Time Prompt Tuning for Vision-Language ModelsFei Song, Yi Li, Rui Wang 0079, Jiahuan Zhou, Changwen Zheng, Jiangmeng Li. 9069-9078 [doi]
- Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity MasksLingran Song, Yucheng Zhou 0001, Jianbing Shen. 9079-9087 [doi]
- Creating Blank Canvas Against AI-enabled Image ForgeryQi Song 0003, Ziyuan Luo, Renjie Wan. 9088-9096 [doi]
- Insert Anything: Image Insertion via In-Context Editing in DiTWensong Song, Hong Jiang, Zongxing Yang, Zheqiao Cheng, Ruijie Quan, Yi Yang 0001. 9097-9105 [doi]
- CLUENet: Cluster Attention Makes Neural Networks Have EyesXiangshuai Song, Jun-Jie Huang, Tianrui Liu 0001, Ke Liang, Chang Tang. 9106-9115 [doi]
- UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and PerceptionXinyang Song, Libin Wang, Weining Wang 0001, Shaozhen Liu, Dandan Zheng, Jingdong Chen, Qi Li 0005, Zhenan Sun. 9116-9126 [doi]
- Object Fusion via Diffusion Time-step for Customized Image Editing with Single ExampleXue-song, Zhongqi Yue, Jiequan Cui, Hanwang Zhang, Jingjing Chen 0001. 9127-9134 [doi]
- UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous DrivingZiyi Song, Chen Xia, Chenbing Wang, Haibao Yu, Sheng Zhou 0001, Zhisheng Niu. 9135-9143 [doi]
- DiffNR: Diffusion-Enhanced Neural Representation Optimization for Sparse-View 3D Tomographic ReconstructionShiyan Su, Ruyi Zha, Danli Shi, Hongdong Li, Xuelian Cheng. 9144-9152 [doi]
- Zero-to-Hero: Empowering Video Appearance Transfer with Zero-Shot Initialization and Holistic RestorationTongtong Su, Chengyu Wang 0001, Haipeng Liao, Jun Huang 0007, Dongming Lu. 9153-9161 [doi]
- Weather-Robust LiDAR Perception: Point Cloud Restoration from Adverse WeatherChenghao Sun, Pengpeng Sun, Xiangmo Zhao. 9162-9170 [doi]
- AlignTrack: Top-Down Spatiotemporal Resolution Alignment for RGB-Event Visual TrackingChuanyu Sun, Jiqing Zhang, Yang Wang 0106, Yuanchen Wang, Yutong Jiang, Baocai Yin, Xin Yang 0011. 9171-9179 [doi]
- HumanPro: Single-view 3D Clothed Human Reconstruction with Progressive Normal GuidanceJianchi Sun, Fei Luo 0004, Wenzhuo Fan, Yu Jiang 0007, Chunxia Xiao. 9180-9188 [doi]
- VaccineRAG: Boosting Multimodal Large Language Models' Immunity to Harmful RAG SamplesQixin Sun, Ziqin Wang, Hengyuan Zhao, Yilin Li, Kaiyou Song, Si Liu 0001, Xiaolin Hu 0001, Qingpei Guo, Linjiang Huang. 9189-9197 [doi]
- SNN-Driven Event-Based Flow and Rotation Estimation with SO(3) RefinementRuimin Sun, Haoran Xu, De Ma. 9198-9205 [doi]
- CoMA: Compositional Human Motion Generation with Multi-modal AgentsShanlin Sun, Jiaqi Xu, Gabriel de Araujo, Shenghan Zhou, Hanwen Zhang, Ziheng Huang 0001, Chenyu You, Xiaohui Xie. 9206-9214 [doi]
- Bridging Granularity Gaps: Hierarchical Semantic Learning for Cross-domain Few-shot SegmentationSujun Sun, Haowen Gu, Cheng Xie, Yanxu Ren, Mingwu Ren, Haofeng Zhang 0001. 9215-9223 [doi]
- Small but Mighty: Dynamic Wavelet Expert-Guided Fine-Tuning of Large-Scale Models for Optical Remote Sensing Object SegmentationYanguang Sun, Chao Wang, Jian Yang 0003, Lei Luo 0001. 9224-9232 [doi]
- SwiftVideo: A Unified Framework for Few-Step Video Generation Through Trajectory-Distribution AlignmentYanxiao Sun, Jiafu Wu, Yun Cao 0002, Chengming Xu 0001, Yabiao Wang, Weijian Cao, Donghao Luo 0001, Chengjie Wang 0001, Yanwei Fu 0001. 9233-9241 [doi]
- WDT-MD: Wavelet Diffusion Transformers for Microaneurysm Detection in Fundus ImagesYifei Sun 0005, Yuzhi He, Junhao Jia, Jinhong Wang, Ruiquan Ge, Changmiao Wang, Hongxia Xu. 9242-9250 [doi]
- SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention CollapseYiming Sun 0006, Mi Zhang 0001, Feifei Li, Geng Hong, Min Yang 0002. 9251-9259 [doi]
- CtrlFuse: Mask-Prompt Guided Controllable Infrared and Visible Image FusionYiming Sun 0003, Yuan Ruan, Qinghua Hu, Pengfei Zhu 0001. 9260-9268 [doi]
- EmbryoDiff: A Conditional Diffusion Framework with Multi-Focal Feature Fusion for Fine-Grained Embryo Developmental Stage RecognitionYong Sun, Zhengjie Zhang, Junyu Shi, Zhiyuan Zhang, Lijiang Liu, Qiang Nie. 9269-9277 [doi]
- Difference Vector Equalization for Robust Fine-tuning of Vision-Language ModelsSatoshi Suzuki, Shin'ya Yamaguchi, Shoichiro Takeda, Taiga Yamane, Naoki Makishima, Naotaka Kawata, Mana Ihori, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura. 9278-9286 [doi]
- Geometry Meets Light: Leveraging Geometric Priors for Universal Photometric Stereo Under Limited Multi-Illumination CuesKing-Man Tam, Satoshi Ikehata, Yuta Asano, Zhaoyi An, Rei Kawakami. 9287-9295 [doi]
- Adapt-As-You-Walk Through the Clouds: Training-Free Online Test-Time Adaptation of 3D Vision-Language Foundation ModelsMehran Tamjidi, Hamidreza Dastmalchi, Mohammadreza Alimoradijazi, Ali Cheraghian, Aijun An, Morteza Saberi. 9296-9304 [doi]
- FLAG-4D: Flow-Guided Local-Global Dual-Deformation Model for 4D ReconstructionGuan Yuan Tan, Ngoc-Tuan Vu, Arghya Pal, Sailaja Rajanala, Raphael C.-W. Phan, Mettu Srinivas, Chee-Ming Ting. 9305-9313 [doi]
- Aggregating Diverse Cue Experts for AI-Generated Image DetectionLei Tan, Shuwei Li, Mohan Kankanhalli, Robby T. Tan. 9314-9322 [doi]
- Hybrid-Domain Adaptative Representation Learning for Gaze EstimationQida Tan, Hongyu Yang, Wenchao Du. 9323-9331 [doi]
- PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and SegmentationWenbin Tan 0001, Jiawen Lin, Fangyong Wang, Yuan Xie 0006, Yong Xie, Yachao Zhang 0001, Yanyun Qu. 9332-9340 [doi]
- Dual-Seed Evolutionary Algorithm for Noise Optimization in Diffusion ModelsYuzheng Tan, Yuan He 0011, Yao Zhu 0003, Tianlin Huo, Huanqian Yan, Hang Su 0006, Shuxin Zhang, Guangneng Hu. 9341-9349 [doi]
- Meta-Guided Sample Reweighting for Robust Cross-Modal Hashing Retrieval with Noisy LabelsZiang Tan, Weitao An, Erkun Yang. 9350-9358 [doi]
- PGMamba: A Physical Model-Guided Global Mamba for Underwater Image EnhancementZijun Tan, Chuan Fu, Tan Guo, Zhixiong Nan, Pengzhan Zhou, Xinggan Peng, Fulin Luo. 9359-9367 [doi]
- TSPO: Temporal Sampling Policy Optimization for Long-form Video Language UnderstandingCanhui Tang, Zifan Han, Hongbo Sun, Sanping Zhou, Xuchong Zhang, Xin Wei, Ye Yuan, Huayu Zhang, Jinglin Xu, Hao Sun. 9368-9376 [doi]
- Neural Video Compression with Reference HierarchyChuanbo Tang, Zhuoyuan Li 0001, Li Li 0040, Dong Liu 0002, Feng Wu 0001. 9377-9385 [doi]
- Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic LearningHaomiao Tang, Jinpeng Wang 0002, Minyi Zhao, Guanghao Meng, Ruisheng Luo, Long Chen 0016, Shu-Tao Xia. 9386-9394 [doi]
- Video Spatial Reasoning with Object-Centric 3D RolloutHaoran Tang, Meng Cao 0002, Ruyang Liu, Xiaoxi Liang, Linglong Li, Ge Li 0002, Xiaodan Liang. 9395-9403 [doi]
- Decompose and Conquer: Compositional Reasoning for Zero-Shot Temporal Action LocalizationHaoyu Tang 0002, Tianyuan Liang, Han Jiang 0012, Xuesong Liu, Qinghai Zheng, Yupeng Hu 0003. 9404-9412 [doi]
- Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous DrivingJiacheng Tang, Mingyue Feng, Jiachao Liu, YaoNong Wang, Jian Pu. 9413-9420 [doi]
- Robust-R1: Degradation-Aware Reasoning for Robust Visual UnderstandingJiaqi Tang 0005, Jianmin Chen, Wei Wei, Xiaogang Xu 0002, Runtao Liu, Xiangyu Wu, Qipeng Xie, Jiafei Wu, Lei Zhang 0001, Qifeng Chen 0001. 9421-9429 [doi]
- Less Is More: Sparse and Cooperative Perturbation for Point Cloud AttacksKeke Tang, Tianyu Hao, Xiaofei Wang, Weilong Peng, Denghui Zhang 0001, Peican Zhu, Zhihong Tian. 9430-9438 [doi]
- Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination MitigationLexiang Tang, Xianwei Zhuang, Bang Yang, Zhiyuan Hu, Hongxiang Li, Lu Ma, Jinghan Ru, Yuexian Zou. 9439-9447 [doi]
- Diffusion Once and Done: Degradation-Aware LoRA for All-in-One Image RestorationNi Tang, Xiaotong Luo, Zihan Cheng, Liangtai Zhou, Dongxiao Zhang, Yanyun Qu. 9448-9456 [doi]
- Learning Underwater Image Enhancement Iteratively Without Reference ImagesYi Tang 0008, Hiroshi Kawasaki, Takafumi Iwaguchi, Yuhang Zhang, Hiroshi Masui. 9457-9465 [doi]
- Manipulation Intention Understanding for Zero-Shot Composed Image RetrievalYuanmin Tang, Jing Yu 0007, Keke Gai, Gang Xiong 0001, Gaopeng Gou, Meikang Qiu, Qi Wu 0001. 9466-9474 [doi]
- Revisiting MLLM Based Image Quality Assessment: Errors and RemedyZhenchen Tang, Songlin Yang, Bo Peng, Zichuan Wang, Jing Dong. 9475-9483 [doi]
- SNS-Grasp: Semantic-guided Noise Scaling for Grasp GenerationZhenhua Tang, Yudian Zheng, Yuzhang Zhong, Haolun Li 0001, Yanbin Hao, Chi-Man Pun. 9484-9492 [doi]
- NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D RepresentationsZhenyu Tang 0004, Chaoran Feng 0001, Xinhua Cheng, Wangbo Yu, Junwu Zhang, Yuan Liu 0025, Xiaoxiao Long, Wenping Wang 0001, Li Yuan 0007. 9493-9501 [doi]
- Mimic-X: A Large-Scale Motion Dataset via Fast Physics-Based Controller AdaptationHongyu Tao, Shuaiying Hou, Junheng Fang, Mingyao Shi, Weiwei Xu 0003. 9502-9510 [doi]
- Prompting Adversarial Transferability via Path Flatness AttackZeze Tao, Jinjia Peng, Huibing Wang. 9511-9519 [doi]
- Distillation Dynamics: Towards Understanding Feature-Based Distillation in Vision TransformersHuiyuan Tian, Bonan Xu, Shijian Li. 9520-9528 [doi]
- Unsupervised Motion-Compensated Decomposition for Cardiac MRI Reconstruction via Neural RepresentationXuanyu Tian, Lixuan Chen, Qing Wu 0001, Xiao Wang, Jie Feng 0013, Yuyao Zhang, Hongjiang Wei. 9529-9537 [doi]
- CoGrad3D: Spatially-Coupled Timestep Optimization with Orthogonal Gradient Fusion for 3D GenerationHaoyang Tong, Hongbo Wang, Jin Liu 0040, Qi Wang, Jie Cao 0002, Ran He 0001. 9539-9547 [doi]
- Towards Accurate 3D Object Detection in Adverse Weather by Leveraging 4D Radar for LiDAR Geometry EnhancementTianxu Tong, Xinrun Liu, Hongmin Liu 0001, Bin Fan 0001. 9548-9556 [doi]
- CADiff: Context-Aware Diffusion for Controllable Anomaly Generation in Anomaly DetectionXuan Tong, Yuxuan Lin 0001, Junxiong Lin, Xinji Mai, Haoran Wang 0006, Zeng Tao, Yang Yao, Ruofan Wang, Wenqiang Zhang. 9557-9565 [doi]
- FedSDA: Federated Stain Distribution Alignment for Non-IID Histopathological Image ClassificationCheng-Chang Tsai, Kai-Wen Cheng, Chun-Shien Lu. 9566-9575 [doi]
- Mitigating Low-Quality Reasoning in MLLMs: Self-Driven Refined Multimodal CoT with Selective Thinking and Step-wise Visual EnhancementChongjun Tu, Peng Ye 0006, Dongzhan Zhou, Tao Chen 0003, Wanli Ouyang. 9576-9584 [doi]
- Mass Concept Erasure in Diffusion Models with Concept HierarchyJiahang Tu, Ye Li 0043, Yiming Wu, Hanbin Zhao, Chao Zhang 0029, Hui Qian 0001. 9585-9593 [doi]
- DiLO: Disentangled Latent Optimization for Learning Shape and Deformation in Grouped Deforming 3D ObjectsMostofa Rafid Uddin, Jana Armouti, Umong Sain, Md. Asib Rahman, Xingjian Li 0002, Min Xu 0009. 9594-9602 [doi]
- NURBGen: High-Fidelity Text-to-CAD Generation Through LLM-Driven NURBS ModelingMuhammad Usama, Mohammad Sadil Khan, Didier Stricker, Muhammad Zeshan Afzal. 9603-9611 [doi]
- Guideline-Consistent Segmentation via Multi-Agent RefinementVanshika Vats, Ashwani Rathee, James Davis 0001. 9612-9620 [doi]
- Certified but Fooled! Breaking Certified Defenses with Ghost CertificatesViet Quoc Vo, Tashreque Mohammed Haq, Paul Montague, Tamas Abraham, Ehsan Abbasnejad, Damith C. Ranasinghe. 9621-9629 [doi]
- GUSLO: General and Unified Structured Light OptimizationTinglei Wan, Zhongjie Wang 0003, Tonghua Su. 9630-9638 [doi]
- Tuning Medical Foundation Models for Inner Ear Temporal CT Analysis with Plug-and-play Domain Knowledge AggregatorWeixun Wan, Xinyang Jiang, Zilong Wang 0006, Bei Li, Cairong Zhao. 9639-9647 [doi]
- Diffusion-based Personalized Pathology Disentanglement for Impaired Gait AnalysisXiaoyue Wan, Xu Zhao 0001. 9648-9656 [doi]
- Perceptual Quality Assessment of 3D Gaussian Splatting: A Subjective Dataset and Prediction MetricZhaolin Wan, Yining Diao, Jingqi Xu, Hao Wang 0073, Zhiyang Li 0001, Xiaopeng Fan 0001, Wangmeng Zuo, Debin Zhao. 9657-9665 [doi]
- Biologically-Inspired Evolutionary Domain Symbiosis for Few-shot and Zero-shot Point Cloud Semantic SegmentationChangshuo Wang 0001, Zhijian Hu, Xiang Fang, Zaiyang Yu, Yibin Wu, Mingkun Xu, Yusong Wang 0003, Xingyu Gao 0001, Prayag Tiwari. 9666-9674 [doi]
- AEDR: Training-Free AI-Generated Image Attribution via Autoencoder Double-ReconstructionChao Wang, Zijin Yang, Yaofei Wang, Weiming Zhang 0001, Kejiang Chen. 9675-9683 [doi]
- SGS-3D: High-Fidelity 3D Instance Segmentation via Reliable Semantic Mask Splitting and GrowingChaolei Wang, Yang Luo 0002, Jing Du, Siyu Chen 0004, Yiping Chen, Ting Han 0001. 9684-9692 [doi]
- S²Flow: Towards Fast and Authentic Training-Free High-Resolution Video GenerationChaoqun Wang 0011, Shaobo Min, Xu Yang 0019. 9693-9701 [doi]
- Axis-Aligned Document DewarpingChaoyun Wang, I-Chao Shen, Takeo Igarashi, Caigui Jiang. 9702-9710 [doi]
- Self-NPO: Data-Free Diffusion Model Enhancement via Truncated Diffusion Fine-TuningFu-Yun Wang, Keqiang Sun, Yao Teng, Xihui Liu, Jiale Yuan, Jiaming Song, Hongsheng Li 0001. 9711-9719 [doi]
- 3One2: One-Step Regression plus One-Step Diffusion for One-Hot Modulation in Dual-Path Video Snapshot Compressive ImagingGe Wang, Xing Liu, Xin Yuan. 9720-9728 [doi]
- UniMo: Unified Motion Generation and Understanding with Chain of ThoughtGuocun Wang, Kenkun Liu, Jing Lin, Guorui Song, Jian Li, Xiaoguang Han 0001. 9729-9737 [doi]
- Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language ModelsHanqing Wang, Shaoyang Wang, Yiming Zhong 0001, Zemin Yang, Jiamin Wang, Zhiqing Cui, Jiahao Yuan, Yifan Han, Mingyu Liu, Yuexin Ma. 9738-9746 [doi]
- VIR-Bench: Evaluating Geospatial and Temporal Understanding of MLLMs via Travel Video Itinerary ReconstructionHao Wang, Eiki Murata, Lingfang Zhang, Ayako Sato, So Fukuda, Ziqi Yin, Wentao Hu, Keisuke Nakao, Yusuke Nakamura, Sebastian Zwirner, Yi-Chia Chen, Hiroyuki Otomo, Hiroki Ouchi, Daisuke Kawahara. 9747-9756 [doi]
- SOMA: Feature Gradient Enhanced Affine-Flow Matching for SAR-Optical RegistrationHaodong Wang, Tao Zhuo, Xiuwei Zhang 0001, Hanlin Yin, Wencong Wu, Yanning Zhang 0001. 9757-9765 [doi]
- READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head GenerationHaotian Wang, Yuzhe Weng, Jun Du 0002, Haoran Xu, Xiaoyan Wu, Shan He, Bing Yin, Cong Liu 0006, Jianqing Gao, Qingfeng Liu. 9766-9774 [doi]
- JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation PromotionHaoyu Wang 0016, Lei Zhang 0054, Wenrui Liu, Dengyang Jiang, Wei Wei 0008, Chen Ding 0002. 9775-9783 [doi]
- RTGaze: Real-Time 3D-Aware Gaze Redirection from a Single ImageHengfei Wang, Zhongqun Zhang, Yihua Cheng, Hyung Jin Chang. 9784-9792 [doi]
- Temporal-Consistent Video Restoration with Pre-trained Diffusion ModelsHengkang Wang, Yang Liu, Huidong Liu, Chien-Chih Wang, Yanhui Guo, Hongdong Li, Bryan Wang, Ju Sun. 9793-9801 [doi]
- HarmoQ: Harmonized Post-Training Quantization for High-Fidelity Image Super-ResolutionHongjun Wang 0007, Jiyuan Chen, Xuan Song 0001, Yinqiang Zheng. 9802-9810 [doi]
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language ModelsHongyang Wang 0001, Yichen Shi, Zhuofu Tao, Yuhao Gao, Liepiao Zhang, Xun Lin, Jun Feng, Xiaochen Yuan, Zitong Yu, Xiaochun Cao. 9811-9819 [doi]
- MoEA-Net: Modality-Incremental Expert Aggregation Network for Retinal Prognostic PredictionHua Wang, Xiaodan Zhang 0003, Yanzhao Shi, Chengxin Zheng, Wanyu Zhang, Zhen Wang, Jianing Wang, Xiaobing Yu. 9820-9828 [doi]
- Multi-Window Gabor Transform Network for Ground Penetrating Radar B-Scan Image ReconstructionHuabin Wang, Yu Yang, Xinran Zhong, Zilong Ling. 9829-9837 [doi]
- Towards Zero-Shot Diabetic Retinopathy Grading: Learning Generalized Knowledge via Prompt-Driven Matching and EmulatingHuan Wang, Haoran Li 0024, Yuxin Lin, Huaming Chen, Jun Yan 0005, Lijuan Wang, Jiahua Shi, QiHao Xu, Yongting Hu, Yong Xu 0001, Jun Shen 0001. 9838-9846 [doi]
- Semantic Feature Purification for Adversarially-Aware RGB-T TrackingJiahao Wang 0002, Fang Liu 0001, Hao Wang 0211, Shuo Li 0010, Xinyi Wang, Puhua Chen. 9847-9855 [doi]
- HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video TrackingJiahao Wang 0002, Fang Liu 0001, Licheng Jiao, Hao Wang 0211, Shuo Li 0010, Xinyi Wang, Lingling Li 0002, Puhua Chen, Xu Liu 0006. 9856-9866 [doi]
- Griffin: Aerial-Ground Cooperative Detection and Tracking Dataset and BenchmarkJiahao Wang 0005, Xiangyu Cao, Jiaru Zhong, Yuner Zhang, Zeyu Han, Haibao Yu, Chuang Zhang, Lei He, Shaobing Xu, Jianqiang Wang 0003. 9867-9875 [doi]
- SparseCoop: Cooperative Perception with Kinematic-Grounded QueriesJiahao Wang 0005, Zhongwei Jiang, Wenchao Sun, Jiaru Zhong, Haibao Yu, Yuner Zhang, Chenyang Lu 0011, Chuang Zhang, Lei He, Shaobing Xu, Jianqiang Wang 0003. 9876-9884 [doi]
- EPSegFZ: Efficient Point Cloud Semantic Segmentation for Few- and Zero-Shot Scenarios with Language GuidanceJiahui Wang, Haiyue Zhu, Haoren Guo, Abdullah-Al Mamun 0002, Cheng Xiang 0001, Tong Heng Lee. 9885-9893 [doi]
- Monte Carlo Diffusion for Generalizable Learning-Based RANSACJiale Wang, Chen Zhao 0025, Wei Ke 0003, Tong Zhang 0023. 9894-9902 [doi]
- Invisible Triggers, Visible Threats! Road-Style Adversarial Creation Attack for Visual 3D Detection in Autonomous DrivingJian Wang 0113, Lijun He 0001, Yixing Yong, Haixia Bi, Fan Li 0003. 9903-9911 [doi]
- SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding CapabilityJiankang Wang, Zhihan Zhang, Zhihang Liu, Yang Li, Jiannan Ge, Hongtao Xie, Yongdong Zhang 0001. 9912-9920 [doi]
- SD-PSFNet: Sequential and Dynamic Point Spread Function Network for Image DerainingJiayu Wang, Haoyu Bian, Haoran Sun, Shaoning Zeng. 9921-9929 [doi]
- SEA-PACE: Semi-Supervised Underwater Image Enhancement via Gaussian Process-Assisted Self-Paced LearningJingyang Wang, Hengyue Bi, Jingchao Cao, Feng Gao 0005, Junyu Dong. 9930-9938 [doi]
- Taming Cascaded Mixture-of-Experts for Modality-missing Multi-modal Salient Object DetectionKunpeng Wang 0005, Feifan Sun, Keke Chen. 9939-9947 [doi]
- Temporal and Spatial Representation Learning for Multimodal Low-Beam 3D Object DetectionLin Wang 0053, Shiliang Sun, Jing Zhao 0015. 9948-9956 [doi]
- PointChain: Learning Generalizable Point Cloud Representations via Structural Chain ModelingLuyao Wang, Chuxin Wang, Qiao Li, Tianzhu Zhang 0001. 9957-9965 [doi]
- FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait AnimationMengchao Wang, Qiang Wang, Fan Jiang, Mu Xu. 9966-9974 [doi]
- Modality-Aware Bias Mitigation and Invariance Learning for Unsupervised Visible-Infrared Person Re-IdentificationMenglin Wang 0001, Xiaojin Gong, Jiachen Li, Genlin Ji. 9975-9983 [doi]
- PortraitSR: Artist-Inspired Prior Learning for Progressive Face Super-ResolutionMiaoqing Wang, Jiaxu Leng, Shuang Li, Changjiang Kuang, Long Sun. 9984-9992 [doi]
- MotionPhysics: Learnable Motion Distillation for Text-Guided SimulationMiaowei Wang, Jakub Zadrozny, Oisin Mac Aodha, Amir Vaxman. 9993-10001 [doi]
- CAST-LUT: Tokenizer-Guided HSV Look-Up Tables for Purple Flare RemovalPu Wang 0008, Shuning Sun, Jialang Lu, Chen Wu 0006, Zhihua Zhang, Youshan Zhang, Chenggang Shan, Dianjie Lu, Guijuan Zhang, Zhuoran Zheng. 10002-10010 [doi]
- Adversarial Fair Incomplete Multi-View ClusteringQianqian Wang 0001, Haiming Xu, Wei Feng 0010, Quanxue Gao. 10011-10019 [doi]
- Compression Artifacts Removal for VVC with Frequency Domain Mixture of Experts NetworkQijun Wang, Kang Wang, Jun Wang. 10020-10028 [doi]
- BokehCrafter: Taming Video Diffusion Models for Controllable Bokeh RenderingQiwen Wang, Liao Shen, Jiaqi Li 0007, Tianqi Liu 0003, Huiqiang Sun, Zihao Huang 0001, Yachuan Huang, Xianrui Luo, Zhiguo Cao 0001. 10029-10037 [doi]
- D3-RSMDE: 40× Faster and High-Fidelity Remote Sensing Monocular Depth EstimationRuizhi Wang, Weihan Li, Zunlei Feng, Haofei Zhang, Mingli Song, Jiayu Wang, Jie Song 0011, Li Sun. 10038-10046 [doi]
- PC-Flow: Preference Alignment in Flow Matching via ClassifierShaomeng Wang, He Wang, Longquan Dai, Jinhui Tang 0001. 10047-10055 [doi]
- EC-MVSNet: Enhanced Cascaded Multi-View Stereo with Cross-Scale Relevance IntegrationShaoqian Wang, Jiadai Sun, Bin Fan 0002, Qiang Wang 0023, Bin Lu, Yuchao Dai. 10056-10064 [doi]
- Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object TrackingShilei Wang 0001, Pujian Lai, Dong Gao, Jifeng Ning, Gong Cheng 0003. 10065-10073 [doi]
- MonoDream: Monocular Vision-Language Navigation with Panoramic DreamingShuo Wang 0015, Yongcai Wang, Zhaoxin Fan, Yucheng Wang, Maiyue Chen, Kaihui Wang, Zhizhong Su, Wanting Li, Xudong Cai, Yeying Jin, Deying Li 0001. 10074-10082 [doi]
- ObjecTok: Learning Holistic and Robust Object Tokens for MLLMsSihan Wang, Xiyao Liu 0002, Lianqing Liu, Zhi Han. 10083-10091 [doi]
- PipeDiT: Accelerating Diffusion Transformers in Video Generation with Task Pipelining and Model DecouplingSijie Wang, Qiang Wang 0022, Shaohuai Shi. 10092-10100 [doi]
- CaTFormer: Causal Temporal Transformer with Dynamic Contextual Fusion for Driving Intention PredictionSirui Wang, Zhou Guan, Bingxi Zhao, Tongjia Gu, Jie Liu. 10101-10108 [doi]
- GEWDiff: Geometric Enhanced Wavelet-based Diffusion Model for Hyperspectral Image Super-resolutionSirui Wang, Jiang He, Natàlia Blasco Andreo, Xiao Xiang Zhu 0001. 10109-10117 [doi]
- Slender3D: Curve-Guided Multi-View Reconstruction of Slender StructuresSuqin Wang, Zeyi Wang, Min Shi 0005, Zhaoxin Li, Qi Wang 0111, Xiujuan Chai, Dengming Zhu. 10118-10126 [doi]
- You Only Need One Stage: Novel-View Synthesis from a Single Blind Face ImageTaoyue Wang, Xiang Zhang, Xiaotian Li, Huiyuan Yang, Lijun Yin 0001. 10127-10135 [doi]
- Towards 3D Object-Centric Feature Learning for Semantic Scene CompletionWeihua Wang, Yubo Cui, Xiangru Lin, Zhiheng Li 0003, Zheng Fang 0001. 10136-10144 [doi]
- Debiased Multiplex Tokenizer for Efficient Map-Free Visual RelocalizationWenshuai Wang, Hong Liu 0008, Shengquan Li, Peifeng Jiang, Runwei Ding. 10145-10153 [doi]
- Semore: VLM-guided Enhanced Semantic Motion Representations for Visual Reinforcement LearningWentao Wang, Chunyang Liu, Kehua Sheng, Bo Zhang 0106, Yan Wang. 10154-10162 [doi]
- Noisy Correspondence Learning with Modality Gap Direction CorrectionWuyuqing Wang, Zeyuan Gu, Erkun Yang. 10163-10171 [doi]
- When Person Re-Identification Meets Event Camera: A Benchmark Dataset and an Attribute-Guided Re-Identification FrameworkXiao Wang 0014, Qian Zhu, Shujuan Wu, Bo Jiang 0002, Shiliang Zhang. 10172-10180 [doi]
- Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target DetectionXiaolin Wang 0006, Houzhang Fang, Qingshan Li, Lu Wang 0014, Yi Chang 0002, Luxin Yan. 10181-10189 [doi]
- Spectrally Adaptive Channel-aware Unrolling Network for Compressed SensingXiaoyang Wang, Hongping Gan. 10190-10198 [doi]
- UMNet: Uncertainty-guided Memory Network for Hyperspectral PansharpeningXiaozheng Wang, Yong Yang 0001, Shuying Huang, Nayu Liu, Ziyang Liu. 10199-10206 [doi]
- Bi-VLM: Binary Post-Training Quantization for Vision-Language ModelsXijun Wang 0002, Rayyan Abdalla, Junyun Huang, Chengyuan Zhang, Ruiqi Xian, Dinesh Manocha. 10207-10215 [doi]
- Generating Attribute-Aware Human Motions from Textual PromptXinghan Wang, Kun Xu 0005, Fei Li, Cao Sheng, Jiazhong Yu, Yadong Mu. 10216-10224 [doi]
- PointSLAM++: Robust Dense Neural Gaussian Point Cloud-based SLAMXu Wang, Boyao Han, Xiaojun Chen, Ying Liu, Ruihui Li. 10225-10233 [doi]
- X-MoGen: Unified Motion Generation Across Humans and AnimalsXuan Wang, Kai Ruan, Liyang Qian, Guo Zhi Zhi, Chang Su, Gaoang Wang. 10234-10242 [doi]
- InterCoser: Interactive 3D Character Creation with Disentangled Fine-Grained FeaturesYi Wang, Jian Ma, Zhuo Su 0006, Guidong Wang, Jingyu Yang 0002, Yu-Kun Lai, Kun Li 0001. 10243-10251 [doi]
- Deeply Seeking Boundary for Lunar Regolith SegmentationYifeng Wang, Lingxin Wang, Lu Zhang, Yang Li, Chao Xu, Weiwei Zhang 0009, Junyue Tang, Yanhong Zheng, Yong Pang, Shengyuan Jiang, Yi Zhao, Zongquan Deng. 10252-10260 [doi]
- ID-Splat: Propagating Object Identities for Segmenting 3D Aerial-view ScenesYijing Wang 0004, Xu Tang 0004, Xiangrong Zhang, Jingjing Ma 0001. 10261-10269 [doi]
- Radar-APLANC: Unsupervised Radar-based Heartbeat Sensing via Augmented Pseudo-Label and Noise ContrastYing Wang, Zhaodong Sun, Xu Cheng 0003, Zuxian He, Xiaobai Li. 10270-10278 [doi]
- TileGS: Adaptive Gaussian Densification Through Tile-Guided Perceptual AnalysisYiwen Wang, Ran Yi 0002, Lizhuang Ma. 10279-10287 [doi]
- A Pseudo-Label Optimization Method Based on Polar Coordinate Modeling and Prior ConstraintsYudi Wang, Hailan Shen, Yixiao Fu, Yuqi Li, Zeshi Lu, Zailiang Chen 0001. 10288-10296 [doi]
- PATexGS: Perceptual-Adaptive Texture Scheduling for Visual Coherence in Textured Gaussian SplattingYuesong Wang, Dounian Ma, Xiaoyu Chen, Tao Guan. 10297-10305 [doi]
- ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLMYujun Wang, Aniri, Jinhe Bi, Sören Pirk, Yunpu Ma. 10306-10314 [doi]
- Topology-Aware Vision Transformers for Enhanced Scene RecognitionYunxi Wang, Shuaiyu Liu, Qiling Li, Yazhou Ren 0001, Xiaorong Pu. 10315-10322 [doi]
- TIME: Temporal-Sensitive Multi-Dimensional Instruction Tuning and Robust Benchmarking for Video-LLMsYunxiao Wang, Meng Liu 0006, Wenqi Liu, Xuemeng Song, Bin Wen, Fan Yang 0094, Tingting Gao, Di Zhang 0026, Guorui Zhou, Liqiang Nie. 10323-10331 [doi]
- Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single ImageYuxuan Wang, Xuanyu Yi, Qingshan Xu 0001, Yuan Zhou 0016, Long Chen 0016, Hanwang Zhang. 10332-10340 [doi]
- MCGS: Markov Chain Gaussian Splatting for Dynamic Scenes ReconstructionYuzhong Wang, Wenmin Wang 0001, Shixiong Zhang, Xinxing Yu, Zhongheng Chen. 10341-10348 [doi]
- Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language ModelsZehao Wang, Xinpeng Liu 0002, Yudonglin Zhang, Xiaoqian Wu, Zhou Fang, Yifan Fang, Junfu Pu, Cewu Lu, Yong-Lu Li 0001. 10349-10357 [doi]
- Minute-Long Videos with Dual ParallelismsZeqing Wang, Bowen Zheng, Xingyi Yang, Zhenxiong Tan, Yuecong Xu, Xinchao Wang. 10358-10366 [doi]
- Difficulty Controlled Diffusion Model for Synthesizing Effective Training DataZerun Wang, Jiafeng Mao, Xueting Wang, Toshihiko Yamasaki. 10367-10375 [doi]
- Breaking Task Boundaries: A Unified Model for 3D Medical Image Fusion and Segmentation Guided by Manifold PerspectiveZeyu Wang 0009, Jiayu Wang, Haiyu Song 0002. 10376-10384 [doi]
- SigFusion: Unified Signal-Level Self-Supervised Learning Paradigm for Image FusionZeyu Wang 0009, Jiawei Feng, Jiayu Wang, Pengjie Wang 0001, Haiyu Song 0002. 10385-10393 [doi]
- Fine-Grained Generalization via Structuralizing Concept and Feature Space into Commonality, Specificity and ConfoundingZhen Wang, Jiaojiao Zhao, Qilong Wang, Yongfeng Dong, Wenlong Yu. 10394-10402 [doi]
- SpatioTemporal Difference Network for Video Depth Super-ResolutionZhengxue Wang, Yuan Wu, Xiang Li 0041, Zhiqiang Yan 0001, Jian Yang 0003. 10403-10411 [doi]
- DiffusionPose: Markov-Optimized Diffusion Model for Human Pose EstimationZhigang Wang, Zhenguang Liu, Shaojing Fan, Sifan Wu 0001, Yingying Jiao. 10412-10420 [doi]
- Beyond Single-Point Perturbation: A Hierarchical, Manifold-Aware Approach to Diffusion AttacksZhijie Wang, Lin Wang, Zhenyu Wen, Cong Wang. 10421-10429 [doi]
- T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low BitratesZhitao Wang, Hengyu Man, Wenrui Li 0001, Xingtao Wang, Xiaopeng Fan 0001, Debin Zhao. 10430-10438 [doi]
- MTAttack: Multi-Target Backdoor Attacks Against Large Vision-Language ModelsZihan Wang, Guansong Pang, Wenjun Miao, Jin Zheng, Xiao Bai 0001. 10440-10448 [doi]
- RPGen: Robust and Differentially Private Synthetic Image GenerationZihao Wang 0001, Hao Peng 0001, Wei Dong, Yuecen Wei, Li Sun 0008, Zhengtao Yu 0001. 10449-10457 [doi]
- Efficient and Effective In-context Demonstration Selection with CoresetZihua Wang, Jiarui Wang, Haiyang Xu 0001, Ming Yan 0008, Fei Huang 0002, Xu Yang 0021, Xiu-Shen Wei, Siya Mi, Yu Zhang 0004. 10458-10466 [doi]
- VideoChat-A1: Thinking with Long Videos by Chain-of-Shot ReasoningZikang Wang, Boyu Chen, Zhengrong Yue, Yi Wang 0074, Yu Qiao 0001, Limin Wang 0002, Yali Wang 0001. 10467-10475 [doi]
- FlowAnyTime: Efficient Fine-tuning with Intra-Inter Frame Distillation for All-Weather Optical Flow EstimationZixu Wang, Hongye Chen, Xiaochun Zou, Congxuan Zhang, Zhen Chen 0004, Xinbo Zhao. 10476-10484 [doi]
- PosPrune: Visual Token Pruning with Positional Bias Correction for Efficient Large Vision-Language ModelsZiyang Wang, Mengwei Li, Hao Yin, Wenhao Liu, Zilei Wang. 10485-10493 [doi]
- Towards Privacy-Protected Generalized Gaze Estimation Using Diffusion Models and Domain Stability Adaptation FrameworkZiyi Wang, Shengcheng Ye, Faming Fang, Haichuan Song. 10494-10502 [doi]
- DreamRunner: Fine-Grained Compositional Story-to-Video Generation with Retrieval-Augmented Motion AdaptationZun Wang 0001, Jialu Li 0001, Han Lin, Jaehong Yoon, Mohit Bansal. 10503-10511 [doi]
- Video Mirror Detection with the Motion-in-Depth CueAlex Warren, Ke Xu 0010, Xin Tian 0015, Gary K. L. Tam, Benjamin W. Wah, Rynson W. H. Lau. 10512-10520 [doi]
- Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware DiffusionHaoran Wei, Wencheng Han, Xingping Dong, Jianbing Shen. 10521-10529 [doi]
- ST-SAM: Multimodal Scene Text Segmentation with Dense Visual and Sparse Textual Prompts via SAMJin Wei, Yaqiang Wu, Jiayi Yan, Zeng Li, Zhen Xu, Yu Zhou 0015, Lingling Zhang 0005, Qianying Wang 0002. 10530-10538 [doi]
- Point-SRA: Self-Representation Alignment for 3D Representation LearningLintong Wei, Jian Lu, Haozhe Cheng, Jihua Zhu, Kaibing Zhang. 10539-10547 [doi]
- Where It Moves, It Matters: Referring Surgical Instrument Segmentation via MotionMeng Wei, Kun Yuan 0004, Shi Li, Yue Zhou, Long Bai 0008, Nassir Navab, Hongliang Ren 0001, Hong Joo Lee 0001, Tom Vercauteren, Nicolas Padoy. 10548-10556 [doi]
- Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic ConsistencyRiling Wei, Kelu Yao, Chuanguang Yang, Jin Wang 0039, Zhuoyan Gao, Chao Li 0028. 10557-10565 [doi]
- Exploring High-order-aware Prompt Learning for Zero-shot Anomaly DetectionShun Wei, Jielin Jiang, Xiaolong Xu 0001. 10566-10574 [doi]
- PBR3DGen: A VLM-Guided Mesh Generation with High-Quality PBR TextureXiaokang Wei, Bowen Zhang, Xianghui Yang, Yuxuan Wang, Chunchao Guo, Xi Zhao, Yan Luximon. 10575-10583 [doi]
- Seeing Is Believing: Grounding Long-Video Understanding in Spatio-Temporal Visual EvidenceZhaoyang Wei, Guoliang Wang, Guohua Gao, Yanchao Hao, Mingda Li, Wenchao Ding 0007, Xi Chen, Shizhu He, Xuehui Yu. 10584-10592 [doi]
- Efficient Segmentation with Multimodal Large Language Model via Token RoutingChangsong Wen, Zelin Peng, Yu Huang, Wei Shen 0002. 10593-10602 [doi]
- CHARM: Collaborative Harmonization Across Arbitrary Modalities for Modality-Agnostic Semantic SegmentationLekang Wen, Jing Xiao 0004, Liang Liao, Jiajun Chen, Mi Wang. 10603-10611 [doi]
- Robust Long-Term Test-Time Adaptation for 3D Human Pose Estimation Through Motion DiscretizationYilin Wen 0008, Kechuan Dong, Yusuke Sugano. 10612-10620 [doi]
- ReAlign: Text-to-Motion Generation via Step-Aware Reward-Guided AlignmentWanjiang Weng, Xiaofeng Tan 0001, Junbo Wang 0003, Guo-Sen Xie, Pan Zhou, Hongsong Wang 0001. 10621-10629 [doi]
- HiFusion: Hierarchical Intra-Spot Alignment and Regional Context Fusion for Spatial Gene Expression Prediction from HistopathologyZiqiao Weng, Yaoyu Fang, Jiahe Qian, Xinkun Wang, Lee A. Cooper, Weidong Cai 0001, Bo Zhou 0009. 10630-10637 [doi]
- VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame InterpolationChenyang Wu, Jiayi Fu, Chun-Le Guo, Shuhao Han, Chongyi Li. 10638-10645 [doi]
- A Study of Finetuning Video Transformers for Multi-view Geometry TasksHuimin Wu 0001, Kwang-Ting Cheng, Stephen Lin 0001, Zhirong Wu. 10646-10654 [doi]
- Deep Inverse Shading: Consistent Albedo and Surface Detail Recovery via Generative RefinementJiacheng Wu, Ruiqi Zhang, Jie Chen. 10655-10663 [doi]
- Promptus: Can Prompt Streaming Replace Video StreamingJiangkai Wu, Liming Liu, Yunpeng Tan, Junlin Hao, Liang Zhang, Xinggong Zhang. 10664-10672 [doi]
- Codebook-Empowered Analysis-Friendly Extreme Underwater Image CompressionJianhao Wu, Yudong Mao, Qiuping Jiang. 10673-10681 [doi]
- Gradient as Conditions: Rethinking HOG for All-in-one Image RestorationJiawei Wu, Zhifei Yang 0004, Zhe Wang, Zhi Jin. 10682-10690 [doi]
- MRGeo: Robust Cross-View Geo-Localization of Corrupted Images via Spatial and Channel Feature EnhancementLe Wu, Bo Lv, Songsong Ouyang, Yingying Zhu 0001. 10691-10699 [doi]
- ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency ConstraintsMeiqi Wu, Jiashu Zhu, Xiaokun Feng, Chubin Chen, Chen Zhu, Bingze Song, Fangyuan Mao, Jiahong Wu 0005, Xiangxiang Chu, Kaiqi Huang. 10700-10708 [doi]
- Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCTQing Wu 0001, Hongjiang Wei, Jingyi Yu 0001, Yuyao Zhang. 10709-10717 [doi]
- Better Matching, Less Forgetting: A Quality-Guided Matcher for Transformer-based Incremental Object DetectionQirui Wu, Shizhou Zhang, De Cheng, Yinghui Xing, Lingyan Ran, Dahu Shi, Peng Wang 0015. 10718-10726 [doi]
- Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular VideoRenlong Wu, Zhilu Zhang 0001, Mingyang Chen, Zifei Yan, Wangmeng Zuo. 10727-10735 [doi]
- RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly DetectionRongcheng Wu, Hao Zhu, Shiying Zhang, Mingzhe Wang, Zhidong Li, Hui Li, Jianlong Zhou, JiangTao Cui, Fang Chen 0001, Pingyang Sun, Qiyu Liao, Ye-Lin. 10736-10744 [doi]
- Dual Coding Theory in Action: Language-Assisted Human Pose Estimation in VideosSifan Wu 0001, Haipeng Chen 0002, Yingda Lyu, Shaojing Fan, Zhigang Wang, Zhenguang Liu, Yingying Jiao. 10745-10753 [doi]
- Attentive Keypoint Identification: Progressive Spatiotemporal Refinement for Video-based Human Pose EstimationSifan Wu 0001, Haipeng Chen 0002, Yingda Lyu, Shaojing Fan, Zhigang Wang, Zhenguang Liu, Yingying Jiao. 10754-10762 [doi]
- Explicit Modeling of Causal Factors and Confounders for Image ClassificationWei Wu, Lei Meng 0001, Zhuang Qi, Zixuan Li, Yachong Zhang, Xiaoshuo Yan, Xiangxu Meng. 10763-10771 [doi]
- MUTrack: A Memory-Aware Unified Representation Framework for Visual TrackingWeijing Wu, Qihua Liang, Bineng Zhong 0001, Xiaohu Tang, Yufei Tan, Ning Li 0044, Yuanliang Xue. 10772-10780 [doi]
- TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point CloudsWeishang Wu, Yifei Shi, Zhiping Cai. 10781-10789 [doi]
- Remodeling Semantic Relationships in Vision-Language Fine-TuningXiangyang Wu, Liu Liu 0014, Baosheng Yu, Jiayan Qiu, Zhenwei Shi. 10790-10798 [doi]
- CiNuSeg: Class Incremental Nuclei Segmentation via Anchor-driven Consistency Learning with Dual Region RegularizationXuexin Wu, Zhenhui Ding, Huisi Wu, Jing Qin 0001. 10799-10807 [doi]
- FRBAT: Conditionally-Visible Physical Backdoor Attack via FluorescenceYalun Wu, Liu Liu, Endong Tong, Yingxiao Xiang, Xiaoting Lyu, Zhen Han 0001, Jiqiang Liu. 10808-10816 [doi]
- DCAC: Dynamic Class-Aware Cache Creates Stronger Out-of-Distribution DetectorsYanqi Wu, Qichao Chen, Runhe Lai, Xinhua Lu, Jia-Xin Zhuang, Zhilin Zhao 0001, Weishi Zheng 0001, Ruixuan Wang. 10817-10825 [doi]
- BeyondSparse: Facilitating Mamba to Enhance Cross-Domain 3D Semantic Segmentation in Adverse WeatherYao Wu, Mingwei Xing, Yachao Zhang 0001, Fangyong Wang, Xiaopei Zhang, Yanyun Qu. 10826-10834 [doi]
- Learning Knowledge from Textual Descriptions for 3D Human Pose EstimationYi Wu 0019, Jingtian Li, Shangfei Wang, Guoming Li, Meng Mao, Linxiang Tan. 10835-10843 [doi]
- Explainable Synthetic Image Detection Through Diffusion Timestep EnsemblingYixin Wu 0005, Feiran Zhang, Tianyuan Shi, Ruicheng Yin, Zhenghua Wang, Zhenliang Gan, Xiaohua Wang, Changze Lv, Xiaoqing Zheng, Xuanjing Huang 0001. 10844-10852 [doi]
- FoundationSLAM: Unleashing the Power of Depth Foundation Models for End-to-End Dense Visual SLAMYuchen Wu, Jiahe Li 0007, Fabio Tosi, Matteo Poggi, Jin Zheng, Xiao Bai 0001. 10853-10861 [doi]
- Hybrid Vector-Occupancy Field for Robust Implicit 3D Surface ReconstructionYue Wu 0004, Zhigang Gao, Tengfei Xiao, Can Qin, Yongzhe Yuan, Hao Li 0009, Kaiyuan Feng, Wenping Ma 0001. 10862-10870 [doi]
- Language-Guided and Motion-Aware Gait Representation for Generalizable RecognitionZhengxian Wu, Chuanrui Zhang, Shenao Jiang, Hangrui Xu, Zirui Liao, Luyuan Zhang, Huaqiu Li, Peng Jiao, Haoqian Wang. 10871-10878 [doi]
- Incomplete Multi-view Diabetic Retinopathy Grading via Self-Supervised Inter- and Intra-View RestorationZhihao Wu 0002, Yuxin Lin, Jie Wen 0001, Wuzhen Shi, LinLin Shen. 10879-10887 [doi]
- DLVINet: Advancing Dual-Lens Video Inpainting Beyond Parallax ConstraintsZhiliang Wu, Kun Li 0008, Yunqiu Xu, Hehe Fan, Yi Yang 0001. 10888-10896 [doi]
- Injection Without Distortion: Geometrically Constrained Knowledge Enhancement for Vision-Language ModelsZhongze Wu, Xiu Su, Feng Yang, Shan You, Jun Long, Yueyi Luo. 10897-10905 [doi]
- Realism Control One-step Diffusion for Real-world Image Super ResolutionZongliang Wu, Siming Zheng, Peng-Tao Jiang, Xin Yuan 0002. 10906-10914 [doi]
- OmniVDiff: Omni Controllable Video Diffusion for Generation and UnderstandingDianbing Xi, Jiepeng Wang 0005, Yuanzhi Liang, Xi Qiu, Yuchi Huo, Rui Wang 0004, Chi Zhang 0012, Xuelong Li 0001. 10915-10923 [doi]
- PFAvatar: Pose-Fusion 3D Personalized Avatar Reconstruction from Real-World Outfit-of-the-Day PhotosDianbing Xi, Guoyuan An, Jingsen Zhu, Zhijian Liu, Yuan Liu, Ruiyuan Zhang, Jiayuan Lu, Yuchi Huo, Rui Wang 0004. 10924-10932 [doi]
- 3DDM: Physically-based Anisotropic 3D Diffusion Model with 3D Gaussian for Point Cloud CompletionLong Xi 0001, Jia Ma, ZhenYu Yuan, Tao Xue 0001, Wen Tang 0004, Wen Lv. 10933-10941 [doi]
- Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior RefinementYichong Xia, Yimin Zhou 0011, Jinpeng Wang 0002, Bin Chen 0011. 10942-10950 [doi]
- Probabilistic Deformation Consistency for Unsupervised Shape MatchingYifan Xia, Tianwei Ye, Jun Huang 0008, Xiaoguang Mei, Jiayi Ma 0001. 10951-10959 [doi]
- SWIFT:A General Sensitive Weight Identification Framework for Fast Sensor-Transfer PansharpeningZeyu Xia, Chenxi Sun, Tianyu Xin, Yubo Zeng, Haoyu Chen, Liang-Jian Deng. 10960-10968 [doi]
- VGGS: VGGT-guided Gaussian Splatting for Efficient and Faithful Sparse-View Surface ReconstructionPeng Xiang 0002, Liang Han, Hui Zhang 0013, Yu-Shen Liu, Zhizhong Han. 10969-10977 [doi]
- Not Just What's There: Enabling CLIP to Comprehend Negated Visual Descriptions Without Fine-TuningJunhao Xiao, Zhiyu Wu, Hao Lin, Yi Chen, Yahui Liu, Xiaoran Zhao, Zixu Wang, Zejiang He. 10978-10986 [doi]
- Training-free Boosting for Few-shot Segmentation via Generalizing Semantic MiningKangyu Xiao, Zilei Wang, Yixin Zhang, Junjie Li 0002. 10987-10995 [doi]
- DcSplat: Dual-Constraint Human Gaussian Splatting with Latent Multi-View ConsistencyTengfei Xiao, Yue Wu 0004, Zhigang Gao, Yongzhe Yuan, Can Qin, Hao Li 0009, Mingyang Zhang 0002. 10996-11004 [doi]
- A Hybrid Space Model for Misaligned Multi-modality Image FusionYi Xiao, Jia Wang, Zhu Liu 0004, Di Wang 0018, Jinyuan Liu 0001, Risheng Liu. 11005-11013 [doi]
- Unaligned UAV RGBT Tracking: A Largescale Benchmark and a Novel ApproachYun Xiao 0003, Yuhang Wang, Jiandong Jin, Wankang Zhang, Chenglong Li 0002. 11014-11022 [doi]
- UniMGS: Unifying Mesh and 3D Gaussian Splatting with Single-Pass Rasterization and Proxy-Based DeformationZeyu Xiao 0001, Mingyang Sun, Yimin Cong, Lintao Wang, Dongliang Kou, Zhenyi Wu, Dingkang Yang, Peng Zhai, Zeyu Wang, Lihua Zhang 0002. 11023-11031 [doi]
- Exploiting Blurry Representations for Event-guided Video Super-ResolutionZeyu Xiao 0002, Xinchao Wang. 11032-11041 [doi]
- PUNO: A Neural Operator Framework for Point Cloud UpsamplingZijian Xiao, Yining Xu 0001, Yingjie Huang 0001, Li Yao 0003. 11042-11050 [doi]
- SimROD: A Simple Baseline for Raw Object Detection with Global and Local EnhancementsHaiyang Xie, Xi Shen, Shihua Huang, Qirui Wang, Zheng Wang. 11051-11059 [doi]
- Retrieval-driven Reasoning for Deliberative Visual ClassificationJianye Xie, Lianyong Qi, Fan Wang 0020, Anqi Wang, Wenjuan Gong, Danxin Wang, Wanchun Dou, Yang Cao 0019, Shichao Pei, Xiaokang Zhou. 11060-11068 [doi]
- Unnoticed Yet Effective: A Hybrid Physical Camouflage Framework Against DNNs and Human PerceptionMingye Xie, Jiacheng Ruan, Xian Gao, Ting Liu 0016, Yuzhuo Fu. 11069-11077 [doi]
- Human2Robot: Learning Robot Actions from Paired Human-Robot VideosSicheng Xie, Haidong Cao, Zejia Weng, Zhen Xing, Haoran Chen 0003, Shiwei Shen, Jiaqi Leng 0002, Zuxuan Wu, Yu-Gang Jiang 0001. 11078-11086 [doi]
- Sonic4D: Spatial Audio Generation for Immersive 4D Scene ExplorationSiyi Xie, Hanxin Zhu, Xinyi Chen, Tianyu He, Xin Li 0082, Zhibo Chen 0001. 11087-11095 [doi]
- MGD: Mesh-guided Gaussians with Diffusion Priors for Dynamic Objects Reconstruction from Monocular RGB-D VideoWeixing Xie, Ying Ye, Xian Wu, Jintian Li, Bingchuan Li, Yanchen Lin, Junfeng Yao. 11096-11104 [doi]
- Unleashing the Potential of Large Language Models for Text-to-Image Generation Through Autoregressive Representation AlignmentXing Xie, Jiawei Liu 0003, Ziyue Lin, Huijie Fan, Zhi Han, Yandong Tang, Liangqiong Qu. 11105-11113 [doi]
- FCMO: A Flow-Curv Mamba Operator for Large-Scale 3D Vehicle AerodynamicsYuchen Xie, Yufeng Xie, Hanyu He, Yue Huang, Lijuan Sun, Hengyi Ren. 11114-11122 [doi]
- FilmSceneDesigner: Chaining Set Design for Procedural Film Scene GenerationZhifeng Xie, Keyi Zhang, Yiye Yan, Yuling Guo, Fan Yang, Jiting Zhou, Mengtian Li. 11123-11131 [doi]
- Revealing the Invisible: Latent Structure Modeling for Semantically Consistent Cloud RemovalJingwei Xin, Kai Guo, Jie Li 0001, Nannan Wang 0001. 11132-11140 [doi]
- Training and Inference Within 1 Second - Tackle Cross-Sensor Degradation of Real-World Pansharpening with Efficient Residual Feature TailoringTianyu Xin, Jin-Liang Xiao, Zeyu Xia, Shan Yin, Liang-Jian Deng. 11141-11149 [doi]
- OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time OptimizationJiazheng Xing, Hai Ci, Hongbin Xu, Hangjie Yuan, Yong Liu, Mike Zheng Shou. 11150-11158 [doi]
- MAGIC: Mastering Physical Adversarial Generation in Context Through Collaborative LLM AgentsYun Xing, Nhat Chung, Jie Zhang 0002, Yue Cao, Ivor W. Tsang, Yang Liu 0003, Lei Ma 0003, Qing Guo 0005. 11159-11168 [doi]
- Gait Recognition via Collaborating Discriminative and Generative Diffusion ModelsHaijun Xiong, Bin Feng 0001, Bang Wang 0001, Xinggang Wang, Wenyu Liu 0001. 11169-11177 [doi]
- DocR1: Evidence Page-Guided GRPO for Multi-Page Document UnderstandingJunyu Xiong, Yonghui Wang, Weichao Zhao, Chenyu Liu, Bing Yin, Wengang Zhou 0001, Houqiang Li. 11178-11186 [doi]
- Disentangled Hypergraph-Guided Mamba Scanning for Fine-Grained Visual RecognitionZhongwei Xiong, Hao Wang, Xiaoyan Yu, Lingling Li, Xuezhuan Zhao, Taisong Jin. 11187-11195 [doi]
- Ultralight Polarity-Split Neuromorphic SNN for Event-Stream Super-ResolutionChuanzhi Xu, Haoxian Zhou, Langyi Chen, Yuk Ying Chung, Qiang Qu 0004. 11196-11204 [doi]
- BAT: Learning Event-based Optical Flow with Bidirectional Adaptive Temporal CorrelationGangwei Xu, Haotong Lin, Zhaoxing Zhang, Hongcheng Luo, Haiyang Sun, Xin Yang 0008. 11205-11213 [doi]
- Fine-Grained Representation for Lane Topology ReasoningGuoqing Xu, Yiheng Li, Yang Yang. 11214-11222 [doi]
- MoEG-HOI: Mixture of Expert Groups for One-Stage Hand-Object Interaction Motion Generation with Hand-Finger-Joint Semantic GuidanceHang Xu, Yang Xiao 0007, Changlong Jiang, Haohong Kuang, Kaidi Zhang, Min Du, Ran Wang 0005. 11223-11231 [doi]
- TRT: Harnessing Tensor Ring Transformer for Hyperspectral Image Super-ResolutionHonghui Xu 0002, Junwei Zhu, Yubin Gu, Yueqian Quan, Chuangjie Fang, Hong Qiu, Jianwei Zheng 0001. 11232-11240 [doi]
- MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality AssessmentHuangbiao Xu, Huanqi Wu 0001, Xiao Ke, Junyi Wu, Rui Xu 0028, Jinglin Xu. 11241-11249 [doi]
- S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything Without SupervisionHuihui Xu, Jin Ye 0002, Hongqiu Wang, Changkai Ji, Jiashi Lin, Ming Hu, Ziyan Huang, Ying Chen, Chenglong Ma 0002, Tianbin Li, Lihao Liu, Junjun He, Lei Zhu 0003. 11250-11258 [doi]
- PulseMind: A Multi-Modal Medical Model for Real-World Clinical DiagnosisJiao Xu, Junwei Liu, Jiangwei Lao, Qi Zhu 0010, Yunpeng Zhao, Congyun Jin, Shinan Liu, Zhihong Lu 0002, Lihe Zhang, Xin Chen, Jian Wang 0108, Ping Wang. 11259-11268 [doi]
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video GenerationJiazheng Xu, Yu Huang, Jiale Cheng, Yuanming Yang, Jiajun Xu, Yuan Wang, Wenbo Duan, Shen Yang 0001, Qunlin Jin, Shurun Li, Jiayan Teng, Zhuoyi Yang, Wendi Zheng, Xiao Liu 0036, Dan Zhang, Ming Ding 0004, Xiaohan Zhang, Shiyu Huang 0001, Xiaotao Gu, Minlie Huang, Jie Tang 0001, Yuxiao Dong. 11269-11277 [doi]
- Identity-Aware Vision-Language Model for Explainable Face Forgery DetectionJunhao Xu, Jingjing Chen 0001, Yang Jiao, Jiacheng Zhang, Zhiyu Tan, Hao Li, Yu-Gang Jiang 0001. 11278-11286 [doi]
- EccoMamba: Enhanced Cross-hierarchical Continuity Orthogonal Mamba for Medical Image SegmentationJunlin Xu, Jincan Li, Feifei Cui, Zhuang Zhang, Jialiang Yang, Shuting Jin, Qiangguo Jin, Yajie Meng. 11287-11295 [doi]
- DeFT-LoRA: Decoupled and Fused Tuning with LoRA Experts for Universal Cross-Domain RetrievalKe Xu 0011, Xiaozheng Shen, Shanshan Wang 0008, Mengzhu Wang, Xun Yang 0001. 11296-11304 [doi]
- RealRep: Generalized SDR-to-HDR Conversion via Attribute-Disentangled Representation LearningLi Xu 0008, Siqi Wang, Kepeng Xu, Lin Zhang 0040, Gang He 0002, Weiran Wang, Yu-Wing Tai. 11305-11313 [doi]
- Heterogeneous Complementary DistillationLiuchi Xu, Hao Zheng, Lu Wang 0001, Lisheng Xu, Jun Cheng 0003. 11314-11322 [doi]
- Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and PrunableLizhen Xu, Zehao Wu, Wenzhao Qiu, Shanmin Pang, Xiuxiu Bai, Kuizhi Mei, Jianru Xue. 11323-11331 [doi]
- VP-Bench: A Comprehensive Benchmark for Visual Prompting in Multimodal Large Language ModelsMingjie Xu, Jinpeng Chen 0003, Yuzhi Zhao, Jason Chun Lok Li, Yue Qiu, Zekang Du, Mengyang Wu, Pingping Zhang, Kun Li 0015, Hongzheng Yang, Wenao Ma, Jiaheng Wei, Qinbin Li, Kangcheng Liu, Wenqiang Lei. 11332-11341 [doi]
- MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale AdaptationMuyu Xu, Fangneng Zhan, Xiaoqin Zhang 0002, Ling Shao 0001, Shijian Lu. 11343-11351 [doi]
- Pushing Rendering Boundaries: Hard Gaussian SplattingQingshan Xu 0001, Jiequan Cui, Xuanyu Yi, Yuxuan Wang, Yuan Zhou 0016, Yew-Soon Ong, Hanwang Zhang. 11352-11360 [doi]
- NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from VideosQingshan Xu 0001, Jiao Liu 0006, Shangshu Yu, Yuxuan Wang, Yuan Zhou 0016, Junbao Zhou, Jiequan Cui, Yew-Soon Ong, Hanwang Zhang. 11361-11369 [doi]
- OAD-Promoter: Enhancing Zero-Shot VQA Using Large Language Models with Object Attribute DescriptionQuanxing Xu, Ling Zhou 0005, Feifei Zhang, Rubing Huang, Jinyu Tian 0001. 11370-11378 [doi]
- SCALAR: Scale-wise Controllable Visual Autoregressive LearningRyan Xu, Dongyang Jin, Yancheng Bai, Rui Lan, Xu Duan, Lei Sun, Xiangxiang Chu. 11379-11387 [doi]
- CAG-GS: Consistent Anchor Guided Gaussian Splatting for Large-scale Scene RenderingShijie Xu, Qiulei Dong. 11388-11396 [doi]
- SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMsShuhan Xu, Siyuan Liang 0004, Hongling Zheng, Aishan Liu, Xinbiao Wang, Yong Luo 0002, Fu Lin, Leszek Rutkowski, Dacheng Tao. 11397-11405 [doi]
- CNM-UNet: Continuous Ordinary Differential Equations for Medical Image SegmentationTianqi Xu, Yashi Zhu, Quansong He, Yue Cao, Kaishen Wang, Zhang Yi 0001, Tao He 0016. 11406-11414 [doi]
- ICLR: Inter-Chrominance and Luminance Interaction for Natural Color Restoration in Low-Light Image EnhancementXin Xu, Hao Liu, Wei Liu, Wei Wang, Jiayi Wu, Kui Jiang. 11415-11423 [doi]
- FedARKS: Federated Aggregation via Robust and Discriminative Knowledge Selection and Integration for Person Re-identificationXin Xu 0007, Binchang Ma, Zhixi Yu, Wei Liu 0183. 11424-11432 [doi]
- STMI: Segmentation-Guided Token Modulation with Cross-Modal Hypergraph Interaction for Multi-Modal Object Re-IdentificationXingguo Xu, Zhanyu Liu, Weixiang Zhou, Yuansheng Gao, Junjie Cao, Yuhao Wang, Jixiang Luo, Dell Zhang. 11433-11441 [doi]
- Dream-IF: Dynamic Relative EnhAnceMent for Image FusionXingxin Xu, Bing Cao 0002, Dongdong Li, Qinghua Hu, Pengfei Zhu 0001. 11442-11450 [doi]
- Improving the Convergence Rate of Ray Search Optimization for Query-Efficient Hard-Label AttacksXinjie Xu, Shuyu Cheng, Dongwei Xu, Qi Xuan 0001, Chen Ma 0003. 11451-11459 [doi]
- PMGS: Reconstruction of Projectile Motion Across Large Spatiotemporal Spans via 3D Gaussian SplattingYijun Xu, Jingrui Zhang, Yuhan Chen, Dingwen Wang, Lei Yu 0006, Chu He. 11460-11468 [doi]
- Dynamic Gaussian Scene Reconstruction from Unsynchronized VideosZhixin Xu, Hengyu Zhou, Yuan Liu 0025, Wenhan Xue, Hao Pan 0001, Wenping Wang 0001, Bin Wang 0021. 11469-11477 [doi]
- TSPE-GS: Probabilistic Depth Extraction for Semi-Transparent Surface Reconstruction via 3D Gaussian SplattingZhiyuan Xu, Nan Min, Yuhang Guo, Tong Wei. 11478-11486 [doi]
- Target-Balanced Score DistillationZhou Xu, Qi Wang, Yuxiao Yang, Luyuan Zhang, Zhang Liang, Yang Li. 11487-11495 [doi]
- VideoSeg-R1: Reasoning Video Object Segmentation via Reinforcement LearningZishan Xu, Yifu Guo, Yuquan Lu, Fengyu Yang, Junxin Li, Lihua Cai. 11496-11504 [doi]
- LinProVSR: Linguistics-Knowledge Guided Progressive Disambiguation Network for Visual Speech RecognitionFeng Xue 0002, Baochao Zhu, Wei Jia 0001, Shujie Li 0002, Yu Li 0053, Jinrui Zhang, Shengeng Tang, Dan Guo 0001. 11505-11513 [doi]
- Dual-View Inference Attack: Machine Unlearning Amplifies Privacy ExposureLulu Xue, Shengshan Hu, Linqiang Qian, Peijin Guo, Yechao Zhang, Minghui Li, Yanjun Zhang, Dayong Ye, Leo Yu Zhang. 11514-11522 [doi]
- SpaCRD: Multimodal Deep Fusion of Histology and Spatial Transcriptomics for Cancer Region DetectionShuailin Xue, Jun Wan 0005, Lihua Zhang, Wenwen Min. 11523-11531 [doi]
- UVLM: Benchmarking Video Language Model for Underwater World UnderstandingXizhe Xue, Yang Zhou, Dawei Yan, Lijie Tao, Junjie Li, Ying Li 0017, Haokui Zhang, Rong Xiao 0003. 11532-11540 [doi]
- Diff-NAT: Better Naturalistic and Aggressive Adversarial Attacks via Class-Optimized Diffusion for Object DetectionQinglong Yan, Tong Zou, Xunpeng Yi, Xinyu Xiang, Xuying Wu, Hao Zhang 0073, Jiayi Ma 0001. 11541-11549 [doi]
- Start Small, Think Big: Curriculum-based Relative Policy Optimization for Visual GroundingQingyang Yan, Guangyao Chen, Yixiong Zou. 11550-11558 [doi]
- Backtrace Mamba: Reviving Critical Temporal Contexts via Hierarchical Memory Compression for Online Action DetectionSu Yan, Jiahua Li, Kun Wei, Cheng Deng 0002. 11559-11567 [doi]
- OmniEvent: Unified Event Representation LearningWeiqi Yan 0005, Chenlu Lin, Youbiao Wang, Zhipeng Cai 0003, Xiuhong Lin, Yangyang Shi, Weiquan Liu, Yu Zang. 11568-11576 [doi]
- MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level PrecisionZhonghao Yan, Muxi Diao, Yuxuan Yang, Ruoyan Jing, Jiayuan Xu, Kaizhou Zhang, Lele Yang, Yanxi Liu 0006, Kongming Liang, Zhanyu Ma. 11577-11585 [doi]
- Binary-Gaussian: Compact and Progressive Representation for 3D Gaussian Segmentationan Yang, Chenyu Liu, Jun Du 0002, Jianqing Gao, Jia Pan, Jinshui Hu, Baocai Yin, Bing Yin, Cong Liu 0006. 11586-11594 [doi]
- PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud ClassificationHao Yang, Qianyu Zhou 0001, Haijia Sun, Xiangtai Li, Xuequan Lu, Lizhuang Ma, Shuicheng Yan. 11595-11603 [doi]
- Motion-Aware Object Tracking via Motion and Geometry-Aware CuesHongtao Yang, Bineng Zhong 0001, Qihua Liang, Xiantao Hu, Yufei Tan, Haiying Xia, Shuxiang Song 0001. 11604-11612 [doi]
- FIA-Edit: Frequency-Interactive Attention for Efficient and High-Fidelity Inversion-Free Text-Guided Image EditingKaixiang Yang, Boyang Shen, Xin Li 0001, Yuchen Dai, Yuxuan Luo, Yueran Ma, Wei Fang, Qiang Li 0018, Zhiwei Wang 0002. 11613-11621 [doi]
- Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If CalibratedMuli Yang, Gabriel James Goenawan, Henan Wang, Huaiyuan Qin, Chenghao Xu, Yanhua Yang, Fen Fang, Ying Sun 0001, Joo-Hwee Lim, Hongyuan Zhu 0002. 11622-11630 [doi]
- Perceive, Act and Correct: Confidence Is Not Enough for Hyperspectral ClassificationMuzhou Yang, Wuzhou Quan, Mingqiang Wei. 11631-11639 [doi]
- UniHOI: Unified Human-Object Interaction Understanding via Unified Token SpacePanqi Yang, Haodong Jing, Nanning Zheng 0001, Yongqiang Ma. 11640-11648 [doi]
- WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous DrivingPengxuan Yang, Ben Lu, Zhongpu Xia, Chao Han, Yinfeng Gao, Teng Zhang, Kun Zhan, Xianpeng Lang, Yupeng Zheng, Qichao Zhang. 11649-11657 [doi]
- Rethinking Rainy 3D Scene Reconstruction via Perspective Transforming and Brightness TuningQianfeng Yang, Xiang Chen 0015, Pengpeng Li 0001, Qiyuan Guan, Guiyue Jin, Jiyu Jin. 11658-11666 [doi]
- MagicPaint: Operate Anything for Image Inpainting with Diffusion ModelQinhong Yang, Dongdong Chen 0001, Qi Chu 0001, Tao Gong, Qiankun Liu 0001, Zhentao Tan, Xulin Li, Huamin Feng, Nenghai Yu. 11667-11675 [doi]
- Learning Beyond Vision: Vision-Language Distillation and Edge-Aware Mix Diffusion in Semi-Supervised Semantic SegmentationRui Yang, Yunfei Bai, Yuehua Liu, Xiaomao Li, Shaorong Xie. 11676-11684 [doi]
- MaRS: A Multi-modality Very-high-resolution Remote Sensing Foundation Model with Cross-Granularity Meta-Modality LearningRuoyu Yang, Yinhe Liu, Heng Yan, Yiheng Zhou, Yihan Fu, Han Luo, Yanfei Zhong. 11685-11693 [doi]
- Look-Back: Implicit Visual Re-focusing in MLLM ReasoningShuo Yang, Yuwei Niu, Yuyang Liu, Yang Ye, Bin Lin 0014, Li Yuan 0007. 11694-11702 [doi]
- VAEVQ: Enhancing Discrete Visual Tokenization Through Variational ModelingSicheng Yang, Xing Hu 0010, Qiang Wu 0012, Dawei Yang. 11703-11711 [doi]
- MMG-VL: A Vision-Language Driven Approach for Multi-Person Motion GenerationSongyuan Yang, Wanrong Huang, Yinuo Liu, Kedi Zhang, Xihuai He, Shaowu Yang, Huibin Tan. 11712-11720 [doi]
- MonoCLUE: Object-Aware Clustering Enhances Monocular 3D Object DetectionSunghun Yang, Minhyeok Lee, Jungho Lee, Sangyoun Lee. 11721-11729 [doi]
- ACID-Style: An Adaptive Condition Injection Diffusion Model for Arbitrary Style TransferTing Yang 0009, Siyu Yang 0005, Xiyao Liu 0001, Songtao Wu, Gerald Schaefer, Kuanhong Xu, Hui Fang 0003. 11730-11738 [doi]
- SODiff:Semantic-Oriented Diffusion Model for JPEG Compression Artifacts RemovalTingyu Yang, Jue Gong, Jinpei Guo, Wenbo Li, Yong Guo, Yulun Zhang 0001. 11739-11747 [doi]
- StyleProto: Style-Augmented Prototype Learning for Cross-Domain Few-Shot Object DetectionXi Yang, Quantao Xie. 11748-11756 [doi]
- MedSAMix: A Training-Free Model Merging Approach for Medical Image SegmentationYanwu Yang, Guinan Su, Jiesi Hu, Francesco Sammarco, Jonas Geiping, Thomas Wolfers. 11757-11765 [doi]
- StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion ModelYifan Yang, Zhi Cen, Sida Peng, Xiangwei Chen, Yifu Deng, Xinyu Zhu, Fan Jia, Xiaowei Zhou 0001, Hujun Bao. 11766-11774 [doi]
- RobusTor3D: Robust Multimodal 3D Object Detector for Autonomous Driving by Vision-Language Knowledge BlendingYing Yang, Hui Yin 0002, Aixin Chong, Hui Wang, Zhengyin Liang. 11775-11783 [doi]
- FantasyStyle: Controllable Stylized Distillation for 3D Gaussian SplattingYitong Yang, Yinglin Wang, Changshuo Wang 0001, Huajie Wang, Shuting He. 11784-11792 [doi]
- SAR-DisentDM: A Semantic-Disentangled Diffusion Model for Limited-Data SAR Image SynthesisYue Yang, Song Tang, Qijun Zhao, Hailun Zhang, Xiwen Wang, Zijian Deng. 11793-11801 [doi]
- Endowing Vision-Language Models with System 2 Thinking for Fine-grained Visual RecognitionYutong Yang, Lifu Huang, Yijie Lin 0001, Xi Peng 0001, Mouxing Yang. 11802-11810 [doi]
- Beyond Quadratic: Linear-Time Change Detection with RWKVZhenyu Yang, Gensheng Pei, Tao Chen 0012, Xia Yuan, Haofeng Zhang 0001, Xiangbo Shu, Yazhou Yao. 11811-11819 [doi]
- LongT2IBench: A Benchmark for Evaluating Long Text-to-Image Generation with Graph-structured AnnotationsZhichao Yang 0013, Tianjiao Gu, Jianjie Wang, Feiyu Lin, Xiangfei Sheng, Pengfei Chen 0003, Leida Li. 11820-11828 [doi]
- HD²-SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous DrivingZhiwen Yang, Yuxin Peng 0001. 11829-11837 [doi]
- MUSE: Multi-Scale Dense Self-Distillation for Nucleus Detection and ClassificationZijiang Yang 0009, Hanqing Chao, Bokai Zhao, Yelin Yang, Yunshuo Zhang, Dongmei Fu, Junping Zhang, Le Lu 0001, Ke Yan 0006, Dakai Jin, Minfeng Xu, Yun Bian, Hui Jiang. 11838-11847 [doi]
- Parameter-, Memory-, Time-Efficient Multi-Task Dense Vision AdaptationHaiming Yao, Wei Luo, Qiyu Chen 0002, Jianxing Liao, Wei You. 11848-11856 [doi]
- TDSS: Task Dynamic-Synergistic Skill Adaptation for Boosting Efficient and Scalable Multi-Task Learning in Dense Visual PredictionHaiming Yao, Qiyu Chen 0002, Wei Luo, Zheng Zhang, Jianxing Liao, Wei You. 11857-11865 [doi]
- Beyond Boundaries: Leveraging Vision Foundation Models for Source-Free Object DetectionHuizai Yao, Sicheng Zhao, Pengteng Li, Yi Cui, Shuo Lu, Weiyu Guo, Yunfan Lu, Yijie Xu, Hui Xiong 0001. 11866-11874 [doi]
- VK-Det: Visual Knowledge Guided Prototype Learning for Open-Vocabulary Aerial Object DetectionJianhang Yao, Yongbin Zheng, Siqi Lu, Wanying Xu, Peng Sun. 11875-11882 [doi]
- RemoteReasoner: Towards Unifying Geospatial Reasoning WorkflowLiang Yao, Fan Liu 0003, Hongbo Lu, Chuanyi Zhang, Rui Min, Shengxiang Xu, Shimin Di, Pai Peng. 11883-11891 [doi]
- Conditional Prompt Learning via Degradation Perception for Underwater Image EnhancementMingze Yao, Zhiying Jiang, XianPing Fu, Huibing Wang. 11892-11900 [doi]
- HOSIG: Full-Body Human-Object-Scene Interaction Generation with Hierarchical Scene PerceptionWei Yao, Yunlian Sun, Hongwen Zhang 0001, Yebin Liu, Jinhui Tang 0001. 11901-11909 [doi]
- DriveSuprim: Towards Precise Trajectory Selection for End-to-End PlanningWenhao Yao, Zhenxin Li, Shiyi Lan, Zi Wang, Xinglong Sun, José M. Álvarez 0004, Zuxuan Wu. 11910-11918 [doi]
- SAFE: Semantic- and Frequency-Enhanced Curriculum for Cross-Domain Deepfake DetectionYulin Yao, Kangfeng Zheng, Bin Wu 0012, Chunhua Wu, Jujie Wang, Jiaqi Gao, Minjiao Yang, Dan Luo. 11919-11927 [doi]
- Tell as You Want: Customizing Image Narrative with Knowledge and ThoughtsZiwei Yao, Qian Wang, Ruiping Wang 0001, Xilin Chen 0001. 11928-11936 [doi]
- RIS-LAD: A Benchmark and Model for Referring Image Segmentation in Low-Altitude Drone ImageryKai Ye, YingShi Luan, Zhudi Chen, Guangyue Meng, Pingyang Dai, Liujuan Cao. 11937-11945 [doi]
- OW-DAR: Dual-Granularity Adaptive Reconstruction-Error Modeling for Open-World Object DetectionLinhua Ye, Xing Xi, Ronghua Luo. 11946-11954 [doi]
- When Eyes and Ears Disagree: Can MLLMs Discern Audio-Visual Confusion?Qilang Ye, Wei Zeng, Meng Liu, Jie Zhang, Yupeng Hu, Zitong Yu, Yu Zhou 0015. 11955-11963 [doi]
- DcMatch: Unsupervised Multi-Shape Matching with Dual-Level ConsistencyTianwei Ye, Yong Ma 0001, Xiaoguang Mei. 11964-11972 [doi]
- HyperSign: Saliency-Aware Spatial Graphs and Temporal Hypergraphs for Continuous Sign Language RecognitionWeiyi Ye, Xu-Hua Yang 0001, Dong Wei, Gang-Feng Ma, Yujiao Huang, Xiao-Xin Li. 11973-11981 [doi]
- Conformable Convolution for Topologically Constrained Learning of Complex Anatomical StructuresYousef Yeganeh, Goktug Guvercin, Nassir Navab, Azade Farshad. 11982-11990 [doi]
- Beyond Simple Edits: X-Planner for Complex Instruction-Based Image EditingChun-Hsiao Yeh, Yilin Wang 0002, Nanxuan Zhao, Richard Zhang 0002, Yuheng Li, Yi Ma, Krishna Kumar Singh. 11991-11999 [doi]
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsChun-Hsiao Yeh, Chenyu Wang, Shengbang Tong, Ta Ying Cheng, Ruoyu Wang 0014, Tianzhe Chu, Yuexiang Zhai, Yubei Chen, Shenghua Gao, Yi Ma 0001. 12000-12008 [doi]
- DiffRefiner: Coarse to Fine Trajectory Planning via Diffusion Refinement with Semantic Interaction for End to End Autonomous DrivingLiuhan Yin, Runkun Ju, Guodong Guo, Erkang Cheng. 12009-12017 [doi]
- Codebook-Centric Deep Hashing: End-to-End Joint Learning of Semantic Hash Centers and Neural Hash FunctionShuo Yin, Zhiyuan Yin, Yuqing Hou, Rui Liu 0007, Yong Chen 0008, Dell Zhang. 12018-12026 [doi]
- Learning to Tell Apart: Weakly Supervised Video Anomaly Detection via Disentangled Semantic AlignmentWenti Yin, Huaxin Zhang, Xiang Wang 0012, Yuqing Lu, Yicheng Zhang, Bingquan Gong, Jialong Zuo, Li Yu 0003, Changxin Gao, Nong Sang. 12027-12035 [doi]
- KPLM-STA: Physically-Accurate Shadow Synthesis for Human Relighting via Keypoint-Based Light ModelingXinhui Yin, Qifei Li, Yilin Guo, Hongxia Xie, Xiaoli Zhang. 12036-12043 [doi]
- Spatiotemporal-Untrammelled Mixture of Experts for Multi-Person Motion PredictionZheng Yin, Chengjian Li, Xiangbo Shu, Meiqi Cao, Rui Yan 0010, Jinhui Tang 0001. 12044-12052 [doi]
- RefSTAR: Blind Face Image Restoration with Reference Selection, Transfer, and ReconstructionZhicun Yin, Junjie Chen, Ming Liu 0018, Zhixin Wang, Fan Li, Renjing Pei, Xiaoming Li 0002, Rynson W. H. Lau, Wangmeng Zuo. 12053-12062 [doi]
- Exploiting All Mamba Fusion for Efficient RGB-D TrackingGe Ying, Dawei Zhang 0002, Chengzhuan Yang, Wei Liu 0044, Sang-Woon Jeon, Hua Wang 0002, Changqin Huang, Zhonglong Zheng. 12063-12071 [doi]
- PeriUn: Enhancing Unlearning by Selectively Forgetting Peripheral SamplesHee bin Yoo, Dong-Sig Han, Jaein Kim 0004, Byoung-Tak Zhang. 12072-12080 [doi]
- Through the Water: Refractive Gaussian Splatting for Water Surface ScenesYeonghun Yoon, Hojoon Jung, Jaeyoon Lee, Taegwan Kim, Gyu-Hyun Kim, Jongwon Choi. 12081-12089 [doi]
- Dense Cross-Scale Image Alignment with Fully Spatial Correlation and Just Noticeable Difference GuidanceJinkun You, Jiaxue Li, Jie Zhang, Yicong Zhou. 12090-12098 [doi]
- KPDM: Key Phrase Dynamic Masking for Robust Text-to-Image Person RetrievalShaofeng You, Tianle Miao, Qihang Chen, Xin Li, Zhuo Cheng, Dapeng Luo. 12099-12107 [doi]
- Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image CaptioningXiaoxing You, Qiang Huang, Lingyu Li, Chi Zhang, Xiaopeng Liu, Min Zhang 0005, Jun Yu 0002. 12108-12116 [doi]
- X-ReID: Multi-granularity Information Interaction for Video-Based Visible-Infrared Person Re-IdentificationChenyang Yu, Xuehu Liu, Pingping Zhang, Huchuan Lu. 12117-12125 [doi]
- Understanding Interaction as You Need: Intention-Driven Pedestrian Behavior PredictionHang Yu 0006, Yansen Yu, Jiayan Qiu. 12126-12134 [doi]
- IMAGGarment+: Efficient Attribute-Wise Diffusion for Garment GenerationJian Yu, Fei Shen, Cong Wang 0034, Yanpeng Sun, Hao Tang 0007, Qin Guo, Xiaoyu Du 0002. 12135-12143 [doi]
- FQ-PETR: Fully Quantized Position Embedding Transformation for Multi-View 3D Object DetectionJiangyong Yu, Changyong Shu, Sifan Zhou, Zichen Yu, Xing Hu 0010, Dawei Yang. 12144-12152 [doi]
- DehazeGS: Seeing Through Fog with 3D Gaussian SplattingJinze Yu, Yiqun Wang 0001, Aiheng Jiang, Zhengda Lu, Jianwei Guo, Yong Li 0023, Hongxing Qin, Xiaopeng Zhang 0001. 12153-12161 [doi]
- DGSAN: Dual-Graph Spatiotemporal Attention Network for Pulmonary Nodule Malignancy PredictionXiao Yu, Zhaojie Fang, Guanyu Zhou, Yin Shen, Huoling Luo, Ye Li, Ahmed Elazab, Xiang Wan, Ruiquan Ge, Changmiao Wang. 12162-12168 [doi]
- PointMC: Multi-view Consistent Encoding and Center-Global Feature Fusion for Point Clouds UnderstandingXinxing Yu, Ajian Liu 0001, Sunyuan Qiang, Yuzhong Wang, Hui Ma 0018, Yanyan Liang 0001. 12169-12177 [doi]
- EARG-Net: Edge-Aware Reconstruction-Guided Network for Image Manipulation Detection and LocalizationYanpu Yu, Zhaoxin Shi, Hanqing Zhao, Tianyi Wei, Wenbo Zhou 0004, Nenghai Yu. 12178-12186 [doi]
- CueBench: Advancing Unified Understanding of Context-Aware Video Anomalies in Real-WorldYating Yu, Congqi Cao, Zhaoying Wang, Weihua Meng, Jie Li, Yuxin Li, Zihao Wei, Zhongpei Shen, Jiajun Zhang. 12187-12195 [doi]
- End-to-End Multi-Person Pose Estimation with Pose-Aware Video TransformerYonghui Yu, Jiahang Cai, Xun Wang 0007, Wenwu Yang. 12196-12203 [doi]
- RealUHR: Harnessing Patch-Cascade Flows for Photorealistic Ultra-High-Resolution SynthesisYongsheng Yu, Haitian Zheng, Zhe Lin 0001, Connelly Barnes, YuQian Zhou, Zhifei Zhang, Jiebo Luo 0001. 12204-12212 [doi]
- Instruction-Guided Cross-Modal Clustering for Training-Free Visual Token Pruning in Vision-Language ModelsYunqian Yu, Biao Chen, Yunya Zhang, Tonglan Xie, Mengmeng Jing, Lin Zuo. 12213-12221 [doi]
- Semantics and Content Matter: Towards Multi-Prior Hierarchical Mamba for Image DerainingZhaocheng Yu, Kui Jiang, Junjun Jiang, Xianming Liu 0005, Guanglu Sun, Yi Xiao 0003. 12222-12230 [doi]
- Domain-Aware Suppression and Aggregation for Federated DG ReIDZhixi Yu, Wei Liu 0183, Wenke Huang 0003, Bin Yang 0026, Qian Bie, Guancheng Wan, Xin Xu 0007. 12231-12239 [doi]
- InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment TransferMuyao Yuan, Yuanhong Zhang, Weizhan Zhang, Lan Ma, Yuan Gao, Jiangyong Ying, Yudeng Xin. 12240-12248 [doi]
- Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud AnalysisShangbo Yuan, Jie Xu 0044, Ping Hu 0001, Xiaofeng Zhu 0001, Na Zhao 0004. 12250-12258 [doi]
- Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionXinbin Yuan, Zhaohui Zheng 0003, Yuxuan Li 0004, Xialei Liu, Li Liu 0004, Xiang Li 0041, Qibin Hou, Ming-Ming Cheng. 12259-12267 [doi]
- Geometry-Aware Noisy Correspondence Mitigation for Cross-Modal Text-Based Person RetrievalXinpan Yuan, Shaomin Xie, Liujie Hua, Chengyuan Zhang 0001, Guihu Zhao, Lin Yuanbo Wu. 12268-12276 [doi]
- UniMapGen: A Generative Framework for Large-Scale Map Construction from Multi-modal DataYujian Yuan, Changjie Wu, Xinyuan Chang, Sijin Wang, Hang Zhang, Shiyi Liang, Shuang Zeng, Mu Xu. 12277-12285 [doi]
- Decompose and Attribute: Boosting Generalizable Open-Set Object Detection via Objectness ScoreYuxuan Yuan, Lichen Wei, Luyao Tang, Chaoqi Chen, Zheyuan Cai, Yue Huang 0001, Xinghao Ding. 12286-12294 [doi]
- I2CD: An Invertible Causal Framework for Compositional Zero-Shot Learning via Disentangle-Compose-DisentangleZhaoquan Yuan, Zining Wang, Yuankang Pan, Ao Luo, Wei Li 0110, Xiao Wu 0001, Changsheng Xu. 12295-12303 [doi]
- Arbitrary-Scale 3D Gaussian Super-ResolutionHuimin Zeng, Yue Bai, Yun Fu 0001. 12304-12312 [doi]
- PriorDrive: Enhancing Online HD Mapping with Unified Vector PriorsShuang Zeng, Xinyuan Chang, Xinran Liu, Yujian Yuan, Shiyi Liang, Zheng Pan, Mu Xu, Xing Wei 0001. 12313-12321 [doi]
- PromptEmo: Learning Emotion with Bilateral Textual Prompts in Multi-Domain Open-set ScenariosXinyi Zeng, Yuxiang Yang 0009, Pinxian Zeng, Wenxia Yin, Bo Liu 0113, Xi Wu 0004, Yan Wang 0015. 12322-12330 [doi]
- Seeing Beyond Illusion: Generalized and Efficient Mirror DetectionMingfeng Zha, Guoqing Wang 0001, Tianyu Li 0003, Wei Dong 0010, Peng Wang 0023, Yang Yang 0002. 12331-12339 [doi]
- CASL: Curvature-Augmented Self-supervised Learning for 3D Anomaly DetectionYaohua Zha, Xue Yuerong, Chunlin Fan, Yuansong Wang, Tao Dai 0001, Ke Chen 0004, Shu-Tao Xia. 12340-12348 [doi]
- P-SLCR: Unsupervised Point Cloud Semantic Segmentation via Prototypes Structure Learning and Consistent ReasoningLixin Zhan, Jie Jiang 0017, Tianjian Zhou, Yukun Du, Yan Zheng, Xuehu Duan. 12349-12357 [doi]
- L2V-CoT: Cross-Modal Transfer of Chain-of-Thought Reasoning via Latent InterventionYu-Liang Zhan, Xinyu Tang 0004, Han Wan, Jian Li 0064, Jirong Wen, Hao Sun 0002. 12358-12366 [doi]
- TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI AgentsBofei Zhang, Zirui Shang, Zhi Gao 0002, Wang Zhang, Rui Xie, Xiaojian Ma 0001, Tao Yuan, Xinxiao Wu, Song Chun Zhu, Qing Li 0003. 12367-12375 [doi]
- Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous DrivingBozhou Zhang, Jingyu Li, Nan Song, Li Zhang 0040. 12376-12384 [doi]
- PanFlow: Decoupled Motion Control for Panoramic Video GenerationCheng Zhang, Hanwen Liang, Donny Y. Chen, Qianyi Wu, Konstantinos N. Plataniotis, Camilo Cruz Gambardella, Jianfei Cai 0001. 12385-12393 [doi]
- Event-Guided Super-Resolving Blurry Image via Asymmetric Integral Driven ConsistencyChi Zhang 0027, Xiang Zhang 0022, Lei Yu 0006, Gui-Song Xia, Yuming Fang, Wenhan Yang. 12394-12402 [doi]
- Enhancing Noise Resilience in Face Clustering via Sparse Differential TransformerDafeng Zhang, Yongqi Song, Shizhuo Liu. 12403-12411 [doi]
- D²Pruner: Debiased Importance and Structural Diversity for MLLM Token PruningEvelyn Zhang, Fufu Yu, Aoqi Wu, Zichen Wen, Ke Yan, Shouhong Ding, Biqing Qi, Linfeng Zhang 0001. 12412-12420 [doi]
- Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image SegmentationFan Zhang, Zhiwei Gu, Hua Wang. 12421-12429 [doi]
- CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory AugmentationGuanghao Zhang, Tao Zhong, Yan Xia 0006, Mushui Liu, Zhelun Yu, Haoyuan Li 0002, Wanggui He, Dong She, Yi Wang, Hao Jiang 0014. 12430-12438 [doi]
- Aware Distillation for Robust Vision-Language Tracking Under Linguistic SparsityGuangtong Zhang, Bineng Zhong 0001, Shirui Yang, Yang Wang, Tian Bai 0002. 12439-12447 [doi]
- BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object DetectionGuowen Zhang, Chenhang He, Liyi Chen 0002, Lei Zhang 0006. 12448-12456 [doi]
- DSFedMed: Dual-Scale Federated Medical Image Segmentation via Mutual Distillation Between Foundation and Lightweight ModelsHanwen Zhang, Qiaojin Shen, Yuxi Liu 0005, Yuesheng Zhu, Guibo Luo. 12457-12465 [doi]
- Robust Fusion Controller: Degradation-Aware Image Fusion with Fine-Grained Language InstructionsHao Zhang 0073, Yanping Zha, Qingwei Zhuang, Zhenfeng Shao, Jiayi Ma 0001. 12466-12474 [doi]
- High-Speed FHD Full-Color Video Computer-Generated HolographyHaomiao Zhang, Miao Cao, Xuan Yu, Hui Luo, Yanling Piao, Mengjie Qin, Zhangyuan Li, Ping Wang 0029, Xin Yuan 0002. 12475-12483 [doi]
- Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object TrackingHaonan Zhang, Xinyao Wang, Boxi Wu, Tu Zheng, Wang Yunhua, Zheng Yang 0008. 12484-12492 [doi]
- FGM-HD: Boosting Generation Diversity of Fractal Generative Models through Hausdorff Dimension InductionHaowei Zhang, Yuanpei Zhao, Ji-Zhe Zhou 0001, Mao Li 0001. 12493-12501 [doi]
- Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video UnderstandingHaoyu Zhang, Qiaohui Chu, Meng Liu 0006, Haoxiang Shi, Yaowei Wang 0001, Liqiang Nie. 12502-12510 [doi]
- Towards Explainable Video Camouflaged Object Detection: SAM2 with Eventstream-Inspired DataHong Zhang 0018, Yixuan Lyu, Hanyang Liu, Jianbo Song, Ding Yuan 0001, Yifan Yang 0003. 12511-12519 [doi]
- VGD: Value-Guided Diffusion Toward High-Utility Medical Image SegmentationHongyu Zhang, Haipeng Chen 0002, Chengxin Yang, Yingda Lyu. 12520-12528 [doi]
- Proxy Zero-Shot Hashing with Multimodal Fusion via Stable DiffusionHui Zhang, Weikang Gao, Tao Yang, Yuan Cao 0005. 12529-12537 [doi]
- xMHashSeg: Cross-modal Hash Learning for Training-free Unsupervised LiDAR Semantic SegmentationJialong Zhang 0002, Yachao Zhang 0001, Yao Wu, Jiangming Shi, Fangyong Wang, Yanyun Qu. 12538-12546 [doi]
- SAQ-SAM: Semantically-Aligned Quantization for Segment Anything ModelJing Zhang, Zhikai Li, Chengzhi Hu, Xuewen Liu, Qingyi Gu. 12547-12555 [doi]
- Collaborative Transformers with Multi-Level Forensic Attention for Image Manipulation LocalizationJiwei Zhang 0007, Wenbo Feng, Siwei Wang, Feifei Kou, Haoyang Yu, Shaozhang Niu. 12556-12563 [doi]
- Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image ClassificationJunjie Zhang 0011, Feng Zhao 0005, Hanqiang Liu 0001, Jun Yu 0001. 12564-12572 [doi]
- Spherical Geometry Diffusion: Generating High-quality 3D Face Geometry via Sphere-anchored RepresentationsJunyi Zhang, Yiming Wang, Yunhong Lu, Qichao Wang, Wenzhe Qian, Xiaoyin Xu, David Gu, Min Zhang. 12573-12581 [doi]
- Geometry-Aware Stereo Matching via Monocular Disparity Distribution Prior and Gradient EnhancementJunze Zhang, Luoxi Jing, Yuanyuan Wang 0002, Xueqi Li, Guoli Yang, Songchang Jin, Chunping Qiu. 12582-12590 [doi]
- Top-Down Semantic Refinement for Image CaptioningJusheng Zhang, Kaitong Cai, Jing Yang, Jian Wang 0100, Chengpei Tang, Keze Wang. 12591-12599 [doi]
- ProCrop: Learning Aesthetic Image Cropping from Professional CompositionsKe Zhang, Tianyu Ding, Jiachen Jiang, Tianyi Chen, Ilya Zharkov, Vishal M. Patel, Luming Liang. 12600-12608 [doi]
- Gaussian Uncertainty-Driven Multi-Model Fitting with Graph Neural NetworkLigang Zhang, Jun Li, Qiming Li. 12609-12617 [doi]
- HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language ModelsLiheng Zhang, Jin Wang, Hui Li, Bingfeng Zhang, Weifeng Liu 0001. 12618-12626 [doi]
- What You See Is What You Reach: Towards Spatial Navigation with High-Level Human InstructionsLingFeng Zhang, Haoxiang Fu, Xiaoshuai Hao, Shuyi Zhang, Qiang Zhang 0029, Rui Liu, Long Chen, Wenbo Ding. 12627-12635 [doi]
- Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation DiffusionLirui Zhang, Zhengkai Zhao, Zhi Zuo, Pan Gao 0001, Jie Qin 0004. 12636-12644 [doi]
- MPI-Mamba: Latent Feature Fusion Mamba for Anisotropic Image Calibration and Deblurring in Magnetic Particle ImagingLiwen Zhang, Zhaoji Miao, Yusong Shen, Zechen Wei, Hui Hui, Jie Tian 0001. 12645-12653 [doi]
- CO²IF: Language-Bridging Hyperspectral-Multispectral Image Fusion with Coordinated and Cross-modal Optimal TransportMingjin Zhang, Zhongkai Yang, Fei Gao 0006. 12654-12662 [doi]
- Exploring Generalizable Remote Sensing Change Detection via Low-Rank Exchange Adaptation of Vision Foundation ModelMingwei Zhang, Jingtao Hu, Qiang Li 0042, Qi Wang 0009. 12663-12671 [doi]
- Unified Interaction Consistency Learning for Single-Source Domain-Generalized Object Detection in Urban ScenePeng Zhang 0121, Xiang Yuan, Gong Cheng 0003. 12672-12680 [doi]
- FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse AttentionPeng Zhang 0057, Zhihui Lai 0001, Wenting Chen, Xu Wu, Heng Kong. 12681-12689 [doi]
- LR-AdaInSeg: Adaptive Instance Segmentation of Incomplete 3D Scenes Driven by Low-Rank NetworksQin Zhang, Kun Zhou, Xulun Ye. 12690-12698 [doi]
- FashionMAC: Deformation-Free Fashion Image Generation with Fine-Grained Model Appearance CustomizationRong Zhang, Jinxiao Li, Jingnan Wang, Zhiwen Zuo, Jianfeng Dong, Wei Li 0111, Chi Wang 0004, Weiwei Xu 0003, Xun Wang 0007. 12699-12707 [doi]
- Zo3T: Zero-Shot 3D-Aware Trajectory-Guided Image-to-Video Generation via Test-Time TrainingRuicheng Zhang, Jun Zhou, Zunnan Xu, Zihao Liu, Jiehui Huang, Mingyang Zhang, Yu Sun, Xiu Li 0001. 12708-12716 [doi]
- AuthSig: Safeguarding Scanned Signatures Against Unauthorized Reuse in Paperless WorkflowsRuiqiang Zhang, Zehua Ma, Guanjie Wang, Chang Liu 0089, Hengyi Wang, Weiming Zhang 0001. 12717-12725 [doi]
- VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality EvaluationShi-Xue Zhang, Hongfa Wang, Duojun Huang, Xin Li, Xiaobin Zhu 0001, Xu-Cheng Yin. 12726-12734 [doi]
- FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video GenerationShilong Zhang, Wenbo Li, Shoufa Chen, Chongjian Ge, Peize Sun, Yifu Zhang, Yi Jiang 0009, Zehuan Yuan, Bingyue Peng, Ping Luo 0002. 12735-12743 [doi]
- YOLO-IOD: Towards Real Time Incremental Object DetectionShizhou Zhang, Xueqiang Lv, Yinghui Xing, Qirui Wu, Di Xu 0010, Chen Zhao, Yanning Zhang 0001. 12744-12752 [doi]
- SimpleDiffusion: A Lightweight and Efficient Conditional Diffusion Model for Multi-Modal Salient Object DetectionShuo Zhang 0013, Jiaming Huang, Wenbing Tang 0001, Jing Liu 0012, Li Han 0001, Jiandun Li, Hongchun Yuan, Zizhu Fan. 12753-12761 [doi]
- Tracking and Segmenting Anything in Any ModalityTianlu Zhang, Qiang Zhang 0020, Guiguang Ding, Jungong Han. 12762-12770 [doi]
- Unifying Locality of KANs and Feature Drift Compensation Projection for Data-Free Replay Based Continual Face Forgery DetectionTianshuo Zhang, Siran Peng, Li Gao, Haoyuan Zhang, Xiangyu Zhu 0001, Zhen Lei 0001. 12771-12779 [doi]
- Fine-Grained DINO Tuning with Dual Supervision for Face Forgery DetectionTianxiang Zhang, Peipeng Yu, Zhihua Xia, Longchen Dai, Xiaoyu Zhou, Hui Gao. 12780-12788 [doi]
- Beyond Illumination: Fine-Grained Detail Preservation in Extreme Dark Image RestorationTongshun Zhang, Pingping Liu, Zixuan Zhong, Zijian Zhang, Qiuzhan Zhou. 12789-12797 [doi]
- SPJFNet: Self-Mining Prior-Guided Joint Frequency Enhancement for Ultra-Efficient Dark Image RestorationTongshun Zhang, Pingping Liu, Zijian Zhang, Qiuzhan Zhou. 12798-12806 [doi]
- Dual-Path Knowledge-Augmented Contrastive Alignment Network for Spatially Resolved TranscriptomicsWei Zhang, Jiajun Chu, Xinci Liu, Chen Tong, Xinyue Li. 12807-12815 [doi]
- UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic AlignmentWei Zhang 0196, Yeying Jin, Xin Li 0082, Yan Zhang 0004, Xiaofeng Cong, Cong Wang 0018, Fengcai Qiao, Zhichao Lian. 12816-12824 [doi]
- MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement LearningWenrui Zhang, Xinggang Wang, Bin Feng 0001, Wenyu Liu 0001. 12825-12833 [doi]
- Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level ConstraintsXiangyue Zhang, Jianfang Li, Jianqiang Ren, Jiaxu Zhang. 12834-12842 [doi]
- Learning Better UAV-Based Cross-View Object Geo-Localization from Multi-Modal Prompts: MoP-UAV Benchmark and MoPT FrameworkXiaohan Zhang, Zhangkai Shen, Si-Yuan Cao, Xiaokai Bai, Yiming Li, Zheheng Han, Zhe Wu, Qi Ming, Hui-Liang Shen. 12843-12851 [doi]
- Any2RSI: Controllable Remote Sensing Text-to-Image Generation via Any Control and Enriched DescriptionXu Zhang 0044, Jianzhong Huang, Lefei Zhang. 12852-12860 [doi]
- ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image RestorationXu Zhang 0044, Huan Zhang 0008, Guoli Wang 0004, Qian Zhang 0009, Lefei Zhang. 12861-12869 [doi]
- VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement LearningXuanyu Zhang, Weiqi Li, Shijie Zhao 0001, Junlin Li, Li Zhang 0006, Jian Zhang 0018. 12870-12878 [doi]
- Introducing Decomposed Causality with Spatiotemporal Object-Centric Representation for Video ClassificationYachong Zhang, Lei Meng 0001, Shuo Xu, Zhuang Qi, Wei Wu, Lei Wu 0002, Xiangxu Meng. 12879-12887 [doi]
- Adaptive Dynamic Dehazing via Instruction-Driven and Task-Feedback Closed-Loop Optimization for Diverse Downstream Task AdaptationYafei Zhang, Shuaitian Song, Huafeng Li 0001, Shujuan Wang, Yu Liu 0023. 12888-12896 [doi]
- PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission TomographyYichi Zhang 0007, Wenbo Zhang, Zehui Ling, Gang Feng, Sisi Peng, Deshu Chen, Yuchen Liu, Hongwei Zhang, Shuqi Wang, Lanlan Li, Limei Han, Yuan Cheng, Zixin Hu, Yuan Qi 0001, Le Xue. 12897-12906 [doi]
- Cyto-SSL: A Self-Supervised Pretraining Framework for Cytology Foundation ModelYiming Zhang, Rui Yan 0009, Xiaohua Wan, Yifan Zhao, Shuang Feng, Zhetao Xu, Ying Wang 0043, Fa Zhang 0001, Bin Hu 0001. 12907-12915 [doi]
- Causal-Tune: Mining Causal Factors from Vision Foundation Models for Domain Generalized Semantic SegmentationYin Zhang, Yongqiang Zhang, Yaoyue Zheng, Bogdan Raducanu, Dan Liu. 12916-12924 [doi]
- Real Noise Decoupling for Hyperspectral Image DenoisingYingkai Zhang, Tao Zhang 0042, Jing Nie, Ying Fu 0001. 12925-12933 [doi]
- LLaVA-UHD v2: Exploiting Hierarchical Vision Granularity in MLLMs via Inverse Semantic PyramidYipeng Zhang, Yifan Liu, Zonghao Guo, Yidan Zhang, Xuesong Yang, Xiaoying Zhang, Chi Chen 0005, Jun Song, Yuan Yao 0013, Tat-Seng Chua, Maosong Sun 0001. 12934-12942 [doi]
- Integrating Diverse Assignment Strategies into DETRsYiWei Zhang, Jin Gao, Hanshi Wang, Fudong Ge, Guan Luo, Weiming Hu 0004, Zhipeng Zhang. 12943-12951 [doi]
- SGAT: Learning Feature Matching with Singularity-enhanced Graph Attention NetworkYizhuo Zhang, Kun Sun 0002, Chang Tang, Yuanyuan Liu 0004, Xin Li 0005. 12952-12960 [doi]
- Anchor-Guided Discriminative Subspace Alignment and Clustering for Cross-Scene Hyperspectral ImageryYongshan Zhang, Zixuan Zhang, Xinxin Wang 0003, Lefei Zhang, Zhihua Cai. 12961-12969 [doi]
- Joint Implicit and Explicit Language Learning for Pedestrian Attribute RecognitionYukang Zhang, Lei Tan, Yang Lu 0009, Yan Yan 0001, Hanzi Wang. 12970-12978 [doi]
- M3SR: Multi-Scale Multi-Perceptual Mamba for Efficient Spectral ReconstructionYuze Zhang, Lingjie Li, Qiuzhen Lin, Zhong Ming 0001, Fei Yu 0016, Victor C. M. Leung. 12979-12987 [doi]
- InstructDubber: Instruction-based Alignment for Zero-shot Movie DubbingZhedong Zhang, Liang Li 0003, Gaoxiang Cong 0001, Chunshan Liu, YuHan Gao, Xiaowan Wang, Tao Gu, Yuankai Qi. 12988-12996 [doi]
- FourierPET: Deep Fourier-based Unrolled Network for Low-count PET ReconstructionZheng Zhang, Hao Tang 0007, Yingying Hu, Zhanli Hu, Jing Qin 0001. 12997-13005 [doi]
- Evolving Generalist Virtual Agents with Generative and Associative MemoryZhenkui Zhang, Wendong Bu, Kaihang Pan, Bingchen Miao, Wenqiao Zhang, Guoming Wang, Wei Ji 0008, Rui Tang, Juncheng Li 0006, Siliang Tang. 13006-13014 [doi]
- Adaptive Morph-Patch Transformer for Aortic Vessel SegmentationZhenxi Zhang, Fuchen Zheng, Adnan Iltaf, Yifei Han, Zhenyu Chen 0001, Yue Du, Bin Li, Tianyong Liu, Shoujun Zhou. 13015-13024 [doi]
- GEMA-Score: Granular Explainable Multi-Agent Scoring Framework for Radiology Report EvaluationZhenxuan Zhang, Kinhei Lee, Peiyuan Jing, Weihang Deng, Huichi Zhou, Zihao Jin, Jiahao Huang, Zhifan Gao, Dominic C. Marshall, Yingying Fang, Guang Yang 0006. 13025-13033 [doi]
- D²-VPR: A Parameter-efficient Visual-foundation-model-based Visual Place Recognition Method via Knowledge Distillation and Deformable AggregationZheyuan Zhang, Jiwei Zhang 0001, Boyu Zhou, Linzhimeng Duan, Hong Chen. 13034-13042 [doi]
- RPE-PAD: Relative Pose Estimation for Pose-agnostic Anomaly DetectionZhipeng Zhang, Mengzan Qi, Rongkang Ma, Yingying Fang, Guixu Zhang, Tieyong Zeng, Zhi Li 0080. 13043-13051 [doi]
- Disentangling for Transfer: Boosting Limited Modalities via Information-Theoretic Regularization and Cross-Modal ReconstructionZhiyun Zhang, Yan-Jie Zhou, Yujian Hu, Xiyao Ma, Zhouhang Yuan, Zirui Wang, Hongkun Zhang, Minfeng Xu. 13052-13060 [doi]
- Simulating Distribution Dynamics: Liquid Temporal Feature Evolution for Single-Domain Generalized Object DetectionZihao Zhang, Yang Li, Aming Wu, Yahong Han. 13061-13069 [doi]
- Diffusion Distillation with Direct Preference Optimization for Efficient 3D LiDAR Scene CompletionAn Zhao, Shengyuan Zhang, Zejian Li, Ling Yang 0006, Pei Chen 0005, Jiale Wu, Haoran Xu, Anyang Wei, Perry Pengyun Gu, Lingyun Sun. 13070-13078 [doi]
- CondDiff-AMO: Integrating Conditional Diffusion Mechanism for Unified Amodal Mask GenerationCaijie Zhao, Bob Zhang 0001. 13079-13087 [doi]
- MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive LossCan Zhao 0001, Pengfei Guo, Dong Yang 0005, Yufan He, Yucheng Tang, Benjamin Simon, Mason Belue, Stephanie A. Harmon, Baris Turkbey, Daguang Xu. 13088-13098 [doi]
- Real-time 3D Object Detection with Inference-Aligned LearningChenyu Zhao, Xianwei Zheng, Zimin Xia, Linwei Yue, Nan Xue 0006. 13099-13107 [doi]
- Learning to LEAP: Efficient Dense Point Tracking by Focusing Where It MattersChenzhi Zhao, Wufan Wang, Bo Zhang 0032, Wendong Wang 0003. 13108-13116 [doi]
- Partially Shared Concept Bottleneck ModelsDelong Zhao, Qiang Huang, Di Yan, Yiqun Sun, Jun Yu. 13117-13125 [doi]
- Towards Affordance-Aware Robotic Dexterous Grasping with Human-like PriorsHaoyu Zhao, Linghao Zhuang, Xingyue Zhao, Cheng Zeng, Haoran Xu, Yuming Jiang 0007, Jun Cen, Kexiang Wang, Jiayan Guo, Siteng Huang, Xin Li 0056, Deli Zhao, Hua Zou 0002. 13126-13134 [doi]
- Causal Decoupling Domain Generalization for Remote Sensing Change DetectionJiaqi Zhao 0001, Jianpeng Xie 0001, Yong Zhou 0003, Wen-Liang Du 0002, Hancheng Zhu, Rui Yao 0006. 13135-13143 [doi]
- Unified Representation Causal Prompt Distillation for Re-Inference-Free Lifelong Person Re-IdentificationJiaqi Zhao 0001, Jie Luo, Yong Zhou 0003, Wen-Liang Du 0002, Xixi Li, Rui Yao 0006. 13144-13152 [doi]
- CLIPDet3D: Vision-Language Collaborative Distillation for 3D Object DetectionJiaqi Zhao 0001, Huanfeng Hu, Yong Zhou 0003, Wen-Liang Du 0002, Kunyang Sun, Rui Yao 0006, Qigong Sun. 13154-13162 [doi]
- Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual AwarenessJiaxing Zhao, Boyuan Sun 0001, Xiang Chen, Xihan Wei. 13163-13171 [doi]
- Learning Procedural-Aware Video Representations Through State-Grounded Hierarchy UnfoldingJinghan Zhao, Yifei Huang 0002, Feng Lu 0005. 13172-13180 [doi]
- Beyond Predictive Resampling: Learning Input-Agnostic Downsampling for Efficient Aligned Vision RecognitionKai Zhao 0012, Liting Ruan, Haoran Jiang, Xiaoqiang Zhu, Xianchao Zhang 0002, Dan Zeng 0001. 13181-13189 [doi]
- Cheating Stereo Matching in Full-Scale: Physical Adversarial Attack Against Binocular Depth Estimation in Autonomous DrivingKangqiao Zhao, Shuo Huai, Xurui Song, Jun Luo 0001. 13190-13198 [doi]
- ControlFuse: Instruction-guided Multi-Granularity Controllable Image FusionLibo Zhao, Xiaoli Zhang 0001, Zeyu Wang 0009. 13199-13207 [doi]
- Predicting Video Slot Attention Queries from Random Slot-Feature PairsRongzhen Zhao, Jian Li, Juho Kannala, Joni Pajarinen. 13208-13216 [doi]
- Adaptive-Smooth LiDAR-Camera Knowledge Distillation with Heterogeneous Fusion for Multi-View 3D Object DetectionRui Zhao, Shuoyao Wang, Xinhu Zheng, Shijian Gao. 13217-13225 [doi]
- Multi-Modal Assistance for Unsupervised Domain Adaptation on Point Cloud 3D Object DetectionShenao Zhao, Pengpeng Liang, Zhoufan Yang. 13226-13234 [doi]
- ObjectAdv: Object-Level Unrestricted Adversarial Attacks via Diffusion ModelsShijie Zhao, Zhenyu Liang, Xing Yang 0004, Haoqi Gao, Anjie Peng, Hui Zeng 0002. 13235-13243 [doi]
- KineST: A Kinematics-guided Spatiotemporal State Space Model for Human Motion Tracking from Sparse SignalsShuting Zhao, Zeyu Xiao, Xinrong Chen. 13244-13252 [doi]
- DialoGen: Towards Dialog Gesture Generation via Identity-Decoupled Style Guidance in Interactive Diffusion ModelWeiyu Zhao, Chenyang Wang, Liangxiao Hu, Zonglin Li, Wei Yu 0004, Shengping Zhang. 13253-13261 [doi]
- Good Gradients Poison Your Model: Evading Defenses in Federated Learning via Boundary-adaptive PerturbationXiaojie Zhao, Jinqiao Shi, Yi Li, Junmin Huang, Chongru Fan. 13262-13270 [doi]
- Studying Classifier(-Free) Guidance from a Classifier-Centric PerspectiveXiaoming Zhao 0001, Alex Schwing 0001. 13271-13279 [doi]
- GloTok: Global Perspective Tokenizer for Image Reconstruction and GenerationXuan Zhao, Zhongyu Zhang, Yuge Huang, Yuxi Mi, Guodong Mu, Shouhong Ding, Jun Wang 0006, Rizen Guo, Shuigeng Zhou. 13280-13288 [doi]
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene UnderstandingYoujun Zhao, Jiaying Lin 0001, Shuquan Ye, Qianshi Pang, Rynson W. H. Lau. 13289-13296 [doi]
- UDCH: Unsupervised Dynamic Weighted Cluster-cooperative Hashing for Cross-modal RetreivalYuanzhi Zhao, Fan Yang, Yudong Zhao, Xiaoyu Li. 13297-13304 [doi]
- SceneGenesis: 3D Scene Synthesis via Semantic Structural Priors and Mesh-Guided Video-Geometry FusionYueming Zhao, Hongyu Yang, Di Huang 0001. 13305-13313 [doi]
- Tackling Dual-stage Missing Modalities in Brain Tumor Segmentation via Robust Modality Reconstruction and Prompt-guided Modality AdaptationYunpeng Zhao, Cheng Chen 0013, Qing You Pang, Yibing Fu, Quanzheng Li, Carol Tang, Beng Ti Ang, Yueming Jin. 13314-13322 [doi]
- Temporal Calibrating and Distilling for Scene-Text Aware Text-Video RetrievalZhiqian Zhao, Liang Li 0003, Lei Shen, Xichun Sheng, Yaoqi Sun, Fang Kang, Chenggang Yan 0001. 13323-13331 [doi]
- CogStream: Context-guided Streaming Video Question AnsweringZiCheng Zhao, Kangyu Wang, Shijie Li, Rui Qian 0001, Weiyao Lin, Huabin Liu 0001. 13332-13341 [doi]
- ProxyTTT: Proxy-driven Test-Time Training for Multi-modal Re-identificationAihua Zheng, Zhaojun Liu, Xixi Wan, Chenglong Li 0002, Jin Tang 0001, Yan Yan 0002. 13342-13350 [doi]
- Progressive Multi-modal Knowledge Distillation for Multi-spectral Object Re-identificationAihua Zheng, Pengyu Li, Zi Wang 0013, Jin Tang 0001. 13351-13359 [doi]
- Semantic-Driven Visual Progressive Refinement for Aerial-Ground Person ReID: A Challenging Large-Scale BenchmarkAihua Zheng, Hao Xie, Xixi Wan, Zi Wang 0013, Shihao Li, Jin Tang 0001, Bin Luo 0001. 13360-13368 [doi]
- Manipulating the Mind's Eye: A-SAGE, the Attention-Based Attack on ViT ExplainabilityBoshi Zheng, Yan Li 0035, Jiabin Liu. 13369-13377 [doi]
- Open-World Deepfake Attribution via Confidence-Aware Asymmetric LearningHaiyang Zheng, Nan Pu, Wenjing Li 0005, Teng Long, Nicu Sebe, Zhun Zhong. 13378-13386 [doi]
- HiFC-GAN: Hierarchical Feature-Constrained GAN for Optical-to-SAR Transfer in SAR Target ClassificationHao Zheng 0009, Meiguang Zheng, Zhigang Hu 0001, Liu Yang 0015, Aikun Xu, Tingxuan Chen, Rongchang Zhao, Boyu Wang 0004. 13387-13395 [doi]
- Distilling Future Temporal Knowledge with Masked Feature Reconstruction for 3D Object DetectionHaowen Zheng, Hu Zhu, Lu Deng, Weihao Gu, Yang Yang 0062, Yanyan Liang 0001. 13396-13404 [doi]
- Hierarchical Dual-Domain Fusion with Frequency-Guided Spatial Modeling for Pan-SharpeningHuangqimei Zheng, Chengyi Pan, Qian Jiang, Wei Zhou 0011, Xin Jin 0005. 13405-13413 [doi]
- Universal Adversarial Purification with DDIM Metric Loss for Stable DiffusionLi Zheng, Liangbin Xie, Jiantao Zhou 0001, Yimin He. 13414-13422 [doi]
- E³SAM2: Entropy-Aware and Edge-Guided Adaptation of SAM2 for Echocardiography Video SegmentationLong Zheng, Zhi Li 0012, Weidong Wang, Zhenyu Dai, Shuyun Li. 13423-13431 [doi]
- WinMamba: Multi-Scale Shifted Windows in State Space Model for 3D Object DetectionLonghui Zheng, Qiming Xia, Xiaolu Chen, Zhaoliang Liu, Chenglu Wen. 13432-13440 [doi]
- Physically-Based LiDAR Smoke Simulation for Robust 3D Object DetectionShijun Zheng, Yu Guo, Weiquan Liu, Yu Zang, Siqi Shen, Ming Cheng 0002, Cheng Wang 0003. 13441-13448 [doi]
- Forecast Then Calibrate: Feature Caching as ODE for Efficient Diffusion TransformersShikang Zheng, Liang Feng, Xinyu Wang, Qinming Zhou, Peiliang Cai, Chang Zou, Jiacheng Liu, Yuqi Lin, Junjie Chen, Yue Ma, Linfeng Zhang 0001. 13449-13457 [doi]
- Content-aware Information Compression and Selection for Whole Slide Image AnalysisTingting Zheng, Hongxun Yao, Sicheng Zhao, Yi Xiao 0003. 13458-13466 [doi]
- Selective Diffusion Distillation for Real-World High-Scale Image Super-ResolutionWenli Zheng, Huiyuan Fu, Zekai Xu, Xin Wang 0001, Huadong Ma. 13467-13475 [doi]
- Oscillation Inversion: Training-Free Image and Video Enhancement Through Oscillated Latents in Large Flow ModelsYan Zheng, Zhenxiao Liang, Xiaoyan Cong, Yi Yang 0001, Lanqing Guo, Yuehao Wang, Peihao Wang, Zhangyang Wang. 13476-13484 [doi]
- GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal ModelsYushuo Zheng, Jiangyong Ying, Huiyu Duan, Chunyi Li, Zicheng Zhang, Jing Liu 0002, Xiaohong Liu 0001, Guangtao Zhai. 13485-13493 [doi]
- Wavefront-Constrained Passive Obscured Object DetectionZhiwen Zheng, Yiwei Ouyang, Zhao Huang, Tao Zhang, Xiaoshuai Zhang, Huiyu Zhou 0001, Wenwen Tang, Shaowei Jiang, Jin Liu 0025, Xingru Huang. 13494-13502 [doi]
- OwlCap: Harmonizing Motion-Detail for Video Captioning via HMD-270K and Caption Set Equivalence RewardChunlin Zhong, Qiuxia Hou, Zhangjun Zhou, Yanhao Zhang, Shuang Hao 0015, Haonan Lu, He Tang 0002, Xiang Bai. 13503-13511 [doi]
- SE360: Semantic Edit in 360° Panoramas via Hierarchical Data ConstructionHaoyi Zhong, Fang-Lue Zhang, Andrew Chalmers, Taehyun Rhee. 13512-13520 [doi]
- Collaboratively "Copy & Paste" 2D-3D Features for Complex Video-to-Video Motion EditingJia-Xing Zhong, Shijie Zhao, Junlin Li, Li Zhang. 13521-13529 [doi]
- CompEvent: Complex-valued Event-RGB Fusion for Low-light Video Enhancement and DeblurringMingchen Zhong, Xin Lu 0006, Dong Liu 0002, Senyan Xu, Ruixuan Jiang, Xueyang Fu, Baocai Yin. 13530-13538 [doi]
- Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel ViewsYingji Zhong, Kaichen Zhou, Zhihao Li 0002, Lanqing Hong, Zhenguo Li, Dan Xu 0002. 13539-13547 [doi]
- MoReMouse: Monocular Reconstruction of Laboratory MouseYuan Zhong, Jingxiang Sun, Zhongbin Zhang, Liang An 0001, Yebin Liu. 13548-13556 [doi]
- TG-Field: Geometry-Aware Radiative Gaussian Fields for Tomographic ReconstructionYuxiang Zhong, Jun Wei, Chaoqi Chen, Senyou An, Hui Huang. 13557-13565 [doi]
- Splat-SAP: Feed-Forward Gaussian Splatting for Human-Centered Scene with Scale-Aware Point Map ReconstructionBoyao Zhou, Shunyuan Zheng, Zhanfeng Liao, Zihan Ma 0011, Hanzhang Tu, Boning Liu 0001, Yebin Liu. 13566-13574 [doi]
- M3ashy: Multi-Modal Material Synthesis via HyperdiffusionChenliang Zhou, Zheyuan Hu 0006, Alejandro Sztrajman, YanCheng Cai, Yaru Liu, Cengiz Öztireli. 13575-13583 [doi]
- Paper Folding Puzzles: Can Multimodal Large Language Models Perform Spatial Reasoning?Dibin Zhou, Yantao Xu, Zongming Huang, Zengwei Yan, Wenhao Liu, Yongwei Miao, Jianfeng Ren, Fuchang Liu. 13584-13592 [doi]
- IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story GenerationDonghao Zhou, Jingyu Lin, Guibao Shen, Quande Liu, Jialin Gao, Lihao Liu, Lan Du 0002, Cunjian Chen, Chi-Wing Fu, Xiaowei Hu 0001, Pheng-Ann Heng. 13593-13601 [doi]
- Exploring Position Encoding Mechanism in Diffusion U-Net for Training-free High-resolution Image GenerationFeng Zhou, Pu Cao, Yiyang Ma, Lu Yang 0006, Yonghao Dang, Jianqin Yin. 13602-13610 [doi]
- Toward Real-World High-Precision Image Matting and SegmentationHaipeng Zhou, Zhaohu Xing, Hongqiu Wang, Jun Ma 0008, Ping Li 0016, Lei Zhu 0003. 13611-13619 [doi]
- MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic AlignmentHao Zhou, Xiaobao Guo, Yuzhe Zhu, Adams Wai-Kin Kong. 13620-13628 [doi]
- Duplex Rewards Optimization for Test-Time Composed Image RetrievalHaoliang Zhou, Feifei Zhang, Changsheng Xu. 13629-13637 [doi]
- Seeing and Knowing in the Wild: Open-domain Visual Entity Recognition with Large-scale Knowledge Graphs via Contrastive LearningHongkuan Zhou, Lavdim Halilaj, Sebastian Monka, Stefan Schmid 0002, Yuqicheng Zhu, Jingcheng Wu, Nadeem Nazer, Steffen Staab. 13638-13646 [doi]
- T-Rex-Omni: Integrating Negative Visual Prompt in Generic Object DetectionJiazhou Zhou, Qing Jiang, Kanghao Chen, Lutao Jiang, Yuanhuiyi Lyu, Ying-Cong Chen, Lei Zhang. 13647-13655 [doi]
- StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence DiffusionJin Zhou, Yi Zhou, Hongliang Yang, Pengfei Xu, Hui Huang. 13656-13664 [doi]
- Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual SegmentationJinxing Zhou, Yanghao Zhou, Mingfei Han 0002, Tong Wang 0022, Xiaojun Chang, Hisham Cholakkal, Rao Muhammad Anwer. 13665-13673 [doi]
- CLASP: Cross-modal Salient Anchor-based Semantic Propagation for Weakly-supervised Dense Audio-Visual Event LocalizationJinxing Zhou, Ziheng Zhou, Yanghao Zhou, Yuxin Mao, Zhangling Duan, Dan Guo 0001. 13674-13682 [doi]
- VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose EstimationJun Zhou 0026, Chi Xu 0002, Kaifeng Tang, Yuting Ge, Tingrui Guo, Li Cheng 0001. 13683-13691 [doi]
- Preserving Topological and Geometric Embeddings for Point Cloud RecoveryKaiyue Zhou, Zelong Tan, Hongxiao Wang, Ya-Li Li, Shengjin Wang. 13692-13700 [doi]
- Certified L2-Norm Robustness of 3D Point Cloud Recognition in the Frequency DomainLiang Zhou, Qiming Wang, Tianze Chen. 13701-13709 [doi]
- TargetVAU: Multimodal Anomaly-Aware Reasoning for Target Behavior Understanding in VideosLingru Zhou, Peng Wu 0015, Manqing Zhang, QingSheng Wang, Guansong Pang, Peng Wang 0015. 13710-13718 [doi]
- Mitigating Entity Hallucinations in 3D Radiology Report Generation via Dual-Stream AlignmentLingyu Zhou, Yue Yu, Zhang Yi 0001, Xiuyuan Xu. 13719-13727 [doi]
- Hierarchical Prompt Learning for Image- and Text-Based Person Re-IdentificationLinhan Zhou, Shuang Li, Neng Dong, Yonghang Tai, Yafei Zhang, Huafeng Li 0001. 13728-13736 [doi]
- Thermal-Physics Guided Infrared Image Super-Resolution with Dynamic High-Frequency AmplificationMingxuan Zhou, Yirui Shen, Shuang Li, Jing Geng, Yutang Zhang, Shuigen Wang. 13737-13745 [doi]
- Reasoning via Implicit Self-supervised Emergence for Instruction SegmentationQing Zhou, Lichang Yang, Yuyu Jia, Junyu Gao 0001, Weiping Ni, Junzheng Wu, Qi Wang 0009. 13746-13754 [doi]
- Bridging Vision and Language for Robust Context-Aware Surgical Point Tracking: The VL-SurgPT Dataset and BenchmarkRulin Zhou, Wenlong He, An Wang 0007, Jianhang Zhang, Xuanhui Zeng, Xi Zhang, Chaowei Zhu, Haijun Hu, Hongliang Ren 0001. 13755-13763 [doi]
- ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance SegmentationShengchao Zhou, Jiehong Lin, Jiahui Liu 0012, Shizhen Zhao, Chirui Chang, Xiaojuan Qi 0001. 13764-13772 [doi]
- CompTrack: Information Bottleneck-Guided Low-Rank Dynamic Token Compression for Point Cloud TrackingSifan Zhou, Yichao Cao, Jiahao Nie 0001, Yuqian Fu, Ziyu Zhao, Xiaobo Lu, Shuo Wang 0030. 13773-13781 [doi]
- OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelXingcheng Zhou, Xuyuan Han, Feng Yang, Yunpu Ma, Volker Tresp, Alois Knoll. 13782-13790 [doi]
- ReaSon: Reinforced Causal Search with Information Bottleneck for Video UnderstandingYuan Zhou 0023, Litao Hua, Shilong Jin, Wentao Huang, Haoran Duan 0001. 13791-13799 [doi]
- LookFlow: Training-Free and Efficient High-Resolution Image Synthesis via Dynamic Lookahead Guidance FlowYuan Zhou, Yan Zhang, Jianlong Chang, Xin Gu, Ying Wang 0008, Kun Ding 0001, Guangwen Yang, Shiming Xiang. 13800-13808 [doi]
- Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language ModelsYuan Zhou, Yan Zhang, Jianlong Chang, Xin Gu, Ying Wang 0008, Kun Ding 0001, Guangwen Yang, Shiming Xiang. 13809-13817 [doi]
- DragNeXt: Rethinking Drag-Based Image EditingYuan Zhou 0016, Junbao Zhou, Qingshan Xu 0001, Kesen Zhao, Yuxuan Wang, Hao Fei 0001, Richang Hong, Hanwang Zhang. 13818-13825 [doi]
- Less Is More: Vision Representation Compression for Efficient Video Generation with Large Language ModelsYucheng Zhou 0001, Jihai Zhang 0002, Guanjie Chen, Jianbing Shen, Yu Cheng 0001. 13826-13834 [doi]
- Debiased Dual-Invariant Defense for Adversarially Robust Person Re-IdentificationYuhang Zhou, Yanxiang Zhao, Zhongyun Hua, Zhipu Liu, Zhaoquan Gu, Qing Liao 0001, Leo Yu Zhang. 13835-13843 [doi]
- Zero-Shot Open-Vocabulary Human Motion Grounding with Test-Time TrainingYunjiao Zhou, Xinyan Chen 0002, Junlang Qian, Lihua Xie 0001, Jianfei Yang. 13844-13852 [doi]
- Few-step Flow for 3D Generation via Marginal-Data Transport DistillationZanwei Zhou, Taoran Yi, Jiemin Fang, Chen Yang 0023, Lingxi Xie, Xinggang Wang, Wei Shen 0002, Qi Tian 0001. 13853-13861 [doi]
- Δt-Mamba3D: A Time‑Aware Spatio‑Temporal State‑Space Model for Breast Cancer Risk PredictionZhengbo Zhou, Dooman Arefan, Margarita L. Zuley, Shandong Wu. 13862-13870 [doi]
- Semantic Guided Part Relation-aware Network for Point Cloud CompletionZhensheng Zhou, Jianqing Liang, Jiye Liang, Zijin Du, Chenghao Fang 0001. 13871-13879 [doi]
- DICE: Distilling Classifier-Free Guidance into Text EmbeddingsZhenyu Zhou, Defang Chen 0001, Can Wang 0001, Chun Chen 0001, Siwei Lyu. 13880-13888 [doi]
- Content Diversity-guided Ambiguity Mitigation for Open-Set Noisy Label LearningZhihao Zhou, Rui Li 0059, Xueying Li. 13889-13897 [doi]
- HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language ModelsZiqin Zhou, Yifan Yang 0004, Yuqing Yang 0001, Tianyu He, Houwen Peng, Kai Qiu, Qi Dai 0001, Lili Qiu, Chong Luo 0001, Lingqiao Liu. 13898-13906 [doi]
- Hierarchical Schedule Optimization for Fast and Robust Diffusion Model SamplingAihua Zhu, Rui Su, Qinglin Zhao, Li Feng 0001, Meng Shen 0001, Shibo He. 13907-13915 [doi]
- MedEyes: Learning Dynamic Visual Focus for Medical Progressive DiagnosisChunzheng Zhu, Yangfang Lin, Shen Chen, Yijun Wang, Jianxin Lin. 13916-13924 [doi]
- Self-Supervised Representation Learning with Joint Embedding Predictive Architecture for Automotive LiDAR Object DetectionHaoran Zhu, Zhenyuan Dong, Kristi Topollai, Beiyao Sha, Anna Choromanska. 13925-13933 [doi]
- Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent SpaceJian Zhu, Zhengyu Jia, Tian Gao, Jiaxin Deng, Shidi Li, Lang Zhang, Fu Liu, Peng Jia, Xianpeng Lang. 13934-13942 [doi]
- AnchorDS: Anchoring Dynamic Sources for Semantically Consistent Text-to-3D GenerationJiayin Zhu, Linlin Yang, Yicong Li 0004, Angela Yao. 13943-13951 [doi]
- LENS: Learning to Segment Anything with Unified Reinforced ReasoningLianghui Zhu, Bin Ouyang, Yuxuan Zhang, Tianheng Cheng, Rui Hu, Haocheng Shen, Longjin Ran, Xiaoxin Chen 0001, Li Yu 0003, Wenyu Liu 0001, Xinggang Wang. 13952-13960 [doi]
- Class Incremental Medical Image Segmentation via Prototype-Guided Calibration and Dual-Aligned DistillationShengqian Zhu, Chengrong Yu, Qiang Wang, Ying Song, Guangjun Li, Jiafei Wu, Xiaogang Xu 0002, Zhang Yi 0001, Junjie Hu 0004. 13961-13969 [doi]
- Graph-Driven Domain Co-Adaptation for Cross-Domain Image Quality AssessmentShun Zhu, Xichen Yang, Yan Zhang, Tianshu Wang 0001, Zhongyuan Mao, Tianyin Li, Zhuoyan Sun, Xiaobo Shen 0001. 13970-13978 [doi]
- VTD-CLIP: Video-to-Text Discretization via Prompting CLIPWencheng Zhu, Yuexin Wang, Hongxuan Li, Pengfei Zhu 0001. 13979-13987 [doi]
- SLCFormer: Spectral-Local Context Transformer with Physics-Grounded Flare Synthesis for Nighttime Flare RemovalXiyu Zhu, Wei Wang, Xin Yuan, Xiao Wang. 13988-13996 [doi]
- CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose EstimationYu Zhu, Dan Zeng 0002, Shuiwang Li, Qijun Zhao, Qiaomu Shen, Bo Tang 0016. 13997-14004 [doi]
- Pixel-level Quality Assessment for Oriented Object DetectionYunhui Zhu, Buliao Huang. 14005-14013 [doi]
- SinBasis Networks: Matrix-Equivalent Feature Extraction for Wave-Like Optical SpectrogramsYuzhou Zhu, Zheng Zhang, Ruyi Zhang, Liang Zhou. 14014-14021 [doi]
- Training-Free Spatio-temporal Decoupled Reasoning Video Segmentation with Adaptive Object MemoryZhengtong Zhu, Jiaqing Fan, Zhixuan Liu, Fanzhang Li. 14022-14030 [doi]
- Q Cache: Visual Attention Is Valuable in Less than Half of Decode Layers for Multimodal Large Language ModelJiedong Zhuang, Lu Lu, Ming Dai, Rui Hu, Jian Chen, Qiang Liu, Haoji Hu. 14031-14039 [doi]
- Libra-MIL: Multimodal Prototypes Stereoscopic Infused with Task-specific Language Priors for Few-shot Whole Slide Image ClassificationZhenfeng Zhuang, Fangyu Zhou, Liansheng Wang 0002. 14040-14048 [doi]
- Tuning for Two Adversaries: Enhancing the Robustness Against Transfer and Query-Based Attacks Using Hyperparameter TuningPascal Zimmer, Ghassan Karame. 14049-14058 [doi]
- HouseTune: Two-Stage Floorplan Generation with LLM AssistanceZiyang Zong, Guanying Chen, Zhaohuan Zhan, Fengcheng Yu, Guang Tan. 14059-14067 [doi]
- EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and UnderstandingKai Zou, Hongbo Liu, Dian Zheng, Jianxiong Gao, Zhiwei Zhao, Bin Liu 0016. 14068-14076 [doi]
- Appearance Discrepancy-guided Sequence Hybrid Masking for Robust Scene Text RecognitionShihao Zou, Wei Wei 0002, Leyang Xu, Kaihe Xu, Wenfeng Xie. 14077-14085 [doi]
- Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile DevicesYa Zou, Jingfeng Yao, Siyuan Yu, Shuai Zhang 0050, Wenyu Liu 0001, Xinggang Wang. 14086-14094 [doi]
- HATIR: Heat-Aware Diffusion for Turbulent Infrared Video Super-ResolutionYang Zou 0004, Xingyue Zhu, Kaiqi Han, Jun Ma, Xingyuan Li 0005, Zhiying Jiang, Jinyuan Liu 0001. 14095-14103 [doi]
- Boosting Adversarial Transferability via Ensemble Non-AttentionYipeng Zou, Qin Liu 0001, Jie Wu 0001, Yu Peng 0003, Guo Chen 0001, Hui Zhou 0014, Guanghui Ye. 14104-14112 [doi]
- Decoupling What to Count and Where to See for Referring Expression CountingYuda Zou, Zijian Zhang, Yongchao Xu. 14113-14121 [doi]
- CoordAR: One-Reference 6D Pose Estimation of Novel Objects via Autoregressive Coordinate Map GenerationDexin Zuo, Ang Li, Wei Wang, Wenxian Yu, Danping Zou. 14122-14130 [doi]
- Faster Symmetry Breaking Constraints for Abstract StructuresÖzgür Akgün, Mun See Chang, Ian P. Gent, Christopher Jefferson. 14132-14139 [doi]
- Faster Certified Symmetry Breaking Using Orders with Auxiliary VariablesMarkus Anders, Bart Bogaerts 0001, Benjamin Bogø, Arthur Gontier, Wietze Koops, Ciaran McCreesh, Magnus O. Myreen, Jakob Nordström, Andy Oertel, Adrian Rebola-Pardo, Yong Kiam Tan. 14140-14148 [doi]
- Learning with Structure: Computing Consistent Subsets on Structurally-Regular GraphsAritra Banik, Mano Prakash Parthasarathi, Venkatesh Raman 0001, Diya Roy, Abhishek Sahu. 14149-14156 [doi]
- Greedily Maximizing Ex-Ante FairnessRuben Becker, Bojana Kodric, Cosimo Vinci. 14157-14165 [doi]
- Ordered Objectives in Maximum SatisfiabilityJeremias Berg, André Schidler, Matti Järvisalo. 14166-14174 [doi]
- Proof Systems for Tensor-based Model CountingOlaf Beyersdorff, Joachim Giesen, Andreas Goral, Tim Hoffmann, Kaspar Kasche, Christoph Staudt. 14175-14183 [doi]
- Proof Systems That Tightly Characterise Model Counting AlgorithmsOlaf Beyersdorff, Tim Hoffmann, Kaspar Kasche. 14184-14191 [doi]
- Using Certifying Constraint Solvers for Generating Step-wise ExplanationsIgnace Bleukx, Maarten Flippo, Bart Bogaerts 0001, Emir Demirovic, Tias Guns. 14192-14200 [doi]
- Learning DFAs from Positive Examples Only via Word CountingBenjamin Bordais, Daniel Neider. 14201-14208 [doi]
- Aperiodic Tiling and Rhythmic Canons: A CP JourneyGuillaume Derval, Christophe Lecoutre. 14209-14216 [doi]
- Exact Algorithms for Distance to Unique Vertex CoverFoivos Fioravantes, Dusan Knop, Nikolaos Melissinos, Michal Opler, Manolis Vasilakis. 14217-14224 [doi]
- Preference Elicitation for Step-Wise Explanations in Logic PuzzlesMarco Foschini, Marianne Defresne, Emilio Gamba, Bart Bogaerts 0001, Tias Guns. 14225-14233 [doi]
- Model Counting for Dependency Quantified Boolean FormulasLong-Hin Fung, Che Cheng, Jie-Hong Roland Jiang, Friedrich Slivovsky, Tony Tan. 14234-14242 [doi]
- Constraint Optimization of MicroPlate DesignsRamiz Gindullin, María Andreína Francisco Rodríguez. 14243-14250 [doi]
- Efficient and Reliable Hitting-Set Computations for the Implicit Hitting Set ApproachHannes Ihalainen, Dieter Vandesande, André Schidler, Jeremias Berg, Bart Bogaerts 0001, Matti Järvisalo. 14251-14260 [doi]
- The Limitations and Power of NP-Oracle Based Functional Synthesis TechniquesBrendan Juba, Kuldeep S. Meel. 14261-14268 [doi]
- Graph Choosability via SAT: Beyond the NullstellensatzMarkus Kirchweger, Tomás Peitl, David Seka, Stefan Szeider. 14269-14277 [doi]
- Using Constraint Solvers to Construct Binary Codes with Good Error Correction PerformanceStepan Kochemazov, Oleg Zaikin 0002, Grigorii Trofimiuk, Kirill Antonov, Alexander A. Semenov. 14278-14286 [doi]
- Towards Single Exponential Time for Temporal and Spatial Reasoning: A Study via Redundancy and Dynamic ProgrammingVictor Lagerkvist, Johanna Groven, Leif Eriksson. 14287-14294 [doi]
- Scale-Net: A Hierarchical U-Net Framework for Cross-Scale Generalization in Multi-Task Vehicle RoutingSuyu Liu, Zhiguang Cao, Nan Yin, Yew-Soon Ong. 14295-14303 [doi]
- LLM-Guided Quantified SMT Solving over Uninterpreted FunctionsKunhang Lv, Yuhang Dong, Rui Han, Fuqi Jia, Feifei Ma, Jian Zhang 0001. 14304-14312 [doi]
- Constrained Molecule Generation Modelled Using the Grammar ConstraintDavid Saikali, Gilles Pesant. 14313-14321 [doi]
- Assignment Problems in Cost Function NetworksGuidio Sewa, David Allouche, Simon de Givry, George Katsirelos, Pierre Montalbano, Thomas Schiex. 14322-14330 [doi]
- A GPU-based Constraint Programming SolverPierre Talbot. 14331-14341 [doi]
- Certified Branch-and-Bound MaxSAT SolvingDieter Vandesande, Jordi Coll, Bart Bogaerts 0001. 14342-14351 [doi]
- Generative Branching for Mixed-Integer Linear ProgrammingRuobing Wang, Xin Li, Yangchuan Wang, Zijian Zhang, Mingzhong Wang. 14352-14360 [doi]
- Cubing for TuningHaoze Wu 0001, Clark W. Barrett, Nina Narodytska. 14361-14370 [doi]
- Co-Layout: LLM-driven Co-optimization for Interior LayoutChucheng Xiang, Ruchao Bao, Biyin Feng, Wenzheng Wu, Zhongyuan Liu, Yirui Guan, Ligang Liu 0001. 14371-14379 [doi]
- CSP4SDG: Constraint and Information-Theory Based Role Identification in Social Deduction Games with LLM-Enhanced InferenceKaijie Xu, Fandi Meng, Clark Verbrugge, Simon Mark Lucas. 14380-14387 [doi]
- Scalable Mixed-Integer Optimization with Neural Constraints via Dual DecompositionShuli Zeng, Sijia Zhang, Feng Wu 0001, Shaojie Tang 0001, Xiangyang Li 0001. 14388-14396 [doi]
- Right Branches Matter in Failure-based Variable Ordering HeuristicsYang Zhang, Hongbo Li 0005. 14397-14404 [doi]
- Improving Exact Algorithm for Pseudo Boolean Optimization with Two New Phase Selection HeuristicsYujiao Zhao, Yizhan Xiang, Jiangnan Li, Yiyuan Wang 0002, Minghao Yin. 14405-14413 [doi]
- Relational Verification for Cost-Aware Quantum Program OptimizationZiming Zhao 0008, Tingting Li 0004, Zhaoxuan Li, Jianwei Yin. 14414-14422 [doi]
- Exact Optimization for Minimum Dominating SetsEnqiang Zhu, Qiqi Bao, Yu Zhang 0231, Chanjuan Liu 0001, Pu Wu. 14423-14430 [doi]
- T-SKM-Net: Trainable Neural Network Framework for Linear Constraint Satisfaction via Sampling Kaczmarz-Motzkin MethodHaoyu Zhu, Yao Zhang, Jiashen Ren, Qingchun Hou. 14431-14439 [doi]
- Beyond Static: Related Questions Retrieval Through Conversations in Community Question AnsweringXiao Ao, Jie Zou 0001, Yibiao Wei, Peng Wang 0023, Weikang Guo. 14441-14449 [doi]
- Extracting Interaction-Aware Monosemantic Concepts in Recommender SystemsDor Arviv, Yehonatan Elisha, Oren Barkan, Noam Koenigstein. 14450-14458 [doi]
- Brownian Bridge Augmented Surrogate Simulation and Injection Planning for Geological CO2 StorageHaoyue Bai 0002, Guodong Chen 0002, Wangyang Ying, Xinyuan Wang 0011, Nanxu Gong, Sixun Dong, Giulia Pedrielli, Haoyu Wang 0003, Haifeng Chen, Yanjie Fu. 14459-14466 [doi]
- NP-MiSR: Neural Process-based Multi-Interest Learning for Session-Based RecommendationJun Bao, Junbo Wang, Yiheng Jiang, Xiangfeng Liu, Mingyang Lv, Yuanbo Xu. 14467-14474 [doi]
- DynaQuant: Dynamic Mixed-Precision Quantization for Learned Image CompressionYouneng Bao, Yulong Cheng, Yiping Liu, Yichen Yang, Peng Qin, Mu Li 0005, Yongsheng Liang 0001. 14475-14483 [doi]
- Fidelity-Aware Recommendation Explanations via Stochastic Path IntegrationOren Barkan, Yahlly Schein, Yehonatan Elisha, Veronika Bogina, Mikhail Baklanov, Noam Koenigstein. 14484-14492 [doi]
- F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language ModelHanbo Bi, Zhiqiang Yuan, Zexi Jia, Jiapei Zhang, Chongyang Li, Peixiang Luo, Ying Deng, Xiaoyue Duan, Jinchao Zhang 0001. 14493-14501 [doi]
- HCF: Hierarchical Cascade Framework for Distributed Multi-Stage Image CompressionJunhao Cai, Taegun An, Chengjun Jin, Sung-il Choi, Juhyun Park, Changhee Joo. 14502-14510 [doi]
- Augmenting Intra-Modal Understanding in MLLMs for Robust Multimodal Keyphrase GenerationJiajun Cao, Qinggang Zhang, Yunbo Tang, Zhishang Xiang, Chang Yang, Jinsong Su. 14511-14519 [doi]
- VBF++: Variational Bayesian Fusion with Context-Aware Priors and Recommendation-Guided Adversarial Refinement for Multimodal Video RecommendationZiyi Cao, Rui Liu 0007, Yong Chen 0008. 14520-14528 [doi]
- Learning to Compress Graphs via Dual Agents for Consistent Topological Robustness EvaluationQisen Chai, Yansong Wang, Junjie Huang, Tao Jia 0001. 14529-14537 [doi]
- MISF: MLLM Guided Iterative Sample Filtering for Data Fault DetectionGuoying Chen, Ruizhuo Zhao, Zhewei Xu, Bo Yang, Kunlong Wang. 14538-14546 [doi]
- Breaking the Aggregation Bottleneck in Federated Recommendation: A Personalized Model Merging ApproachJundong Chen 0003, Honglei Zhang 0002, Chunxu Zhang, Fangyuan Luo, Yidong Li. 14547-14555 [doi]
- Diffusion Reconstruction-based Data Likelihood Estimation for Core-Set SelectionMingyang Chen, Jiawei Du, Bo Huang, Yi Wang 0017, Xiaobo Zhang, Wei Wang 0011. 14556-14564 [doi]
- Transform-Free Feature Coding via Entropy-Constrained Vector QuantizationQiaoxi Chen, Changsheng Gao, Li Li 0040, Dong Liu 0002. 14565-14573 [doi]
- ProRec-Video: Guiding Hierarchical Interest Transitions for Proactive Short Video Recommendation with Dynamic Feedback AdaptationWeizhi Chen, Baoyun Peng, Bo Liu 0014, Xingkong Ma, Houjie Qiu. 14574-14582 [doi]
- Dual-Kernel Graph Community Contrastive LearningXiang Chen, Kun Yue, Wenjie Liu, Zhenyu Zhang, Liang Duan. 14583-14591 [doi]
- ARDiff: Anisotropic Residual Diffusion for Heterogeneous Graph LearningYong Chen, Li Li, Nannan Zong, Zhihui Liu, Song-Zhi Su. 14592-14600 [doi]
- TOPOGRAPH: Topology-Preserving Graph Reduction with Adaptive Structure for Persistent HomologyZonghao Chen, Yuncheng Jiang 0004, Gang Li. 14601-14609 [doi]
- GraphRAG-Induced Dual Knowledge Structure Graphs for Personalized Learning Path RecommendationXinghe Cheng, Zihan Zhang, Jiapu Wang, Liangda Fang, Chaobo He, Quanlong Guan, Shirui Pan, Weiqi Luo 0002. 14610-14620 [doi]
- Reinforced Rate Control for Neural Video Compression via Inter-Frame Rate-Distortion AwarenessWuyang Cong, Junqi Shi, Lizhong Wang, Weijing Shi, Ming Lu 0003, Hao Chen 0036, Zhan Ma 0001. 14621-14629 [doi]
- De-collapsing User Intent: Adaptive Diffusion Augmentation with Mixture-of-Experts for Sequential RecommendationXiaoxi Cui, Chao Zhao, Yurong Cheng, Xiangmin Zhou. 14630-14638 [doi]
- Intermediate N-Gramming: Deterministic and Fast N-Grams for Large N and Large DatasetsRyan R. Curtin, Fred Lu, Edward Raff, Priyanka Ranade. 14639-14647 [doi]
- Delayed Feedback Modeling with Influence FunctionsChenlu Ding, Jiancan Wu, Yancheng Yuan, Cunchun Li, Xiang Wang 0010, Dingxian Wang, Frank Yang, Andrew Rabinovich. 14648-14656 [doi]
- ARNS: Adaptive Relation-Aware Negative Sampling with Curriculum Learning for Inductive Knowledge Graph CompletionLing Ding 0001, Zhizhi Yu, Di Jin 0001, Lei Huang. 14657-14665 [doi]
- Scalable Semi-supervised Community Search via Graph Transformer on Attributed Heterogeneous Information NetworksLinLin Ding, Zhaosong Zhao, Mo Li 0004, Yishan Pan, Xin Wang 0030, Renata Borovica-Gajic. 14666-14674 [doi]
- Inductive Generative Recommendation via Retrieval-based SpeculationYijie Ding, Jiacheng Li 0003, Julian J. McAuley, Yupeng Hou. 14675-14683 [doi]
- Transferable Graph Condensation from the Causal PerspectiveHuaming Du, Yijie Huang, Su Yao, Yiying Wang, Yueyang Zhou, Jingwen Yang, Jinshi Zhang, Han Ji, Yu Zhao 0019, Guisong Liu, Hegui Zhang, Carl Yang 0001, Gang Kou. 14684-14692 [doi]
- HISE-KT: Synergizing Heterogeneous Information Networks and LLMs for Explainable Knowledge Tracing with Meta-Path OptimizationZhiyi Duan, Zixing Shi, Hongyu Yuan, Qi Wang 0078. 14693-14701 [doi]
- Multi-granularity Intent Modeling with Adversarial Robustness for Sequential RecommendationYangyi Fang, Haolin Shi. 14702-14710 [doi]
- Subgraph Encoding with Bicentric Sphere Node Labeling and Pooling for Link PredictionZhihong Fang, Shaolin Tan, Qiu Fang, Zhe Li 0050, Qing Gao 0001. 14711-14719 [doi]
- Stable and Adaptive Fusion for Multi-domain Multi-task RecommendationKe Fei, Da Luo, Kangyi Lin, Zibin Zhang, Jingjing Li 0001. 14720-14728 [doi]
- Quantifying the Potential to Escape Filter Bubbles: A Behavior-Aware Measure via Contrastive SimulationDifu Feng, Qianqian Xu 0001, Zitai Wang, Cong Hua, Zhiyong Yang 0001, Qingming Huang. 14729-14737 [doi]
- DeepRAHT: Learning Predictive RAHT for Point Cloud Attribute CompressionChunyang Fu, Tai Qin, Shiqi Wang 0001, Zhu Li 0001. 14738-14746 [doi]
- Towards LLM-Empowered Knowledge Tracing via LLM-Student Hierarchical Behavior Alignment in Hyperbolic SpaceXingcheng Fu, Shengpeng Wang, Yisen Gao, Xianxian Li, Chunpei Li, Qingyun Sun, Dongran Yu. 14747-14755 [doi]
- NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal SimulationYuan Gao, Hao Wu, Fan Xu 0009, Yanfei Xiang, Ruijian Gou, Ruiqi Shu, Qingsong Wen, Xian Wu 0001, Kun Wang 0056, Xiaomeng Huang. 14756-14764 [doi]
- Order-Preserving Dimension Reduction for Multimodal Semantic EmbeddingChengyu Gong, Gefei Shen, Luanzheng Guo, Nathan R. Tallent, Dongfang Zhao 0001. 14765-14773 [doi]
- OneSug: The Unified End-to-End Generative Framework for E-commerce Query SuggestionXian Guo, Ben Chen, Siyuan Wang, Ying Yang, Mingyue Cheng 0004, Chenyi Lei, Yuqing Ding, Han Li 0005. 14774-14782 [doi]
- DiffMM: Efficient Method for Accurate Noisy and Sparse Trajectory Map Matching via One Step DiffusionChenxu Han, Sean Bin Yang, Jilin Hu. 14783-14791 [doi]
- Beyond Single Transactions: D-EMAML - Dual-Edge Motif Neural Networks for Enhanced Anti-Money Laundering DetectionDongmei Han, Min Min, Yuchen Wang, Guoming Xu, Xiaofeng Zhou. 14792-14801 [doi]
- LLMTM: Benchmarking and Optimizing LLMs for Temporal Motif Analysis in Dynamic GraphsBing Hao, Minglai Shao 0001, Zengyi Wo, Yunlong Chu, Yuhang Liu 0006, Ruijie Wang 0004. 14802-14810 [doi]
- M²VAE: Multi-Modal Multi-View Variational Autoencoder for Cold-start Item RecommendationChuan He 0005, Yongchao Liu 0004, Qiang Li 0054, Chuntao Hong, Wenliang Zhong, Xin-Wei Yao 0001. 14811-14819 [doi]
- Exploiting Inter-Session Information with Frequency-enhanced Dual-Path Networks for Sequential RecommendationPeng He, Yanglei Gan, Tingting Dai, Run Lin, Xuexin Li, Yao Liu 0019, Qiao Liu 0003. 14820-14828 [doi]
- Multimodal Graph Representation Learning with Dynamic Information PathwaysXiaobin Hong 0002, Mingkai Lin, Xiaoli Wang, Chaoqun Wang 0012, Wenzhong Li. 14829-14837 [doi]
- RecCocktail: A Generalizable and Efficient Framework for LLM-Based RecommendationMin Hou, Chenxi Bai, Le Wu, Hao Liu 0003, Kai Zhang 0038, Weiwen Liu, Richang Hong, Ruiming Tang, Meng Wang 0001. 14838-14847 [doi]
- Tokenize Once, Recommend Anywhere: Unified Item Tokenization for Multi-domain LLM-based RecommendationYu Hou, Won-Yong Shin. 14848-14855 [doi]
- NTSFormer: A Self-Teaching Graph Transformer for Multimodal Isolated Cold-Start Node ClassificationJun Hu 0016, Yufei He, Yuan Li 0032, Bryan Hooi, Bingsheng He. 14856-14864 [doi]
- Echoless Label-Based Pre-computation for Memory-Efficient Heterogeneous Graph LearningJun Hu 0016, Shangheng Chen, Yufei He, Yuan Li 0032, Bryan Hooi, Bingsheng He. 14865-14873 [doi]
- From IDs to Semantics: A Generative Framework for Cross-Domain Recommendation with Adaptive Semantic TokenizationPeiyu Hu, Wayne Lu, Jia Wang 0009. 14874-14882 [doi]
- Emotion and Intention Guided Multi-Modal Learning for Sticker Response SelectionYuxuan Hu 0005, Jian Chen 0011, Yuhao Wang 0006, Zixuan Li 0001, Jing Xiong, Pengyue Jia, Wei Wang, Chengming Li, Xiangyu Zhao 0001. 14883-14891 [doi]
- BAG: Benchmarking Anomaly Detection on Dynamic GraphsFengrui Hua, Yiyan Qi, Zikai Wei, Yuxing Tian, Chengjin Xu, Xiaojun Wu, Jia Li, Jian Guo 0016. 14892-14900 [doi]
- Context-aware Graph Meta-learningNingbo Huang, Gang Zhou, Meng Zhang, Shunhang Li, Ling Wang, Shiyu Wang, Yi Xia. 14901-14909 [doi]
- FusedRec: Fused Embedding Communication for Distributed Recommendation Training on GPUsXuanteng Huang, Fan Li, Riyang Hu, Jianchang Zhang, Yuan Peng, Yang Zhou, Fangying Chen, XianWei Zhang. 14910-14918 [doi]
- DuoKD: Dual Knowledge Distillation from Large Language Models for Robust Graph Neural NetworksCuiying Huo, Xiaotong Huang, Dongxiao He, Yixuan Du, Wenhuan Lu, Di Jin 0001. 14919-14927 [doi]
- LLM-Aligned Geographic Item Tokenization for Local-Life RecommendationHao Jiang, Guoquan Wang, Donglin Zhou, Sheng Yu, Yang Zeng, Wencong Zeng, Kun Gai, Guorui Zhou. 14928-14936 [doi]
- SSCL: Adversarially Guided Image Compression via Semantic and Spectral Consistency LearningWei Jiang 0031, Yongqi Zhai, Jiayu Yang, Bohao Feng, Wenqiang Wang, Bo Huang, Lin Ding 0002, Ronggang Wang. 14937-14945 [doi]
- Towards Multimodal Continual Knowledge Embedding wth Modality Forgetting ModulationXiaowen Jiang, Jing Yang 0051, Shundong Yang, Yuan Gao 0031, Xinfa Jiang, Laurence Tianruo Yang, Jieming Yang. 14946-14954 [doi]
- CAFU: Constrained Alignment and Filtered Uniformity for Denoising RecommendationXinzhe Jiang, Lei Sang 0001, Yi Zhang 0103, Kaibin Wang, Yiwen Zhang 0001. 14955-14963 [doi]
- Invariant Feature Learning for Counterfactual Watch-time Prediction in Video RecommendationChenghou Jin, Yixin Ren, Hongxu Ma 0001, Yewei Xia, Yi Guan, Hao Zhang 0079, Jiandong Ding, Jihong Guan, Shuigeng Zhou. 14964-14972 [doi]
- Mitigating Noise and Imbalance in Social Governance Graphs for Multi-Type Risk AssessmentDi Jin 0001, Haotian Zhao, Xiaobao Wang, Fengyu Yan, Dongxiao He. 14973-14981 [doi]
- Inference-time Scaling for Diffusion-based Audio Super-resolutionYizhu Jin, Zhen Ye 0006, Zeyue Tian, Haohe Liu, Qiuqiang Kong, Yike Guo, Wei Xue 0002. 14982-14990 [doi]
- Revisiting Contrastive Learning in Collaborative Filtering via Parallel Graph FiltersFang Kai, Yu Zhang, Kaibin Wang, Lei Sang, Yiwen Zhang 0001. 14991-14999 [doi]
- Memorize Early, Then Query: Inlier-Memorization-Guided Active Outlier DetectionMinseo Kang, Seunghwan Park, Dongha Kim. 15000-15008 [doi]
- PEOCH: Online Cross-Modal Hashing with Semi-Supervised Streaming Data Driving Prototype EvolutionXiao Kang, Xingbo Liu, Shuo Pan, Xuening Zhang, Xiushan Nie, Yilong Yin. 15009-15017 [doi]
- CANDI: Curated Test-Time Adaptation for Multivariate Time-Series Anomaly Detection Under Distribution ShiftHyungi Kim, Jisoo Mok, Hyungyu Lee, Juhyeon Shin, Sungroh Yoon. 15018-15026 [doi]
- SACO: Sequence-Aware Constrained Optimization Framework for Coupon Distribution in E-commerceLi Kong, Bingzhe Wang, Zhou Chen, Suhan Hu, Yuchao Ma, Qi Qi 0003, Suoyuan Song, Bicheng Jin. 15027-15035 [doi]
- MovSemCL: Movement-Semantics Contrastive Learning for Trajectory SimilarityZhichen Lai 0001, Hua Lu 0001, Huan Li 0003, Jialiang Li 0007, Christian S. Jensen. 15036-15043 [doi]
- Think Then Rewrite: Reasoning Enhanced Query Rewriting for Domain Specific RetrievalAng Li 0049, Yufei Shi, Yuxuan Si, Yiquan Wu 0001, Ming Cai, Xu Tan, Yi Wang, Changlong Sun, Xiaozhong Liu 0001, Kun Kuang 0001. 15045-15053 [doi]
- SGP4SR: Seperated-Modality Guided User Perference Learning for Multimodal Sequential ReconmmendationChanghong Li, Zhiqiang Guo, Guohui Li, Zhong Yang, Chuhang Hong. 15054-15062 [doi]
- FreqTAD: Multi-scale Frequency Encoding and Time-Frequency Attention for Anomaly Detection in Dynamic GraphsChao Li 0022, Runshuo Liu, Zhongying Zhao 0001, Hui Zhou, Qingtian Zeng. 15063-15071 [doi]
- CL-DMDF: Dynamic Multimodal Data Fusion Model Based on Contrastive LearningDong Li, Lingling Zhang, Binghao Han, LinLin Ding, Yue Kou. 15072-15080 [doi]
- Interest-Shift-Aware Logical Reasoning for Efficient Long-Sequence RecommendationFei Li 0044, Qingyun Gao, Enneng Yang, Jianzhe Zhao, Guibing Guo. 15081-15089 [doi]
- Knowledge-Enhanced Image Captioning with Adaptive Graph-based Multimodal Alignment and LLMGuoyi Li, Die Hu 0004, Haozhe Li, Zhongjiang Yao, Wei Mi, Zongzhen Liu, Xiaodan Zhang 0004, Honglei Lyu. 15090-15098 [doi]
- Subspace-Aware Graph Construction and Contrastive Alignment for Multimodal Recommendation with Large Language ModelsHaodong Li, Lianyong Qi, Weiming Liu 0005, Fan Wang 0020, Chong Li, Shengye Pang, Wenwen Gong, Yanwei Xu 0003, Xiaoxiao Chi, Yang Zhang, Xiaokang Zhou. 15099-15107 [doi]
- Can Molecular Evolution Mechanism Enhance Molecular Representation?Kun Li 0009, Longtao Hu, Jiameng Chen, Hongzhi Zhang, Yida Xiong, Xiantao Cai, Wenbin Hu 0001, Jia Wu 0001. 15108-15116 [doi]
- Adaptive Diffusion-based Augmentation for RecommendationNa Li, Fanghui Sun, Yan Zou, Yangfu Zhu, Xiatian Zhu, Ying Ma. 15117-15125 [doi]
- Multiplex Heterogeneous Graph Neural Networks with Euclidean-Riemannian Mutual Space SynergyXiang Li 0111, Yuan Cao 0005, Zhongying Zhao 0001, Guoqing Chao, Yanwei Yu. 15126-15134 [doi]
- Self-Improving Sparse Retrieval Through Heuristic Representation Refinement and Representation-Focused LearningXiaojing Li, Bin Wang 0015, Xiaochun Yang 0001, Meng Luo. 15135-15143 [doi]
- Exploring Domain Generalization and Subpopulation Shift for Generalizable Graph-Level Anomaly DetectionXiaoxiang Li, Xihe Xie, Hai Wan, Xibin Zhao. 15144-15152 [doi]
- RGMP: Recurrent Geometric-prior Multimodal Policy for Generalizable Humanoid Robot ManipulationXuetao Li, Wenke Huang 0003, Nengyuan Pan, Kaiyan Zhao, Songhua Yang, Yiming Wang, Mengde Li, Mang Ye, Jifeng Xuan, Miao Li 0002. 15153-15161 [doi]
- Data-Centric Sequential Recommendation with Relation-Augmented GenerationYichen Li 0006, Yichen Tan, Yijing Shan, Haozhao Wang, Rui Zhang 0003, Imran Razzak, Ruixuan Li 0001. 15162-15170 [doi]
- DGP: A Dual-Granularity Prompting Framework for Fraud Detection with Graph-Enhanced LLMsYuan Li 0032, Jun Hu 0016, Bryan Hooi, Bingsheng He, Cheng Chen 0008. 15171-15179 [doi]
- APT: Affine Prototype-Timestamp for Time Series Forecasting Under Distribution ShiftYujie Li 0008, Zezhi Shao, Chengqing Yu, Yisong Fu, Tao Sun 0011, Yongjun Xu 0001, Fei Wang 0014. 15180-15188 [doi]
- BLADE: A Behavior-Level Data Augmentation Framework with Dual Fusion Modeling for Multi-Behavior Sequential RecommendationYupeng Li, Mingyue Cheng 0004, Yucong Luo, Yitong Zhou, Qingyang Mao, Shijin Wang 0001. 15189-15197 [doi]
- Capturing Dynamic User Interests Under Modality Imbalance for Multimodal Sequential RecommendationZilong Li 0002, Jia Zhu 0003, Chenglei Huang, Zhangze Chen, Hanghui Guo, Guoqing Ma, Jianxia Ling. 15198-15206 [doi]
- Towards Synthesizing High-Dimensional Tabular Data with Limited SamplesZuqing Li, Junhao Gan, Jianzhong Qi 0001. 15207-15215 [doi]
- Beyond Local Patterns: Multiscale Inconsistency Learning for Graph Anomaly DetectionJie Lian 0006, Zhihao Wu 0003, Jielong Lu, Jiajun Yu, Qianqian Shen, Haishuai Wang. 15216-15224 [doi]
- Sign-Aware Multimodal Graph RecommendationYahong Lian, Haotian Tian, Chunyao Song, Tingjian Ge. 15225-15233 [doi]
- AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward OptimizationJingyi Liao, Yongyi Su, Rong-Cheng Tu, Zhao Jin, Wenhao Sun, Yiting Li, Xun Xu 0002, Dacheng Tao, XuLei Yang. 15234-15242 [doi]
- Stepwise Contrastive Reasoning for Retrieval-Augmented Generation over Knowledge GraphsChenxiao Lin, Ye Luo, KunHong Liu 0001, Qingqiang Wu 0001. 15243-15251 [doi]
- Learnable Matrix Profile for Motif Discovery on Multivariate Time SeriesMingkai Lin, Yinke Wang, Xiaobin Hong 0002, Wenzhong Li. 15252-15260 [doi]
- Comprehensive Urban Region Representation Learning via Multi-View Joint Learning and Contrastive LearningYingde Lin, Yuanbo Xu, Lu Jiang 0007, Pengyang Wang. 15261-15268 [doi]
- Multifaceted Scenario-Aware Hypergraph Learning for Next POI RecommendationYuxi Lin, Yongkang Li, Jie Xing, Zipei Fan. 15269-15277 [doi]
- RAGAR: Retrieval Augmented Personalized Image Generation Guided by RecommendationRun Ling, Wenji Wang, Yuting Liu 0003, Guibing Guo, Haowei Liu, Jian Lu, Quanwei Zhang, Yexing Xu, Shuo Lu, Yun Wang, Yihua Shao, Linying Jiang, Xingwei Wang 0001. 15278-15286 [doi]
- UrbanPG: An Efficient Framework with Personalized Context and General Backbone Interaction for Urban Spatio-Temporal LearningAoyu Liu, Yaying Zhang. 15287-15295 [doi]
- Relative Advantage Debiasing for Watch-Time Prediction in Short-Video RecommendationEmily Liu, Kuan Han, Minfeng Zhan, Bocheng Zhao, Guanyu Mu, Yang Song. 15296-15305 [doi]
- Diagnostic-Guided Dynamic Profile Optimization for LLM-based User Simulators in Sequential RecommendationHongyang Liu, Zhu Sun 0001, Tianjun Wei, Yan Wang 0002, Jiajie Zhu 0001, Xinghua Qu. 15306-15314 [doi]
- Graph2Video: Leveraging Video Models to Model Dynamic Graph EvolutionHua Liu, Yanbin Wei, Fei Xing, Tyler Derr, Haoyu Han 0001, Yu Zhang 0006. 15315-15323 [doi]
- MoToRec: Sparse-Regularized Multimodal Tokenization for Cold-Start RecommenderJialin Liu, Zhaorui Zhang, Ray C. C. Cheung. 15324-15332 [doi]
- Debate over Mixed-knowledge: A Robust Multi-Agent Reasoning Framework for Incomplete Knowledge Graph Question AnsweringJilong Liu, Pengyang Shao, Wei Qin, Fei Liu 0038, Yonghui Yang 0001, Richang Hong. 15333-15341 [doi]
- MACRec: A Multi-View Subspace Alignment Framework for Contrastive Sampling Calibration in RecommendationJunping Liu, Mingchao Yu, Xinrong Hu, Rui Yan, Wanqing Li 0001, Jie Yang 0009, Yi Guo 0001. 15342-15350 [doi]
- Hierarchical Attention Network with Correction for Cross-Domain User AssociationWenlong Liu, Ze Wang, Chenlong Wu, Yude Bai, Ji Zhang. 15351-15359 [doi]
- A Scalable and Exact Relaxation for Densest k-Subgraph via Error BoundsYa Liu, Junbin Liu, Wing-Kin Ma, Aritra Konar. 15360-15368 [doi]
- MIGDiff: Multi-attributes Imputations for Attribute-missing Graphs via Graph Denoising Diffusion ModelYe Liu 0014, Yang Chen, Hongmin Cai. 15369-15376 [doi]
- SEFEL: A Simple Yet Effective Framework for Fast Event LinkingYinan Liu 0001, Ziyang Zhang, Bin Wang 0015, Xiaochun Yang 0001. 15377-15385 [doi]
- PathMind: A Retrieve-Prioritize-Reason Framework for Knowledge Graph Reasoning with Large Language ModelsYu Liu 0118, Xixun Lin, Yanmin Shang, Yangxi Li, Shi Wang 0002, Yanan Cao 0001. 15386-15393 [doi]
- S²HyRec: Self-Supervised Hypergraph Sequential RecommendationYuchen Liu, Kunyu Ni, Zhongying Zhao 0001, Guoqing Chao, Yanwei Yu. 15394-15402 [doi]
- Hyperbolic-Enhanced Mixture-of-Experts Mamba for Sequential RecommendationYuwen Liu 0003, Lianyong Qi, Xingyuan Mao, Weiming Liu 0005, Xuhui Fan 0001, Qiang Ni, Xuyun Zhang, Yang Zhang 0095, Yuan Tian, Amin Beheshti. 15403-15411 [doi]
- Multi-dimensional Adaptive Mix-hop Contextual Learning Framework for Universal Graph Anomaly DetectionZhaowei Liu 0001, Leilei Jiang, Haitao Yang. 15412-15420 [doi]
- UniHR: Hierarchical Representation Learning for Unified Knowledge Graph Link PredictionZhiqiang Liu, Yin-Hua, Mingyang Chen 0002, Yichi Zhang 0009, Zhuo Chen 0007, Lei Liang 0002, Wen Zhang 0015. 15421-15429 [doi]
- Reimagining Anomalies: What If Anomalies Were Normal?Philipp Liznerski, Saurabh Varshneya, Ece Calikus, Puyu Wang, Alexander Bartscher, Sebastian Josef Vollmer, Sophie Fellenz, Marius Kloft. 15430-15438 [doi]
- Region-Point Joint Representation for Effective Trajectory Similarity LearningHao Long, Silin Zhou, Lisi Chen 0001, Shuo Shang. 15439-15447 [doi]
- A Novel Fine-Tuned CLIP-OOD Detection Method with Double Loss Constraint Through Optimal Transport Semantic AlignmentHengyang Lu, Xin Guo, Shuai Feng, Wenyu Jiang, Yuntao Du 0001, Chang Xia, Chenyou Fan. 15448-15456 [doi]
- MaskAD: Parallel Masked Autoencoder for Multi-class Unsupervised Anomaly DetectionRuiying Lu, Gang Liu, Kang Li, Long Tian, Junwei Zhang. 15457-15465 [doi]
- SciMKG: A Multimodal Knowledge Graph for Science Education with Text, Image, Video and AudioTong Lu 0005, Zhichun Wang, Yaoyu Zhou, Yiming Guan, Zhiyong Bai, Junsheng Du. 15466-15474 [doi]
- Revisiting Fairness-aware Interactive Recommendation: Item Lifecycle as a Control KnobYun Lu, Xiaoyu Shi 0001, Hong Xie 0004, Chongjun Xia, Zhenhui Gong, Mingsheng Shang 0001. 15475-15482 [doi]
- Privacy Auditing of Multi-Domain Graph Pre-Trained Model Under Membership Inference AttacksJiayi Luo, Qingyun Sun, Yuecen Wei, Haonan Yuan, Xingcheng Fu, Jianxin Li 0002. 15483-15491 [doi]
- LMGL-WD: LLM-Guided Multi-Task Graph Learning for Category-Level Warehouse Demand Prediction in E-CommerceWenjun Lyu, Fangyu Li, Yudong Zhang, Shuai Wang 0008, Yunhuai Liu, Tian He 0001, Desheng Zhang 0002. 15492-15500 [doi]
- SGMT: Social Generating with Multiview-Guided Tuning In Recommender SystemsJianghong Ma, Changran He, Dezhao Yang, Tianjun Wei, Haijun Zhang 0002, Xiaofeng Zhang 0002. 15501-15509 [doi]
- Hierarchical Frequency-Decomposition Graph Neural Networks for Road Network Representation LearningJingtian Ma, Jingyuan Wang 0001, Leong Hou U. 15510-15518 [doi]
- Multi-graph Fusion Cross-model Contrastive Learning for RecommendationShengjun Ma, Yuhai Zhao, Fenglong Ma, Baoyin Liu, Zhengkui Wang, Wen Shan. 15519-15527 [doi]
- Spatial-Temporal Feedback Diffusion Guidance for Controlled Traffic ImputationXiaowei Mao, Huihu Ding, Yan Lin 0006, Tingrui Wu, Shengnan Guo 0001, Dazhuo Qiu, Feiling Fang, Jilin Hu, Huaiyu Wan. 15528-15536 [doi]
- ImageBindDC: Compressing Multi-modal Data with ImageBind-based CondensationYue Min, Shaobo Wang 0001, Jiaze Li, Tianle Niu, Junxin Fan, Yongliang Miao, Lijin Yang, Linfeng Zhang 0001. 15537-15545 [doi]
- Information Theoretic Optimal Surveillance for Epidemic Prevalence in NetworksRitwick Mishra, Abhijin Adiga, Madhav V. Marathe, S. S. Ravi, Ravi Tandon, Anil Vullikanti. 15546-15554 [doi]
- ConvMix: A Mixed-Criteria Data Augmentation Framework for Conversational Dense RetrievalFengran Mo, Jinghan Zhang 0002, Yuchen Hui, Jia Ao Sun, Zhichao Xu 0001, Zhan Su 0002, Jian-Yun Nie. 15555-15563 [doi]
- Continual Out-of-Distribution Detection with Analytic Neural CollapseSaleh Momeni, Changnan Xiao, Bing Liu 0001. 15564-15572 [doi]
- R²D-LPCC: Relevance-Ranking Guided Region-Adaptive Dynamic LiDAR Point Cloud CompressionFangzhe Nan, Frederick W. B. Li, Gary K. L. Tam, Zhaoyi Jiang, Bailin Yang, Jingke Cui, Changshuo Wang 0001. 15573-15581 [doi]
- GenVidBench: A 6-Million Benchmark for AI-Generated Video DetectionZhenliang Ni, Qiangyu Yan, Mouxiao Huang, Tianning Yuan, Yehui Tang, Hailin Hu 0002, Xinghao Chen 0001, Yunhe Wang 0001. 15582-15590 [doi]
- Targeting Borderline Fraudsters: Multi-View Hypergraph Fraud Detection with LLM-Guided Contrastive LearningRui Ou, Kun Zhu 0024, Nana Zhang, Jiangtong Li, Chaochao Chen 0001, Yuhua Xu 0011, Changjun Jiang. 15591-15599 [doi]
- RoSA: Enhancing Parameter-Efficient Fine-Tuning via RoPE-aware Selective Adaptation in Large Language ModelsDayan Pan, Jingyuan Wang 0001, Yilong Zhou, Jiawei Cheng, Pengyue Jia, Xiangyu Zhao 0001. 15600-15608 [doi]
- Think Wise, Collaborate Effectively: A Rationale-Aware LLM-Based Recommender with Reinforcement Learning from Collaborative SignalsChung Park, Taesan Kim, Hyeongjun Yun, Dongjoon Hong, Junui Hong, Kijung Park, Mincheol Cho, Minsung Choi, Jihwan Seok, Jaegul Choo. 15609-15616 [doi]
- UNO! UNified Offline Training Paradigm for Learning Path RecommendationLinzhi Peng, Wentao Zhu, Ke Cheng 0003, Heng Chang, Junchen Ye, Bowen Du 0001, Weifeng Lv. 15617-15625 [doi]
- Quality-Aware Language-Conditioned Local Auto-Regressive Anomaly Synthesis and DetectionLong Qian, Bingke Zhu, Yingying Chen 0003, Ming Tang 0001, Jinqiao Wang. 15626-15634 [doi]
- WaveDiST: A Wavelet Diffusion Transformer for Spatio-Temporal Estimation on Unobserved LocationsHuiling Qin, Yuanxun Li, Weijia Jia 0001. 15635-15643 [doi]
- Unbiased Rectification for Sequential Recommender Systems Under Fake OrdersQiyu Qin, Yichen Li 0006, Haozhao Wang, Cheng Wang 0025, Rui Zhang 0003, Ruixuan Li 0001. 15644-15652 [doi]
- Adaptive Frequency Pathways for Spatiotemporal ForecastingYanjun Qin, Yuchen Fang 0001, Xinke Jiang, Hao Miao 0001, Xiaoming Tao 0001. 15653-15661 [doi]
- TGDD: Trajectory Guided Dataset Distillation with Balanced DistributionFengli Ran, Xiao Pu 0002, Bo Liu 0047, Xiuli Bi, Bin Xiao 0002. 15662-15670 [doi]
- Enhancing Conversational Recommender Systems with Tree-Structured Knowledge and Pretrained Language ModelsYongwen Ren, Chao Wang 0086, Peng Du, Chuan Qin 0002, Dazhong Shen, Hui Xiong 0001. 15671-15679 [doi]
- Bidirectional Counterfactual Distillation for Review-Based RecommendationSheng Sang, Shujie Li 0002, Shuaiyang Li 0001, Kang Liu 0024, Teng Li, Wei Jia 0001, Dan Guo 0001, Feng Xue 0002. 15680-15688 [doi]
- HyperD: Hybrid Periodicity Decoupling Framework for Traffic ForecastingMinlan Shao, Zijian Zhang, Yili Wang 0004, Yiwei Dai, Xu Shen 0002, Xin Wang 0035. 15689-15697 [doi]
- ContextGraph: Lifelog Intelligence Framework for Contextual Subgraph EvolutionAnil Sharma, Gunturi Venkata Sai Phani Kiran, Jayesh Rajkumar Vachhani, Sourabh Vasant Gothe, Ayon Chattopadhyay, Yashwant Saini, Parameswaranath Vadackupurath Mani, Barath Raj Kandur Raja. 15698-15706 [doi]
- Information-Theoretic Minimal Sufficient Representation for Multi-Domain Knowledge Graph CompletionJiawei Sheng, Taoyu Su, Weiyi Yang, Linghui Wang, Yongxiu Xu, Tingwen Liu. 15707-15715 [doi]
- SA²GFM: Enhancing Robust Graph Foundation Models with Structure-Aware Semantic AugmentationJunhua Shi, Qingyun Sun, Haonan Yuan, Xingcheng Fu. 15716-15724 [doi]
- SEQRET: Mining Rule Sets from Event SequencesAleena Siji, Joscha Cüppers, Osman Mian, Jilles Vreeken. 15725-15733 [doi]
- AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style TransferYulim So, Seokho Kang 0001. 15734-15742 [doi]
- Hard vs. Noise: Resolving Hard-Noisy Sample Confusion in Recommender Systems via Large Language ModelsTianrui Song, Wen-Shuo Chao, Hao Liu. 15743-15751 [doi]
- Potent but Stealthy: Rethink Profile Pollution Against Sequential Recommendation via Bi-Level Constrained Reinforcement ParadigmJiajie Su, Zihan Nan, Yunshan Ma 0002, Xiaobo Xia, Xiaohua Feng 0002, Weiming Liu 0005, Xiang Chen, Xiaolin Zheng, Chaochao Chen 0001. 15752-15760 [doi]
- Generalising Traffic Forecasting to Regions Without Traffic ObservationsXinyu Su, Majid Sarvi, Feng Liu 0003, Egemen Tanin, Jianzhong Qi 0001. 15761-15769 [doi]
- Cross-modal Proxy Evolving for OOD Detection with Vision-Language ModelsHao Tang 0007, Yu Liu, Shuanglin Yan, Fei Shen, Shengfeng He, Jing Qin 0001. 15770-15778 [doi]
- Task-Aware Retrieval Augmentation for Dynamic RecommendationZhen Tao, Xinke Jiang, Qingshuai Feng, Haoyu Zhang, Lun Du, Yuchen Fang, Hao Miao, Bangquan Xie, Qingqiang Sun. 15779-15787 [doi]
- HiLoMix: Robust High- and Low-Frequency Graph Learning Framework for Mixing Address AssociationXiaofan Tu, Tiantian Duan, Shuyi Miao, Hanwen Zhang 0001, Yi Sun 0004. 15788-15796 [doi]
- OnlineBootKNN: An Unsupervised Framework for Detecting Anomalies in Spectral Data StreamsNicolas Rojas Varela, Julien Ah-Pine, Engelbert Mephu Nguifo. 15797-15805 [doi]
- TGCA-LLM: Time-Aware Graph-Text Contrastive Alignment for Enhancing LLMs in Temporal Knowledge Graph CompletionZexuan Wan, Bo Wang, Kuofei Fang, Bin Wu. 15806-15814 [doi]
- HFR-MKGC: Hierarchical Fusion Reasoning with MLLMs for Multi-modal Knowledge Graph CompletionDi Wang, Junping Du 0001, Zhe Xue, MeiYu Liang, Guanhua Ye, Yingxia Shao, Haisheng Li 0002. 15815-15823 [doi]
- TOP-RL: Task-Optimized Progressive Token Pruning with Reinforcement Learning for Vision Language ModelsHengyi Wang, Weiying Xie, Hui Jiang, Yaotao Wei, Kai Jiang 0001, Mingxiang Cao, Chenhe Hao, Leyuan Fang. 15824-15832 [doi]
- Task-Aware Meta-Learning on Heterogeneous Knowledge Graph for POI RecommendationJingyuan Wang, Zhichun Wang, Tong Lu 0005, Yiming Guan. 15833-15840 [doi]
- Robust Domain Adaptive Hashing via Structural Noise Modeling and CorrectionJunsheng Wang, Tiantian Gong, Yeyun Wu, Xiaobing Sun. 15841-15849 [doi]
- From Subtle to Significant: Prompt-Driven Self-Improving Optimization in Test-Time Graph OOD DetectionLuzhi Wang, Xuanshuo Fu, He Zhang 0012, Chuang Liu, Xiaobao Wang, Hongbo Liu. 15851-15859 [doi]
- Assessing LLMs for Serendipity Discovery in Knowledge Graphs: A Case for Drug RepurposingMengying Wang, Chenhui Ma, Ao Jiao, Tuo Liang, Pengjun Lu, Shrinidhi Hegde, Yu Yin 0001, Evren Gurkan-Cavusoglu, Yinghui Wu. 15860-15867 [doi]
- ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented GenerationShu Wang, Yixiang Fang, Yingli Zhou, Xilin Liu 0001, Yuchi Ma. 15868-15876 [doi]
- MSR-Rec: Multi-Step Reasoning-Enhanced LLM for Sequential RecommendationTuo Wang 0001, Meng Jian, Ge Shi 0002, Lifang Wu, Yashen Wang. 15877-15885 [doi]
- Sat2Flow: A Structure-Aware Diffusion Framework for Human Flow Generation from Satellite ImageryXiangxu Wang, Tianhong Zhao, Wei Tu 0001, Bowen Zhang 0005, Guanzhou Chen, Jinzhou Cao. 15886-15894 [doi]
- Bid Farewell to Seesaw: Towards Accurate Long-Tail Session-Based Recommendation via Dual Constraints of Hybrid IntentsXiao Wang 0055, Ke Qin, Dongyang Zhang 0001, Xiurui Xie, Shuang Liang 0002. 15895-15903 [doi]
- MoMoREC: A Multi-agent Motivation Generation Framework for Residual Semantic ID-Aware RecommendationYige Wang, Mingming Li, Li Wang, Kaichen Zhao, Wangming Li, Weipeng Jiang, Xueying Li. 15904-15914 [doi]
- Knowledge Graph Guided Heterogeneity-Informed Diffusion Model for Spatio-Temporal GenerationZi'ang Wang, Lei Chen, Yuanchang Jin, Pan Deng, Shuangshuang Pang, Junting Liu, Yu Zhao 0021. 15915-15923 [doi]
- T-Retriever: Tree-based Hierarchical Retrieval Augmented Generation for Textual GraphsChunyu Wei, Huaiyu Qin, Siyuan He, Yunhai Wang, Yueguo Chen. 15924-15932 [doi]
- DMGIN: How Multimodal LLMs Enhance Large Recommendation Models for Lifelong User Post-click BehaviorsZhuoxing Wei, Qingchen Xie, Qi Liu 0003, Jingsong Yu. 15933-15940 [doi]
- AirDDE: Multifactor Neural Delay Differential Equations for Air Quality ForecastingBinqing Wu, Zongjiang Shang, Shiyu Liu, Jianlong Huang, Jiahui Xu, Ling Chen 0001. 15941-15949 [doi]
- MoCast: Learning Turbulent Motions Under Physical Guidance for Precipitation NowcastingBinqing Wu, Weiqi Chen, Shiyu Liu, Zongjiang Shang, Haiou Wang, Liang Sun 0001, Ling Chen 0001. 15950-15958 [doi]
- Dual-Perspective Disentanglement: Learning Symmetric Group-Aware Representations for Cross-Domain RecommendationBorui Wu, Yuanbo Xu. 15959-15967 [doi]
- DARLING: Dual Hypergraph-Enhanced Curriculum-Guided Graph Structure Learning for Node ClassificationGuangkai Wu, Gen Liu 0001, Chao Li 0022, Qingtian Zeng, Hui Zhou, Zhongying Zhao 0001. 15968-15976 [doi]
- REACTION: Parameter-Efficient Learning for RecommendationSong-Li Wu, Zhaocheng Du, Qinglin Jia, Zhenhua Dong. 15977-15985 [doi]
- ICAD-LLM: One-for-All Anomaly Detection via In-Context Learning with Large Language ModelsZhongyuan Wu, Jingyuan Wang, Zexuan Cheng, Yilong Zhou, Weizhi Wang, Juhua Pu, Chao Li, Changqing Ma. 15986-15994 [doi]
- From Points to Coalitions: Hierarchical Contrastive Shapley Values for Prioritizing Data SamplesCanran Xiao, Jiabao Dou, Zhiming Lin, Zong Ke, Liwei Hou. 15995-16003 [doi]
- FT-MoE: Sustainable-learning Mixture of Experts for Fault-Tolerant ComputingWenjing Xiao, Wenhao Song, Miaojiang Chen, Min Chen 0003. 16004-16012 [doi]
- The Last Byte: Learning Just Enough for Machine-Oriented Image CompressionWuyuan Xie, Zhenming Li, Ye Liu 0005, Jian Jin, Yun Song, Miaohui Wang. 16013-16021 [doi]
- CLUHCS: Dual-View Contrastive Learning Enabled Unsupervised Heterogeneous Community Search with Meta-Path Behavior ModelingXiaoqin Xie, Bin Zhao, Mingzhu Chang, Shuai Han 0002, Wu Yang 0001. 16022-16030 [doi]
- Beyond Graph Priors: A Co-Evolving Framework Under Uncertainty for Enterprise Resilience AssessmentYanzhe Xie, Li Huang 0002, Qiang Gao 0003, Xueqin Chen 0002, Fan Zhou 0002, Kunpeng Zhang 0001. 16031-16039 [doi]
- MetaGPT: A Large Vision-Language Model for Meme Metaphor UnderstandingBo Xu 0009, Chenyuan Wang, Xinyu Chen, Hongfei Lin, Feng Xia 0001. 16040-16048 [doi]
- GUIDER: Uncertainty Guided Dynamic Re-ranking for Large Language Models Based Recommender SystemsCai Xu, Xujing Wang, Ziyu Guan, Wei Zhao 0019, Meng Yan 0013. 16049-16057 [doi]
- Wavelet Enhanced Adaptive Frequency Filter for Sequential RecommendationHuayang Xu, Huanhuan Yuan, Guanfeng Liu 0001, Junhua Fang, Lei Zhao 0001, Pengpeng Zhao 0001. 16058-16065 [doi]
- Bridging Optimization and Neural Networks for Efficient Multi-view ClusteringHui-Lang Xu, Xiang-Xiang Su, Simin Chen, Guang-yong Chen, Xing Chen. 16066-16074 [doi]
- IDK-S: Incremental Distributional Kernel for Streaming Anomaly DetectionYang Xu, Yixiao Ma, Kaifeng Zhang, Zuliang Yang, Kai Ming Ting. 16075-16082 [doi]
- SCoNE: Spherical Consistent Neighborhoods Ensemble for Effective and Efficient Multi-View Anomaly DetectionYang Xu, Hang Zhang 0003, Yixiao Ma, Ye Zhu 0002, Kai Ming Ting. 16083-16090 [doi]
- A Novel Retrieve-Read-Group Paradigm for Open Knowledge Base CanonicalizationBinhan Yang, Wei Shen 0004, Han Tian. 16091-16100 [doi]
- DRSoRec: Dual-Rectification of Social Networks for RecommendationLiangxun Yang, Tianzi Zang, Jiayi Sun, Juan Li 0011, Yicong Li 0016. 16101-16109 [doi]
- GIER: Addressing Class Imbalance in GNNs Through Experience ReplayLiu Yang, Chuyao Liu, Zidong Wang, Tingxuan Chen, Mengni Chen, Hongyu Zhang 0001. 16110-16118 [doi]
- Structural Entropy Guided Incremental Learning for Open-World Multimodal Social Event DetectionZhiwei Yang 0006, Haimei Qin, Xiaoyan Yu, Hao Peng 0001, Lei Jiang 0003, Li Sun 0008, Zhiqin Yang. 16119-16127 [doi]
- Bipartite Mode Matching for Vision Training Set Search from a Hierarchical Data ServerYue Yao, Ruining Yang, Tom Gedeon. 16128-16136 [doi]
- DiM-TS: Bridge the Gap Between Selective State Space Models and Time Series for Generative ModelingZihao Yao, Jiankai Zuo, Yaying Zhang. 16137-16144 [doi]
- Making Visual Dialogue More Engaging: A New Task, Method, and MetricGuanghui Ye, Huan Zhao 0003, Yingxue Gao, Zhixue Zhao, Kehan Wang, Xupeng Zha, Zhihua Jiang. 16145-16153 [doi]
- Align³GR: Unified Multi-Level Alignment for LLM-based Generative RecommendationWencai Ye, Mingjie Sun, Shuhang Chen, Wenjin Wu, Peng Jiang 0002. 16154-16162 [doi]
- FairGSE: Fairness-Aware Graph Neural Network Without High False Positive RatesZhenqiang Ye, Jinjie Lu, Tianlong Gu, Fengrui Hao, Xuemin Wang 0003. 16163-16171 [doi]
- NumCoKE: Ordinal-Aware Numerical Reasoning over Knowledge Graphs with Mixture-of-Experts and Contrastive LearningMing Yin, Zongsheng Cao, Qiqing Xia, Chenyang Tu, Neng Gao. 16172-16180 [doi]
- Neural Graph Navigation for Intelligent Subgraph MatchingYuchen Ying, Yiyang Dai, Wenda Li 0003, Wenjie Huang, Rui Wang 0076, Tongya Zheng, Yu Wang 0176, Hanyang Yuan, Mingli Song. 16181-16189 [doi]
- MM4Rec: Multi-Source and Multi-Scenario Recommender for Unified User PreferenceChu-Chun Yu, Ming-Yi Hong 0002, Miao-Chen Chiang, Min-Chen Hsieh, Che Lin. 16190-16198 [doi]
- Drift-aware Collaborative Assistance Mixture of Experts for Heterogeneous Multistream LearningEn Yu, Jie Lu 0001, Kun Wang 0050, Xiaoyu Yang, Guangquan Zhang 0001. 16199-16207 [doi]
- Rethinking Crystal Symmetry Prediction: A Decoupled PerspectiveLiheng Yu, Zhe Zhao 0008, Xucong Wang, Di Wu 0057, Pengkun Wang 0001. 16208-16216 [doi]
- Self-Supervised Cross-City Trajectory Representation Learning Based on Meta-LearningYanwei Yu, Hong Xia, Shaoxuan Gu, Xingyu Zhao 0006, Dongliang Chen, Yuan Cao 0005. 16217-16225 [doi]
- DRFGD: Disentangled Representation-Focused Generative Defense for Attack-Tolerant Cross-Modal HashingZhongqing Yu, Xin Liu 0011, Yiu-ming Cheung, Zhikai Hu, Wentao Fan 0001, Pan Zhou 0001. 16226-16234 [doi]
- Plug-and-Play Parameter-Efficient Tuning of Embeddings for Federated RecommendationHaochen Yuan 0001, Yang Zhang 0095, Xiang He 0002, Quan Z. Sheng, Zhongjie Wang 0003. 16235-16243 [doi]
- DHMRec: Collaboration-Guided Multimodal Disentanglement and Hierarchical Fusion for RecommendationXiaohan Zhan, Yuliang Shi, Jihu Wang, Shijun Liu, Fanyu Kong 0002, Zhiyong Chen. 16244-16252 [doi]
- Modeling Item-Level Dynamic Variability with Residual Diffusion for Bundle RecommendationDong Zhang, Lin Li 0001, Ming Li 0072, Amran Bhuiyan, Meng Sun, Xiaohui Tao 0001, Jimmy Huang 0001. 16253-16261 [doi]
- DiMA: Distinguishing Resident and Tourist Preferences via Multi-Modal LLM Alignment for Out-of-Town Cross-Domain RecommendationFan Zhang, Jinpeng Chen 0001, Tao Wang, Huan Li 0003, Senzhang Wang, Feifei Kou, Ye Ji 0002, Kaimin Wei, Zhenye Yang. 16262-16270 [doi]
- Multi-Aspect Cross-modal Quantization for Generative RecommendationFuwei Zhang, Xiaoyu Liu, Dongbo Xi, Jishen Yin, Huan Chen, Peng Yan, Fuzhen Zhuang, Zhao Zhang 0011. 16271-16279 [doi]
- Noise-Aware Graph-Based Cognitive Diagnostic Framework Through Low-Rank AlignmentGuixian Zhang, Yanmei Zhang, Guan Yuan, Shang Liu 0001, Xiaojing Du, Debo Cheng. 16280-16288 [doi]
- Sequence-Free for Compound Protein Interaction PredictionHongzhi Zhang, Jiameng Chen, Kun Li 0009, Yida Xiong, Xiantao Cai, Wenbin Hu 0001, Jia Wu 0001. 16289-16297 [doi]
- GT-SNT: A Linear-Time Transformer for Large-Scale Graphs via Spiking Node TokenizationHuizhe Zhang, Jintang Li, Yuchang Zhu, Huazhen Zhong, Liang Chen 0001. 16298-16306 [doi]
- MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt OptimizationJian Zhang 0087, Zhangqi Wang, Haiping Zhu, Kangda Cheng, Kai He 0001, Bo Li, Qika Lin, Jun Liu 0002, Erik Cambria. 16307-16315 [doi]
- MAPS: Multi-Agent Personality Shaping for Collaborative ReasoningJian Zhang 0087, Zhiyuan Wang, Zhangqi Wang, Fangzhi Xu, Qika Lin, Lingling Zhang 0005, Rui Mao 0010, Erik Cambria, Jun Liu 0002. 16316-16324 [doi]
- Diversity Recommendation via Causal Deconfounding of Co-purchase Relations and Counterfactual ExposureJingmao Zhang, Zhiting Zhao, Yunqi Lin, Jianghong Ma, Tianjun Wei, Haijun Zhang 0002, Xiaofeng Zhang 0002. 16325-16333 [doi]
- Binary Message Passing for Generalizable Semi-Supervised Graph Anomaly DetectionJingyuan Zhang, Xin Wang, Lei Yu, Li Yang 0015, Fengjun Zhang. 16334-16342 [doi]
- Fashion Microscope: Pixel-Level Attribute Perception via Optimal Transport and Neural Semantic AggregationShuili Zhang, Hongzhang Mu, Jiawei Sheng, Qianqian Tong, Wenyuan Zhang 0002, Quangang Li, Tingwen Liu. 16343-16351 [doi]
- Evidence-aware Integration and Domain Identification of Spatial Transcriptomics DataWei Zhang, Siyu Yi, Lezhi Chen, Yifan Wang, Ziyue Qiao, Yongdao Zhou, Wei Ju 0001. 16352-16360 [doi]
- D-FCGS: Feedforward Compression of Dynamic Gaussian Splatting for Free-Viewpoint VideosWenkang Zhang, Yan Zhao 0041, Qiang Wang 0061, Zhixin Xu, Li Song 0001, Zhengxue Cheng. 16361-16369 [doi]
- TrajAgg: Dual-Scale Feature Aggregation with Hybrid Training for Trajectory Similarity Computation in Free SpaceXiao Zhang, Xingyu Zhao 0006, Yuan Cao 0005, Bin Wang 0045, Guiyuan Jiang, Yanwei Yu. 16370-16378 [doi]
- Towards OOD Generalization in Dynamic Graphs via Causal Invariant LearningXinxun Zhang, Pengfei Jiao, Mengzhou Gao 0001, Tianpeng Li, Xuan Guo 0005. 16379-16387 [doi]
- Dimension-Aware Active Annotation for Aesthetic Perception via Multi-Agent Human-AI CollaborationYe Zhang 0014, Jinlong He, Dongjie Wang, Yupeng Zhou, Minghao Yin. 16388-16396 [doi]
- Improving Region Representation Learning from Urban Imagery with Noisy Long-Caption SupervisionYimei Zhang 0003, Guojiang Shen, Kaili Ning, Tongwei Ren, Xuebo Qiu, Mengmeng Wang 0005, Xiangjie Kong 0001. 16397-16405 [doi]
- Personalize Before Retrieve: LLM-based Personalized Query Expansion for User-Centric RetrievalYingyi Zhang 0001, Pengyue Jia, Derong Xu, Yi Wen 0001, Xianneng Li, Yichao Wang 0002, Wenlin Zhang 0001, Xiaopeng Li 0014, Weinan Gan, Huifeng Guo, Yong Liu 0020, Xiangyu Zhao 0001. 16406-16414 [doi]
- Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent ReasoningYipeng Zhang 0003, Xin Wang 0019, Hong Chen 0011, Junwei Pan, Qian Li, Jun Zhang, Jie Jiang, Hong Mei 0001, Wenwu Zhu 0001. 16415-16423 [doi]
- Knowledge-Enhanced Explainable Hypergraph Convolution Network for Medication RecommendationZihan Zhang, Hongzhi Liu 0001, Xiaoshuang Guo, Tianqi Sun, Zhonghai Wu. 16424-16432 [doi]
- TriFusion-IDS: A Multimodal Graph-Tabular-Text Contrastive Framework for Cross-Dataset Intrusion DetectionQinxin Zhao, Sheng Zhong. 16433-16440 [doi]
- ST-VLM: A Spatial-to-Image Multimodal Spatial-Temporal Prediction Framework with Vision-Language ModelTong Zhao, Junping Du 0001, Zhe Xue, MeiYu Liang, Aijing Li, Xiaolong Meng, Dandan Liu. 16441-16449 [doi]
- CoS: Towards Optimal Event Scheduling via Chain-of-SchedulingYiming Zhao, Jiwei Tang, Shimin Di, Libin Zheng 0001, Jianxing Yu, Jian Yin 0001. 16450-16458 [doi]
- MusicRec: Multi-modal Semantic-Enhanced Identifier with Collaborative Signals for Generative RecommendationYuqiu Zhao, Lei Shi 0030, Yan Zhong, Feifei Kou, Pengfei Zhang, Jiwei Zhang 0007, Mingying Xu, Yanchao Liu. 16459-16467 [doi]
- Uplift Modeling with Delayed Feedback: Identifiability and AlgorithmsChunyuan Zheng 0001, Anpeng Wu, Chuan Zhou 0013, Taojun Hu, Qingying Chen, Hongyi Liu, Chenxi Li, Huiyou Jiang, Haoxuan Li 0001, Zhouchen Lin. 16468-16476 [doi]
- Unified Minimax Optimization Framework for Propensity Score Estimation in Debiased RecommendationChunyuan Zheng 0001, Haocheng Yang, Jinkun Chen, Shufeng Zhang, Tianyu Xia. 16477-16485 [doi]
- From Semantics to Spectrum: A New Lens on Graph Augmentation StrategyXiangping Zheng, Xiuxin Hao, Bo Wu 0026, Wei Li 0109, Bin Ren, Bin Tang, Yuhui Guo, Xun Liang 0001, Zhiwen Yu 0001. 16486-16494 [doi]
- HaNa: Hardness and Noise-Aware Robust Cross-modal RetrievalFangming Zhong, Haiquan Yu, Cun Zhu, Suhua Zhang. 16495-16503 [doi]
- IdeFN: Identifying Unclicked Space False Negatives via Relaxed Partial Optimal Transport for Conversion Rate PredictionWeiyi Zhong, Weiming Liu 0005, Lianyong Qi, Xiaoran Zhao 0001, Xiaolong Xu 0001, Haolong Xiang, Yang Cao 0019, Shichao Pei, Qiang Ni. 16504-16512 [doi]
- Exploiting Pre-trained Language Model for Cross-city Urban Flow Prediction Guided by Information-theoretic AnalysisQiang Zhou, Xudong Tong, Yuting Liu, Chuanxing Liu, Jingjing Gu. 16513-16521 [doi]
- Inference Scaling Law for Retrieval Augmented GenerationShu Zhou, Yuxuan Ao, Yunyang Xuan, Xin Wang, Tao Fan, Hao Wang. 16522-16530 [doi]
- EdgeMTSC: A Lightweight Large-Kernel ConvNet for Multivariate Time Series ClassificationXueyi Zhou, Zhenyu Li, Dong-Kyu Chae. 16531-16539 [doi]
- DFRec: Dual Fluctuation Modeling of Multi-level Intent Evolution for Next-Item RecommendationNengjun Zhu, Lingdan Sun, Qi Zhang 0020, Jian Cao, Hang Yu 0006. 16540-16547 [doi]
- Stage-Aware Graph Contrastive Learning with Node-oriented Mixture of ExpertsXiangkai Zhu, Yeyu Yan, Saiqin Long, Chao Li 0022, Guanwen Chen, Longsheng Su. 16548-16556 [doi]
- Boosting Fine-Grained Urban Flow Inference via Lightweight Architecture and Focalized OptimizationYuanshao Zhu, Xiangyu Zhao 0001, Zijian Zhang 0009, Xuetao Wei, James Jianqiao Yu. 16557-16565 [doi]
- Self-Correction Distillation for Structured Data Question AnsweringYushan Zhu, Wen Zhang 0015, Long Jin, Mengshu Sun, Ling Zhong, Zhiqiang Liu, Juan Li 0010, Lei Liang 0002, Chong Long, Chao Deng, Junlan Feng. 16566-16574 [doi]
- Global-Lens Transformers: Adaptive Token Mixing for Dynamic Link PredictionTao Zou 0003, Chengfeng Wu, Tianxi Liao, Junchen Ye, Bowen Du 0001. 16575-16583 [doi]
- Meta Dynamic Graph for Traffic Flow PredictionYiqing Zou, Hanning Yuan, Qianyu Yang, Ziqiang Yuan, Shuliang Wang 0001, Sijie Ruan. 16584-16592 [doi]
- Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery ProblemsZengyu Zou, Jingyuan Wang, Yixuan Huang, Junjie Wu. 16593-16601 [doi]
- Metric Distortion with Preference IntensitiesMehrad Abbaszadeh, Ali Ansarifar, Mohamad Latifian, Masoud Seddighin. 16603-16611 [doi]
- Colonel Blotto with Battlefield GamesSalam Afiouni, Jakub Cerný, Chun Kai Ling, Christian Kroer. 16612-16620 [doi]
- Delta Matters: An Analytically Tractable Model for beta-delta Discounting AgentsYasunori Akagi, Takeshi Kurashima. 16621-16630 [doi]
- Computing Approximately Proportional Allocations of Indivisible Goods: Beyond Additive and Monotone ValuationsMartin Jupakkal Andersen, Ioannis Caragiannis, Anders Bo Ipsen, Alexander Søltoft. 16631-16638 [doi]
- How Hard Is It to Explain Preferences Using Few Boolean Attributes?Clemens Anzinger, Jiehua Chen 0001, Christian Hatschka, Manuel Sorge, Alexander Temper. 16639-16646 [doi]
- Designing Optimal Mechanisms to Locate Facilities with Insufficient Capacity for Bayesian AgentsGennaro Auricchio, Jie Zhang 0008. 16647-16655 [doi]
- Utilitarian Guarantees for the Method of Equal SharesAnton Baychkov, Markus Brill, Jannik Peters 0001. 16656-16663 [doi]
- Structural Approach to Guiding a Present-Biased AgentTatiana Belova, Yuriy Dementiev, Artur Ignatiev, Danil Sagunov. 16664-16672 [doi]
- On the Edge of Core (Non-)Emptiness: An Automated Reasoning Approach to Approval-Based Multi-Winner VotingRatip Emin Berker, Emanuel Tewolde, Vincent Conitzer, Mingyu Guo, Marijn Heule, Lirong Xia. 16673-16681 [doi]
- Best of Both Worlds Guarantees for Equitable AllocationsUmang Bhaskar, Vishwa Prakash HV, Aditi Sethia, Rakshitha. 16682-16690 [doi]
- Compensate to Not Deviate: On Subsidised EquilibriaVittorio Bilò, Gianpiero Monaco, Luca Moscardelli. 16691-16699 [doi]
- Approximately Envy-free and Equitable Allocations of Indivisible Items for Non-monotone ValuationsVittorio Bilò, Martin Loebl, Cosimo Vinci. 16700-16708 [doi]
- Understanding the Impact of Proportionality in Approval-Based Multiwinner ElectionsNiclas Boehmer, Lara Glessen, Jannik Peters 0001. 16709-16716 [doi]
- Picking a Representative Set of Solutions in Multiobjective Optimization: Axioms, Algorithms, and ExperimentsNiclas Boehmer, Maximilian T. Wittmann. 16717-16725 [doi]
- Putting Fair Division on the MapPaula Böhm, Robert Bredereck, Paul Gölz, Andrzej Kaczmarczyk 0001, Stanislaw Szufa. 16726-16734 [doi]
- Probing EFX via PMMS: (Non-)Existence Results in Discrete Fair DivisionJaroslaw Byrka, Franciszek Malinka, Tomasz Ponitka. 16735-16742 [doi]
- What Voting Rules Actually Do: A Data-Driven Analysis of Multi-Winner VotingJoshua Caiata, Ben Armstrong, Kate Larson. 16743-16751 [doi]
- Spatial Branch-and-Bound for Computing Multiplayer Nash EquilibriumJakub Cerný, Shuvomoy Das Gupta, Christian Kroer. 16752-16760 [doi]
- Exact and Approximate Maximin Share Allocations in Multi-GraphsGeorge Christodoulou 0001, Symeon Mastrakoulis. 16761-16769 [doi]
- Explaining Tournament Solutions with Minimal SupportsClément Contet, Umberto Grandi, Jérôme Mengin. 16770-16778 [doi]
- ElementaryNet: A Non-Strategic Neural Network for Predicting Human Behavior in Normal-Form GamesGreg d'Eon, Hala Murad, Kevin Leyton-Brown, James R. Wright. 16779-16786 [doi]
- Optimally Auditing Adversarial AgentsSanmay Das, Fang-Yi Yu, Yuang Zhang. 16787-16794 [doi]
- EFX and PO Allocation Exists for Two Types of GoodsVladimir Davidiuk, Yuriy Dementiev, Artur Ignatiev, Danil Sagunov. 16795-16802 [doi]
- Breaking Barriers, Finding Boundaries: Not Obviously Manipulable Budget-Feasible Mechanism DesignBart de Keijzer, Guido Schäfer, Artem Tsikiridis, Carmine Ventre. 16803-16811 [doi]
- Dividing Indivisible Items for the Benefit of All: It Is Hard to Be Fair Without Social AwarenessArgyrios Deligkas, Eduard Eiben, Tiger-Lily Goldsmith, Dusan Knop, Simon Schierreich. 16812-16820 [doi]
- Public Goods Games in Directed Networks with Constraints on SharingArgyrios Deligkas, Gregory Z. Gutin, Mark Jones 0001, Philip R. Neary, Anders Yeo. 16821-16828 [doi]
- Reconfiguring Proportional CommitteesChris Dong 0001, Fabian Frank, Jannik Peters 0001, Warut Suksompong. 16829-16837 [doi]
- Cost-Free Neutrality for the River MethodMichelle Döring, Jannes Malanowski, Stefan Neubert. 16838-16845 [doi]
- The River Voting MethodMichelle Döring, Markus Brill, Jobst Heitzig. 16846-16854 [doi]
- Optimal Welfare in Noncooperative Network Formation Under AttackNatan Doubez, Pascal Lenzner, Marcus Wunderlich. 16855-16862 [doi]
- Perturbing Best Responses in Zero-Sum GamesAdam Dziwoki, Rostislav Horcík. 16863-16870 [doi]
- Computing Equilibrium Nominations in Presidential ElectionsPiotr Faliszewski, Stanislaw Kazmierowski, Grzegorz Lisowski, Ildikó Schlotter, Paolo Turrini. 16871-16879 [doi]
- Diversity of Structured Domains via k-Kemeny ScoresPiotr Faliszewski, Krzysztof Sornat, Stanislaw Szufa, Tomasz Was. 16880-16888 [doi]
- Identifying Imperfect Clones in ElectionsPiotr Faliszewski, Lukasz Janeczko, Grzegorz Lisowski, Kristýna Pekárková, Ildikó Schlotter. 16889-16896 [doi]
- Fairness and Stability for Shared Resource Allocation ProblemsJiazhu Fang, Qizhi Fang, Minming Li, Wenjing Liu. 16897-16905 [doi]
- No-Regret Strategy Solving in Imperfect-Information Games via Pre-Trained EmbeddingYanchang Fu, Shengda Liu, Pei Xu 0003, Kaiqi Huang. 16906-16913 [doi]
- Designing Truthful Mechanisms for Asymptotic Fair DivisionJugal Garg, Vishnu V. Narayan, Yuang Eric Shen. 16914-16921 [doi]
- Existence of 2-EFX Allocations of ChoresJugal Garg, Aniket Murhekar. 16922-16929 [doi]
- Optimized Distortion in Linear Social ChoiceLuise Ge, Gregory Kehne, Yevgeniy Vorobeychik. 16930-16937 [doi]
- Fair Incentives for Early Arrival in 0-1 Cooperative GamesYaoxin Ge, Yao Zhang 0011, Dengji Zhao. 16938-16945 [doi]
- On the Approximation Ratio of Optimal Fixed-Price Mechanisms for Single and Multi-Unit Bilateral TradeGiordano Giambartolomei, Bart de Keijzer. 16946-16953 [doi]
- Fair Allocation of Indivisible Goods with Variable GroupsPaul Gölz, Ayumi Igarashi 0001, Pasin Manurangsi, Warut Suksompong. 16954-16962 [doi]
- Fair Division Among Couples and Small GroupsPaul Gölz, Hannane Yaghoubizade. 16963-16970 [doi]
- City Sampling for Citizens' AssembliesPaul Gölz, Jan Maly 0001, Ulrike Schmidt-Kraepelin, Markus Utke, Philipp C. Verpoort. 16971-16979 [doi]
- Multi-District School Choice: Playing on Several FieldsYannai A. Gonczarowski, Michael Yin, Shirley Zhang 0001. 16980-16988 [doi]
- Fair Diffusion AuctionsZixin Gu, Yaoxin Ge, Yao Zhang 0011, Dengji Zhao. 16989-16996 [doi]
- Minimizing Inequity in Facility Location GamesYuhang Guo 0003, Houyu Zhou. 16997-17004 [doi]
- Pricing Online LLM Services with Data-Calibrated Stackelberg Routing GameZhendong Guo, Wenchao Bai, Jiahui Jin 0001. 17005-17013 [doi]
- Edge-Binary Public Goods GamesThekla Hamm, Paloma T. Lima. 17014-17022 [doi]
- Scalable Solutions to Zero-Sum Partially Observable Stochastic Games Through Belief Aggregation with Approximation GuaranteesKim Hammar, Tansu Alpcan. 17023-17031 [doi]
- Improved Differentially Private Algorithms for Rank AggregationQuentin Hillebrand, Pasin Manurangsi, Vorapong Suppakitpaisarn, Phanu Vajanopath. 17032-17039 [doi]
- Stable Voting and the Splitting of CyclesWesley H. Holliday, Milan Mossé, Chase Norman, Eric Pacuit, Cynthia Wang. 17040-17049 [doi]
- Fair Societies: Algorithms for House AllocationsHadi Hosseini, Sanjukta Roy 0001, Aditi Sethia. 17050-17058 [doi]
- Non-Monotonicity in Fair Division of GraphsHadi Hosseini, Shraddha Pathak, Yu Zhou. 17059-17066 [doi]
- Fair and Efficient Balanced Allocation for Indivisible GoodsYasushi Kawase, Ryoga Mahara. 17067-17075 [doi]
- Sequential Selling with Sunk Cost BiasYasushi Kawase, Tomohiro Nakayoshi. 17076-17083 [doi]
- Algorithms for Structured Elections Under Thiele Voting RulesAlexandra Lassota, Krzysztof Sornat. 17084-17092 [doi]
- Facility Location for Congesting Commuters and Generalizing the Cost-Distance ProblemThanasis Lianeas, Marios Mertzanidis, Aikaterini Nikolidaki. 17093-17101 [doi]
- EFX Allocation in (Multi)HypergraphsThanasis Lianeas, Alkmini Sgouritsa, Minas Marios Sotiriou. 17102-17110 [doi]
- Fairness in Repeated Matching: A Maximin PerspectiveEugene Lim, Tzeh Yuan Neoh, Nicholas Teh. 17111-17119 [doi]
- Security Games with Layered Defenses: Adaptive Adversaries and Gittins IndicesChun Kai Ling, Jakub Cerný, Chin Hui Han, Garud Iyengar, Christian Kroer. 17120-17128 [doi]
- The Power of Initial Investigation in Audit GamesRen Liu, Weiran Shen. 17129-17136 [doi]
- Position Fair Mechanisms Allocating Indivisible GoodsRyoga Mahara, Ryuhei Mizutani, Taihei Oki, Tomohiko Yokoyama. 17137-17144 [doi]
- Area-Optimal Control Strategies for Heterogeneous Multi-Agent PursuitKamal Mammadov, Damith C. Ranasinghe. 17145-17152 [doi]
- On Condorcet's Jury Theorem with AbstentionReshef Meir, Ganesh Ghalme. 17153-17160 [doi]
- Faster Game Solving via Asymmetry of Step SizesLinjian Meng, Tianpei Yang, Youzhi Zhang 0001, Zhenxing Ge, Yang Gao 0001. 17161-17169 [doi]
- Learning in Zero-Sum Markov Games: Relaxing Strong Reachability and Mixing Time AssumptionsReda Ouhamma, Maryam Kamgarpour. 17170-17178 [doi]
- Group Fair Matchings Using Convex Cost FunctionsAtasi Panda, Harsh Sharma, Anand Louis, Prajakta Nimbhorkar. 17179-17187 [doi]
- Fairness in the Multi-Secretary ProblemGeorgios Papasotiropoulos, Zein Pishbin. 17188-17196 [doi]
- Shapley Value Approximation Based on k-Additive GamesGuilherme Dean Pelegrina, Patrick Kolpaczki, Eyke Hüllermeier. 17197-17205 [doi]
- Weakest Bidder Types and New Core-Selecting Combinatorial AuctionsSiddharth Prasad, Maria-Florina Balcan, Tuomas Sandholm. 17206-17214 [doi]
- Testing Under Strategic Manipulation: Mechanism Design for Human and AI InstitutionsXiaoyun Qiu, Liren Shan. 17215-17222 [doi]
- Truth, Justice, and Secrecy: Cake Cutting Under Privacy ConstraintsYaron Salman, Tamir Tassa, Omer Lev, Roie Zivan. 17223-17230 [doi]
- Promises Made, Promises Kept: Safe Pareto Improvements via Ex Post Verifiable CommitmentsNathaniel Sauerberg, Caspar Oesterheld. 17231-17241 [doi]
- Exclusion Zones of Instant Runoff VotingKiran Tomlinson, Johan Ugander, Jon M. Kleinberg. 17242-17249 [doi]
- The Publication Choice ProblemHaichuan Wang, Yifan Wu, Haifeng Xu. 17250-17258 [doi]
- Centralized Group Equitability and Individual Envy-Freeness in the Allocation of Indivisible ItemsYing Wang, Jiaqian Li, Tianze Wei, Hau Chan, Minming Li. 17259-17266 [doi]
- Online Fair Allocations with Binary Valuations and BeyondYuanyuan Wang, Tianze Wei. 17267-17275 [doi]
- How Hard Is It to Rig a Tournament When Few Players Can Beat or Be Beaten by the Favorite?Zhonghao Wang, Junqiang Peng 0001, Yuxi Liu, Mingyu Xiao 0001. 17276-17283 [doi]
- Deep (Predictive) Discounted Counterfactual Regret MinimizationHang Xu 0006, Kai Li 0022, Haobo Fu, Qiang Fu 0016, Junliang Xing, Jian Cheng 0001. 17284-17292 [doi]
- Inequality in the Age of PseudonymityAviv Yaish, Nir Chemaya, Dahlia Malkhi, Lin William Cong. 17293-17301 [doi]
- Pacing Equilibria in Second-Price Auctions with Few BuyersYonglei Yan, Zihe Wang 0001, Zhengyang Liu 0002. 17302-17309 [doi]
- Deviation Dynamics in Cardinal Hedonic GamesValentin Zech, Martin Bullinger. 17310-17318 [doi]
- Faster Game Solving via Hyperparameter SchedulesNaifeng Zhang, Stephen Marcus McAleer, Tuomas Sandholm. 17319-17326 [doi]
- Hide and Seek with LLMs: An Adversarial Game for Sneaky Error Generation and Self-Improving DiagnosisRui Zou, Mengqi Wei, Yutao Zhu 0001, Jirong Wen, Xin Zhao 0018, Jing Chen. 17327-17335 [doi]
- Align When They Want, Complement When They Need! Human-Centered Ensembles for Adaptive Human-AI CollaborationSyed Hasan Amin Mahmood, Ming Yin 0001, Rajiv Khanna. 17337-17346 [doi]
- Who Is Helping Whom? Analyzing Inter-Dependencies to Evaluate Cooperation in Human-AI TeamingUpasana Biswas, Vardhan Palod, Siddhant Bhambri, Subbarao Kambhampati. 17347-17356 [doi]
- Explaining Decentralized Multi-Agent Reinforcement Learning PoliciesKayla Boggess, Sarit Kraus, Lu Feng 0001. 17357-17365 [doi]
- EPIC: Explanation of Pretrained Image Classification Networks via PrototypesPiotr Borycki, Magdalena Tredowicz, Szymon Janusz, Jacek Tabor, Przemyslaw Spurek, Arkadiusz Lewicki, Lukasz Struski. 17366-17373 [doi]
- HuiduRep: A Robust Self-Supervised Framework for Learning Neural Representations from Extracellular RecordingsFeng Cao, Zishuo Feng, Jicong Zhang, Wei Shi. 17374-17383 [doi]
- Spontaneous Yet Predictable: Shapelet-Driven, Channel-Aware Intention Decoding from Multi-Region ECoGKeren Cao, Yuhang Tian, Kaizhong Zheng, Wei Xi, Xinjian Li, Liangjun Chen. 17384-17392 [doi]
- Counterfactual eXplainable AI (XAI) Method for Deep Learning-Based Multivariate Time Series ClassificationAlan G. Paredes Cetina, Kaouther Benguessoum, Raoni Lourenço, Sylvain Kubler. 17393-17400 [doi]
- Attribution Analysis-based Concept Alignment: A Human-in-the-loop Data Debugging FrameworkLei Chai, Lu Qi, Hailong Sun 0001, Jing Zhang, Jingxuan Xu. 17401-17409 [doi]
- GazeInterpreter: Parsing Eye Gaze to Generate Eye-Body-Coordinated NarrationsQing Chang, Zhiming Hu. 17410-17418 [doi]
- D-GARA: A Dynamic Benchmarking Framework for GUI Agent Robustness in Real-World AnomaliesSen Chen, Tong Zhao, Yi Bin, Fei Ma 0006, Wenqi Shao, Zheng Wang 0044. 17419-17426 [doi]
- EMOD: A Unified EEG Emotion Representation Framework Leveraging V-A Guided Contrastive LearningYuning Chen, Sha Zhao, Shijian Li, Gang Pan 0001. 17427-17435 [doi]
- Intention-Guided Cognitive Reasoning for Egocentric Long-Term Action AnticipationQiaohui Chu, Haoyu Zhang, Meng Liu 0006, Yisen Feng, Haoxiang Shi, Liqiang Nie. 17436-17444 [doi]
- Too Sure for Our Own Good: A User Study on AI Confidence and Human RelianceCaterina Fregosi, Lucia Vicente, Andrea Campagner, Federico Cabitza. 17445-17453 [doi]
- ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models Using Pareto High-Quality DataHaoran Gu, Handing Wang, Yi Mei 0001, Mengjie Zhang 0001, Yaochu Jin. 17454-17462 [doi]
- Cross-modal Prompting for Balanced Incomplete Multi-modal Emotion RecognitionWenjue He, Xiaofeng Zhu 0001, Zheng Zhang 0006. 17463-17471 [doi]
- Can Humans Teach Machines to Code?Céline Hocquette, Johannes Langer, Andrew Cropper, Ute Schmid. 17472-17480 [doi]
- Alfa: Attentive Low-Rank Filter Adaptation for Structure-Aware Cross-Domain Personalized Gaze EstimationHe-Yen Hsieh, Wei-Te Mark Ting, H. T. Kung 0001. 17481-17489 [doi]
- Graph Neural Field with Spatial-Correlation Augmentation for HRTF PersonalizationDe Hu, Junsheng Hu, Cuicui Jiang. 17490-17498 [doi]
- SSL-CST: Cell Segmentation for Single-Cell Spatial Transcriptome Based on Self-Supervised LearningWeiliang Huo, Shilin Zhang, Suixue Wang, Qingchen Zhang 0001. 17499-17507 [doi]
- Whole-Field Action Sensing via Wearable Single-Channel EMG Sensors and Resource-Efficient Motion NetworkXuanming Jiang, Dingyu Nie, Baoyi An 0001, Yuzhe Zheng, Yichuan Mao, Jialie Shen 0001, Xueming Qian, Zhiwen Jin, Wei Lan, Guoshuai Zhao. 17508-17516 [doi]
- Mitigating Length Bias in RLHF Through a Causal LensHyeonji Kim, Sujeong Oh, Sanghack Lee. 17517-17525 [doi]
- ESCA: An Emotional Support Conversation Agent for Enhancing Reasonable Strategy Planning and Effective ExpressionJing Li, Yanxin Luo, Donghong Han, Yimeng Zhan, Xiaoming Fu 0001, Baiyou Qiao, Gang Wu 0007. 17526-17534 [doi]
- ViTE: Virtual Graph Trajectory Expert Router for Pedestrian Trajectory PredictionRuochen Li 0002, Zhanxing Zhu, Tanqiu Qiao, Hubert P. H. Shum. 17535-17543 [doi]
- Belief-Driven Value Alignment for Human-Robot CollaborationSaisai Li, Bing Shi 0002, Yiming Xia, Xiao Su 0007. 17544-17552 [doi]
- GigaMoE: Sparsity-Guided Mixture of Experts for Efficient Gigapixel Object DetectionXiang Li, Wenxi Li, Yuetong Wang, Chenyang Lyu, Haozhe Lin, Guiguang Ding, Yuchen Guo. 17553-17561 [doi]
- Bridging Cognitive Gap: Hierarchical Description Learning for Artistic Image Aesthetics AssessmentHenglin Liu, Nisha Huang, Chang Liu 0071, Jiangpeng Yan, Huijuan Huang, Jixuan Ying, Tong-Yee Lee, Pengfei Wan 0001, Xiangyang Ji. 17562-17570 [doi]
- Do Large Language Models Reason About Uncertainty Like Humans? A Benchmark on Hurricane Forecast Visualization ComprehensionLe Liu 0008, Yuhao Wang, Bohan Shen, Wei Zeng 0004, Shizhou Zhang, Di Xu 0010, Peng Wang 0015. 17571-17579 [doi]
- Leveraging Visual Blur Perception Characteristics for EEG DecodingWenchao Liu 0004, Hongwei Li 0024, Zhouyang Xu, Lin Ma 0003, Haifeng Li 0001. 17580-17588 [doi]
- MindCross: Fast New Subject Adaptation with Limited Data for Cross-subject Video Reconstruction from Brain SignalsXuan-Hao Liu, Yan-Kai Liu, Tianyi Zhou, Bao-Liang Lu, Wei-Long Zheng. 17589-17597 [doi]
- Gracefully Air-Written: Enhancing the Legibility and Style Consistency of In-Air HandwritingYu Liu, Cunrui Wang, Lin Feng, Jianxin Zhang 0001, Bo Lu 0005. 17598-17607 [doi]
- UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement LearningZhengxi Lu, Yuxiang Chai, Yaxuan Guo, Xi Yin 0011, Liang Liu, Hao Wang 0251, Han Xiao 0010, Shuai Ren 0002, Pengxiang Zhao, Guangyi Liu, Guanjing Xiong, Hongsheng Li 0001. 17608-17616 [doi]
- BrainHGT: A Hierarchical Graph Transformer for Interpretable Brain Network AnalysisJiaJun Ma, Yongchao Zhang, Chao Zhang 0047, Zhao Lv, Shengbing Pei. 17617-17625 [doi]
- TRIPLE: Theory-Driven Integration of Planned and Habitual Behaviors for LLM-based PersonalizationTaehyung Noh, Seungwan Jin, Haein Yeo, Kyungsik Han. 17626-17634 [doi]
- FGD-Align: Pluralistic Alignment for Large Language Models via Fuzzy Group Decision-MakingWeihang Pan, Zhengxu Yu, Yong Wu, Xun Liang, Zhongming Jin 0001, Qiang Fu, Penghui Shang, Binbin Lin, Xiaofei He 0001, Jieping Ye. 17635-17643 [doi]
- BiO-HMC: Dynamic Human-Machine Collaboration for Consensus Decision-Making via Bilevel OptimizationYinghui Pan, Shuaijie Zhao, Shenbao Yu, Zongyang Liu, Yifeng Zeng, Han Liu, Mingwei Lin. 17644-17651 [doi]
- Human Cognitive Biases in Explanation-based Interaction: The Case of Within and Between Session Order EffectDario Pesenti, Alessandro Bogani, Katya Tentori, Stefano Teso. 17652-17660 [doi]
- Inferring Heterogeneous Private Valuations from Offline Market Data via Entropic Risk-Sensitive Utility MaximizationXingyu Qian, Haoran Yu. 17661-17669 [doi]
- Towards Reinforcement Learning from Neural Feedback: Mapping fNIRS Signals to Agent PerformanceJulia Santaniello, Matthew Russell, Benson Jiang, Donatello Sassaroli, Robert J. K. Jacob, Jivko Sinapov. 17670-17678 [doi]
- TuningIQA: Fine-Grained Blind Image Quality Assessment for Livestreaming Camera TuningXiangfei Sheng, Zhichao Duan 0002, Xiaofeng Pan, Yipo Huang, Zhichao Yang 0013, Pengfei Chen 0003, Leida Li. 17679-17687 [doi]
- Simulating Human-Like Counseling: A Path- and Scenario-Guided Framework for Psychological Support DialogueYuanchen Shi, Longyin Zhang, Maodong Li 0003, Yibin Zheng, Xiuhong Wang, Fang Kong 0001. 17688-17696 [doi]
- Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment StrategiesDong-Hee Shin, Deok-Joong Lee, Young-Han Son, Tae-Eui Kam. 17697-17705 [doi]
- PINet: Improving the Stability of Prototype Networks via Phantasia-Inspired Uncertain RepresentationsHo Kyung Shin, Soeun Bae, Sang-Min Kim, Byoung Chul Ko, Woo-Jeoung Nam. 17706-17714 [doi]
- Reducing Goal State Divergence with Environment DesignKelsey Sikes, Sarah Keren, Sarath Sreedharan. 17715-17723 [doi]
- Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based SearchZijian Song, Xiaoxin Lin, Tao Pu 0002, Zhenlong Yuan, Guangrun Wang, Liang Lin. 17724-17732 [doi]
- EEG-DLite: Dataset Distillation for Efficient Large EEG Model TrainingYuting Tang, Weibang Jiang, Shanglin Li, Yong Li 0032, Chenyu Liu, Xinliang Zhou, Yi Ding 0012, Cuntai Guan. 17733-17741 [doi]
- AutoGameUI: Constructing High-Fidelity GameUI via Multimodal Correspondence MatchingZhongliang Tang, Qingrong Cheng, Mengchen Tan, Yongxiang Zhang 0003, Fei Xia. 17742-17750 [doi]
- Consensus-Driven Multi-Agent Cognitive Reasoning for Enhancing the Emotional Intelligence of Large Language ModelsGeng Tu, Dingming Li, Jun Huang, Ruifeng Xu. 17751-17759 [doi]
- Inferring Implicit Goals Across Differing Task ModelsSilvia Tulli, Stylianos Loukas Vasileiou, Mohamed Chetouani, Sarath Sreedharan. 17760-17768 [doi]
- Emotion-Conditioned Motion Sub-spaces with Flow Matching for Real-Time Audio-Driven Talking HeadsHaoyu Wang, Xiaozhe Xin, Xiaoyu Qin, Meiguang Jin, Junfeng Ma, Dan Xu, Jia Jia. 17769-17777 [doi]
- Perceive More with Less: LiDAR Point Cloud Compression at Just Recognizable Distortion for 3D Scene UnderstandingMiaohui Wang, Runnan Huang, Taojun Liu, Shuyuan Lin, Ye Liu 0005, Yun Song. 17778-17786 [doi]
- New Synthetic Goldmine: Hand Joint Angle-Driven EMG Data Generation Framework for Micro-Gesture RecognitionNana Wang, Suli Wang, Gen Li, Pengfei Ren, Hao Su. 17787-17795 [doi]
- A Brain-Inspired Saliency Prediction Framework for Human-AI Cognitive Consistency in AIGC Content via Multi-Region Liquid NeuronsShibo Wang, Yan Zhao 0012, Shigang Wang 0003, Jian Wei, Shuo Li. 17796-17804 [doi]
- BraSTORM: A Dual-Branch Self-Supervised Framework for EEG Representation Learning via Input-Level Spatio-Temporal DecompositionYifan Wang, Der Horng Lee, Bruce X. B. Yu. 17805-17813 [doi]
- Reasoning Shapes Alignment: Investigating Cultural Alignment in Large Reasoning Models with Cultural NormsYuhang Wang, YanXu Zhu, Jitao Sang. 17814-17822 [doi]
- QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction DetectionYuxiao Wang 0003, Wolin Liang, Yu Lei, Weiying Xue, Nan Zhuang, Qi Liu 0005. 17823-17831 [doi]
- What-Meets-Where: Unified Learning of Action and Contact Localization in ImagesYuxiao Wang 0003, Yu Lei, Wolin Liang, Weiying Xue, Zhenao Wei, Nan Zhuang, Qi Liu 0005. 17832-17840 [doi]
- F.A.C.U.L.: Language-Based Interaction with AI Companions in GamingWenya Wei, Sipeng Yang, Qixian Zhou, Ruochen Liu, Xuelei Zhang, Yifu Yuan, Yan Jiang, Yongle Luo, Hailong Wang, Tianzhou Wang, Peipei Jin, Wangtong Liu, Zhou Zhao, Xiaogang Jin 0001, Elvis S. Liu. 17841-17849 [doi]
- State Mamba: Spatiotemporal EEG State-Space Model with Dynamic Brain Alignment for Cross-Subject RepresentationWeining Weng, Yang Gu 0001, Yuan Ma, Yuchen Liu, Yingwei Zhang 0002, Yiqiang Chen 0001. 17850-17858 [doi]
- Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision AlignmentLukun Wu, Jie Li 0001, Ziqi Ren, Kaifan Zhang, Xinbo Gao 0001. 17859-17867 [doi]
- Multigranular Evaluation for Brain Visual DecodingWeihao Xia 0001, Cengiz Öztireli. 17868-17876 [doi]
- Automated Human Strategic Behavior Modeling via Large Language ModelsXiaohan Xie, Haoran Yu 0001, Biying Shou, Jianwei Huang 0001. 17877-17885 [doi]
- Point Cloud Quality Assessment via Multi-View Structure-Aware Feature FusionJian Xiong 0005, Lingxia Jiang, Xianzhong Long, Miaohui Wang, Hao Gao 0005. 17886-17894 [doi]
- CyC3D: Fine-grained Controllable 3D Generation via Cycle Consistency RegularizationHongbin Xu, Chaohui Yu, Feng Xiao, Jiazheng Xing, Hai Ci, Weitao Chen, Fan Wang 0019, Ming Li. 17895-17903 [doi]
- AR-Nav Benchmark: Augmented Reality Navigation with Vision and LanguageLiqi Yan, Yihao Wu, Chenyi Xu, Chao Yang, Jianhui Zhang, Pan Li 0001. 17904-17912 [doi]
- On Coresets for End-to-end Learning from CrowdsHang Yang, Zhiwu Li 0001, Witold Pedrycz. 17913-17920 [doi]
- Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent DisambiguationSicheng Yang, Yukai Huang, Weitong Cai, Shitong Sun, You He, Jiankang deng, Hang Zhang, Jifei Song, Zhensong Zhang. 17921-17929 [doi]
- SUGAR: Learning Skeleton Representation with Visual-Motion Knowledge for Action RecognitionQilang Ye, Yu Zhou 0015, Lian He, Jie Zhang, Xuanming Guo, Jiayu Zhang, Mingkui Tan, Weicheng Xie 0001, Yue Sun 0001, Tao Tan 0002, Xiaochen Yuan, Ghada Khoriba, Zitong Yu. 17930-17938 [doi]
- Reconstruction Attack-Resistant Inference Paradigm for LLM Cloud ServicesZipeng Ye, Wenjian Luo, Qi Zhou, Yubo Tang. 17939-17947 [doi]
- TiCAL: Typicality-Based Consistency-Aware Learning for Multimodal Emotion RecognitionWen Yin, Siyu Zhan, Cencen Liu, Xin Hu, Guiduo Duan, Xiurui Xie, Yuan-Fang Li, Tao He 0007. 17948-17956 [doi]
- 2D-CrossScan Mamba: Enhancing State Space Models with Spatially Consistent Multi-Path 2D Information PropagationLongLong Yu, Wenxi Li, Yaoqi Sun, Hang Xu, Chenggang Yan 0001, Yuchen Guo. 17957-17965 [doi]
- MF-Speech: Achieving Fine-Grained and Compositional Control in Speech Generation via Factor DisentanglementXinyue Yu, Youqing Fang, Pingyu Wu, Guoyang Ye, Wenbo Zhou 0004, Weiming Zhang, Song Xiao. 17966-17974 [doi]
- ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph ReconstructionYan Yu, Yilun Liu 0001, Minggui He, Shimin Tao, Weibin Meng, Xinhua Yang, Li Zhang, Hongxia Ma, Dengye Li, Daimeng Wei, Boxing Chen, Fuliang Li. 17975-17983 [doi]
- PressTrack-HMR: Pressure-Based Top-Down Multi-Person Global Human Mesh RecoveryJiayue Yuan, Fangting Xie, Guangwen Ouyang, Changhai Ma, Ziyu Wu, Heyu Ding, Quan Wan, Yi Ke, Yuchen Wu, Xiaohui Cai. 17984-17992 [doi]
- SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber WorldJiaqi Zhang, Chen Gao 0001, Liyuan Zhang, Quoc Viet Hung Nguyen, Hongzhi Yin. 17993-18001 [doi]
- SEBSFormer: A Spectral-Enhanced Bi-Stream Transformer for Robust EEG DecodingLin Zhang, Shikui Tu, Lei Xu 0001. 18002-18010 [doi]
- GATCL: An Adaptive Contrastive Learning Framework Based on MHGAT for Spatial Domain Identification in Spatial TranscriptomicsShilin Zhang, Weiliang Huo, Qingchen Zhang 0001, Xiulong Liu 0001. 18011-18019 [doi]
- SAMGTD: Spatial-Aware Masked Graph Transformer-Diffusion Model for Enhanced Cell Type Deconvolution in Spatial TranscriptomicsShilin Zhang, Suixue Wang, Qingchen Zhang 0001, Xiulong Liu 0001. 18020-18027 [doi]
- NeuroBridge: Bio-Inspired Self-Supervised EEG-to-Image Decoding via Cognitive Priors and Bidirectional Semantic AlignmentWenjiang Zhang, Sifeng Wang, Yuwei Su, Xinyu Li, Chen Zhang, Suyu Zhong. 18028-18036 [doi]
- Single-Stage fMRI-to-3D Reconstruction via Viewpoint-Aware Embedding and Hierarchical GuidanceXun Zhang, Weihao Xia 0001, Yulong Liu, Bo Yang 0027, Alessandro Bozzon, Pan Wang 0005. 18037-18045 [doi]
- SPARD: Single-step Inference with Adaptive Sampling in Residual Diffusion for Human Motion PredictionYiming Zhang, Baojia Han, Ximing Li 0002, Wei Pang 0001, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan. 18046-18054 [doi]
- Coverage-Constrained Human-AI Cooperation with Multiple ExpertsZheng Zhang 0046, Cuong C. Nguyen, Kevin Wells, Thanh-Toan Do, David Rosewarne, Gustavo Carneiro 0001. 18055-18062 [doi]
- EEG Agent: A Unified Framework for Automated EEG Analysis Using Large Language ModelsSha Zhao, Mingyi Peng, Haiteng Jiang, Tao Li, Shijian Li. 18063-18071 [doi]
- TrajEvo: Trajectory Prediction Heuristics Design via LLM-driven EvolutionZhikai Zhao, Chuanbo Hua, Federico Berto, Kanghoon Lee, Zihan Ma 0003, Jiachen Li 0001, Jinkyoo Park. 18072-18080 [doi]
- S³: Spiking Neurons as an Isolating Segmenter for Brain Signal DecodingQian Zheng, Ming Chen, Sha Zhao, Shi Gu, Peng Lin, De Ma, Huajin Tang, Gang Pan 0001. 18081-18089 [doi]
- CAT-Net: A Cross-Attention Tone Network for Cross-Subject EEG-EMG Fusion Tone DecodingYifan Zhuang, Calvin Huang, Zepeng Yu, Yongjie Zou, Jiawei Ju. 18090-18098 [doi]
- MindSight: A Bio-Inspired Neural Architecture for Visual Restoration via Cortical Electrical StimulationYongjie Zou, Haonan Niu, Bin Zhao, Guoliang Yi, Mengchuanzhi Yang, Jiawei Ju, Jiapeng Yin, Chengyu T. Li. 18099-18107 [doi]
- Enhancing the Knowledge Tracing via a Plug-In Guided Diffusion ModelShuaishuai Zu, Jihao Zhao, Biao Qin. 18108-18116 [doi]
- GRIM: Task-Oriented Grasping with Conditioning on Generative ExamplesShailesh, Alok Raj, Nayan Kumar, Priya Shukla, Andrew Melnik, Michael Beetz, Gora Chand Nandi. 18118-18125 [doi]
- Dexterous Manipulation Transfer via Progressive Kinematic-Dynamic AlignmentWenbin Bai, Qiyu Chen, Xiangbo Lin, Jw L, Quancheng Li, Hejiang Pan, Yi Sun. 18126-18134 [doi]
- H-RDT: Human Manipulation Enhanced Bimanual Robotic ManipulationHongzhe Bi, Lingxuan Wu, Tianwei Lin, Hengkai Tan, Zhizhong Su, Hang Su 0006, Jun Zhu 0001. 18135-18143 [doi]
- Steering Visuomotor Policy in Open Worlds via Cross-View Goal AlignmentShaofei Cai, Zhancun Mu, Anji Liu, Yitao Liang. 18144-18152 [doi]
- A Natural-Gradient Approach for Nonlinear Stochastic Systems with Parameter UncertaintyLiang Cao. 18153-18160 [doi]
- AerialVLA: A Vision-Language-Action Model for Aerial Navigation with Online DialogueJinyu Chen, HongYu Li, Zongheng Tang, Xiaoduo Li, Wenjun Wu, Si Liu 0001. 18161-18169 [doi]
- PIPHEN: Physical Interaction Prediction with Hamiltonian Energy NetworksKewei Chen, Yayu Long, Mingsheng Shang 0001. 18170-18179 [doi]
- FT-NCFM: An Influence-Aware Data Distillation Framework for Efficient VLA ModelsKewei Chen, Yayu Long, Shuai Li 0002, Mingsheng Shang 0001. 18180-18188 [doi]
- ManiLong-Shot: Interaction-Aware One-Shot Imitation Learning for Long-Horizon ManipulationZixuan Chen, Chongkai Gao, Lin Shao 0002, Jieqi Shi, Jing Huo, Yang Gao 0001. 18189-18197 [doi]
- STOLA: Self-Adaptive Touch-Language Framework for Tactile Commonsense Reasoning in Open-Ended ScenariosNing Cheng, Jinan Xu, Jialing Chen, Bin Fang 0003, Wenjuan Han. 18198-18206 [doi]
- PEOD: A Pixel-Aligned Event-RGB Benchmark for Object Detection Under Challenging ConditionsLuoping Cui, Hanqing Liu, Mingjie Liu, Endian Lin, Donghong Jiang, Yuhao Wang, Chuang Zhu. 18207-18215 [doi]
- RflyPano: A Panoramic Benchmark for Ultra-low Altitude UAV Localization Powered by RflySimDun Dai, Ze Lu, Xunhua Dai, Quan Quan. 18216-18224 [doi]
- History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language NavigationXichen Ding, Jianzhe Gao, Cong Pan, Wenguan Wang, Jie Qin. 18225-18233 [doi]
- NaVLA$^2$: A Vision-Language-Audio-Action Model for Multimodal Instruction NavigationJugang Fan, Peihao Chen, Changhao Li, Qing Du, Jian Chen 0011, Mingkui Tan. 18234-18242 [doi]
- MHED-SLAM: Multi-Scale Hybrid Encoding-Based Decoupled SLAMDengfang Feng, Wenyang Qin, Zhongchen Shi, Wei Chen 0092, Yanhui Duan, Liang Xie 0012, Erwei Yin. 18243-18252 [doi]
- VPN: Visual Prompt NavigationShuo Feng, Zihan Wang, Yuchen Li 0006, Rui Kong, Hengyi Cai, Shuaiqiang Wang, Gim Hee Lee, Piji Li, Shuqiang Jiang. 18253-18261 [doi]
- Learning Diffusion Policy from Primitive Skills for Robot ManipulationZhihao Gu, Ming Yang, Difan Zou, Dong Xu. 18262-18270 [doi]
- Just Few States Are Enough: Randomized Sparse Feedback for Stability of Dynamical SystemsZaid Hadach, Hajar El Hammouti, El Houcine Bergou, Adnane Saoud. 18271-18278 [doi]
- SeqWalker: Sequential-Horizon Vision-and-Language Navigation with Hierarchical PlanningZebin Han, Xudong Wang, Baichen Liu, Qi Lyu, Zhenduo Shang, Jiahua Dong 0001, Lianqing Liu, Zhi Han. 18279-18287 [doi]
- Learning Object-Centric Motion Priors from Human for Robotic Dexterous ManipulationZhengdong Hong, Guofeng Zhang 0001. 18288-18296 [doi]
- LOG-Nav: Efficient Layout-Aware Object-Goal Navigation with Hierarchical PlanningJiawei Hou, Yuting Xiao, Xiangyang Xue 0001, Taiping Zeng. 18297-18305 [doi]
- Real Garment Benchmark (RGBench): A Comprehensive Benchmark for Robotic Garment Manipulation Featuring a High-Fidelity Scalable SimulatorWenkang Hu, Xincheng Tang, Yanzhi E, Yitong Li, Zhengjie Shu, Wei Li 0111, Huamin Wang 0001, Ruigang Yang. 18306-18314 [doi]
- UNeMo: Collaborative Visual-Language Reasoning and Navigation via a Multimodal World ModelChangxin Huang, Lv Tang, Zhaohuan Zhan, Lisha Yu, Runhao Zeng, Zun Liu, Zhengjie Wang, JianQiang Li. 18315-18323 [doi]
- GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous InstructionsHelong Huang, Min Cen, Kai Tan, Xingyue Quan, Guowei Huang, Hong Zhang. 18324-18332 [doi]
- RENEW: Risk- and Energy-Aware Navigation in Dynamic WaterwaysMingi Jeong, Alberto Quattrini Li. 18333-18341 [doi]
- Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic MethodologyYatai Ji, Zhengqiu Zhu, Yong Zhao, Beidan Liu, Chen Gao 0001, Yihao Zhao, Sihang Qiu, Yue Hu 0016, Quanjun Yin. 18342-18350 [doi]
- PanoNav: Mapless Zero-Shot Object Navigation with Panoramic Scene Parsing and Dynamic MemoryQunchao Jin, Yilin Wu, Changhao Chen. 18351-18359 [doi]
- PhyPlan: Learning to Plan Tasks with Generalizable and Rapid Physical Reasoning for Embodied ManipulationAnkit Kanwar, Hartej Soin, Abhinav Barnawal, Mudit Chopra, Harshil Vagadia, Tamajit Banerjee, Shreshth Tuli, Rohan Paul, Souvik Chakraborty. 18360-18369 [doi]
- Lightweight Adaptive Topological Layout and Semantic Mapping in Vision-and-Language Navigation on WebsitesPingrui Lai, Zihao Xie, Hua Yang. 18370-18378 [doi]
- DiTEA: Mixture-of-Experts for Vision-Language-Action Model in Robotic ManipulationChengxuan Li, Xingwan Wang. 18379-18387 [doi]
- Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action ModelingHao Li 0069, Shuai Yang 0001, Yilun Chen, Xinyi Chen, Xiaoda Yang, Yang Tian, Hanqing Wang, Tai Wang, Dahua Lin, Feng Zhao 0004, Jiangmiao Pang. 18388-18396 [doi]
- SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic ManipulationWei Li, Renshan Zhang, Rui Shao 0001, Zhijian Fang, Kaiwen Zhou 0001, Zhuotao Tian, Liqiang Nie. 18397-18405 [doi]
- LiDARCrafter: Dynamic 4D World Modeling from LiDAR SequencesAlan Liang, Youquan Liu, Yu Yang, Dongyue Lu, Linfeng Li, Lingdong Kong, Huaici Zhao, Wei Tsang Ooi. 18406-18414 [doi]
- Cook and Clean Together: Teaching Embodied Agents for Parallel Task ExecutionDingkang Liang, Cheng Zhang 0020, Xiaopeng Xu, Jianzhong Ju, Zhenbo Luo, Xiang Bai. 18415-18424 [doi]
- A3D: Adaptive Affordance Assembly with Dual-Arm ManipulationJiaqi Liang, Yue Chen, Qize Yu, Yan Shen 0035, Haipeng Zhang 0006, Hao Dong 0003, Ruihai Wu. 18425-18433 [doi]
- Whole-Body Coordination for Dynamic Object Grasping with Legged ManipulatorsQiwei Liang, Boyang Cai, Rongyi He, Hui Li, Tao Teng, Haihan Duan, Changxin Huang, Runhao Zeng. 18434-18442 [doi]
- Affordance-Guided Coarse-to-Fine Exploration for Base Placement in Open-Vocabulary Mobile ManipulationTzu-Jung Lin, Jia-Fong Yeh, Hung-Ting Su, Chung-Yi Lin, Yi-Ting Chen, Winston H. Hsu. 18443-18451 [doi]
- TTF-VLA: Temporal Token Fusion via Pixel-Attention Integration for Vision-Language-Action ModelsChenghao Liu, Jiachen Zhang, Chengxuan Li, Zhimu Zhou, Shixin Wu, Songfang Huang, Huiling Duan. 18452-18459 [doi]
- FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic ManipulationLitao Liu, Wentao Wang, Yifan Han, Zhuoli Xie, Pengfei Yi, Junyan Li, Wenzhao Lian. 18460-18468 [doi]
- Intention-Aware Diffusion Model for Pedestrian Trajectory PredictionYu Liu, Zhijie Liu, Xiao Ren, Youfu Li 0001, He Kong 0001. 18469-18477 [doi]
- SIAM: Towards Generalizable Articulated Object Modeling via Single Robot-Object InteractionYuyan Liu, Li Zhang 0104, Di Wu, Yan Zhang 0053, Anran Huang, Zhi Wang, Liu Liu 0012, Dan Guo 0001. 18478-18486 [doi]
- DNOI-4DRO: Deep 4D Radar Odometry with Differentiable Neural-Optimization IterationsShouyi Lu, Huanyu Zhou, Guirong Zhuo, Xiao Tang. 18487-18495 [doi]
- TouchFormer: A Robust Transformer-based Framework for Multimodal Material PerceptionKailin Lyu, Long Xiao, Jianing Zeng, Junhao Dong 0001, Xuexin Liu, Zhuojun Zou, Haoyue Yang, Lin Shu, Jie Hao. 18496-18504 [doi]
- UrbanNav: Learning Language-Guided Embodied Urban Navigation from Web-Scale Human TrajectoriesYanghong Mei, Yirong Yang, Longteng Guo, Qunbo Wang, Ming-Ming Yu, Xingjian He, Wenjun Wu 0001, Jing Liu 0001. 18505-18513 [doi]
- Autonomous Vehicle Path Planning by Searching with Differentiable SimulationAsen Nachkov, Jan-Nico Zaech, Danda Pani Paudel, Xi Wang 0021, Luc Van Gool. 18514-18522 [doi]
- Coordinated Humanoid Robot Locomotion with Symmetry Equivariant Reinforcement Learning PolicyBuqing Nie, Yang Zhang, Rongjun Jin, Zhanxiang Cao, Huangxuan Lin, Xiaokang Yang, Yue Gao 0005. 18523-18531 [doi]
- MP1: MeanFlow Tames Policy Learning in 1-step for Robotic ManipulationJuyi Sheng, Ziyi Wang, Peiming Li, Mengyuan Liu. 18532-18539 [doi]
- Real-Time Path Planning for UAVs in Windy Environments Without Computational Fluid DynamicsAbhudaya Shrivastava, Shelly Gupta 0001, Zoran Obradovic. 18540-18548 [doi]
- ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot PerceiverWenxuan Song, Ziyang Zhou, Han Zhao 0008, Jiayi Chen, Pengxiang Ding, Haodong Yan, Yuxin Huang, Feilong Tang, Donglin Wang, Haoang Li. 18549-18557 [doi]
- ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language ModelsZirui Song, Guangxian Ouyang, Mingzhe Li 0001, Yuheng Ji, Chenxi Wang, Zixiang Xu, Zeyu Zhang, Xiaoqing Zhang 0017, Qian Jiang, Fengxian Ji, Zhenhao Chen, Zhongzhi Li, Xiuying Chen. 18558-18566 [doi]
- VirtualEnv: A Platform for Embodied AI ResearchKabir Swain, Sijie Han, Ayush Raina, Jin Zhang, Shuang Li 0013, Michael Stopa, Antonio Torralba 0001. 18567-18574 [doi]
- FARM: Frame-Accelerated Augmentation and Residual Mixture-of-Experts for Physics-Based High-Dynamic Humanoid ControlTan Jing, Shiting Chen, Yangfan Li, Weisheng Xu, Renjing Xu. 18575-18583 [doi]
- WorldAgen: Unified State-Action Prediction with Test-Time World Model TrainingChi Wan, Kangrui Wang, Yuan Si, Pingyue Zhang, Manling Li. 18584-18592 [doi]
- LatentVLA: Taming Latent Space for Generalizable and Long-Horizon Bimanual ManipulationJunming Wang 0001. 18593-18601 [doi]
- ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon TasksKaijun Wang, Liqin Lu, Mingyu Liu, Jianuo Jiang, Zeju Li, Bolin Zhang, Wancai Zheng, Xinyi Yu, Hao Chen, Chunhua Shen. 18602-18610 [doi]
- PLUM-Net: Prototype-Induced Label Structuring for Disentangled Multimodal Representation NetworkKehan Wang, Huan Zhao 0003, Yong Wei, Xupeng Zha, Guanghui Ye, Cheng Zhu, Yiming Liu, Zixing Zhang 0001. 18611-18619 [doi]
- Expand Your SCOPE: Semantic Cognition over Potential-Based Exploration for Embodied Visual NavigationNingnan Wang, Weihuang Chen, Liming Chen, Haoxuan Ji, Zhongyu Guo, Xuchong Zhang, Hongbin Sun 0001. 18620-18628 [doi]
- Lifelong Language-Conditioned Robotic Manipulation LearningXudong Wang, Zebin Han, Zhiyu Liu, Gan Li, Jiahua Dong 0001, Baichen Liu, Lianqing Liu, Zhi Han. 18629-18637 [doi]
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action ModelYihao Wang, Pengxiang Ding, Lingxiao Li, Can Cui 0008, Zirui Ge, Xinyang Tong, Wenxuan Song, Han Zhao 0008, Wei Zhao, Pengxu Hou, Siteng Huang, Yifan Tang, Wenhui Wang, Ru Zhang, Jianyi Liu, Donglin Wang. 18638-18646 [doi]
- Self-supervised Multiplex Consensus Mamba for General Image FusionYingying Wang 0005, Rongjin Zhuang, Hui Zheng 0003, Xuanhua He, Ke Cao, Xiaotong Tu, Xinghao Ding. 18647-18655 [doi]
- MMMamba: A Versatile Cross-Modal in Context Fusion Framework for Pan-Sharpening and Zero-Shot Image EnhancementYingying Wang 0005, Xuanhua He, Chen Wu, Jialing Huang, Suiyun Zhang, Rui Liu, Xinghao Ding, Haoxuan Che. 18656-18664 [doi]
- ForeDiffusion: Foresight-Conditioned Diffusion Policy via Future View Construction for Robot ManipulationWeize Xie, Yi Ding, Ying He 0006, LeiLei Wang, Binwen Bai, Zheyi Zhao, Chenyang Wang 0001, F. Richard Yu. 18665-18673 [doi]
- Firing Bits Where It Matters: Spiking-Guided Just Recognizable Distortion Modeling for Machine-Centric Video CodingWuyuan Xie, Zhenming Li, Yuwu Lu, Di Lin 0002, Yun Song, Miaohui Wang. 18674-18682 [doi]
- Zero-Shot Robotic Manipulation via 3D Gaussian Splatting-Enhanced Multimodal Retrieval-Augmented GenerationZilong Xie, Jingyu Gong, Xin Tan 0002, Zhizhong Zhang 0001, Yuan Xie 0006. 18683-18691 [doi]
- Actor-Critic for Continuous Action Chunks: A Reinforcement Learning Framework for Long-Horizon Robotic Manipulation with Sparse RewardJiarui Yang, Bin Zhu 0006, Jingjing Chen 0001, Yu-Gang Jiang 0001. 18692-18700 [doi]
- ReflexDiffusion: Reflection-Enhanced Trajectory Planning for High-lateral-acceleration Scenarios in Autonomous DrivingXuemei Yao, Xiao Yang, Jianbin Sun, Liuwei Xie, Xuebin Shao, Xiyu Fang, Hang Su, Kewei Yang. 18701-18709 [doi]
- Indoor Multi-View Radar Object Detection via 3D Bounding Box DiffusionRyoma Yataka, Pu Perry Wang, Petros Boufounos, Ryuhei Takahashi. 18710-18718 [doi]
- GraphGrasp: Lightweight and Efficient Graph-Guided 6-DoF Robotic Grasp Pose Estimation NetworkSheng Yu 0009, Di-Hua Zhai, Yuanqing Xia. 18719-18727 [doi]
- Learning from Human Gaze: Human-like Robot Social Navigation in Dense CrowdsZhecheng Yu, Yan Lyu, Chen Yang, Tao Chen, Yishuang Zhang, Bo Ling, Peng Wang, Guanyu Gao, Weiwei Wu, Brian Y. Lim. 18728-18736 [doi]
- CorrectNav: Self-Correction Flywheel Empowers Vision-Language-Action Navigation ModelZhuoyuan Yu, Yuxing Long, Zihan Yang, Chengyan Zeng, Hongwei Fan, Jiyao Zhang, Hao Dong 0003. 18737-18745 [doi]
- DIMM: Decoupled Multi-hierarchy Kalman Filter via Reinforcement LearningJirong Zha, Yuxuan Fan, Kai Li, Han Li, Chen Gao 0001, Xinlei Chen. 18746-18754 [doi]
- Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow ModelsHongyin Zhang, Shiyuan Zhang, Junxi Jin, Qixin Zeng, Yifan Qiao, Hongchao Lu, Donglin Wang. 18755-18763 [doi]
- MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot ManipulationRongyu Zhang, Menghang Dong, Yuan Zhang 0020, Liang Heng, Xiaowei Chi, Gaole Dai, Li Du, Dan Wang 0002, Yuan Du, Shanghang Zhang. 18764-18772 [doi]
- RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent DiffusionRuijie Zhang, Bixin Zeng, Shengpeng Wang, Fuhui Zhou, Wei Wang 0050. 18773-18781 [doi]
- Grounding Actions in Camera Space: Observation-Centric Vision-Language-Action PolicyTianyi Zhang, Haonan Duan 0001, Haoran Hao 0003, Yu Qiao 0001, Jifeng Dai, Zhi Hou. 18782-18790 [doi]
- Agent Journey Beyond RGB: Hierarchical Semantic-Spatial Representation Enrichment for Vision-and-Language NavigationXuesong Zhang, Yunbo Xu, Jia Li 0013, Ruonan Liu, Zhenzhen Hu 0004. 18791-18799 [doi]
- Keep On Going: Learning Robust Humanoid Motion Skills via Selective Adversarial TrainingYang Zhang, Zhanxiang Cao, Buqing Nie, Haoyang Li 0009, Jiangwei Zhong, Qiao Sun, Xiaoyi Hu, Xiaokang Yang, Yue Gao 0005. 18800-18808 [doi]
- Bridging Scale Discrepancies in Robotic Control via Language-Based Action RepresentationsYuchi Zhang, Churui Sun, Shiqi Liang, Diyuan Liu, Chao Ji, Weinan Zhang 0003, Ting Liu 0001. 18809-18817 [doi]
- Towards Adaptive Humanoid Control via Multi-Behavior Distillation and Reinforced Fine-TuningYingnan Zhao 0002, Xinmiao Wang, Dewei Wang, Xinzhe Liu, Dan Lu 0004, Qilong Han, Peng Liu 0008, Chenjia Bai. 18818-18826 [doi]
- CoEvoer: Collaborative Evolution Transformer for Upper-Body Expressive Human Pose and Shape EstimationYuxiang Zhao, Wei Huang 0050, Yujie Song, Liu Wang, Huan Zhao. 18827-18835 [doi]
- DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous GraspingYifan Zhong, Xuchuan Huang, Ruochong Li, Ceyao Zhang, Zhang Chen, Tianrui Guan, Fanlian Zeng, Ka Nam Lui, Yuyao Ye, Yitao Liang, Yaodong Yang 0001, Yuanpei Chen. 18836-18844 [doi]
- Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language NavigationYu Zhong, Zihao Zhang, Rui Zhang 0040, Lingdong Huang, Haihan Gao, Shuo Wang, Da Li, Ruijian Han, Jiaming Guo, Shaohui Peng, Di Huang, Yunji Chen. 18845-18854 [doi]
- Gentle Manipulation Policy Learning via Demonstrations from VLM Planned Atomic SkillsJiayu Zhou, Qiwei Wu 0001, Jian Li, Zhe Chen, Xiaogang Xiong, Renjing Xu. 18855-18863 [doi]
- Collaborative Representation Learning for Alignment of Tactile, Language, and Vision ModalitiesYiyun Zhou, Mingjing Xu, Jingwei Shi, Quanjiang Li, Jingyuan Chen. 18864-18872 [doi]
- Effective Robotic Cloth Grasping Through Suppressing False DiscoveriesXingyu Zhu 0014, Zhiwen Tu, Yan Wu 0002, Shan Luo 0001, Hechang Chen, Yixing Gao. 18873-18881 [doi]
- H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic ManipulationYijie Zhu, Rui Shao 0001, Ziyang Liu, Jie He, Jizhihui Liu, Jiuru Wang, Zitong Yu. 18882-18890 [doi]
- D²PPO: Diffusion Policy Policy Optimization with Dispersive LossGuowei Zou, Weibing Li, Hejun Wu, Yukun Qian, Yuhang Wang, Haitao Wang. 18891-18899 [doi]
- NoReGeo: Non-Reasoning Geometry BenchmarkIrina Abdullaeva, Anton Vasiliuk, Elizaveta Goncharova, Temurbek Rahmatullaev, Zagorulko Ivan, Maxim Kurkin, Andrey Kuznetsov. 18901-18909 [doi]
- Conditional Probabilistic Bipolar Argumentation Framework: Explanations, Complexity and ApproximationGianvincenzo Alfano, Sergio Greco, Domenico Mandaglio, Francesco Parisi, Irina Trubitsyna. 18910-18919 [doi]
- Hybrid Semantics Accounting for Argument TypesLeila Amgoud, Marco Hanocq, Marie-Christine Lagasquie-Schiex. 18920-18927 [doi]
- Expressive Recursive Answers for Ontological Knowledge BasesLuca Andolfi, Gianluca Cima, Marco Console, Maurizio Lenzerini. 18928-18935 [doi]
- Under-Approximating Semantics in Clustered Assumption-Based ArgumentationIosif Apostolakis, Johannes P. Wallner. 18936-18943 [doi]
- Argumentative Debates for Transparent Bias DetectionHamed Ayoobi, Nico Potyka, Anna Rapberger, Francesca Toni. 18944-18952 [doi]
- Data Complexity of Querying Description Logic Knowledge Bases Under Cost-Based SemanticsMeghyn Bienvenu, Quentin Manière. 18953-18960 [doi]
- Learning from Answer Sets via Single-Shot Disjunctive ASP EncodingRoberto Borelli, Agostino Dovier. 18961-18968 [doi]
- Automata-less Monitoring via Trace-CheckingAndrea Brunello, Luca Geatti, Angelo Montanari, Nicola Saccomanno. 18969-18976 [doi]
- VSPO: Validating Semantic Pitfalls in Ontology via LLM-Based CQ GenerationHyojun Choi, Seokju Hwang, Kyong-Ho Lee. 18977-18984 [doi]
- Geo2Vec: Shape- and Distance-Aware Neural Representation of Geospatial EntitiesChen Chu, Cyrus Shahabi. 18985-18993 [doi]
- Foundations of Formal Reasoning over Knowledge Bases Combining Symbolic and Sub-Symbolic KnowledgeGianluca Cima, Marco Console, Laura Papi. 18994-19002 [doi]
- Efficient Rule Induction by Ignoring Pointless RulesAndrew Cropper, David M. Cerna. 19003-19011 [doi]
- Symmetry Breaking for Inductive Logic ProgrammingAndrew Cropper, David M. Cerna, Matti Järvisalo. 19012-19020 [doi]
- Generalizing Analogical Inference from Boolean to Continuous DomainsFrancisco Cunha, Yves Lepage, Miguel Couceiro, Zied Bouraoui. 19021-19029 [doi]
- 2-ASP(Q) Solving Based on CEGARAndrea Cuteri, Giuseppe Mazzotta, Francesco Ricca. 19030-19038 [doi]
- A Topological Rewriting of Tarski's MereogeometryRichard Dapoigny. 19039-19046 [doi]
- Strategic Reasoning over Golog Programs in the Nondeterministic Situation CalculusGiuseppe De Giacomo, Yves Lespérance, Matteo Mancanelli. 19047-19054 [doi]
- GraphOracle: Efficient Fully-Inductive Knowledge Graph Reasoning via Relation-Dependency GraphsEnjun Du, Siyi Liu, Yongqi Zhang. 19055-19063 [doi]
- Decidable Multi-agent Epistemic Planning: A Situation Calculus ApproachQihui Feng, Gerhard Lakemeyer. 19064-19072 [doi]
- Computing Syntax Tree-based Minimal Unsatisfiable Cores of LTLf FormulasValeria Fionda, Antonio Ielo, Francesco Ricca. 19073-19081 [doi]
- Two Heads Are Better than One: Distilling Large Language Model Features into Small Models with Feature Decomposition and MixtureTianhao Fu, Xinxin Xu 0006, Weichen Xu 0001, Jue Chen, Ruilong Ren, Bowen Deng, Xinyu Zhao, Jian Cao 0002, Xixin Cao. 19082-19090 [doi]
- Active Learning of Symbolic Automata over Rational NumbersSebastián Hagedorn Gaete, Martin Muñoz, Cristian Riveros, Rodrigo Toro Icarte. 19091-19098 [doi]
- Formal Verification of Diffusion AuctionsRustam Galimullin, Munyque Mittelmann, Laurent Perrussel. 19099-19107 [doi]
- Matrix Editing Meets Fair Clustering: Parameterized Algorithms and ComplexityRobert Ganian, Hung P. Hoang 0001, Simon Wietheger. 19108-19116 [doi]
- Heterogeneous Graph Neural Networks for Assumption-Based ArgumentationPreesha Gehlot, Anna Rapberger, Fabrizio Russo 0002, Francesca Toni. 19117-19125 [doi]
- Non-Monotonic S4F Standpoint LogicPiotr Gorczyca, Hannes Strass. 19126-19134 [doi]
- The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order LogicBernardo Cuenca Grau, Eva Feng, Przemyslaw Andrzej Walega. 19135-19142 [doi]
- Extending Description Logics with Generic Concepts - the Case of TerminologiesJoshua Hirschbrunn, Yevgeny Kazakov. 19143-19151 [doi]
- Revisiting Conjunctive Query Entailment for SYazmín Ibáñez García, Jean Christoph Jung, Vincent Michielini, Filip Murlak. 19152-19159 [doi]
- Enumerating Minimal Unsatisfiable Cores of LTLf FormulaeAntonio Ielo, Giuseppe Mazzotta, Rafael Peñaloza, Francesco Ricca. 19160-19168 [doi]
- Enhancing Strategy Logic with Procedural RationalityRuiqi Jin 0002, Shuyi Li, Yongmei Liu 0001. 19169-19177 [doi]
- Causal, Strategic, and Combined Responsibility Attribution in Situation Calculus Concurrent Game StructuresMohammad Hossein Karimian, Shakil M. Khan 0001, Yves Lespérance. 19178-19188 [doi]
- Can You Tell the Difference? Contrastive Explanations for ABox EntailmentsPatrick Koopmann, Yasir Mahmood 0002, Axel-Cyrille Ngonga Ngomo, Balram Tiwari. 19189-19197 [doi]
- Tractable Weighted First-Order Model Counting with Bounded Treewidth Binary EvidenceVáclav Kula, Qipeng Kuang, Yuyi Wang 0001, Yuanhong Wang, Ondrej Kuzelka. 19198-19207 [doi]
- Robust Lazy Conflict Detection via Multi-Conflict Extraction and Genetic Diversity ControlViet Man Le, Lukas André Feldgrill, Alexander Felfernig. 19208-19215 [doi]
- Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel EnvironmentsMario A. Leiva, Noel Ngu, Joshua Shay Kricheli, Aditya Taparia, Ransalu Senanayake, Paulo Shakarian, Nathaniel D. Bastian, John Corcoran, Gerardo I. Simari. 19216-19223 [doi]
- SITA: A Framework for Structure-to-Instance Theorem AutoformalizationChenyi Li, Wanli Ma, Zichen Wang, Zaiwen Wen. 19224-19232 [doi]
- Discovering Latent Facts from Context to Construct Richer Open Knowledge GraphsJinpeng Li, Hang Yu 0006, Ziqi Ma, Peng Qi 0001. 19233-19241 [doi]
- Multi-Modal Fact Knowledge Generation for Imbalanced Cross-Source Entity AlignmentQian Li 0033, Cheng Ji 0001, Zhaoji Liang, Yuzheng Zhang, Zhuo Chen 0007, Siyuan Liang. 19242-19250 [doi]
- PCoKG: Personality-aware Commonsense Reasoning with DebateWeijie Li 0001, Zhongqing Wang, Guodong Zhou. 19251-19258 [doi]
- A Logical Analysis of an Information Filtering Architecture Based on Epistemic Trust InferenceXu Li 0037, Leendert van der Torre, Liuwen Yu. 19259-19266 [doi]
- From Dialogue to Destination: Geography-Aware Large Language Models with Multimodal Fusion for Conversational RecommendationYeming Li, Chenxi Liu 0003, Jie Zou 0001, Cheng Long 0001, Chaoning Zhang, Peng Wang 0023, Yang Yang 0002. 19267-19275 [doi]
- MyGram: Modality-aware Graph Transformer with Global Distribution for Multi-modal Entity AlignmentZhifei Li, Ziyue Qin, Xiangyu Luo, Xiaoju Hou, Yue Zhao, Miao Zhang, Zhifang Huang, Kui Xiao, Bing Yang. 19276-19284 [doi]
- KeenKT: Knowledge Mastery-State Disambiguation for Knowledge TracingZhifei Li, Lifan Chen, Jiali Yi, Xiaoju Hou, Yue Zhao, Wenxin Huang, Miao Zhang 0036, Kui Xiao, Bing Yang. 19285-19293 [doi]
- Structure-Aware Encodings of Argumentation Properties for Clique-widthYasir Mahmood 0002, Markus Hecher, Johanna Groven, Johannes Klaus Fichte. 19294-19302 [doi]
- Rational Revision of Group IntentionsNima Motamed, Natasha Alechina, Mehdi Dastani, Dragan Doder. 19303-19311 [doi]
- Variance Computation for Weighted Model Counting with Knowledge Compilation ApproachKengo Nakamura 0001, Masaaki Nishino, Norihito Yasuda. 19312-19320 [doi]
- Model Change for Description Logic ConceptsAna Ozaki, Jandson S. Ribeiro. 19321-19328 [doi]
- TFD-Net: Towards Intelligent Time-Frequency Mode Decomposition with Practical ApplicationsPingping Pan, Yunjian Zhang, Jinyi Liu. 19329-19336 [doi]
- Self-Supervised Inductive Logic ProgrammingStassa Patsantzis. 19337-19344 [doi]
- Aligning Cross-View Visual Geometries in LVLMs Through Human-Like Reasoning LearningYuming Qiao, Liang Luo, Dan Meng 0001, Yifan Yang, Qingyuan Wang, Juntuo Wang, Yuwei Zhang, Ru Zhen, Yanhao Zhang, Haonan Lu, Xudong Zhang. 19345-19353 [doi]
- Truth-Tracking Evaluation in Opinion-Based ArgumentationJuliete Rossie, Jérôme Delobelle, Sébastien Konieczny, Srdjan Vesic. 19354-19361 [doi]
- EchoEdit: Consistent Multi-Hop Question Answering via Ripple Control in Knowledge EditingJinwei Shi, Wenxuan Huang, Yu Xing, Yunhui Liu 0002, Tao Zheng, Bin Chong, Tieke He. 19362-19370 [doi]
- Description Logics with Two Types of Definite Descriptions: Complexity, Expressiveness, and Automated DeductionMichal Sochanski, Przemyslaw Andrzej Walega, Michal Zawidzki. 19371-19379 [doi]
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web SearchesJiejun Tan, Zhicheng Dou, Yan Yu, Jiehan Cheng, Lifeng Liu, Jian Xie, Jirong Wen. 19380-19388 [doi]
- Delphi: A Neuro-Symbolic Framework for Individualized, Safe and Interpretable Treatment RecommendationMuchan Tao, Haonan Qin, Yuqi Fang, Caifeng Shan, Tieniu Tan. 19389-19397 [doi]
- A Robust Unlearning Method with Adaptive Knowledge Guidance and Memory PreservationJingyuan Tian, Xiaofei Zhou. 19398-19405 [doi]
- A Knowledge Compilation Map for Quantum InformationLieuwe Vinkhuijzen, Tim Coopmans, Alfons Laarman. 19406-19414 [doi]
- Convergent Semantics for Weighted Bipolar ArgumentationZongshun Wang, Yuping Shen. 19415-19423 [doi]
- Encode Geometric Diagram as Geo-Graph in Geometry Problem SolvingWenjun Wu, Lingling Zhang 0005, Bo Zhao, Bo Li, Xinyu Zhang 0021, Yaqiang Wu. 19424-19432 [doi]
- Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-Based Test OraclesZihao Xu, Junchen Ding, Yiling Lou, Kun Zhang, Dong Gong, Yuekang Li. 19433-19440 [doi]
- SpatialLogic-Bench: A Diagnostic Benchmark for Task-Oriented Spatiotemporal ReasoningXiaoda Yang, Shenzhou Gao, Can Wang, Jiahe Zhang, Menglan Tang, Jingyang Xue, Sheng Liu, Peijian Zhang, Yao Mu, Xiangyu Yue 0001. 19441-19449 [doi]
- Counterfactual Question Generation Uncovering Learner ContradictionsBo Zhang 0096, Hao Yu, Wenjie Dong, Yvhang Yang, Dezhuang Miao, Fengyi Song, Yanhui Gu, Xiaoming Zhang 0001, Junsheng Zhou. 19450-19457 [doi]
- Learning to Disentangle Latent Reasoning Rules with Language VAEs: A Systematic StudyYingji Zhang, Marco Valentino, Danilo S. Carvalho, André Freitas. 19458-19466 [doi]
- Incremental Maintenance of DatalogMTL MaterialisationsKaiyue Zhao, Dingqi Chen, Shaoyu Wang, Pan Hu 0001. 19467-19476 [doi]
- AgentODRL: A Large Language Model-based Multi-agent System for ODRL GenerationWanle Zhong, Keman Huang, Xiaoyong Du 0001. 19477-19485 [doi]
- FedGRPO: Privately Optimizing Foundation Models with Group-Relative Rewards from Domain ClientsGongxi Zhu, Hanlin Gu, Lixin Fan, Qiang Yang 0001, Yuxing Han 0001. 19487-19495 [doi]
- EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AIJianlei Chang, Ruofeng Mei, Wei Ke, Xiangyu Xu. 19496-19504 [doi]
- Automatic Channel Pruning by Searching with Structure Embedding for Hash NetworkZifan Liu, Yuan Cao 0005, Yifan Sun, Yanwei Yu, Heng Qi. 19505-19513 [doi]
- DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point RepresentationMohamed Abdelsamad, Michael Ulrich, Bin Yang 0009, Miao Zhang 0043, Yakov Miron, Abhinav Valada. 19514-19523 [doi]
- Constrained Online Convex Optimization with Memory and PredictionsMohammed Abdullah, George Iosifidis, Salah-Eddine Elayoubi, Tijani Chahed. 19524-19532 [doi]
- Expressive Temporal Specifications for Reward MonitoringOmar Adalat, Francesco Belardinelli. 19533-19541 [doi]
- ProbLog4Fairness: A Neurosymbolic Approach to Modeling and Mitigating BiasRik Adriaensen, Lucas Van Praet, Jessa Bekker, Robin Manhaeve, Pieter Delobelle, Maarten Buyl. 19542-19550 [doi]
- RefiDiff: Progressive Refinement Diffusion for Efficient Missing Data ImputationMd. Atik Ahamed, Qiang Ye 0003, Qiang Cheng 0001. 19551-19559 [doi]
- Stabilizing Policy Gradient Methods via Reward ProfilingShihab Ahmed, El Houcine Bergou, Yue Wang, Aritra Dutta. 19560-19568 [doi]
- Expressive Power of Graph Transformers via LogicVeeti Ahvonen, Maurice Funk, Damian Heiman, Antti Kuusisto, Carsten Lutz. 19569-19579 [doi]
- PharmaQA: Prompt-Based Molecular Representation Learning via Pharmacophore-Oriented Question AnsweringChengwei Ai, Qiaozhen Meng, Mengwei Sun, Ruihan Dong, Hongpeng Yang, Shiqiang Ma, Xiaoyi Liu, Cheng Liang 0001, Fei Guo 0001. 19580-19588 [doi]
- PAGE: A Unified Approach for Federated Graph UnlearningYuming Ai, Xunkai Li, Jiaqi Chao, Bowen Fan, Zhengyu Wu, Yinlin Zhu, Rong-Hua Li, Guoren Wang. 19589-19597 [doi]
- InfoQ: Mixed-Precision Quantization via Global Information FlowMehmet Emre Akbulut, Hazem Hesham Yousef Shalby, Fabrizio Pittorino, Manuel Roveri. 19598-19606 [doi]
- Symmetric Aggregation of Conformity Scores for Efficient Uncertainty SetsNabil Alami, Jad Zakharia, Souhaib Ben Taieb. 19607-19614 [doi]
- Removing Box-Free Watermarks for Image-to-Image Models via Query-Based Reverse EngineeringHaonan An 0001, Guang Hua 0001, Hangcheng Cao, Zhengru Fang, Guowen Xu, Susanto Rahardja, Yuguang Fang. 19615-19622 [doi]
- FreDN: Spectral Disentanglement for Time Series Forecasting via Learnable Frequency DecompositionZhongde An, Jinhong You, Jiyanglin Li, Yiming Tang, Wen Li, Heming Du, Shouguo Du. 19623-19631 [doi]
- SOSControl: Enhancing Human Motion Generation Through Saliency-Aware Symbolic Orientation and Timing ControlHo Yin Au, Junkun Jiang, Jie Chen 0026. 19632-19639 [doi]
- Spectral Basis Learning for Expressive Graph Neural Networks in Link PredictionNiloofar Azizi, Nils M. Kriege, Nicholas J. A. Harvey, Horst Bischof. 19640-19648 [doi]
- Convergence of Fast Policy Iteration in Markov Games and Robust MDPsKeith Badger, Jefferson Huang, Marek Petrik. 19649-19656 [doi]
- Mechanistic Dissection of Cross-Attention Subspaces in Text-to-Image Diffusion ModelsJun-Hyun Bae, Wonyong Jo, Jaehyup Lee, Heechul Jung. 19657-19665 [doi]
- Medical Vision-Language Pretraining with LLM-Guided Temporal SupervisionLiang Bai 0001, Zhi Wang, Huimin Yan, Xian Yang 0001. 19666-19674 [doi]
- Multi-Level Domain Adaptation and Contrastive Domain Isolation with Bilinear Fusion for Patient Drug Response PredictionYuting Bai, Hanwen Lv, Wanwan Shi, Zhiyi Zou, Jiawei Luo 0001. 19675-19683 [doi]
- Collaborative Dual Representations for Semi-Supervised Partial Label LearningWei-Xuan Bao, Yong Rui, Min-Ling Zhang. 19684-19692 [doi]
- Revisiting (Un)Fairness in Recourse by Minimizing Worst-Case Social BurdenAinhize Barrainkua, Giovanni De Toni, José Antonio Lozano 0001, Novi Quadrianto. 19693-19701 [doi]
- Differentially Private Linear Programming: Reduced Sub-Optimality and Guaranteed Constraint SatisfactionAlexander Benvenuti, Brendan J. Bialy, Miriam E. Dennis, Matthew Hale 0001. 19702-19710 [doi]
- Shaping Without Tearing: Controllable Diffeomorphic Deformations for Topology-Preserving 3D Point Cloud AugmentationJian Bi, Qianliang Wu, Jianjun Qian, Lei Luo 0001, Jian Yang 0003. 19711-19719 [doi]
- Deep Clustering Based on Sparse Kolmogorov-Arnold Network and Spectral ConstraintZixuan Bi, Yang Zhao 0021, Ganchao Liu. 19720-19727 [doi]
- FedALT: Federated Fine-Tuning Through Adaptive Local Training with Rest-of-World LoRAJieming Bian, Lei Wang 0199, Letian Zhang, Jie Xu 0001. 19728-19736 [doi]
- DiffOP: Reinforcement Learning of Optimization-Based Control Policies via Implicit Policy GradientsYuexin Bian, Jie Feng 0006, Yuanyuan Shi. 19737-19745 [doi]
- MDBench: Benchmarking Data-Driven Methods for Model DiscoveryAmirmohammad Ziaei Bideh, Aleksandra Georgievska, Jonathan Gryak. 19746-19754 [doi]
- Condensed Data Expansion Using Model Inversion for Knowledge DistillationKuluhan Binici, Shivam Aggarwal, Cihan Acar, Nam Trung Pham, Karianto Leman, Gim Hee Lee, Tulika Mitra. 19755-19763 [doi]
- Asymptotic and Finite Sample Analysis of Nonexpansive Stochastic Approximations with Markovian NoiseEthan Blaser, Shangtong Zhang. 19764-19772 [doi]
- MechaFormer: Sequence Learning for Kinematic Mechanism Design AutomationDiana Bolanos, Mohammadmehdi Ataei, Pradeep Kumar Jayaraman. 19773-19780 [doi]
- Cancer Survival Prediction by Cyclic Generation and Multi-grained AlignmentYongqi Bu, Qinggang Niu, Zhen Li, Yanyu Xu, Jun Wang 0035, Guoxian Yu. 19781-19789 [doi]
- SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred TrajectoriesReturaj Burnwal, Nirav Pravinbhai Bhatt, Balaraman Ravindran. 19790-19798 [doi]
- Paths Not Taken: Structure-Based Pruning in PSDD Learning and InferenceCory J. Butz, Alejandro Santoscoy-Rivero, Camilla E. Lewis. 19799-19807 [doi]
- Constrained Best Arm Identification with Tests for FeasibilityTing Cai 0003, Kirthevasan Kandasamy. 19808-19816 [doi]
- HyperGOOD: Towards Out-of-Distribution Detection in HypergraphsTingyi Cai, Yunliang Jiang, Ming Li 0065, Changqin Huang, Yujie Fang, Chengling Gao, Zhonglong Zheng. 19817-19825 [doi]
- Rethinking Explanation Evaluation Under the Retraining SchemeYi Cai 0005, Thibaud Ardoin, Mayank Gulati, Gerhard Wunder. 19826-19834 [doi]
- HeadHunt-VAD: Hunting Robust Anomaly-Sensitive Heads in MLLM for Tuning-Free Video Anomaly DetectionZhaolin Cai, Fan Li 0003, Ziwei Zheng, Haixia Bi, Lijun He 0001. 19835-19843 [doi]
- Stabilizing Self-Consuming Diffusion Models with Latent Space FilteringZhongteng Cai, Yaxuan Wang, Yang Liu, Xueru Zhang. 19844-19852 [doi]
- Spiking Heterogeneous Graph Attention NetworksBuqing Cao, Qian Peng, Xiang Xie, Liang Chen 0001, Min Shi 0001, Jianxun Liu 0001. 19853-19861 [doi]
- ProCache: Constraint-Aware Feature Caching with Selective Computation for Diffusion Transformer AccelerationFanpu Cao, Yaofo Chen, Zeng You, Wei Luo. 19862-19870 [doi]
- PITE: Multi-Prototype Alignment for Individual Treatment Effect EstimationFuyuan Cao, Jiaxuan Zhang, Xiaoli Li 0001. 19871-19879 [doi]
- Causality-Aware Efficient Exploration for Cooperative Multi-Agent Reinforcement LearningHongye Cao, Tianpei Yang, Fan Feng, Hammadi Rafik Ouariachi, Yali Du 0001, Meng Fang, Jing Huo, Yang Gao 0001. 19880-19888 [doi]
- Provably Efficient Multi-Objective Bandit Algorithms Under Preference-Centric CustomizationLinfeng Cao, Ming Shi 0003, Ness B. Shroff. 19889-19897 [doi]
- PPFL: A Parameter Behavior-Driven Plug-in Personalization Engine for Federated LearningQianyue Cao, Zongwei Zhu, Zirui Lian, Rui Zhang, Boyu Li 0006, Yi Xiong 0003, Xuehai Zhou. 19898-19906 [doi]
- Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized ModelsTianxiao Cao, Kyohei Atarashi, Hisashi Kashima. 19907-19915 [doi]
- LUCID: Learning-Enabled Uncertainty-Aware Certification of Stochastic Dynamical SystemsErnesto Casablanca, Oliver Schön, Paolo Zuliani, Sadegh Soudjani. 19916-19924 [doi]
- Covariance Scattering TransformsAndrea Cavallo, Ayushman Raghuvanshi, Sundeep Prabhakar Chepuri, Elvin Isufi. 19925-19933 [doi]
- Generalizing Fair Clustering to Multiple Groups: Algorithms and ApplicationsDiptarka Chakraborty, Kushagra Chatterjee, Debarati Das 0001, Tien Long Nguyen. 19934-19942 [doi]
- PADiff: Predictive and Adaptive Diffusion Policies for Ad Hoc TeamworkHohei Chan, Xinzhi Zhang 0009, Antao Xiang, Weinan Zhang 0001, Mengchen Zhao. 19943-19951 [doi]
- Enhancing Medical Large Vision-Language Models via Alignment DistillationAofei Chang, Ting Wang 0006, Fenglong Ma. 19952-19960 [doi]
- D3ToM: Decider-Guided Dynamic Token Merging for Accelerating Diffusion MLLMsShuochen Chang, Xiaofeng Zhang, Qingyang Liu, Li Niu. 19961-19969 [doi]
- Benchmarking Reinforcement Learning Algorithms for ICU Ventilator Settings: An Interpretable and Probabilistic Patient Environment for Doctor AgentsYa-Hsi Chang, Po-Chih Kuo. 19970-19977 [doi]
- LoRA in LoRA: Towards Parameter-Efficient Architecture Expansion for Continual Visual Instruction TuningChang Che, Ziqi Wang, Pengwan Yang, Cheems Wang, Hui Ma 0011, Zenglin Shi. 19978-19986 [doi]
- Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language ModelsBiao Chen, Lin Zuo, Mengmeng Jing, Kunbin He, Yuchen Wang. 19987-19995 [doi]
- Extendable Planning via Multiscale DiffusionChang Chen, Hany Hamed, Doojin Baek, Taegu Kang, Samyeul Noh, Yoshua Bengio, Sungjin Ahn. 19996-20004 [doi]
- Edge Self-Adversarial Augmentation Enhances Graph Contrastive Learning Against Neighborhood InconsistencyChunchun Chen, Xing Wei, Jiayi Yang, Chenrun Wang, Yiwei Fu, Yuxing Zhang, Xin Sun 0003, Rui Fan 0001, Wei Ye 0001. 20005-20013 [doi]
- Cross-Domain Few-Shot Learning via Multi-View Collaborative Optimization with Vision-Language ModelsDexia Chen, Wentao Zhang 0005, Qianjie Zhu, Ping Hu, Weibing Li, Tong Zhang 0017, Ruixuan Wang. 20014-20022 [doi]
- OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMsFeng Chen, Yefei He, Shaoxuan He, Yuanyu He, Jing Liu 0048, Lequan Lin, Akide Liu, Zhaoyang Li, Jiyuan Zhang, Zhenbang Sun, Bohan Zhuang, Qi Wu 0001. 20023-20031 [doi]
- GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed BanditsGongpu Chen, Soung Chang Liew, Deniz Gündüz. 20032-20040 [doi]
- VCGD: Visual Clue Guided Decoding with Caption Model for Mitigating Hallucination in Multimodal Large Language ModelsGuoqing Chen, Fu Zhang 0001, Bingqian Liu, Chenglong Lu, Jingwei Cheng. 20041-20049 [doi]
- Causality-Aligned Semantic Recovery for Incomplete Cross-Modal RetrievalHaipeng Chen 0002, Yu Liu 0004, Xun Yang 0001, Yuheng Liang, Yingda Lyu. 20050-20058 [doi]
- MS-PPO: Mean Standard Deviation Proximal Policy Optimization for Reliable Parking Space Search in Structured EnvironmentsHaoming Chen, Hongliang Guo. 20059-20066 [doi]
- Decentralized Non-convex Stochastic Optimization with Heterogeneous VarianceHongxu Chen, Ke Wei 0001, Luo Luo. 20067-20075 [doi]
- PROMISE: Prompt-Attentive Hierarchical Contrastive Learning for Robust Cross-Modal Representation with Missing ModalitiesJiajun Chen, Sai Cheng, Yutao Yuan, Yirui Zhang, Haitao Yuan, Peng Peng, Yi Zhong. 20076-20082 [doi]
- Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time ComputeJianhao Chen 0001, Zishuo Xun, Bocheng Zhou, Han Qi, Hangfan Zhang, Qiaosheng Zhang 0002, Yang Chen, Wei Hu, Yuzhong Qu, Shuyue Hu. 20083-20091 [doi]
- MARS: A Meta-Adaptive Reinforcement Learning Framework for Risk-Aware Multi-Agent Portfolio ManagementJiayi Chen, Jing Li 0025, Guiling Wang 0001. 20092-20099 [doi]
- Towards Multiple Missing Values-resistant Unsupervised Graph Anomaly DetectionJiazhen Chen, Xiuqin Liang, Sichao Fu, Zheng Ma 0011, Weihua Ou. 20100-20108 [doi]
- Conditional Distribution Learning for Graph ClassificationJie Chen 0065, Hua Mao 0001, Chuanbin Liu 0003, Zhu Wang 0007, Xi Peng 0001. 20109-20117 [doi]
- Offline Fictitious Self-Play for Competitive GamesJingxiao Chen, Weiji Xie, Weinan Zhang 0001, Yong Yu 0001, Ying Wen 0001. 20118-20126 [doi]
- DC-SPAN: A Dual Contrastive Attention Network for Multi-View ClusteringJingyi Chen, Zhibin Dong, Tiejun Li, Yibo Han. 20127-20135 [doi]
- MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-TuningJinhao Chen, Zhen Yang 0034, Jianxin Shi 0004, Tianyu Wo, Jie Tang 0001. 20136-20144 [doi]
- Pseudo Multi-view K-means ClusteringJinqian Chen, Jihua Zhu, Haoyu Tang 0002, Qinghai Zheng. 20145-20153 [doi]
- Sample-specific Modality Diagnosis and Cross-modal Enhancement for Incomplete Multimodal RepresentationsJunsong Chen, Jiyuan Liu 0003, Suyuan Liu, Wei Zhang 0049, Ao Li 0002, En Zhu, Xinwang Liu 0002. 20154-20162 [doi]
- Zero-shot Recommendation: Towards Class Semantic Relation Learning for Inferring Labels of Unseen Micro-videosJunyang Chen 0001, Huan Wang 0005, Yirui Wu, Qiuzhen Lin, Yunfeng Diao, Junkai Ji. 20163-20171 [doi]
- OmniDPO: A Preference Optimization Framework to Address Omni-Modal HallucinationJunzhe Chen 0001, Tianshu Zhang 0002, Shiyu Huang 0001, Yuwei Niu, Chao Sun, Rongzhou Zhang, Guanyu Zhou, Lijie Wen 0001. 20172-20180 [doi]
- SAPO: Self-Adaptive Process Optimization Makes Small Reasoners StrongerKaiYuan Chen, Guangmin Zheng 0001, Jin Wang 0008, Xiaobing Zhou, Xuejie Zhang 0002. 20181-20189 [doi]
- FIRM-MoE: Fine-GrainedExpert Decomposition for Resource-Adaptive MoE InferenceKeyu Chen, Qihang Zhou, Bin Qian 0002, Zhenyu Wen, Wenchao Meng, Shibo He. 20190-20198 [doi]
- ChartEditor: A Reinforcement Learning Framework for Robust Chart EditingLiangyu Chen 0008, Yichen Xu, Jianzhe Ma, Yuqi Liu 0003, Donglu Yang, Liang Zhang, Zihao Yue, Wenxuan Wang 0001, Qin Jin. 20199-20207 [doi]
- Generating In-Distribution Counterfactual Explanation for Graph Neural NetworksLinmao Chen, Chaobo He, Junwei Cheng, Chunying Li, Quanlong Guan. 20208-20216 [doi]
- Spatial-Frequency Spiking Neural Network for Underwater Object DetectionLong Chen 0019, Wei Miao, Xin Gao, Yunzhi Zhuge, Hongming Xu 0002, Yaxin Li, Qi Xu 0008. 20217-20225 [doi]
- Connectivity-Guided Sparsification of 2-FWL GNNs: Preserving Full Expressivity with Improved EfficiencyRongqin Chen 0001, Fan Mo 0002, Pak Lon Ip, Shenghui Zhang, Dan Wu 0002, Ye Li 0002, Leong Hou U. 20226-20234 [doi]
- Extracting Multimodal Learngene in CLIP: Unveiling the Multimodal Generalizable KnowledgeRuiming Chen, Junming Yang, Shiyu Xia, Xu Yang 0021, Xin Geng 0001. 20235-20243 [doi]
- OTARo: Once Tuning for All Precisions Toward Robust On-Device LLMsShaoyuan Chen, Zhixuan Chen, Dawei Yang, Zhihang Yuan, Qiang Wu 0012. 20244-20252 [doi]
- FedMerge: Federated Model Merging for PersonalizationShutong Chen, Tianyi Zhou 0001, Guodong Long, Jing Jiang 0002, Chengqi Zhang. 20253-20261 [doi]
- When Top-ranked Recommendations Fail: Modeling Multi-Granular Negative Feedback for Explainable and Robust Video RecommendationSiran Chen, Boyu Chen, Chenyun Yu, Yi Ouyang, Lei Cheng 0005, Chengxiang Zhuo, Zang Li, Yali Wang. 20262-20270 [doi]
- Piercing the Fog: Disentangling Key Features for Vision Models in Multi-Degradation ScenariosSiyu Chen, Shiqiang Ma, Fei Guo 0001. 20271-20279 [doi]
- Horizontal and Vertical Federated Causal Structure Learning via Higher-order CumulantsWei Chen 0103, Wanyang Gu, Linjun Peng, Ting Yan, Ruichu Cai, Zhifeng Hao, Kun Zhang 0001. 20280-20288 [doi]
- Geometry-Aware Variational Information Maximization for Deep Incomplete Multi-view ClusteringWenlan Chen, Lu Gao, Daoyuan Wang, Fei Guo 0001, Cheng Liang 0001. 20289-20297 [doi]
- GCA: Geometry-aware Conditional Alignment for Partial Domain Adaptation with Coding Rate ReductionXiaohui Chen, Chuan-Xian Ren. 20298-20306 [doi]
- G-IR: Geometric Image Representation for LearningXin Chen, Qi Zhao, Wei Zeng, ZongBen Xu. 20307-20315 [doi]
- Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude CompensationXinrui Chen 0001, Hongxing Zhang, Fanyi Zeng, Yongxian Wei, Yizhi Wang, Xitong Ling, Guanghao Li 0003, Chun Yuan 0003. 20316-20324 [doi]
- XLinear: A Lightweight and Accurate MLP-Based Model for Long-Term Time Series Forecasting with Exogenous InputsXinyang Chen, Huidong Jin 0001, Yu Huang, Zaiwen Feng. 20325-20335 [doi]
- Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size?Xuanyu Chen, Nan Yang, Shuai Wang, Dong Yuan 0001. 20336-20344 [doi]
- F2SST: Frequency-to-Spatial Semantic Transfer for Few-Shot Image ClassificationXueyi Chen, Bangjun Wang, Jiaqing Fan, Li Zhang 0004, Fanzhang Li. 20345-20353 [doi]
- Beyond Sharpness: A Flatness Decomposition Framework for Efficient Continual LearningYanan Chen, Tieliang Gong, Yunjiao Zhang, Wen Wen. 20354-20362 [doi]
- Active Multi-source Domain Adaptation for Multimodal Fake News DetectionYanping Chen, Weijie Shi, Mengze Li 0001, Yue Cui 0001, Jiaming Li, Ruiyuan Zhang, Hao Chen, Hanghui Guo, Shimin Di, Ziyi Liu, Jia Zhu 0003, Jiajie Xu 0001. 20363-20371 [doi]
- SchellingFormer: Laplacian Matrix-guided Geometric Transformer for Robust Schelling Point DetectionYihao Chen, Haobo Jiang, Liang Yu, Jianmin Zheng. 20372-20380 [doi]
- Detecting Unobserved Confounders: A Kernelized Regression ApproachYikai Chen, Yunxin Mao, Chunyuan Zheng 0001, Hao Zou 0001, Shanzhi Gu, Shixuan Liu, Yang Shi 0009, Wenjing Yang 0002, Kun Kuang 0001, Haotian Wang 0001. 20381-20389 [doi]
- Enhancing Kernel Power $K$-means: Scalable and Robust Clustering with Random Fourier Features and Possibilistic MethodYixi Chen, Weixuan Liang, Tianrui Liu, Jun-Jie Huang, Ao Li, Xueling Zhu, Xinwang Liu 0002. 20390-20398 [doi]
- Diffusion Model Based Signal Recovery Under 1-Bit QuantizationYouming Chen, Zhaoqiang Liu. 20400-20408 [doi]
- AV-SSAN: Audio-Visual Selective DOA Estimation Through Explicit Multi-Band Semantic-Spatial AlignmentYu Chen, Hongxu Zhu, Jiadong Wang, Kainan Chen, Xinyuan Qian 0001. 20409-20417 [doi]
- FedCure: Mitigating Participation Bias in Semi-Asynchronous Federated Learning with Non-IID DataYue Chen, Jianfeng Lu 0002, Shuqin Cao, Wei Wang 0170, Gang Li 0028, Guanghui Wen. 20418-20426 [doi]
- Multi-Modal Style Transfer-based Prompt Tuning for Efficient Federated Domain GeneralizationYuliang Chen, Xi Lin 0003, Jun Wu 0001, Xiangrui Cai, Qiaolun Zhang, Xichun Fan, Jiapeng Xu, Xiu Su. 20427-20435 [doi]
- SIFThinker: Spatially-Aware Image Focus for Visual ReasoningZhangquan Chen, Ruihui Zhao, Chuwei Luo, Mingze Sun, Xinlei Yu, Yangyang Kang, Ruqi Huang. 20436-20444 [doi]
- Gated Variational Graph Autoencoders as Experts with Competition and Consensus for Multi-view ClusteringZhaoliang Chen, William K. Cheung 0001, Hong-Ning Dai, Byron Choi, Jiming Liu 0001. 20445-20453 [doi]
- Combining LLM Semantic Reasoning with GNN Structural Modeling for Multi-View Multi-Label Feature SelectionZhiqi Chen, Yuzhou Liu, Jiarui Liu, Wanfu Gao. 20454-20462 [doi]
- INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image RetrievalZhiwei Chen 0003, Yupeng Hu 0003, Zhiheng Fu, Zixu Li 0001, Jiale Huang, Qinlei Huang, Yinwei Wei. 20463-20471 [doi]
- CoGenSAM: Codebook-Interactive Generative Labeling for Adapting SAM to Crack SegmentationZhuangzhuang Chen, Nuo Chen, Dachong Li, Zhiliang Lin, Xingyu Feng, Yifan Zhang, Jie Chen 0027, Jianqiang Li 0001. 20472-20480 [doi]
- Explanation-Preserving Augmentation for Semi-Supervised Graph Representation LearningZhuomin Chen, Jingchao Ni, Hojat Allah Salehi, Xu Zheng 0003, Esteban Schafir, Farhad Shirani 0001, Dongsheng Luo. 20481-20489 [doi]
- LSHFed: Robust and Communication-Efficient Federated Learning with Locally-Sensitive Hashing Gradient MappingGuanjie Cheng, Mengzhen Yang, Xinkui Zhao, Shuyi Yu, Tianyu Du, Yangyang Wu, Mengying Zhu, ShuiGuang Deng. 20490-20498 [doi]
- Personalized Federated Learning with Bidirectional Communication Compression via One-Bit Random SketchingJiacheng Cheng, Xu Zhang 0011, Guanghui Qiu, Yifang Zhang, Yinchuan Li, Kaiyuan Feng. 20499-20508 [doi]
- Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High DimensionsKangke Cheng, Shihong Song, Guanlin Mo, Hu Ding 0003. 20509-20517 [doi]
- DesireKV: Decoupling Sensitivity and Importance for Reasoning-Aware KV Cache CompressionPengyu Cheng, Jiacheng Wang, Tianle Chen, Bei Liu, Xiaofeng Hou, Jiacheng Liu 0001. 20518-20526 [doi]
- Score-Based Model for Low-Rank Tensor RecoveryZhengyun Cheng, Changhao Wang, Guanwen Zhang, Yi Xu 0008, Wei Zhou 0020, Xiangyang Ji. 20527-20535 [doi]
- Best Arm Identification with Biased ContextsJames Cheshire, Stéphan Clémençon. 20536-20543 [doi]
- Departures: Distributional Transport for Single-Cell Perturbation Prediction with Neural Schrödinger BridgesChangxi Chi, Yufei Huang 0002, Jun Xia 0001, Jiangbin Zheng 0002, Yunfan Liu 0002, Zelin Zang, Stan Z. Li. 20544-20552 [doi]
- Are Graph Transformers Necessary? Efficient Long-Range Message Passing with Fractal Nodes in MPNNsJeongwhan Choi 0002, Seungjun Park, Sumin Park, Sung-Bae Cho, Noseong Park. 20553-20561 [doi]
- TabGeoFlow: A Geometric Flow Matching Model for Tabular Data SynthesisJong-In Choi. 20562-20569 [doi]
- Sheaf Graph Neural Networks via PAC-Bayes Spectral OptimizationYoonhyuk Choi, Jiho Choi, Taewook Ko, Jongwook Kim, Chong-kwon Kim. 20570-20578 [doi]
- Let the Void Be Void: Robust Open-Set Semi-Supervised Learning via Selective Non-AlignmentYou Rim Choi, Subeom Park, Seojun Heo, Eunchung Noh, Hyung-Sin Kim. 20579-20587 [doi]
- LoGIC: Multi-LoRA Guided Importance Consensus for Multi-Task Pruning in Vision TransformersYu-Hong Chou, Rui Fang 0002, Hsi-Wen Chen, Ming-Syan Chen. 20588-20596 [doi]
- T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual FusionAbdul Monaf Chowdhury, Rabeya Akter 0001, Safaeid Hossain Arib. 20597-20605 [doi]
- Convex Clustering Redefined: Robust Learning with the Median of Means EstimatorKoustav Chowdhury, Bibhabasu Mandal, Sourav De, Sagar Ghosh, Swagatam Das, Debolina Paul, Saptarshi Chakraborty. 20606-20614 [doi]
- Towards Robust Edge Model Adaptation via Elastic Architecture SearchXianhang Chu, Xu Yang 0019, Kun Wei, Xi Wang. 20615-20623 [doi]
- Robust Watermarking on Gradient Boosting Decision TreesJun Woo Chung, Yingjie Lao, Weijie Zhao 0001. 20624-20633 [doi]
- CATAL: Causally Disentangled Task Representation Learning for Offline Meta-Reinforcement LearningShan Cong, Chao Yu, Xiangyuan Lan. 20634-20641 [doi]
- Causal Discovery from Interval-Based Event SequencesLénaïg Cornanguer, Joscha Cüppers, Jilles Vreeken. 20642-20649 [doi]
- Linear Time Algorithms for Individually Fair k-means via Multi-Swap Local SearchBeirong Cui, Qilong Feng, Junyu Huang. 20650-20657 [doi]
- ShaLa: Multimodal Shared Latent Generative ModellingJiali Cui, Yan-Ying Chen, Yanxia Zhang, Matthew Klenk 0001. 20658-20666 [doi]
- Learning Systems Expansion with Efficient Heterogeneity-aware Knowledge TransferGaole Dai, Huatao Xu, Yifan Yang 0004, Rui Tan 0001, Mo Li 0001. 20667-20675 [doi]
- MemoryART: Enhancing LLMs via Multi-Memory Models with Adaptive Resonance Theory for Healthcare AgentsRenke Dai, Hebin Hu, Jiahui Zhang, Yilin Kang 0001, Ah-Hwee Tan. 20676-20683 [doi]
- Veli: Unsupervised Method and Unified Benchmark for Low-Cost Air Quality Sensor CorrectionYahia Dalbah, Marcel Worring, Yen-Chia Hsu. 20684-20692 [doi]
- Correspondence Coverage Matters for Multi-Modal Dataset DistillationZhuohang Dang, Minnan Luo, Chengyou Jia, Hangwei Qian, Xinyu Zhang 0021, Xiaojun Chang, Ivor W. Tsang. 20693-20701 [doi]
- Statistically Robust Sparse High-order Interaction ModelDiptesh Das, Ichiro Takeuchi, Koji Tsuda. 20702-20710 [doi]
- Intervention-Aware Time Series Modeling: Capturing and Evaluating Feature DependenciesIbrahim Delibasoglu, Sanjay Chakraborty, Fredrik Heintz, Mattias Tiger. 20711-20718 [doi]
- FedSkeleton: Secure Multi-Party Graph Skeleton Construction for Privacy-Preserving Federated Time-Series ForecastingHenggang Deng, Yuchao Tang, Wenjie Fu 0005, Huandong Wang, Kun Chen, Tao Jiang 0002. 20719-20727 [doi]
- Efficiently Seeking Flat Minima for Better Generalization in Fine-Tuning Large Language Models and BeyondJiaxin Deng, Qingcheng Zhu, Junbiao Pang, Linlin Yang, Zhongqian Fu, Baochang Zhang 0001. 20728-20736 [doi]
- InfoDecom: Decomposing Information for Defending Against Privacy Leakage in Split InferenceRuijun Deng, Zhihui Lu 0002, Qiang Duan 0002. 20737-20745 [doi]
- Random Amalgamation of Adapters for Flatter Loss Landscapes: Towards Class-Incremental Learning with Better StabilityYao Deng, Xiang Xiang 0001, Jiaqi Gui. 20746-20754 [doi]
- Neural Tangent Kernels Under Stochastic Data AugmentationJoshua DeOliveira, Sajal Chakroborty, Walter Gerych, Elke A. Rundensteiner. 20755-20762 [doi]
- Demystifying Foreground-Background Memorization in Diffusion ModelsJimmy Z. Di, Yiwei Lu 0001, Yaoliang Yu, Gautam Kamath 0001, Adam Dziedzic, Franziska Boenisch. 20763-20771 [doi]
- CIP-Net: Continual Interpretable Prototype-based NetworkFederico Di Valerio, Michela Proietti, Alessio Ragno, Roberto Capobianco. 20772-20780 [doi]
- SimDiff: Simpler Yet Better Diffusion Model for Time Series Point ForecastingHang Ding, Xue Wang 0010, Tian Zhou 0004, Tao Yao. 20781-20789 [doi]
- TimeMosaic: Temporal Heterogeneity Guided Time Series Forecasting via Adaptive Granularity Patch and Segment-wise DecodingKuiye Ding, Fanda Fan, Chunyi Hou, Zheya Wang, Lei Wang 0004, Zhengxin Yang, Jianfeng Zhan. 20790-20798 [doi]
- Communication-efficient Multi-Agent Reinforcement Learning with Spatiotemporal Information HubLing Ding, Tianbai Lyu, Zhiliang Bi, Hao Wang 0013, Shanshan Feng 0001, Wei Yu. 20799-20807 [doi]
- PrefixGPT: Prefix Adder Optimization by a Generative Pre-trained TransformerRuogu Ding, Xin-ning, Ulf Schlichtmann, Weikang Qian. 20808-20815 [doi]
- Learning Time in Static ClassifiersXi Ding 0001, Lei Wang 0108, Piotr Koniusz, Yongsheng Gao 0001. 20816-20825 [doi]
- Sliding-Window Merging for Compacting Patch-Redundant Layers in LLMsXuan Ding, Rui Sun, Yunjian Zhang, Xiu Yan, Yueqi Zhou, Kaihao Huang, Suzhong Fu, Angelica I. Avilés-Rivero, Chuanlong Xie, Yao Zhu 0003. 20826-20834 [doi]
- RAC-DMVC: Reliability-Aware Contrastive Deep Multi-View Clustering Under Multi-Source NoiseShihao Dong, Yue Liu 0008, Xiaotong Zhou, Yuhui Zheng, Huiying Xu, Xinzhong Zhu. 20835-20843 [doi]
- Accelerating LLM Inference Throughput via Asynchronous KV Cache PrefetchingYanhao Dong, Yubo Miao, Weinan Li, Xiao Zheng, Chao Wang, Jiesheng Wu, Feng Lyu 0001. 20844-20851 [doi]
- EnViT: Enhancing the Performance of Early-Exit Vision Transformers via Exit-Aware Structured Dropout-Enabled Self-DistillationYonghao Dong, Qiang He 0001, Penghong Rui, Zhenzhe Zheng, Zhao Li, Feifei Chen 0001, Hai Jin 0001, Yun Yang 0001. 20852-20860 [doi]
- Learning Dynamics as Feedback: An Adaptive Entropy Flow Dynamics Framework for Long-tailed Human Action RecognitionYuan Dong, Zhe Zhao 0008, Liheng Yu, Di Wu 0057, Pengkun Wang 0001. 20861-20869 [doi]
- Constrained Particle Seeking: Solving Diffusion Inverse Problems with Just Forward PassesHongkun Dou, Zike Chen, ZeYu Li, Hongjue Li, Lijun Yang, Yue Deng 0001. 20870-20878 [doi]
- DSCF: Dual-Source Counterfactual Fusion for High-Dimensional Combinatorial InterventionsJitong Dou, Lingrui Luo, Bing Zhu, Hengliang Luo, Mingjun Zhong, Yurong Cheng. 20879-20886 [doi]
- Closer to Biological Mechanism: Drug-Drug Interaction Prediction from the Perspective of PharmacophoreMingliang Dou, Linfeng Wen, Jinyang Xie, Jijun Tang, Shiqiang Ma, Fei Guo 0001. 20887-20895 [doi]
- Predicting the Future by Retrieving the PastDazhao Du, Tao Han 0002, Song Guo 0001. 20896-20904 [doi]
- GCIB: Causal Intervention Guided Graph Information Bottleneck FrameworkHangyuan Du, Rong Wang, Lixin Cui, Gaoxia Jiang, Liang Bai 0001, Wenjian Wang. 20905-20913 [doi]
- LGAN: An Efficient High-Order Graph Neural Network via the Line Graph AggregationLin Du, Lu Bai 0001, Jincheng Li 0004, Lixin Cui, Hangyuan Du, Lichi Zhang, Yuting Chen, Zhao Li 0007. 20914-20922 [doi]
- Forget What Has Seen: Selective Concept Unlearning in Segmentation Foundation ModelsMiaozeng Du, Jiaqi Li 0031, Sirui Pan, Yi Zhan, Guilin Qi, Yuxin Zhang, Rihui Jin, Yinjia Shu, Qianshan Wei. 20923-20931 [doi]
- Flexible Concept Bottleneck ModelXingbo Du, Qiantong Dou, Lei Fan 0007, Rui Zhang. 20932-20940 [doi]
- Deep Incomplete Multi-View Clustering via Hierarchical Imputation and AlignmentYiming Du, Ziyu Wang, Jian Li, Rui Ning, Lusi Li. 20941-20949 [doi]
- Learning Intrinsic Hierarchy for Generalized Category DiscoveryYu Duan 0001, Junzhi He, Zhanxuan Hu, Mengda Ji, Rong Wang 0001, Quanxue Gao. 20950-20958 [doi]
- K-ProtoDiff: Key Prototypes-Guided Diffusion for Time Series GenerationYuhang Duan, Lin Lin 0008, Xiaoshuai Wu. 20959-20967 [doi]
- FlorE: Integrating Full Lorentz Group and Directional Offsets for Effective Knowledge Graph EmbeddingZehua Duo, Jiang Li 0013, Xiangdong Su, Guanglai Gao. 20968-20976 [doi]
- Simulation-Driven Railway Delay Prediction: An Imitation Learning ApproachClément Elliker, Jesse Read, Sonia Vanier, Albert Bifet. 20977-20984 [doi]
- Decomposing and Composing: Towards Efficient Vision-Language Continual Learning via Rank-1 Expert Pool in a Single LoRAZhan Fa, Yue Duan, Jian Zhang 0090, Lei Qi 0001, Wanqi Yang, Yinghuan Shi. 20985-20993 [doi]
- Efficient Multimodal Large Language Model via Dynamic KV Cache QuantizationJiahao Fan, Chien-Ming Chen. 20994-21001 [doi]
- Online Cross-Modal Hashing with Expanding Label SpaceWentao Fan 0003, Chao Zhang 0078, Chunlin Chen 0001, Huaxiong Li. 21002-21010 [doi]
- MoETTA: Test-Time Adaptation Under Mixed Distribution Shifts with MoE-LayerNormXiao Fan, Jingyan Jiang, Zhaoru Chen, Fanding Huang, Xiao Chen, Qinting Jiang, Bowen Zhang, Xing Tang, Zhi Wang. 21011-21019 [doi]
- TFRank: Think-Free Reasoning Enables Practical Pointwise LLM RankingYongqi Fan, Xiaoyang Chen 0001, Dezhi Ye, Jie Liu 0075, Haijin Liang, Jin Ma 0003, Ben He 0001, Yingfei Sun, Tong Ruan. 21020-21028 [doi]
- Hierarchical Structure-Property Alignment for Data-Efficient Molecular Generation and EditingZiyu Fan, Zhijian Huang, Yahan Li, Xiaowen Hu, Siyuan Shen, Yunliang Wang, Zeyu Zhong, Shuhong Liu, Shuning Yang, Shangqian Wu, Min Wu 0008, Lei Deng 0002. 21029-21037 [doi]
- Towards Illumination-Aware Restoration of Metalens-Captured Images: A New Dataset and a Strong BaselineFen Fang, Xinan Liang, Muli Yang, Jinghong Zheng 0001, Tobias Wilhelm W. Mass, Ying Sun 0001, XuLei Yang, Xuewu Xu, Zhengguo Li. 21038-21046 [doi]
- Graph Domain Adaptation via Homophily-Agnostic Reconstructing StructureRuiyi Fang, Shuo Wang, Ruizhi Pu, Qiuhao Zeng, Hao Zheng 0009, Ziyan Wang, Jiale Cai, Zhimin Mei, Song Tang, Charles Ling 0001, Boyu Wang 0004. 21047-21055 [doi]
- To Align or Not to Align: Strategic Multimodal Representation Alignment for Optimal PerformanceWanlong Fang, Tianle Zhang, Alvin Chan. 21056-21064 [doi]
- Goal-Oriented Time-Series Forecasting: Foundation Framework DesignLuca-Andrei Fechete, Mohamed Sana, Fadhel Ayed, Nicola Piovesan, Wenjie Li 0001, Antonio De Domenico, Tareq Si Salem. 21065-21073 [doi]
- WIET: Harmonizing Group-aware Model Weighting and Worker Allocation for Ensemble Temporal Prediction MaaSBinbin Feng, Shikun He, Yingxin Wang, Pengwei Wang 0001, Xiang Gao, Zhijun Ding. 21074-21082 [doi]
- Federated Incomplete Multi-View Clustering with Tensorized Low-Rank ConstraintWei Feng 0010, Danting Liu, Qianqian Wang 0001, Mengping Jiang, Bin Liu. 21083-21091 [doi]
- Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior VocabularyXinshun Feng, Mingzhe Liu 0002, Yi Qiao, Tongyu Zhu, Leilei Sun, Shuai Wang. 21092-21100 [doi]
- Adaptive and Asymptotic Mean-based Subclass Discriminant AnalysisYuzhe Feng, Yunlong Gao 0001, Feiping Nie 0001. 21101-21110 [doi]
- EM-KD: Distilling Efficient Multimodal Large Language Model with Unbalanced Vision TokensZe Feng, Sen Yang, Boqiang Duan, Wankou Yang, Jingdong Wang 0001. 21111-21119 [doi]
- Statistical Learning Theory for Distributional ClassificationChristian Fiedler. 21120-21127 [doi]
- DeToNATION: Decoupled Torch Network-Aware Training on Interlinked Online NodesMogens Henrik From, Jacob Nielsen, Lukas Galke, Peter Schneider-Kamp. 21128-21135 [doi]
- ENHash: Error Notebook-Guided Fine-Grained Learning for Unsupervised Cross-Modal HashingHao Fu 0020, Zebing Yao, Chuangchuang Tan, Guanghua Gu. 21136-21144 [doi]
- Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level PairsDaniel Furelos-Blanco, Charles Pert, Frederik Kelbel, Alex F. Spies, Alessandra Russo, Michael Dennis. 21145-21153 [doi]
- BIQ: Bisection Interval Quantization for Communication-efficient Federated LearningLuyang Gai, Shusen Yang, Xuebin Ren, Zihao Zhou. 21154-21162 [doi]
- Fairness-Aware Design for Contextual Experiments: Guaranteeing Reliability and Equity in Heterogeneous SubgroupsGuangyan Gan, Ling Zhang, Yanhua Cheng, Yongxiang Tang 0001, Kaiyuan Li, Xialong Liu, Peng Jiang 0002. 21163-21170 [doi]
- Learning Whom to Align With: Progressive Anomaly Combination Detection for Partially View-Aligned ClusteringHang Gao, Zuosong Cai, Yuze Li, Cheng Liu, Gaoyang Li, Ying Li 0004, Wei Du 0002, You Zhou 0008. 21171-21179 [doi]
- MARE: Multimodal Analogical Reasoning for Disease Evolution-Aware Radiology Report GenerationQingqing Gao, Tengfei Liu, Xiaoyan Li, Xiaodan Zhang 0003, Zhongfan Sun, Boyue Wang, Baocai Yin, Zhaohui Liu. 21180-21188 [doi]
- DiAPR: Dimensionally-Allocated Prototype Refinement for Non-Exemplar Class Incremental LearningRuixuan Gao, Qijun Zhao, Keren Fu. 21189-21197 [doi]
- CMedBench: A Comprehensive Benchmark for Efficient Medical Large Language ModelsShengbo Gao, Jinyang Guo, Lixian Su, Yifu Ding, Shiqiao Gu, Aishan Liu, Yuqing Ma, Zhiwang Zhang, Xianglong Liu 0001. 21198-21206 [doi]
- The Semantic Architect: How FEAML Bridges Structured Data and LLMs for Multi-Label TasksWanfu Gao, Zebin He, Jun Gao. 21207-21215 [doi]
- Talon: Breaking the Synchronization Barrier in Speculative Decoding with Hybrid Model-based and Retrieve-based DraftingXiangxiang Gao, Weisheng Xie, Lixin, Xuwei Fang, Chen Hang, Changqun Li, Yuhan Lin, Xiaolong Xu. 21216-21224 [doi]
- Reconcile Gradient Modulation for Harmony Multimodal LearningXiyuan Gao, Bing Cao 0002, Baoquan Gong, Pengfei Zhu 0001. 21225-21233 [doi]
- Visual Bridge: Universal Visual Perception Representations GeneratingYilin Gao, Shuguang Dou, Junzhou Li, Zhiheng Yu, Yin Li, Dongsheng Jiang, Shugong Xu. 21234-21242 [doi]
- Adaptive Momentum and EMA-weighted Modeling for Imbalanced Label Distribution LearningYongbiao Gao, Xiangcheng Sun, Chao Tan, Chunyu Hu, Guohua Lv. 21243-21251 [doi]
- Unified Structural Factors for Transfer Learning Generalization with PAC-Bayesian GuaranteesZiqi Gao. 21252-21259 [doi]
- TGCD: A Framework for Generalized Category Discovery in Time-Series DataChandan Gautam, Lew Choon Hean, Ankit Das, Xiaoli Li 0001, Savitha Ramasamy. 21260-21268 [doi]
- TLAGC: Taylor Linear Attention-Guided Graph Convolutions for Revealing Spatial Domains in Spatial Multi-Omics DataAoyun Geng, Chunyan Cui, Yunyun Su, Zhenjie Luo, Feifei Cui, Zilong Zhang. 21269-21277 [doi]
- ORVIT: Near-Optimal Online Distributionally Robust Reinforcement LearningDebamita Ghosh, George K. Atia, Yue Wang 0068. 21278-21286 [doi]
- SDE-HARL: Scalable Distributed Policy Execution for Heterogeneous-Agent Reinforcement LearningToan D. Gian, Mohammad Abdi, Nathaniel D. Bastian, Francesco Restuccia 0001. 21287-21295 [doi]
- HAMLET4Fairness: Enhancing Fairness in AI Pipelines Through Human-Centered AutoML and ArgumentationJoseph Giovanelli, Giuseppe Pisano, Roberta Calegari. 21296-21304 [doi]
- Rethinking Membership Inference Attacks for CLIPLluís Gómez. 21305-21313 [doi]
- 4D Point Cloud Segmentation via Active Test-Time AdaptationMingrong Gong, Chaoqi Chen, Luyao Tang, Yuxi Wang, Sergio Escalera. 21315-21323 [doi]
- Behaviour Policy Optimization: Provably Lower Variance Return Estimates for Off-Policy Reinforcement LearningAlexander W. Goodall, Edwin Hamel-De le Court, Francesco Belardinelli. 21324-21332 [doi]
- SineLoRA∆: Sine-Activated Delta CompressionCameron Gordon, Yiping Ji, Hemanth Saratchandran, Paul Albert, Simon Lucey. 21333-21342 [doi]
- Risk-Sensitive Exponential Actor CriticAlonso Granados Baca, Jason Pacheco. 21343-21351 [doi]
- Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single PolicyBram Grooten, Patrick MacAlpine, Kaushik Subramanian, Peter Stone 0001, Peter R. Wurman. 21352-21360 [doi]
- Constraint-Guided Clustering for Identifying in-Vehicle Electronic Control Units from Voltage DataBogdan Groza, Patricia Iosif, Lucian Popa 0003. 21361-21368 [doi]
- FLRQ: Faster LLM Quantization with Flexible Low-Rank Matrix SketchingHongyaoxing Gu, Lijuan Hu, Shuzi Niu, Fangfang Liu. 21369-21377 [doi]
- UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding LearningTiancheng Gu, Kaicheng Yang 0002, Kaichen Zhang, Xiang An, Ziyong Feng, Yueyi Zhang, Weidong Cai 0001, Jiankang deng, Lidong Bing. 21378-21386 [doi]
- DAVID: Dual-stage Adaptive Vision-text Integrated Decoupling for Multimodal KV Cache EvictionYifeng Gu, Jianxiu Jin, Kailing Guo, Xiangmin Xu. 21387-21395 [doi]
- Graph Masked Autoencoder for Multi-view Remote Sensing Data ClusteringRenxiang Guan, Junhong Li, Siwei Wang 0001, Tianrui Li 0001, Dayu Hu, Miaomiao Li 0001, Xinwang Liu 0002. 21396-21404 [doi]
- Deformable Polygonal Flow Matching with Informed Priors and Hierarchical Graph ConstraintsArnaud Gueze, Matthieu Ospici, Damien Rohmer, Marie-Paule Cani. 21405-21413 [doi]
- AirWino: Optimized Winograd Convolution for Accelerating CNN Inference on ARMv8 ProcessorsHaoyuan Gui, Xiaoyu Zhang, Yifan Zhang, Ximeng Fu, Shiqi Sun, Leisheng Li, Huiyuan Li. 21414-21422 [doi]
- Disturbance-based Discretization, Differentiable IDS Channel, and an IDS-Correcting Code for DNA-based StorageAlan J. X. Guo, Mengyi Wei, Yufan Dai, Yali Wei, Pengchen Zhang. 21423-21431 [doi]
- Poisoning with a Pill: Circumventing Detection in Federated LearningHanxi Guo, Hao Wang 0022, Tao Song 0003, Tianhang Zheng,