Abstract is missing.
- CausalStock: Deep End-to-end Causal Discovery for News-driven Multi-stock Movement PredictionShuqi Li, Yuebo Sun, Yuxin Lin, Xin Gao 0001, Shuo Shang, Rui Yan 0001. [doi]
- Wide Two-Layer Networks can Learn from Adversarial PerturbationsSoichiro Kumano, Hiroshi Kera, Toshihiko Yamasaki. [doi]
- Derandomizing Multi-Distribution LearningKasper Green Larsen, Omar Montasser, Nikita Zhivotovskiy. [doi]
- Federated Ensemble-Directed Offline Reinforcement LearningDesik Rengarajan, Nitin Ragothaman, Dileep Kalathil, Srinivas Shakkottai. [doi]
- Why Warmup the Learning Rate? Underlying Mechanisms and ImprovementsDayal Singh Kalra, Maissam Barkeshli. [doi]
- SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation LearningJiying Zhang, Zijing Liu, Yu Wang, Bin Feng, Yu Li. [doi]
- Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure ElucidationKehan Guo, Bozhao Nan, Yujun Zhou 0002, Taicheng Guo, Zhichun Guo, Mihir Surve, Zhenwen Liang, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang 0001. [doi]
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less ReparameterizationHaoran You, Yipin Guo, Yichao Fu, Wei Zhou, Huihong Shi, Xiaofan Zhang 0001, Souvik Kundu 0009, Amir Yazdanbakhsh, Yingyan (Celine) Lin. [doi]
- OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingTao Zhang, Xiangtai Li, Hao Fei 0001, Haobo Yuan, Shengqiong Wu, Shunping Ji, Chen Change Loy, Shuicheng Yan. [doi]
- Explanations that reveal all through the definition of encodingAahlad Manas Puli, Nhi Nguyen, Rajesh Ranganath. [doi]
- A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive DelaysSaeed Masoudian, Julian Zimmert, Yevgeny Seldin. [doi]
- HEPrune: Fast Private Training of Deep Neural Networks With Encrypted Data PruningYancheng Zhang, Mengxin Zheng, Yuzhang Shang, Xun Chen, Qian Lou. [doi]
- DeltaDock: A Unified Framework for Accurate, Efficient, and Physically Reliable Molecular DockingJiaxian Yan, Zaixi Zhang, JinTao Zhu, Kai Zhang, Jianfeng Pei, Qi Liu. [doi]
- Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning ScenariosShantanu Jaiswal, Debaditya Roy, Basura Fernando, Cheston Tan. [doi]
- DeepDRK: Deep Dependency Regularized Knockoff for Feature SelectionHongyu Shen, Yici Yan, Zhizhen Jane Zhao. [doi]
- Collaborative Video Diffusion: Consistent Multi-video Generation with Camera ControlZhengfei Kuang, Shengqu Cai, Hao He, Yinghao Xu, Hongsheng Li, Leonidas J. Guibas, Gordon Wetzstein. [doi]
- Energy-Guided Continuous Entropic Barycenter Estimation for General CostsAlexander Kolesov, Petr Mokrov, Igor Udovichenko, Milena Gazdieva, Gudmund Pammer, Anastasis Kratsios, Evgeny Burnaev, Aleksandr Korotin. [doi]
- Improved Algorithms for Contextual Dynamic PricingMatilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet. [doi]
- The Limits of Transfer Reinforcement Learning with Latent Low-rank StructureTyler Sam, Yudong Chen 0001, Christina Lee Yu. [doi]
- Exploring Fixed Point in Image Editing: Theoretical Support and Convergence OptimizationChen Hang, Zhe Ma, Haoming Chen, Xuwei Fang, Vincent Xie, Faming Fang, Guixu Zhang, Hongbin Wang. [doi]
- Instance-Optimal Private Density Estimation in the Wasserstein DistanceVitaly Feldman, Audra McMillan, Satchit Sivakumar, Kunal Talwar. [doi]
- SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel OptimizationShuchen Zhu, Boao Kong, Songtao Lu, Xinmeng Huang, Kun Yuan. [doi]
- A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $\Theta(T^{2/3})$ and its Application to Best-of-Both-WorldsTaira Tsuchiya, Shinji Ito. [doi]
- 3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional RecognitionMahmoud Ahmed, Xiang Li, Arpit Prajapati, Mohamed Elhoseiny. [doi]
- MoVA: Adapting Mixture of Vision Experts to Multimodal ContextZhuofan Zong, Bingqi Ma, Dazhong Shen, Guanglu Song, Hao Shao, Dongzhi Jiang, Hongsheng Li, Yu Liu 0015. [doi]
- Visual Perception by Large Language Model's WeightsFeipeng Ma, Hongwei Xue, Yizhou Zhou, Guangting Wang, Fengyun Rao, Shilin Yan, Yueyi Zhang, Siying Wu, Mike Zheng Shou, Xiaoyan Sun 0001. [doi]
- Interpolating Item and User Fairness in Multi-Sided RecommendationsQinyi Chen, Jason Cheuk Nam Liang, Negin Golrezaei, Djallel Bouneffouf 0001. [doi]
- 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion PriorsXi Liu, Chaoyi Zhou, Siyu Huang. [doi]
- Theoretical Foundations of Deep Selective State-Space ModelsNicola Muca Cirone, Antonio Orvieto, Benjamin Walker 0001, Cristopher Salvi, Terry J. Lyons. [doi]
- RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion ModelsXinchen Zhang, Ling Yang, Yaqi Cai, Zhaochen Yu, Kai-Ni Wang, Jiake Xie, Ye Tian, Minkai Xu, Yong Tang, Yujiu Yang, Bin Cui 0001. [doi]
- Weight decay induces low-rank attention layersSeijin Kobayashi, Yassir Akram, Johannes von Oswald. [doi]
- Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data DomainsLei Wang 0108, Jieming Bian, Letian Zhang, Chen Chen 0001, Jie Xu 0001. [doi]
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image DescriptionsRenjie Pi, Jianshu Zhang, Jipeng Zhang, Rui Pan, Zhekai Chen, Tong Zhang 0001. [doi]
- Stress-Testing Capability Elicitation With Password-Locked ModelsRyan Greenblatt, Fabien Roger, Dmitrii Krasheninnikov, David Krueger 0001. [doi]
- A Simple Image Segmentation Framework via In-Context ExamplesYang Liu, Chenchen Jing, Hengtao Li, Muzhi Zhu, Hao Chen 0041, Xinlong Wang, Chunhua Shen. [doi]
- Generating Highly Designable Proteins with Geometric Algebra Flow MatchingSimon Wagner, Leif Seute, Vsevolod Viliuga, Nicolas Wolf, Frauke Gräter, Jan Stühmer. [doi]
- MonkeySee: Space-time-resolved reconstructions of natural images from macaque multi-unit activityLynn Le, Paolo Papale, Katja Seeliger, Antonio Lozano, Thirza Dado, Feng Wang, Pieter R. Roelfsema, Marcel A. J. van Gerven, Yagmur Güçlütürk, Umut Güçlü. [doi]
- MmCows: A Multimodal Dataset for Dairy Cattle MonitoringHien Vu, Omkar Prabhune, Unmesh Raskar, Dimuth Panditharatne, Hanwook Chung, Christopher Y. Choi, Younghyun Kim 0001. [doi]
- EEVR: A Dataset of Paired Physiological Signals and Textual Descriptions for Joint Emotion Representation LearningPragya Singh, Ritvik Budhiraja, Ankush Gupta, Anshul Goswami, Mohan Kumar, Pushpendra Singh 0001. [doi]
- EffiBench: Benchmarking the Efficiency of Automatically Generated CodeDong Huang 0005, Yuhao Qing, Weiyi Shang, Heming Cui, Jie Zhang 0050. [doi]
- Analysing Multi-Task Regression via Random Matrix Theory with Application to Time Series ForecastingRomain Ilbert, Malik Tiomoko, Cosme Louart, Ambroise Odonnat, Vasilii Feofanov, Themis Palpanas, Ievgen Redko. [doi]
- Leveraging partial stragglers within gradient codingAditya Ramamoorthy, Ruoyu Meng, Vrinda S. Girimaji. [doi]
- Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single ImagesBahri Batuhan Bilecen, Ahmet Berke Gökmen, Aysegul Dundar. [doi]
- Scribbles for All: Benchmarking Scribble Supervised Segmentation Across DatasetsWolfgang Boettcher, Lukas Hoyer, Ozan Unal, Jan Eric Lenssen, Bernt Schiele. [doi]
- Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsRui Yang, Jie Wang, Guoping Wu, Bin Li. [doi]
- Truthful High Dimensional Sparse Linear RegressionLiyang Zhu, Amina Manseur, Meng Ding, Jinyan Liu, Jinhui Xu 0001, Di Wang 0015. [doi]
- Transductive Active Learning: Theory and ApplicationsJonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause 0001. [doi]
- How Diffusion Models Learn to Factorize and ComposeQiyao Liang, Ziming Liu, Mitchell Ostrow, Ila Fiete. [doi]
- RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging RadarFangqiang Ding, Xiangyu Wen, Yunzhou Zhu, Yiming Li 0003, Chris Xiaoxuan Lu. [doi]
- Learning Action and Reasoning-Centric Image Editing from Videos and SimulationBenno Krojer, Dheeraj Vattikonda, Luis Lara, Varun Jampani, Eva Portelance, Chris Pal, Siva Reddy. [doi]
- Attack-Aware Noise Calibration for Differential PrivacyBogdan Kulynych, Juan Felipe Gómez, Georgios Kaissis, Flávio P. Calmon, Carmela Troncoso. [doi]
- Universal Exact Compression of Differentially Private MechanismsYanxiao Liu 0003, Wei-Ning Chen, Ayfer Özgür, Cheuk Ting Li. [doi]
- Not so griddy: Internal representations of RNNs path integrating more than one agentWilliam Redman, Francisco Acosta, Santiago Acosta-Mendoza, Nina Miolane. [doi]
- Can an AI Agent Safely Run a Government? Existence of Probably Approximately Aligned PoliciesFrédéric Berdoz, Roger Wattenhofer. [doi]
- Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement LearningGaia Molinaro, Cédric Colas, Pierre-Yves Oudeyer, Anne Collins. [doi]
- GOMAA-Geo: GOal Modality Agnostic Active Geo-localizationAnindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Chongjie Zhang, Nathan Jacobs, Yevgeniy Vorobeychik. [doi]
- Graph Neural Networks and Arithmetic CircuitsTimon Barlag, Vivian Holzapfel, Laura Strieker, Jonni Virtema, Heribert Vollmer. [doi]
- Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and MetricsLukas Klein, Carsten T. Lüth, Udo Schlegel, Till J. Bungert, Mennatallah El-Assady, Paul F. Jaeger. [doi]
- Metric Space Magnitude for Evaluating the Diversity of Latent RepresentationsKatharina Limbeck, Rayna Andreeva, Rik Sarkar, Bastian Rieck. [doi]
- What Makes and Breaks Safety Fine-tuning? A Mechanistic StudySamyak Jain, Ekdeep Singh Lubana, Kemal Oksuz, Tom Joy, Philip Torr 0001, Amartya Sanyal, Puneet K. Dokania. [doi]
- UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-TrainingBiao Gong, Shuai Tan, Yutong Feng, Xiaoying Xie, Yuyuan Li, Chaochao Chen 0001, Kecheng Zheng, Yujun Shen, Deli Zhao. [doi]
- Non-asymptotic Approximation Error Bounds of Parameterized Quantum CircuitsZhan Yu, Qiuhao Chen, Yuling Jiao, Yinan Li, Xiliang Lu, Xin Wang, Jerry Zhijian Yang. [doi]
- Boosting Vision-Language Models with TransductionMaxime Zanella, Benoît Gérin, Ismail Ben Ayed. [doi]
- AutoSurvey: Large Language Models Can Automatically Write SurveysYidong Wang, Qi Guo, Wenjin Yao, Hongbo Zhang, Xin Zhang, Zhen Wu 0002, Meishan Zhang, Xinyu Dai, Min Zhang 0005, Qingsong Wen, Wei Ye 0004, Shikun Zhang, Yue Zhang 0004. [doi]
- GraphCroc: Cross-Correlation Autoencoder for Graph Structural ReconstructionShijin Duan, Ruyi Ding, Jiaxing He, Aidong Adam Ding, Yunsi Fei, Xiaolin Xu 0001. [doi]
- Inference via Interpolation: Contrastive Representations Provably Enable Planning and InferenceBenjamin Eysenbach, Vivek Myers, Ruslan Salakhutdinov, Sergey Levine. [doi]
- Quadratic Quantum Variational Monte CarloBaiyu Su, Qiang Liu. [doi]
- DePLM: Denoising Protein Language Models for Property OptimizationZeyuan Wang, Keyan Ding, Ming Qin, Xiaotong Li, Xiang Zhuang, Yu Zhao 0009, Jianhua Yao 0001, Qiang Zhang 0026, Huajun Chen. [doi]
- RL-GPT: Integrating Reinforcement Learning and Code-as-policyShaoteng Liu, Haoqi Yuan, Minda Hu, Yanwei Li, Yukang Chen, Shu Liu 0005, Zongqing Lu, Jiaya Jia. [doi]
- Exactly Minimax-Optimal Locally Differentially Private SamplingHyun Young Park, Shahab Asoodeh, Si-Hyeon Lee. [doi]
- FedLPA: One-shot Federated Learning with Layer-Wise Posterior AggregationXiang Liu, Liangxi Liu, Feiyang Ye 0004, Yunheng Shen, Xia Li, Linshan Jiang, Jialin Li. [doi]
- Nesterov acceleration despite very noisy gradientsKanan Gupta, Jonathan W. Siegel, Stephan Wojtowytsch. [doi]
- AdaptiveISP: Learning an Adaptive Image Signal Processor for Object DetectionYujin Wang, Tianyi Xu, Zhang Fan, Tianfan Xue, Jinwei Gu. [doi]
- ConceptFactory: Facilitate 3D Object Knowledge Annotation with Object ConceptualizationJianhua Sun 0003, Yuxuan Li, Longfei Xu, Nange Wang, Jiude Wei, Yining Zhang, Cewu Lu. [doi]
- Exploring Token Pruning in Vision State Space ModelsZheng Zhan 0001, Zhenglun Kong, Yifan Gong 0004, Yushu Wu, Zichong Meng, Hangyu Zheng, Xuan Shen, Stratis Ioannidis, Wei Niu 0002, Pu Zhao 0001, Yanzhi Wang. [doi]
- CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor SegmentationZhongzhen Huang, Yankai Jiang 0003, Rongzhao Zhang, Shaoting Zhang 0001, Xiaofan Zhang 0002. [doi]
- Query-Based Adversarial Prompt GenerationJonathan Hayase, Ema Borevkovic, Nicholas Carlini, Florian Tramèr, Milad Nasr. [doi]
- Learning Goal-Conditioned Representations for Language Reward ModelsVaskar Nath, Dylan Slack, Jeff Da, Yuntao Ma, Hugh Zhang, Spencer Whitehead, Sean Hendryx. [doi]
- EGODE: An Event-attended Graph ODE Framework for Modeling Rigid DynamicsJingyang Yuan, Gongbo Sun, Zhiping Xiao 0001, Hang Zhou 0008, Xiao Luo 0001, Junyu Luo 0002, Yusheng Zhao, Wei Ju, Ming Zhang 0004. [doi]
- Autobidder's Dilemma: Why More Sophisticated Autobidders Lead to Worse Auction EfficiencyYuan Deng, Jieming Mao, Vahab Mirrokni, Hanrui Zhang 0001, Song Zuo. [doi]
- MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical ProblemsBin Lei, Yi Zhang, Shan Zuo, Ali Payani, Caiwen Ding. [doi]
- Simple and Effective Masked Diffusion Language ModelsSubham S. Sahoo, Marianne Arriola, Yair Schiff, Aaron Gokaslan, Edgar Marroquin, Justin T. Chiu, Alexander Rush, Volodymyr Kuleshov. [doi]
- Fearless Stochasticity in Expectation PropagationJonathan So, Richard E. Turner. [doi]
- HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical HumanoidXinyu Xu, Yizheng Zhang, Yonglu Li 0001, Lei Han 0001, Cewu Lu. [doi]
- Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectorsAnisha Pal, Julia Kruk, Mansi Phute, Manognya Bhattaram, Diyi Yang, Duen Horng Chau, Judy Hoffman. [doi]
- Neglected Hessian component explains mysteries in sharpness regularizationYann N. Dauphin, Atish Agarwala, Hossein Mobahi. [doi]
- A distributional simplicity bias in the learning dynamics of transformersRiccardo Rende, Federica Gerace, Alessandro Laio, Sebastian Goldt. [doi]
- Linear Causal Bandits: Unknown Graph and Soft InterventionsZirui Yan, Ali Tajer. [doi]
- Model Collapse Demystified: The Case of RegressionElvis Dohmatob, Yunzhen Feng, Julia Kempe. [doi]
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and HelpfulnessHung Le, Doyen Sahoo, Yingbo Zhou, Caiming Xiong, Silvio Savarese. [doi]
- The Dormant Neuron Phenomenon in Multi-Agent Reinforcement Learning Value FactorizationHaoyuan Qin, Chennan Ma, Mian Deng, Zhengzhu Liu, Songzhu Mei, Xinwang Liu, Cheng Wang, Siqi Shen. [doi]
- Do's and Don'ts: Learning Desirable Skills with Instruction VideosHyunseung Kim, ByungKun Lee, HoJoon Lee, Dongyoon Hwang, Donghu Kim, Jaegul Choo. [doi]
- Achieving Tractable Minimax Optimal Regret in Average Reward MDPsVictor Boone, Zihan Zhang. [doi]
- Conditional Generative Models are Sufficient to Sample from Any Causal Effect EstimandMd. Musfiqur Rahman, Matt Jordan, Murat Kocaoglu. [doi]
- A teacher-teacher framework for clinical language representation learningFeiqing Huang, Shenghan Zhang, Sara Morini Sweet, Tianxi Cai. [doi]
- Toward Efficient Inference for Mixture of ExpertsHaiyang Huang 0003, Newsha Ardalani, Anna Y. Sun, Liu Ke 0001, Shruti Bhosale, Hsien-Hsin S. Lee, Carole-Jean Wu, Benjamin Lee. [doi]
- Brain Treebank: Large-scale intracranial recordings from naturalistic language stimuliChristopher Wang, Adam Uri Yaari, Aaditya Singh, Vighnesh Subramaniam, Dana Rosenfarb, Jan DeWitt, Pranav Misra, Joseph R. Madsen, Scellig Stone, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu. [doi]
- Fast Best-of-N Decoding via Speculative RejectionHanshi Sun, Momin Haider, Ruiqi Zhang, Huitao Yang, Jiahao Qiu, Ming Yin, Mengdi Wang, Peter L. Bartlett, Andrea Zanette. [doi]
- Improved Bayes Regret Bounds for Multi-Task Hierarchical Bayesian Bandit AlgorithmsJiechao Guan, Hui Xiong 0001. [doi]
- RashomonGB: Analyzing the Rashomon Effect and Mitigating Predictive Multiplicity in Gradient BoostingHsiang Hsu, Ivan Brugere, Shubham Sharma, Freddy Lécué, Richard Chen. [doi]
- HaloScope: Harnessing Unlabeled LLM Generations for Hallucination DetectionXuefeng Du, Chaowei Xiao, Sharon Li 0001. [doi]
- Weak-to-Strong Search: Align Large Language Models via Searching over Small Language ModelsZhanhui Zhou, Zhixuan Liu, Jie Liu, Zhichen Dong, Chao Yang, Yu Qiao. [doi]
- WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language ModelsJinghan Jia, Jiancheng Liu, Yihua Zhang, Parikshit Ram, Nathalie Baracaldo, Sijia Liu 0001. [doi]
- Better by default: Strong pre-tuned MLPs and boosted trees on tabular dataDavid Holzmüller, Léo Grinsztajn, Ingo Steinwart. [doi]
- From an Image to a Scene: Learning to Imagine the World from a Million 360° VideosMatthew Wallingford, Anand Bhattad, Aditya Kusupati, Vivek Ramanujan, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi, Wei-Chiu Ma, Ali Farhadi. [doi]
- MoGenTS: Motion Generation based on Spatial-Temporal Joint ModelingWeihao Yuan 0001, Yisheng He, Weichao Shen, Yuan Dong, Xiaodong Gu 0004, Zilong Dong, Liefeng Bo, Qixing Huang. [doi]
- PromptFix: You Prompt and We Fix the PhotoYongsheng Yu, Ziyun Zeng, Hang Hua, Jianlong Fu, Jiebo Luo 0001. [doi]
- Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information MinimizationZiyu Shan, Yujie Zhang, Yipeng Liu 0003, Yiling Xu. [doi]
- Hierarchy-Agnostic Unsupervised Segmentation: Parsing Semantic Image StructureSimone Rossetti, Fiora Pirri. [doi]
- Unleashing the Potential of the Diffusion Model in Few-shot Semantic SegmentationMuzhi Zhu, Yang Liu, Zekai Luo, Chenchen Jing, Hao Chen 0041, Guangkai Xu, Xinlong Wang, Chunhua Shen. [doi]
- Memory-Efficient Gradient Unrolling for Large-Scale Bi-level OptimizationQianli Shen, Yezhen Wang, Zhouhao Yang, Xiang Li, Haonan Wang, Yang Zhang, Jonathan Scarlett, Zhanxing Zhu, Kenji Kawaguchi. [doi]
- Sample-Efficient Geometry Reconstruction from Euclidean Distances using Non-Convex OptimizationIpsita Ghosh, Abiy Tasissa, Christian Kümmerle. [doi]
- Computing the Bias of Constant-step Stochastic Approximation with Markovian NoiseSebastian Allmeier, Nicolas Gast. [doi]
- Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementZhi Wang, Li Zhang, Wenhao Wu, Yuanheng Zhu, Dongbin Zhao, Chunlin Chen. [doi]
- Time-MMD: Multi-Domain Multimodal Dataset for Time Series AnalysisHaoxin Liu 0001, Shangqing Xu, Zhiyuan Zhao 0002, Lingkai Kong, Harshavardhan Kamarthi, Aditya B. Sasanur, Megha Sharma, Jiaming Cui, Qingsong Wen, Chao Zhang 0014, B. Aditya Prakash. [doi]
- Causal discovery with endogenous context variablesWiebke Günther, Oana-Iuliana Popescu, Martin Rabel, Urmi Ninad, Andreas Gerhardus, Jakob Runge. [doi]
- The Power of Hard Attention Transformers on Data Sequences: A formal language theoretic perspectivePascal Bergsträßer, Chris Köcher, Anthony Widjaja Lin, Georg Zetzsche. [doi]
- Analyzing & Reducing the Need for Learning Rate Warmup in GPT TrainingAtli Kosson, Bettina Messmer, Martin Jaggi. [doi]
- Unraveling the Gradient Descent Dynamics of TransformersBingqing Song, Boran Han, Shuai Zhang, Jie Ding 0002, Mingyi Hong 0001. [doi]
- Learning to be Smooth: An End-to-End Differentiable Particle SmootherAli Younis, Erik B. Sudderth. [doi]
- Elucidating the Design Space of Dataset CondensationShitong Shao, Zikai Zhou, Huanran Chen, Zhiqiang Shen. [doi]
- DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-SolvingYuxuan Tong, Xiwen Zhang, Rui Wang, Ruidong Wu, Junxian He. [doi]
- Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLQin-Wen Luo, Ming-Kun Xie, Ye-Wen Wang, Sheng-Jun Huang. [doi]
- QKFormer: Hierarchical Spiking Transformer using Q-K AttentionChenlin Zhou, Han Zhang, Zhaokun Zhou, Liutao Yu, Liwei Huang, Xiaopeng Fan, Li Yuan 0007, Zhengyu Ma, Huihui Zhou, Yonghong Tian 0001. [doi]
- Learning Noisy Halfspaces with a Margin: Massart is No Harder than RandomGautam Chandrasekaran, Vasilis Kontonis, Konstantinos Stavropoulos, Kevin Tian. [doi]
- FasMe: Fast and Sample-efficient Meta Estimator for Precision Matrix Learning in Small Sample SettingsXiao Tan 0005, Yiqin Wang, Yangyang Shen, Dian Shen, Meng Wang 0009, Peibo Duan, Beilun Wang. [doi]
- Bridging the Divide: Reconsidering Softmax and Linear AttentionDongchen Han, Yifan Pu, Zhuofan Xia, Yizeng Han, Xuran Pan, Xiu Li 0001, Jiwen Lu, Shiji Song, Gao Huang 0001. [doi]
- Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and FabricationYunuo Chen, Tianyi Xie, Zeshun Zong, Xuan Li, Feng Gao, Yin Yang 0002, Ying Nian Wu, Chenfanfu Jiang. [doi]
- Policy Optimization for Robust Average Reward MDPsZhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou. [doi]
- SyncTweedies: A General Generative Framework Based on Synchronized DiffusionsJaihoon Kim, Juil Koo, Kyeongmin Yeo, Minhyuk Sung. [doi]
- UniGAD: Unifying Multi-level Graph Anomaly DetectionYiqing Lin, Jianheng Tang, Chenyi Zi, H. Vicky Zhao, Yuan Yao, Jia Li. [doi]
- Action Gaps and Advantages in Continuous-Time Distributional Reinforcement LearningHarley Wiltzer, Marc G. Bellemare, David Meger, Patrick Shafto, Yash Jhaveri. [doi]
- No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen RepresentationsWalter Simoncini, Andrei Bursuc, Spyridon Gidaris, Yuki M. Asano. [doi]
- CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial VideosTrong Thuan Nguyen, Pha A. Nguyen, Xin Li 0005, Jackson David Cothren, Alper Yilmaz, Khoa Luu. [doi]
- Approximation-Aware Bayesian OptimizationNatalie Maus, Kyurae Kim, David Eriksson, Geoff Pleiss, John P. Cunningham, Jacob R. Gardner. [doi]
- A Method for Evaluating Hyperparameter Sensitivity in Reinforcement LearningJacob Adkins, Michael Bowling, Adam White 0001. [doi]
- Exploring DCN-like architecture for fast image generation with arbitrary resolutionShuai Wang, Zexian Li, Tianhui Song, Xubin Li, Tiezheng Ge, Bo Zheng 0007, Limin Wang 0002. [doi]
- Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuliMatthias Tangemann, Matthias Kümmerer, Matthias Bethge. [doi]
- The Sample Complexity of Gradient Descent in Stochastic Convex OptimizationRoi Livni. [doi]
- Toxicity Detection for FreeZhanhao Hu, Julien Piet, Geng Zhao, Jiantao Jiao, David A. Wagner 0001. [doi]
- Regression under demographic parity constraints via unlabeled post-processingGayane Taturyan, Evgenii Chzhen, Mohamed Hebiri. [doi]
- Consensus Learning with Deep Sets for Essential Matrix EstimationDror Moran, Yuval Margalit, Guy Trostianetsky, Fadi Khatib, Meirav Galun, Ronen Basri. [doi]
- Semi-supervised Knowledge Transfer Across Multi-omic Single-cell DataFan Zhang, Tianyu Liu, Zihao Chen, Xiaojiang Peng, Chong Chen 0002, Xian-Sheng Hua 0001, Xiao Luo 0001, Hongyu Zhao. [doi]
- Segment Any ChangeZhuo Zheng, Yanfei Zhong, Liangpei Zhang 0001, Stefano Ermon. [doi]
- A theoretical design of concept sets: improving the predictability of concept bottleneck modelsMax Ruiz Luyten, Mihaela van der Schaar. [doi]
- Implicit Regularization Paths of Weighted Neural RepresentationsJin-Hong Du, Pratik Patil. [doi]
- A Modular Conditional Diffusion Framework for Image ReconstructionMagauiya Zhussip, Iaroslav Koshelev, Stamatios Lefkimmiatis. [doi]
- In Pursuit of Causal Label Correlations for Multi-label Image RecognitionZhao-Min Chen, Xin Jin, Yisu Ge, Sixian Chan. [doi]
- DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningHao Bai, Yifei Zhou, Jiayi Pan, Mert Cemri, Alane Suhr, Sergey Levine, Aviral Kumar. [doi]
- FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?Marco Bornstein, Amrit Singh Bedi, Abdirisak Mohamed, Furong Huang. [doi]
- Unveiling The Matthew Effect Across Channels: Assessing Layer Width Sufficiency via Weight Norm VarianceYiting Chen 0003, Jiazi Bu, Junchi Yan. [doi]
- Smoke and Mirrors in Causal Downstream TasksRiccardo Cadei, Lukas Lindorfer, Sylvia Cremer, Cordelia Schmid, Francesco Locatello. [doi]
- Entity Alignment with Noisy Annotations from Large Language ModelsShengyuan Chen, Qinggang Zhang, Junnan Dong, Wen-hua, Qing Li 0001, Xiao Huang 0001. [doi]
- LoRA-GA: Low-Rank Adaptation with Gradient ApproximationShaowen Wang, Linxi Yu, Jian Li. [doi]
- Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language ModelsYilun Jin, Zheng Li 0018, Chenwei Zhang, Tianyu Cao 0001, Yifan Gao 0001, Pratik Jayarao, Mao Li, Xin Liu 0039, Ritesh Sarkhel, Xianfeng Tang, Haodong Wang, Zhengyang Wang, Wenju Xu, Jingfeng Yang 0001, Qingyu Yin, Xian Li, Priyanka Nigam, Yi Xu, Kai Chen 0005, Qiang Yang 0001, Meng Jiang 0001, Bing Yin. [doi]
- ChatQA: Surpassing GPT-4 on Conversational QA and RAGZihan Liu 0001, Wei Ping, Rajarshi Roy 0003, Peng Xu 0008, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro. [doi]
- FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent SpaceYiyang Guo, Ruizhe Li, Mude Hui, Hanzhong Guo, Chen Zhang, Chuangjian Cai, Le Wan, Shangfei Wang. [doi]
- Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory MatchingYasi Zhang, Peiyu Yu, Yaxuan Zhu, Yingshan Chang, Feng Gao 0013, Ying Nian Wu, Oscar Leong. [doi]
- Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural NetworkZih-Syuan Huang, Ching-Pei Lee. [doi]
- Long-Range Feedback Spiking Network Captures Dynamic and Static Representations of the Visual Cortex under Movie StimuliLiwei Huang, Zhengyu Ma, Liutao Yu, Huihui Zhou, Yonghong Tian 0001. [doi]
- Chain of Agents: Large Language Models Collaborating on Long-Context TasksYusen Zhang, Ruoxi Sun 0002, Yanfei Chen, Tomas Pfister, Rui Zhang, Sercan Ö. Arik. [doi]
- MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State SpaceJiangwei Weng, Zhiqiang Yan, Ying Tai, Jianjun Qian, Jian Yang, Jun Li. [doi]
- Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement LearningStefan Pranger, Hana Chockler, Martin Tappler, Bettina Könighofer. [doi]
- End-to-End Video Semantic Segmentation in Adverse Weather using Fusion Blocks and Temporal-Spatial Teacher-Student LearningXin Yang, Wending Yan, Michael Bi Mi, Yuan Yuan, Robby T. Tan. [doi]
- KFNN: K-Free Nearest Neighbor For CrowdsourcingWenjun Zhang 0012, Liangxiao Jiang, Chaoqun Li 0001. [doi]
- Semi-supervised Multi-label Learning with Balanced Binary Angular Margin LossXiming Li 0002, Silong Liang, Changchun Li, Pengfei Wang, Fangming Gu. [doi]
- Instance-Specific Asymmetric Sensitivity in Differential PrivacyDavid Durfee. [doi]
- Recognize Any RegionsHaosen Yang, Chuofan Ma, Bin Wen, Yi Jiang, Zehuan Yuan, Xiatian Zhu. [doi]
- SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference DatasetJuntao Dai, Tianle Chen, Xuyao Wang, Ziran Yang, Taiye Chen, Jiaming Ji, Yaodong Yang 0001. [doi]
- Online Weighted Paging with Unknown WeightsOrin Levy, Noam Touitou, Aviv Rosenberg. [doi]
- A Pairwise Pseudo-likelihood Approach for Matrix Completion with Informative MissingnessJiangyuan Li, Jiayi Wang, Raymond K. W. Wong, Kwun Chuen Gary Chan. [doi]
- Online Iterative Reinforcement Learning from Human Feedback with General Preference ModelChenlu Ye, Wei Xiong 0015, Yuheng Zhang, Hanze Dong, Nan Jiang 0008, Tong Zhang 0001. [doi]
- UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial VehiclesHui Ye, Rajshekhar Sunderraman, Jonathan Shihao Ji. [doi]
- ReMAP: Neural Model Reprogramming with Network Inversion and Retrieval-Augmented Mapping for Adaptive Motion ForecastingSharmita Dey, Sarath Ravindran Nair. [doi]
- Relationship Prompt Learning is Enough for Open-Vocabulary Semantic SegmentationJiahao Li, Yang Lu 0009, Yuan Xie 0006, Yanyun Qu. [doi]
- On the Use of Anchoring for Training Vision ModelsVivek Sivaraman Narayanaswamy, Kowshik Thopalli, Rushil Anirudh, Yamen Mubarka, Wesam Sakla, Jayaraman J. Thiagarajan. [doi]
- UniMTS: Unified Pre-training for Motion Time SeriesXiyuan Zhang 0001, Diyan Teng, Ranak Roy Chowdhury, Shuheng Li, Dezhi Hong, Rajesh K. Gupta, Jingbo Shang. [doi]
- Mixture of Tokens: Continuous MoE through Cross-Example AggregationSzymon Antoniak, Michal Krutul, Maciej Pióro, Jakub Krajewski, Jan Ludziejewski, Kamil Ciebiera, Krystian Król, Tomasz Odrzygózdz, Marek Cygan, Sebastian Jaszczur. [doi]
- On Feature Learning in Structured State Space ModelsLeena Chennuru Vankadara, Jin Xu, Moritz Haas, Volkan Cevher. [doi]
- ReFT: Representation Finetuning for Language ModelsZhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts. [doi]
- Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion PerceptionShuangpeng Han, Ziyu Wang, Mengmi Zhang. [doi]
- Exploring Consistency in Graph Representations: from Graph Kernels to Graph Neural NetworksXuyuan Liu, Yinghao Cai, Qihui Yang, Yujun Yan. [doi]
- DAT: Improving Adversarial Robustness via Generative Amplitude Mix-up in Frequency DomainFengpeng Li, Kemou Li, Haiwei Wu, Jinyu Tian 0001, Jiantao Zhou 0001. [doi]
- VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface ReconstructionHanlin Chen, Fangyin Wei, Chen Li 0038, Tianxin Huang, Yunsong Wang, Gim Hee Lee. [doi]
- ChatCam: Empowering Camera Control through Conversational AIXinhang Liu, Yu-Wing Tai, Chi-Keung Tang. [doi]
- Token Merging for Training-Free Semantic Binding in Text-to-Image SynthesisTaihang Hu, Linxuan Li, Joost van de Weijer 0001, Hongcheng Gao, Fahad Shahbaz Khan, Jian Yang, Ming-Ming Cheng, Kai Wang, Yaxing Wang. [doi]
- InstructG2I: Synthesizing Images from Multimodal Attributed GraphsBowen Jin, Ziqi Pang, Bingjun Guo, Yu-Xiong Wang, Jiaxuan You, Jiawei Han 0001. [doi]
- Evaluating the World Model Implicit in a Generative ModelKeyon Vafa, Justin Y. Chen, Ashesh Rambachan, Jon M. Kleinberg, Sendhil Mullainathan. [doi]
- Prior-itizing Privacy: A Bayesian Approach to Setting the Privacy Budget in Differential PrivacyZeki Kazan, Jerome P. Reiter. [doi]
- A scalable generative model for dynamical system reconstruction from neuroimaging dataEric Volkmann, Alena Brändle, Daniel Durstewitz, Georgia Koppe. [doi]
- On the Impact of Feature Heterophily on Link Prediction with Graph Neural NetworksJiong Zhu, Gaotang Li, Yao-An Yang, Jing Zhu 0005, Xuehao Cui, Danai Koutra. [doi]
- Off-policy estimation with adaptively collected data: the power of online learningJeonghwan Lee, Cong Ma 0001. [doi]
- Practical Bayesian Algorithm Execution via Posterior SamplingChu Xin Cheng, Raul Astudillo, Thomas A. Desautels, Yisong Yue. [doi]
- FNP: Fourier Neural Processes for Arbitrary-Resolution Data AssimilationKun Chen, Peng Ye, Hao Chen 0045, Kang Chen, Tao Han 0002, Wanli Ouyang, Tao Chen 0003, Lei Bai 0001. [doi]
- Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay BuffersGautham Vasan, Mohamed Elsayed 0003, Seyed Alireza Azimi, Jiamin He, Fahim Shahriar, Colin Bellinger, Martha White, Rupam Mahmood. [doi]
- Dual Lagrangian Learning for Conic OptimizationMathieu Tanneau, Pascal Van Hentenryck. [doi]
- ENAT: Rethinking Spatial-temporal Interactions in Token-based Image SynthesisZanlin Ni, Yulin Wang, Renping Zhou, Yizeng Han, Jiayi Guo, Zhiyuan Liu 0001, Yuan Yao 0013, Gao Huang 0001. [doi]
- Everyday Object Meets Vision-and-Language Navigation Agent via BackdoorKeji He, Kehan Chen, Jiawang Bai, Yan Huang 0008, Qi Wu 0001, Shu-Tao Xia, Liang Wang 0001. [doi]
- Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh RecoveryYongwei Nie, Mingxian Fan, Chengjiang Long, Qing Zhang 0006, Jian Zhu 0001, Xuemiao Xu. [doi]
- PrefPaint: Aligning Image Inpainting Diffusion Model with Human PreferenceKendong Liu, Zhiyu Zhu, Chuanhao Li, Hui Liu 0032, Huanqiang Zeng, Junhui Hou. [doi]
- Stability and Generalization of Adversarial Training for Shallow Neural Networks with Smooth ActivationKaibo Zhang, Yunjuan Wang, Raman Arora. [doi]
- DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable DiffusionKe Sun 0016, Shen Chen, Taiping Yao, Hong Liu 0009, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji. [doi]
- A Universal Growth Rate for Learning with Smooth Surrogate LossesAnqi Mao, Mehryar Mohri, Yutao Zhong 0002. [doi]
- Localized Adaptive Risk ControlMatteo Zecchin, Osvaldo Simeone. [doi]
- Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor DefenseRui Min, Zeyu Qin, Nevin L. Zhang, Li Shen 0008, Minhao Cheng. [doi]
- ClevrSkills: Compositional Language And Visual Reasoning in RoboticsSanjay Haresh, Daniel Dijkman, Apratim Bhattacharyya, Roland Memisevic. [doi]
- Real-Time Selection Under General Constraints via Predictive InferenceYuyang Huo, Lin Lu, Haojie Ren, Changliang Zou. [doi]
- SEEV: Synthesis with Efficient Exact Verification for ReLU Neural Barrier FunctionsHongchao Zhang, Zhizhen Qin, Sicun Gao, Andrew Clark 0001. [doi]
- Towards Multi-Domain Learning for Generalizable Video Anomaly DetectionMyeongAh Cho, Taeoh Kim, Minho Shim, Dongyoon Wee, Sangyoun Lee. [doi]
- EffiLearner: Enhancing Efficiency of Generated Code via Self-OptimizationDong Huang 0005, Jianbo Dai, Han Weng, Puzhen Wu, Yuhao Qing, Heming Cui, Zhijiang Guo, Jie Zhang 0050. [doi]
- Video Diffusion Models are Training-free Motion Interpreter and ControllerZeqi Xiao, Yifan Zhou, Shuai Yang, Xingang Pan. [doi]
- TuneTables: Context Optimization for Scalable Prior-Data Fitted NetworksBenjamin Feuer, Robin Schirrmeister, Valeriia Cherepanova, Chinmay Hegde, Frank Hutter, Micah Goldblum, Niv Cohen, Colin White. [doi]
- EMGBench: Benchmarking Out-of-Distribution Generalization and Adaptation for ElectromyographyJehan Yang, Maxwell Soh, Vivianna Lieu, Douglas J. Weber, Zackory Erickson. [doi]
- Differentiable Quantum Computing for Large-scale Linear ControlConnor Clayton, Jiaqi Leng, Gengzhi Yang, Yi-Ling Qiao, Ming C. Lin, Xiaodi Wu 0001. [doi]
- Déjà Vu Memorization in Vision-Language ModelsBargav Jayaraman, Chuan Guo 0001, Kamalika Chaudhuri. [doi]
- Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for ControlGunshi Gupta, Karmesh Yadav, Yarin Gal, Dhruv Batra, Zsolt Kira, Cong Lu, Tim G. J. Rudner. [doi]
- Pre-trained Large Language Models Use Fourier Features to Compute AdditionTianyi Zhou, Deqing Fu, Vatsal Sharan, Robin Jia. [doi]
- Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel CollaborationYichong Huang, Xiaocheng Feng, Baohang Li, Yang Xiang, Hui Wang, Ting Liu, Bing Qin 0001. [doi]
- Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight FrameEvan Markou, Thalaiyasingam Ajanthan, Stephen Gould. [doi]
- IllumiNeRF: 3D Relighting Without Inverse RenderingXiaoming Zhao, Pratul P. Srinivasan, Dor Verbin, Keunhong Park, Ricardo Martin-Brualla, Philipp Henzler. [doi]
- Human-3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion ModelsYuxuan Xue, Xianghui Xie, Riccardo Marin, Gerard Pons-Moll. [doi]
- Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View SynthesisDiwen Wan, Yuxiang Wang, Ruijie Lu, Gang Zeng. [doi]
- Revisiting Differentially Private ReLU RegressionMeng Ding, Mingxi Lei, Liyang Zhu, Shaowei Wang 0003, Di Wang 0015, Jinhui Xu 0001. [doi]
- Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time SeriesVijay Ekambaram, Arindam Jati, Pankaj Dayama 0001, Sumanta Mukherjee, Nam Nguyen, Wesley M. Gifford, Chandra Reddy, Jayant Kalagnanam. [doi]
- Model LEGO: Creating Models Like Disassembling and Assembling Building BlocksJiacong Hu, Jing Gao, Jingwen Ye, Yang Gao, Xingen Wang, Zunlei Feng, Mingli Song. [doi]
- ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction HorizonsJiawen Zhang, Xumeng Wen, Zhenwei Zhang, Shun Zheng, Jia Li 0009, Jiang Bian 0002. [doi]
- Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, EditingHao Fei 0001, Shengqiong Wu, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan. [doi]
- Automating Data Annotation under Strategic Human Agents: Risks and Potential SolutionsTian Xie, Xueru Zhang. [doi]
- Generalization of Hamiltonian algorithmsAndreas Maurer. [doi]
- Rethinking Score Distillation as a Bridge Between Image DistributionsDavid McAllister, Songwei Ge, Jia-Bin Huang 0001, David Jacobs 0001, Alexei A. Efros, Aleksander Holynski, Angjoo Kanazawa. [doi]
- MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language ModelsLeyang Shen, Gongwei Chen, Rui Shao, Weili Guan, Liqiang Nie. [doi]
- FERERO: A Flexible Framework for Preference-Guided Multi-Objective LearningLisha Chen, A. F. M. Saif, Yanning Shen, Tianyi Chen. [doi]
- Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation ModelsAthanasios Tragakis, Marco Aversa, Chaitanya Kaul, Roderick Murray-Smith, Daniele Faccio. [doi]
- Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPsMatthew Zurek, Yudong Chen. [doi]
- Can Models Learn Skill Composition from Examples?Haoyu Zhao, Simran Kaur 0001, Dingli Yu, Anirudh Goyal, Sanjeev Arora. [doi]
- Bayesian Online Natural Gradient (BONG)Matt Jones 0002, Peter G. Chang, Kevin P. Murphy. [doi]
- UniTox: Leveraging LLMs to Curate a Unified Dataset of Drug-Induced Toxicity from FDA LabelsJacob Silberg, Kyle Swanson, Elana Simon, Angela Zhang, Zaniar Ghazizadeh, Scott Ogden, Hisham Hamadeh, James Y. Zou. [doi]
- Amortized Planning with Large-Scale Transformers: A Case Study on ChessAnian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Kevin Li, Elliot Catt, John Reid, Cannada Lewis, Joel Veness, Tim Genewein. [doi]
- CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object DetectionJisong Kim, Minjae Seong, Jun Won Choi. [doi]
- Fine Tuning Out-of-Vocabulary Item Recommendation with User Sequence ImaginationRuochen Liu, Hao Chen, Yuanchen Bei, Qijie Shen, Fangwei Zhong, Senzhang Wang, Jianxin Wang. [doi]
- Dissect Black Box: Interpreting for Rule-Based Explanations in Unsupervised Anomaly DetectionYu Zhang, Ruoyu Li, Nengwu Wu, Qing Li, Xinhan Lin, Yang Hu, Tao Li, Yong Jiang. [doi]
- Online Consistency of the Nearest Neighbor RuleGeelon So, Sanjoy Dasgupta. [doi]
- Constrained Synthesis with Projected Diffusion ModelsJacob K. Christopher, Stephen Baek, Ferdinando Fioretto. [doi]
- Universal Neural FunctionalsAllan Zhou, Chelsea Finn, James Harrison. [doi]
- Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic SegmentationRuihao Xia, Yu Liang, Peng-Tao Jiang, Hao Zhang, Bo Li, Yang Tang, Pan Zhou 0002. [doi]
- Regularized Conditional Diffusion Model for Multi-Task Preference AlignmentXudong Yu, Chenjia Bai, Haoran He, Changhong Wang, Xuelong Li 0001. [doi]
- Mean-Field Langevin Dynamics for Signed Measures via a Bilevel ApproachGuillaume Wang, Alireza Mousavi Hosseini, Lénaïc Chizat. [doi]
- Learning Plaintext-Ciphertext Cryptographic Problems via ANF-based SAT Instance RepresentationXinhao Zheng, Yang Li, Cunxin Fan, Huaijin Wu, Xinhao Song, Junchi Yan. [doi]
- Generative Semi-supervised Graph Anomaly DetectionHezhe Qiao, Qingsong Wen, Xiaoli Li 0001, Ee-Peng Lim, Guansong Pang. [doi]
- GENOT: Entropic (Gromov) Wasserstein Flow Matching with Applications to Single-Cell GenomicsDominik Klein, Théo Uscidda, Fabian J. Theis, Marco Cuturi. [doi]
- Ex Uno Pluria: Insights on Ensembling in Low Precision Number SystemsGiung Nam, Juho Lee 0001. [doi]
- LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPSZhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang. [doi]
- TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlWeichao Zeng, Yan Shu, Zhenhang Li, Dongbao Yang, Yu Zhou. [doi]
- An Analysis of Tokenization: Transformers under Markov DataNived Rajaraman, Jiantao Jiao, Kannan Ramchandran. [doi]
- PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant PerturbationsJiatong Li 0002, Renjun Hu, Kunzhe Huang, Yan Zhuang, Qi Liu 0003, Mengxiao Zhu 0001, Xing Shi, Wei Lin 0016. [doi]
- On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs)Jerry Yao-Chieh Hu, Weimin Wu, Zhuoru Li, Sophia Pi, Zhao Song 0002, Han Liu 0001. [doi]
- On the cohesion and separability of average-link for hierarchical agglomerative clusteringEduardo Laber, Miguel Batista. [doi]
- CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMsZirui Wang, Mengzhou Xia, Luxi He, Howard Chen 0003, Yitao Liu, Richard Zhu, Kaiqu Liang, Xindi Wu, Haotian Liu, Sadhika Malladi, Alexis Chevalier, Sanjeev Arora, Danqi Chen 0001. [doi]
- Tangent Space Causal Inference: Leveraging Vector Fields for Causal Discovery in Dynamical SystemsKurt Butler, Daniel Waxman 0002, Petar M. Djuric. [doi]
- Training Data Attribution via Approximate UnrollingJuhan Bae, Wu Lin, Jonathan Lorraine, Roger B. Grosse. [doi]
- BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement LearningHaohong Lin, Wenhao Ding, Jian Chen, Laixi Shi, Jiacheng Zhu, Bo Li, Ding Zhao. [doi]
- Sample-Efficient Agnostic BoostingUdaya Ghai, Karan Singh. [doi]
- FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion ModelsTong Wu, Yinghao Xu, Ryan Po, Mengchen Zhang 0001, Guandao Yang, Jiaqi Wang 0003, Ziwei Liu 0002, Dahua Lin, Gordon Wetzstein. [doi]
- Confidence Calibration of Classifiers with Many ClassesAdrien Le-Coz, Stéphane Herbin, Faouzi Adjed. [doi]
- On the Necessity of Collaboration for Online Model Selection with Decentralized DataJunfan Li, Zheshun Wu, Zenglin Xu, Irwin King. [doi]
- SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer DevicesRuslan Svirschevski, Avner May, Zhuoming Chen, Beidi Chen, Zhihao Jia, Max Ryabinin. [doi]
- Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement LearningLanqing Li, Hai Zhang, Xinyu Zhang, Shatong Zhu, Yang Yu, Junqiao Zhao, Pheng-Ann Heng. [doi]
- Learning the Expected Core of Strictly Convex Stochastic Cooperative GamesNam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du 0001, Long Tran-Thanh. [doi]
- Rethinking Out-of-Distribution Detection on Imbalanced Data DistributionKai Liu, Zhihang Fu, Sheng Jin 0002, Chao Chen, Ze Chen, Rongxin Jiang 0001, Fan Zhou 0007, Yaowu Chen, Jieping Ye. [doi]
- AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM AgentsEdoardo Debenedetti, Jie Zhang, Mislav Balunovic, Luca Beurer-Kellner, Marc Fischer 0002, Florian Tramèr. [doi]
- Learning 3D Garment Animation from Trajectories of A Piece of ClothYidi Shao, Chen Change Loy, Bo Dai 0002. [doi]
- Fight Back Against Jailbreaking via Prompt Adversarial TuningYichuan Mo, Yuji Wang, Zeming Wei, Yisen Wang 0001. [doi]
- A General Protocol to Probe Large Vision Models for 3D Physical UnderstandingGuanqi Zhan, Chuanxia Zheng, Weidi Xie, Andrew Zisserman. [doi]
- Improved Particle Approximation Error for Mean Field Neural NetworksAtsushi Nitanda. [doi]
- Speaking Your Language: Spatial Relationships in Interpretable Emergent CommunicationOlaf Lipinski, Adam J. Sobey, Federico Cerutti 0001, Timothy J. Norman. [doi]
- Is Programming by Example Solved by LLMs?Wen-Ding Li, Kevin Ellis. [doi]
- Why Go Full? Elevating Federated Learning Through Partial Network UpdatesHaolin Wang, Xuefeng Liu 0001, Jianwei Niu 0002, Wenkai Guo, Shaojie Tang 0001. [doi]
- Self-Labeling the Job Shop Scheduling ProblemAndrea Corsini, Angelo Porrello, Simone Calderara, Mauro Dell'Amico. [doi]
- ReFIR: Grounding Large Restoration Models with Retrieval AugmentationHang Guo, Tao Dai 0001, Zhihao Ouyang, Taolin Zhang 0003, Yaohua Zha, Bin Chen 0011, Shu-Tao Xia. [doi]
- Motion Graph Unleashed: A Novel Approach to Video PredictionYiqi Zhong, Luming Liang, Bohan Tang, Ilya Zharkov, Ulrich Neumann. [doi]
- A Theoretical Perspective for Speculative Decoding AlgorithmMing Yin 0003, Minshuo Chen, Kaixuan Huang, Mengdi Wang. [doi]
- ColJailBreak: Collaborative Generation and Editing for Jailbreaking Text-to-Image Deep GenerationYizhuo Ma, Shanmin Pang, Qi Guo, Tianyu Wei, Qing Guo 0005. [doi]
- Humanoid Locomotion as Next Token PredictionIlija Radosavovic, Bike Zhang, Baifeng Shi, Jathushan Rajasegaran, Sarthak Kamat, Trevor Darrell, Koushil Sreenath, Jitendra Malik. [doi]
- Preference Learning of Latent Decision Utilities with a Human-like Model of Preferential ChoiceSebastiaan De Peuter, Shibei Zhu, Yujia Guo, Andrew Howes, Samuel Kaski. [doi]
- MeMo: Meaningful, Modular Controllers via Noise InjectionMegan Tjandrasuwita, Jie Xu 0028, Armando Solar-Lezama, Wojciech Matusik. [doi]
- MALT Powers Up Adversarial AttacksOdelia Melamed, Gilad Yehudai, Adi Shamir. [doi]
- Transformer Doctor: Diagnosing and Treating Vision TransformersJiacong Hu, Hao Chen 0041, Kejia Chen 0007, Yang Gao 0001, Jingwen Ye, Xingen Wang, Mingli Song, Zunlei Feng. [doi]
- Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer NormalizationQihao Liu, Zhanpeng Zeng, Ju He, Qihang Yu, Xiaohui Shen, Liang-Chieh Chen. [doi]
- Interventional Causal Discovery in a Mixture of DAGsBurak Varici, Dmitriy Katz, Dennis Wei, Prasanna Sattigeri, Ali Tajer. [doi]
- SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD LearningPaul Mangold, Sergey Samsonov, Safwan Labbi, Ilya Levin, Réda Alami, Alexey Naumov, Eric Moulines. [doi]
- Mixture of Experts Meets Prompt-Based Continual LearningMinh Le, An Nguyen The, Huy Nguyen, Trang Nguyen, Trang Pham, Linh Ngo, Nhat Ho. [doi]
- SparseLLM: Towards Global Pruning of Pre-trained Language ModelsGuangji Bai, Yijiang Li, Chen Ling 0003, Kibaek Kim, Liang Zhao 0002. [doi]
- Rethinking The Training And Evaluation of Rich-Context Layout-to-Image GenerationJiaxin Cheng, Zixu Zhao, Tong He 0002, Tianjun Xiao, Zheng Zhang 0001, Yicong Zhou. [doi]
- On the Complexity of Learning Sparse Functions with Statistical and Gradient QueriesNirmit Joshi, Theodor Misiakiewicz, Nati Srebro. [doi]
- EMVP: Embracing Visual Foundation Model for Visual Place Recognition with Centroid-Free ProbingQibo Qiu, Shun Zhang, Haiming Gao, Honghui Yang, Haochao Ying, Wenxiao Wang 0001, Xiaofei He 0001. [doi]
- How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?Jiahua Dong 0001, Wenqi Liang, Hongliu Li, Duzhen Zhang, Meng Cao, Henghui Ding, Salman H. Khan 0001, Fahad Shahbaz Khan. [doi]
- FUGAL: Feature-fortified Unrestricted Graph AlignmentAditya Bommakanti, Harshith Reddy Vonteri, Konstantinos Skitsas, Sayan Ranu, Davide Mottin, Panagiotis Karras. [doi]
- Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesYu Gui, Ying Jin, Zhimei Ren. [doi]
- Maximum Entropy Reinforcement Learning via Energy-Based Normalizing FlowChen-Hao Chao, Chien Feng, Wei-Fang Sun, Cheng-Kuang Lee, Simon See, Chun-Yi Lee. [doi]
- Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable SegmentationJian Hu 0002, Jiayi Lin 0002, Junchi Yan, Shaogang Gong. [doi]
- Mitigating Biases in Blackbox Feature Extractors for Image Classification TasksAbhipsa Basu, Saswat Subhajyoti Mallick, R. Venkatesh Babu. [doi]
- Weak Supervision Performance Evaluation via Partial IdentificationFelipe Maia Polo, Subha Maity, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun. [doi]
- Opponent Modeling based on Subgoal InferenceXiaopeng Yu, Jiechuan Jiang, Zongqing Lu. [doi]
- LG-VQ: Language-Guided Codebook LearningGuotao Liang, Baoquan Zhang, Yaowei Wang 0001, Yunming Ye, Xutao Li, Huaibin Wang, Chuyao Luo, Kola Ye, Linfeng Luo. [doi]
- Axioms for AI Alignment from Human FeedbackLuise Ge, Daniel Halpern 0002, Evi Micha, Ariel D. Procaccia, Itai Shapira, Yevgeniy Vorobeychik, Junlin Wu 0001. [doi]
- Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured trainingYanlai Yang, Matt Jones 0001, Michael C. Mozer, Mengye Ren. [doi]
- Learning to Understand: Identifying Interactions via the Möbius TransformJustin Singh Kang, Yigit Efe Erginbas, Landon Butler, Ramtin Pedarsani, Kannan Ramchandran. [doi]
- Improving Temporal Link Prediction via Temporal Walk Matrix ProjectionXiaodong Lu, Leilei Sun, Tongyu Zhu, Weifeng Lv. [doi]
- Benchmarking Counterfactual Image GenerationThomas Melistas, Nikos Spyrou, Nefeli Gkouti, Pedro Sanchez, Athanasios Vlontzos, Yannis Panagakis, Giorgos Papanastasiou, Sotirios A. Tsaftaris. [doi]
- Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label ConfigurationsHao Chen 0102, Ankit Shah 0001, Jindong Wang 0001, Ran Tao 0013, Yidong Wang, Xiang Li 0106, Xing Xie 0001, Masashi Sugiyama, Rita Singh, Bhiksha Raj. [doi]
- An End-To-End Graph Attention Network Hashing for Cross-Modal RetrievalHuilong Jin, Yingxue Zhang, Lei Shi 0030, Shuang Zhang 0009, Feifei Kou, Jiapeng Yang, Chuangying zhu, Jia Luo 0001. [doi]
- Distributed Least Squares in Small Space via Sketching and Bias ReductionSachin Garg, Kevin Tan, Michal Derezinski. [doi]
- Optimal Classification under Performative Distribution ShiftEdwige Cyffers, Muni Sreenivas Pydi, Jamal Atif, Olivier Cappé. [doi]
- Generalized Tensor Decomposition for Understanding Multi-Output Regression under Combinatorial ShiftsAndong Wang, Yuning Qiu, Mingyuan Bai, Zhong Jin, GuoXu Zhou, Qibin Zhao. [doi]
- Benchmarking LLMs via Uncertainty QuantificationFanghua Ye 0001, Mingming Yang, Jianhui Pang, Longyue Wang, Derek F. Wong, Emine Yilmaz, Shuming Shi 0001, Zhaopeng Tu. [doi]
- Improving Robustness of 3D Point Cloud Recognition from a Fourier PerspectiveYibo Miao, Yinpeng Dong, Jinlai Zhang, Lijia Yu, Xiao Yang, Xiao-Shan Gao. [doi]
- Calibrated Self-Rewarding Vision Language ModelsYiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao. [doi]
- SpeechAlign: Aligning Speech Generation to Human PreferencesDong Zhang, Zhaowei Li, Shimin Li, Xin Zhang, Pengyu Wang, Yaqian Zhou, Xipeng Qiu. [doi]
- Oracle-Efficient Differentially Private Learning with Public DataAdam Block, Mark Bun, Rathin Desai, Abhishek Shetty, Zhiwei Steven Wu. [doi]
- Learning World Models for Unconstrained Goal NavigationYuanlin Duan, Wensen Mao, He Zhu 0001. [doi]
- SpaFL: Communication-Efficient Federated Learning With Sparse Models And Low Computational OverheadMinsu Kim 0003, Walid Saad, Mérouane Debbah, Choong Seon Hong. [doi]
- Tighter Convergence Bounds for Shuffled SGD via Primal-Dual PerspectiveXufeng Cai, Cheuk Yin Lin, Jelena Diakonikolas. [doi]
- Convergence of $\text{log}(1/\epsilon)$ for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed AnalysisIoannis Anagnostides, Tuomas Sandholm. [doi]
- BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n SamplingLin Gui, Cristina Garbacea, Victor Veitch. [doi]
- BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language ModelsYibin Wang 0005, Haizhou Shi, Ligong Han, Dimitris N. Metaxas, Hao Wang 0014. [doi]
- Revisiting K-mer Profile for Effective and Scalable Genome Representation LearningAbdulkadir Çelikkanat, Andrés R. Masegosa, Thomas Nielsen. [doi]
- Training Binary Neural Networks via Gaussian Variational Inference and Low-Rank Semidefinite ProgrammingLorenzo Orecchia, Jiawei Hu, Xue He, Wang Mark, XuLei Yang, Min Wu 0008, Xue Geng. [doi]
- APEBench: A Benchmark for Autoregressive Neural Emulators of PDEsFelix Koehler, Simon Niedermayr, Rüdiger Westermann, Nils Thuerey. [doi]
- AUC Maximization under Positive Distribution ShiftAtsutoshi Kumagai, Tomoharu Iwata, Hiroshi Takahashi, Taishi Nishiyama, Yasuhiro Fujiwara. [doi]
- Learning Diffusion Priors from Observations by Expectation MaximizationFrançois Rozet, Gérôme Andry, François Lanusse, Gilles Louppe. [doi]
- Understanding Emergent Abilities of Language Models from the Loss PerspectiveZhengxiao Du, Aohan Zeng, Yuxiao Dong, Jie Tang 0001. [doi]
- ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot CoordinationXihuai Wang, Shao Zhang, Wenhao Zhang, Wentao Dong, Jingxiao Chen, Ying Wen 0001, Weinan Zhang 0001. [doi]
- MetaLA: Unified Optimal Linear Approximation to Softmax Attention MapYuhong Chou, Man Yao, Kexin Wang, Yuqi Pan, Rui-Jie Zhu 0003, Jibin Wu, Yiran Zhong, Yu Qiao, Bo Xu 0002, Guoqi Li. [doi]
- SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image ClassificationBenjamin Feuer, Jiawei Xu, Niv Cohen, Patrick Yubeaton, Govind Mittal, Chinmay Hegde. [doi]
- Revealing Distribution Discrepancy by Sampling Transfer in Unlabeled DataZhilin Zhao 0001, Longbing Cao, Xuhui Fan 0001, Wei-Shi Zheng 0001. [doi]
- Seek Commonality but Preserve Differences: Dissected Dynamics Modeling for Multi-modal Visual RLYangru Huang, Peixi Peng, Yifan Zhao 0002, Guangyao Chen, Yonghong Tian 0001. [doi]
- Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityVahid Balazadeh Meresht, Keertana Chidambaram, Viet Nguyen, Rahul G. Krishnan, Vasilis Syrgkanis. [doi]
- No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language ModelsAngéline Pouget, Lucas Beyer, Emanuele Bugliarello, Xiao Wang 0038, Andreas Steiner, Xiaohua Zhai, Ibrahim M. Alabdulmohsin. [doi]
- LocCa: Visual Pretraining with Location-aware CaptionersBo Wan, Michael Tschannen, Yongqin Xian, Filip Pavetic, Ibrahim M. Alabdulmohsin, Xiao Wang 0038, André Susano Pinto, Andreas Steiner, Lucas Beyer, Xiaohua Zhai. [doi]
- Gradient-based Discrete Sampling with Automatic Cyclical SchedulingPatrick Pynadath, Riddhiman Bhattacharya, Arun Hariharan, Ruqi Zhang. [doi]
- ProTransformer: Robustify Transformers via Plug-and-Play ParadigmZhichao Hou, Weizhi Gao, Yuchen Shen, Feiyi Wang, Xiaorui Liu. [doi]
- Can Large Language Model Agents Simulate Human Trust Behavior?Chengxing Xie, Canyu Chen, Feiran Jia, Ziyu Ye, Shiyang Lai, Kai Shu, Jindong Gu, Adel Bibi, Ziniu Hu, David Jurgens, James Evans, Philip Torr 0001, Bernard Ghanem, Guohao Li 0001. [doi]
- RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from ScriptsJiaheng Liu, Zehao Ni, Haoran Que, Tao Sun, Noah Wang, Jian Yang 0030, Jiakai Wang, Hongcheng Guo, Zhongyuan Peng, Ge Zhang, Jiayi Tian, Xingyuan Bu, Ke Xu 0001, Wenge Rong, Junran Peng, Zhaoxiang Zhang 0001. [doi]
- ProgressGym: Alignment with a Millennium of Moral ProgressTianyi Qiu, Yang Zhang, Xuchuan Huang, Jasmine Xinze Li, Jiaming Ji, Yaodong Yang 0001. [doi]
- If You Want to Be Robust, Be Wary of InitializationSofiane Ennadir, Johannes F. Lutzeyer, Michalis Vazirgiannis, El Houcine Bergou. [doi]
- Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?Haoang Chi, He Li, Wenjing Yang 0002, Feng Liu 0003, Long Lan, Xiaoguang Ren, Tongliang Liu, Bo Han 0003. [doi]
- Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational DataSofia Ek, Dave Zachariah. [doi]
- Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image CompressionXi Zhang, Xiaolin Wu. [doi]
- PUZZLES: A Benchmark for Neural Algorithmic ReasoningBenjamin Estermann, Luca A. Lanzendörfer, Yannick Niedermayr, Roger Wattenhofer. [doi]
- Statistical Multicriteria Benchmarking via the GSD-FrontChristoph Jansen, Georg Schollmeyer, Julian Rodemann, Hannah Blocher, Thomas Augustin 0001. [doi]
- Understanding and Improving Adversarial Collaborative Filtering for Robust RecommendationKaike Zhang, Qi Cao, Yunfan Wu, Fei Sun 0001, Huawei Shen, Xueqi Cheng. [doi]
- Mission Impossible: A Statistical Perspective on Jailbreaking LLMsJingtong Su, Julia Kempe, Karen Ullrich. [doi]
- Identifiability Guarantees for Causal Disentanglement from Purely Observational DataRyan Welch, Jiaqi Zhang, Caroline Uhler. [doi]
- Infusing Synthetic Data with Real-World Patterns for Zero-Shot Material State SegmentationSagi Eppel, Jolina Li, Manuel S. Drehwald, Alán Aspuru-Guzik. [doi]
- Online Adaptation of Language Models with a Memory of Amortized ContextsJihoon Tack, Jaehyung Kim 0001, Eric Mitchell, Jinwoo Shin, Yee Whye Teh, Jonathan Richard Schwarz. [doi]
- Policy-shaped prediction: avoiding distractions in model-based reinforcement learningMiles Hutson, Isaac Kauvar, Nick Haber. [doi]
- Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse ProblemsJiawei Zhang, Jiaxin Zhuang, Cheng Jin, Gen Li, Yuantao Gu. [doi]
- Classification Diffusion Models: Revitalizing Density Ratio EstimationShahar Yadin, Noam Elata, Tomer Michaeli. [doi]
- A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language ModelsHirofumi Tsuruta, Hiroyuki Yamazaki, Ryota Maeda, Ryotaro Tamura, Akihiro Imura. [doi]
- Sourcerer: Sample-based Maximum Entropy Source Distribution EstimationJulius Vetter, Guy Moss, Cornelius Schröder, Richard Gao, Jakob H. Macke. [doi]
- DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model FeaturesLetian Wang, Seung Wook Kim 0001, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang 0036, Sanja Fidler, Marco Pavone 0001, Péter Karkus. [doi]
- Iteration Head: A Mechanistic Study of Chain-of-ThoughtVivien Cabannes, Charles Arnal, Wassim Bouaziz, Xingyu Yang, François Charton, Julia Kempe. [doi]
- VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video UnderstandingHoulun Chen, Xin Wang 0019, Hong Chen, Zeyang Zhang, Wei Feng, Bin Huang, Jia Jia 0001, Wenwu Zhu 0001. [doi]
- On improved Conditioning Mechanisms and Pre-training Strategies for Diffusion ModelsTariq Berrada Ifriqi, Pietro Astolfi, Melissa Hall, Reyhane Askari Hemmat, Yohann Benchetrit, Marton Havasi, Matthew J. Muckley, Karteek Alahari, Adriana Romero-Soriano, Jakob Verbeek, Michal Drozdzal. [doi]
- Neuc-MDS: Non-Euclidean Multidimensional Scaling Through Bilinear FormsChengyuan Deng, Jie Gao, Kevin Lu, Feng Luo, Hongbin Sun, Cheng Xin. [doi]
- Spiking Neural Network as Adaptive Event Stream SlicerJiahang Cao, Mingyuan Sun, Ziqing Wang, Hao Cheng, Qiang Zhang 0029, Shibo Zhou, Renjing Xu. [doi]
- Connectivity-Driven Pseudo-Labeling Makes Stronger Cross-Domain SegmentersDong Zhao, Qi Zang, Shuang Wang 0001, Nicu Sebe, Zhun Zhong. [doi]
- Fast Last-Iterate Convergence of Learning in Games Requires Forgetful AlgorithmsYang Cai 0001, Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-wei Lee, Haipeng Luo, Weiqiang Zheng. [doi]
- FinBen: A Holistic Financial Benchmark for Large Language ModelsQianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu 0001, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu 0003, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng 0002, Sophia Ananiadou, Jimin Huang. [doi]
- QUEST: Quadruple Multimodal Contrastive Learning with Constraints and Self-PenalizationQi Song, Tianxiang Gong, Shiqi Gao, Haoyi Zhou, Jianxin Li 0002. [doi]
- VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological ImagesM. Maruf, Arka Daw, Kazi Sajeed Mehrab, Harish Babu Manogaran, Abhilash Neog, Medha Sawhney, Mridul Khurana, James P. Balhoff, Yasin Bakis, Bahadir Altintas, Matthew J. Thompson, Elizabeth G. Campolongo, Josef C. Uyeda, Hilmar Lapp, Henry L. Bart Jr., Paula M. Mabee, Yu Su 0001, Wei-Lun Chao, Charles V. Stewart, Tanya Y. Berger-Wolf, Wasila M. Dahdul, Anuj Karpatne. [doi]
- Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spacesAngeliki Kamoutsi, Peter Schmitt-Förster, Tobias Sutter, Volkan Cevher, John Lygeros. [doi]
- Learning Distributions on Manifolds with Free-Form FlowsPeter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Ullrich Köthe. [doi]
- A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled ConstraintsLiuyuan Jiang, Quan Xiao, Victor Tenorio, Fernando Real-Rojas, Antonio G. Marques, Tianyi Chen. [doi]
- Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object PosesSeungwoo Yoo, Juil Koo, Kyeongmin Yeo, Minhyuk Sung. [doi]
- Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation MismatchMalek Mechergui, Sarath Sreedharan. [doi]
- NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage RegimesHao-Lun Sun, Lei Hsiung, Nandhini Chandramoorthy, Pin-Yu Chen, Tsung-Yi Ho. [doi]
- OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map ConstructionHongbo Zhao 0006, Lue Fan, YunTao Chen, Haochen Wang, Yuran Yang, Xiaojuan Jin, Yixin Zhang, Gaofeng Meng, Zhao-Xiang Zhang. [doi]
- Self-Consuming Generative Models with Curated Data Provably Optimize Human PreferencesDamien Ferbach, Quentin Bertrand, Avishek Joey Bose, Gauthier Gidel. [doi]
- Renovating Names in Open-Vocabulary Segmentation BenchmarksHaiwen Huang, Songyou Peng, Dan Zhang, Andreas Geiger 0001. [doi]
- DECO-Bench: Unified Benchmark for Decoupled Task-Agnostic Synthetic Data ReleaseFarzaneh Askari, Lingjuan Lyu, Vivek Sharma 0001. [doi]
- LucidAction: A Hierarchical and Multi-model Dataset for Comprehensive Action Quality AssessmentLinfeng Dong, Wei Wang, Yu Qiao, Xiao Sun. [doi]
- Fast Iterative Hard Thresholding Methods with Pruning Gradient ComputationsYasutoshi Ida, Sekitoshi Kanai, Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara. [doi]
- Bias Detection via SignalingYiling Chen, Tao Lin 0013, Ariel D. Procaccia, Aaditya Ramdas, Itai Shapira. [doi]
- Imitating Language via Scalable Inverse Reinforcement LearningMarkus Wulfmeier, Michael Bloesch, Nino Vieillard, Arun Ahuja, Jorg Bornschein, Sandy H. Huang, Artem Sokolov, Matt Barnes 0001, Guillaume Desjardins, Alex Bewley, Sarah Bechtle, Jost Tobias Springenberg, Nikola Momchev, Olivier Bachem, Matthieu Geist, Martin A. Riedmiller. [doi]
- SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsDan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang 0001. [doi]
- Tell What You Hear From What You See - Video to Audio Generation Through TextXiulong Liu, Kun Su, Eli Shlizerman. [doi]
- Recursive Introspection: Teaching Language Model Agents How to Self-ImproveYuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar. [doi]
- MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language ModelsTianle Gu, Zeyang Zhou, Kexin Huang, Dandan Liang, Yixu Wang, Haiquan Zhao, Yuanqi Yao, Xingge Qiao, Keqing Wang, Yujiu Yang, Yan Teng, Yu Qiao, Yingchun Wang. [doi]
- HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation PredictionQianyue Hao, Jingyang Fan, Fengli Xu, Jian Yuan, Yong Li 0008. [doi]
- What is my quantum computer good for? Quantum capability learning with physics-aware neural networksDaniel Hothem, Ashe Miller, Timothy Proctor. [doi]
- Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path SamplingYuanqi Du, Michael Plainer, Rob Brekelmans, Chenru Duan, Frank Noé, Carla P. Gomes, Alán Aspuru-Guzik, Kirill Neklyudov. [doi]
- Structured flexibility in recurrent neural networks via neuromodulationJulia Costacurta, Shaunak Bhandarkar, David M. Zoltowski, Scott W. Linderman. [doi]
- Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language ModelsJiahao Ying, Yixin Cao 0002, Yushi Bai, Qianru Sun, Bo Wang, Wei Tang 0015, Zhaojun Ding, Yizhe Yang, Xuanjing Huang 0001, Shuicheng Yan. [doi]
- 2: Overcoming Few Labels in Federated Semi-Supervised LearningSeungjoo Lee, Thanh-Long V. Le, Jaemin Shin 0005, Sung-Ju Lee. [doi]
- Disentangling Interpretable Factors with Supervised Independent Subspace Principal Component AnalysisJiayu Su, David A. Knowles, Raúl Rabadán. [doi]
- Scaling White-Box Transformers for VisionJinrui Yang, Xianhang Li, Druv Pai, Yuyin Zhou, Yi Ma 0001, Yaodong Yu, Cihang Xie. [doi]
- PEACE: A Dataset of Pharmaceutical Care for Cancer Pain Analgesia Evaluation and Medication DecisionYutao Dou, Huimin Yu, Wei Li, Jingyang Li, Fei Xia, Jian Xiao. [doi]
- Sequential Probability Assignment with Contexts: Minimax Regret, Contextual Shtarkov Sums, and Contextual Normalized Maximum LikelihoodZiyi Liu, Idan Attias, Dan Roy. [doi]
- The Group Robustness is in the Details: Revisiting Finetuning under Spurious CorrelationsTyler Labonte, John C. Hill, Xinchen Zhang, Vidya Muthukumar, Abhishek Kumar 0001. [doi]
- On Differentially Private U StatisticsKamalika Chaudhuri, Po-Ling Loh, Shourya Pandey, Purnamrita Sarkar. [doi]
- Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View SynthesisZhiyuan Min, Yawei Luo, Jianwen Sun, Yi Yang 0001. [doi]
- Label Noise: Ignorance Is BlissYilun Zhu, Jianxin Zhang, Aditya Gangrade, Clayton Scott. [doi]
- An Image is Worth 32 Tokens for Reconstruction and GenerationQihang Yu, Mark Weber, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen. [doi]
- Average gradient outer product as a mechanism for deep neural collapseDaniel Beaglehole, Peter Súkeník, Marco Mondelli, Misha Belkin. [doi]
- Dueling over Dessert, Mastering the Art of Repeated Cake CuttingSimina Brânzei, MohammadTaghi Hajiaghayi, Reed Phillips, Suho Shin, Kun Wang. [doi]
- Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced OptimizationXinyu Lyu, Beitao Chen, Lianli Gao, Hengtao Shen, Jingkuan Song. [doi]
- Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt OptimizationXingchen Wan, Ruoxi Sun 0002, Hootan Nakhost, Sercan Ö. Arik. [doi]
- Metric from Human: Zero-shot Monocular Metric Depth Estimation via Test-time AdaptationYizhou Zhao, Hengwei Bian, Kaihua Chen, Pengliang Ji, Liao Qu, Shao-yu Lin, Weichen Yu, Haoran Li, Hao Chen, Jun Shen 0001, Bhiksha Raj, Min Xu. [doi]
- Uncertainty-aware Fine-tuning of Segmentation Foundation ModelsKangning Liu, Brian L. Price, Jason Kuen, Yifei Fan, Zijun Wei, Luis Figueroa, Krzysztof J. Geras, Carlos Fernandez-Granda. [doi]
- Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-DesignRuisi Cai, Yeonju Ro, Geon Woo Kim, Peihao Wang, Babak Ehteshami Bejnordi, Aditya Akella, Zhangyang Wang. [doi]
- MedJourney: Benchmark and Evaluation of Large Language Models over Patient Clinical JourneyXian Wu 0001, Yutian Zhao, Yunyan Zhang, Jiageng Wu, Zhihong Zhu, Yingying Zhang, Yi Ouyang, Ziheng Zhang, Huimin Wang, Zhenxi Lin, Jie Yang 0039, Shuang Zhao, Yefeng Zheng 0001. [doi]
- On the Benefits of Public Representations for Private Transfer Learning under Distribution ShiftPratiksha Thaker, Amrith Setlur, Steven Z. Wu, Virginia Smith. [doi]
- DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU ParallelismCheonjun Park, Mincheol Park, Hyunchan Moon, Myung Kuk Yoon, Seokjin Go, Suhyun Kim, Won Woo Ro. [doi]
- Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image FusionYu-Jie Liang, Zihan Cao, Shangqi Deng, Hong-Xia Dou, Liang-Jian Deng. [doi]
- Prune and Repaint: Content-Aware Image Retargeting for any RatioFeihong Shen, Chao Li, Yifeng Geng, Yongjian Deng, Hao Chen 0034. [doi]
- RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference ContentJoão Monteiro 0002, Pierre-André Noël, Étienne Marcotte, Sai Rajeswar Mudumba, Valentina Zantedeschi, David Vázquez 0001, Nicolas Chapados, Chris Pal, Perouz Taslakian. [doi]
- Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference TreesSijia Chen, Yibo Wang 0005, Yi-Feng Wu, Qingguo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Lijun Zhang 0005. [doi]
- Proportional Fairness in Clustering: A Social Choice PerspectiveLeon Kellerhals, Jannik Peters 0001. [doi]
- GS-Hider: Hiding Messages into 3D Gaussian SplattingXuanyu Zhang, Jiarui Meng, Runyi Li, Zhipei Xu, Yongbing Zhang, Jian Zhang 0018. [doi]
- How Sparse Can We Prune A Deep Network: A Fundamental Limit PerspectiveQiaozhe Zhang, Ruijie Zhang, Jun Sun 0020, Yingzhuang Liu. [doi]
- Scene Graph Disentanglement and Composition for Generalizable Complex Image GenerationYunnan Wang, Ziqiang Li, Wenyao Zhang, Zequn Zhang, Baao Xie, Xihui Liu, Wenjun Zeng, Xin Jin. [doi]
- Text-Infused Attention and Foreground-Aware Modeling for Zero-Shot Temporal Action DetectionYearang Lee, Ho Joong Kim, Seong-Whan Lee. [doi]
- SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment OptimizationWanhua Li 0001, Zibin Meng, Jiawei Zhou, Donglai Wei 0001, Chuang Gan, Hanspeter Pfister. [doi]
- Nearly Minimax Optimal Regret for Multinomial Logistic BanditJoongkyu Lee, Min-hwan Oh. [doi]
- UltraEdit: Instruction-based Fine-Grained Image Editing at ScaleHaozhe Zhao, Xiaojian (Shawn) Ma, Liang Chen 0024, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li 0003, Baobao Chang. [doi]
- Optimal Multiclass U-Calibration Error and BeyondHaipeng Luo, Spandan Senapati, Vatsal Sharan. [doi]
- Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-GuidanceJiwan Hur, Dong-Jae Lee, Gyojin Han, Jaehyun Choi, Yunho Jeon, Junmo Kim 0002. [doi]
- DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNAAman Patel, Arpita Singhal, Austin Wang, Anusri Pampari, Maya Kasowski, Anshul Kundaje. [doi]
- Disentangling the Roles of Distinct Cell Classes with Cell-Type Dynamical SystemsAditi Jha, Diksha Gupta, Carlos D. Brody, Jonathan W. Pillow. [doi]
- S-SOS: Stochastic Sum-Of-Squares for Parametric Polynomial OptimizationLicheng Zhu, Mathias Oster, Yuehaw Khoo. [doi]
- Enhancing Preference-based Linear Bandits via Human Response TimeShen Li 0003, Yuyang Zhang, Zhaolin Ren, Claire Liang, Na Li, Julie A. Shah. [doi]
- ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language ModelsMingrui Wu, Xinyue Cai, Jiayi Ji, Jiale Li, Oucheng Huang, Gen Luo, Hao Fei 0001, Guannan Jiang, Xiaoshuai Sun, Rongrong Ji. [doi]
- LLaNA: Large Language and NeRF AssistantAndrea Amaduzzi, Pierluigi Zama Ramirez, Giuseppe Lisanti, Samuele Salti, Luigi di Stefano. [doi]
- XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAXAlexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Artem Agarkov, Viacheslav Sinii, Sergey Kolesnikov. [doi]
- SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery DetectionYachao Liang, Min Yu 0001, Gang Li 0009, Jianguo Jiang, Boquan Li 0002, Feng Yu, Ning Zhang, Xiang Meng, Weiqing Huang. [doi]
- Discrete Flow MatchingItai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman. [doi]
- DMNet: Self-comparison Driven Model for Subject-independent Seizure DetectionShihao Tu, Linfeng Cao, Daoze Zhang, Junru Chen, Lvbin Ma, Yin Zhang 0006, Yang Yang 0009. [doi]
- EEG2Video: Towards Decoding Dynamic Visual Perception from EEG SignalsXuan-Hao Liu, Yan-Kai Liu, Yansen Wang, Kan Ren, Hanwen Shi, Zilong Wang, Dongsheng Li, Bao-Liang Lu, Wei-Long Zheng. [doi]
- Beyond Redundancy: Information-aware Unsupervised Multiplex Graph Structure LearningZhixiang Shen, Shuo Wang, Zhao Kang 0001. [doi]
- Protected Test-Time Adaptation via Online Entropy Matching: A Betting ApproachYarin Bar, Shalev Shaer, Yaniv Romano. [doi]
- Locating What You Need: Towards Adapting Diffusion Models to OOD Concepts In-the-WildJianan Yang, Chenchao Gao, Zhiqing Xiao, Junbo Zhao, Sai Wu, Gang Chen, Haobo Wang. [doi]
- MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-MakingYubin Kim 0002, Chanwoo Park, Hyewon Jeong, Yik Siu Chan, Xuhai Xu, Daniel McDuff, Hyeonhoon Lee, Marzyeh Ghassemi, Cynthia Breazeal, Hae Won Park 0001. [doi]
- Cluster-wise Graph Transformer with Dual-granularity Kernelized AttentionSiyuan Huang 0003, Yunchong Song, Jiayue Zhou, Zhouhan Lin. [doi]
- PGN: The RNN's New Successor is Effective for Long-Range Time Series ForecastingYuxin Jia, Youfang Lin, Jing Yu, Shuo Wang, Tianhao Liu, Huaiyu Wan. [doi]
- Enhancing LLM Reasoning via Vision-Augmented PromptingZiyang Xiao, Dongxiang Zhang, Xiongwei Han, Xiaojin Fu, Wing Yin Yu, Tao Zhong, Sai Wu, Yuan Wang, Jianwei Yin, Gang Chen. [doi]
- FedGMark: Certifiably Robust Watermarking for Federated Graph LearningYuxin Yang, Qiang Li, Yuan Hong, Binghui Wang. [doi]
- Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space InferenceDeqian Kong, Dehong Xu, Minglu Zhao, Bo Pang 0004, Jianwen Xie, Andrew Lizarraga, Yuhao Huang, Sirui Xie, Ying Nian Wu. [doi]
- Active Sequential Posterior Estimation for Sample-Efficient Simulation-Based InferenceSam Griesemer, Defu Cao, Zijun Cui, Carolina Osorio, Yan Liu 0002. [doi]
- The Implicit Bias of Gradient Descent on Separable Multiclass DataHrithik Ravi, Clayton Scott, Daniel Soudry, Yutong Wang 0002. [doi]
- A Benchmark Dataset for Event-Guided Human Pose Estimation and Tracking in Extreme ConditionsHoonhee Cho, Taewoo Kim 0003, Yuhwan Jeong, Kuk-Jin Yoon. [doi]
- SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion ModelsZhaoyang Sun, Shengwu Xiong 0001, Yaxiong Chen, Fei Du, Weihua Chen, Fan Wang, Yi Rong. [doi]
- Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single ImageKailu Wu, Fangfu Liu, Zhihan Cai, Runjie Yan, Hanyang Wang 0003, Yating Hu, Yueqi Duan, Kaisheng Ma. [doi]
- SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material EstimationJesus Zarzar, Bernard Ghanem. [doi]
- SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh DatasetYubin Hu 0001, Kairui Wen, Heng Zhou, Xiaoyang Guo, Yong-Jin Liu 0001. [doi]
- Interactive Deep Clustering via Value MiningHonglin Liu, Peng Hu, Changqing Zhang, Yunfan Li, Xi Peng. [doi]
- Automated Multi-level Preference for MLLMsMengxi Zhang, Wenhao Wu, Yu Lu, Yuxin Song, Kang Rong, Huanjin Yao, Jianbo Zhao, Fanglong Liu, Haocheng Feng, Jingdong Wang 0001, Yifan Sun 0003. [doi]
- Learning on Large Graphs using Intersecting CommunitiesBen Finkelshtein, Ismail Ilkan Ceylan, Michael M. Bronstein, Ron Levie. [doi]
- Unified Covariate Adjustment for Causal InferenceYonghan Jung, Jin Tian 0001, Elias Bareinboim. [doi]
- Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge DistillationJiaming Lv, Haoyuan Yang, Peihua Li. [doi]
- Learning diffusion at lightspeedAntonio Terpin, Nicolas Lanzetti, Martín Gadea, Florian Dörfler. [doi]
- Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent ModelingJiatao Gu, Ying Shen 0006, Shuangfei Zhai, Yizhe Zhang 0002, Navdeep Jaitly, Joshua M. Susskind. [doi]
- GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative ModelingBowen Zhang, Yiji Cheng, Jiaolong Yang, Chunyu Wang, Feng Zhao 0004, Yansong Tang, Dong Chen 0003, Baining Guo. [doi]
- MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsYating Xu, Chen Li, Gim Hee Lee. [doi]
- Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech SeparationUi-Hyeop Shin, Sangyoun Lee, Taehan Kim, Hyung-Min Park. [doi]
- Segment Anything without SupervisionXudong Wang 0007, Jingfeng Yang, Trevor Darrell. [doi]
- TaskBench: Benchmarking Large Language Models for Task AutomationYongliang Shen 0001, Kaitao Song, Xu Tan 0003, Wenqi Zhang, Kan Ren, Siyu Yuan, Weiming Lu 0001, Dongsheng Li 0002, Yueting Zhuang. [doi]
- FouRA: Fourier Low-Rank AdaptationShubhankar Borse, Shreya Kadambi, Nilesh Prasad Pandey, Kartikeya Bhardwaj, Viswanath Ganapathy, Sweta Priyadarshi, Risheek Garrepalli, Rafael Esteves 0002, Munawar Hayat, Fatih Porikli. [doi]
- HardCore Generation: Generating Hard UNSAT Problems for Data AugmentationJoseph Cotnareanu, Zhanguang Zhang, Hui-Ling Zhen, Yingxue Zhang 0001, Mark Coates. [doi]
- How does PDE order affect the convergence of PINNs?Changhoon Song, Yesom Park, Myungjoo Kang. [doi]
- Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization RegimeHaoyu Geng, Hang Ruan, Runzhong Wang, Yang Li, Yang Wang, Lei Chen, Junchi Yan. [doi]
- Neural Combinatorial Optimization for Robust Routing Problem with Uncertain Travel TimesPei Xiao, Zizhen Zhang, Jinbiao Chen, Jiahai Wang, Zhenzhen Zhang. [doi]
- Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem SolvingAniket Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy P. Lillicrap, Danilo Jimenez Rezende, Yoshua Bengio, Michael C. Mozer, Sanjeev Arora. [doi]
- MC-DiT: Contextual Enhancement via Clean-to-Clean Reconstruction for Masked Diffusion ModelsGuanghao Zheng, Yuchen Liu 0006, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong. [doi]
- Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and BeyondYingcong Li, Ankit Singh Rawat, Samet Oymak. [doi]
- Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion ModelHao Zhang 0073, Lei Cao, Jiayi Ma 0001. [doi]
- SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure InterpretationJonathan Roberts 0004, Kai Han 0001, Neil Houlsby, Samuel Albanie. [doi]
- RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression SegmentationChangli Wu, Qi Chen, Jiayi Ji, Haowei Wang 0001, Yiwei Ma, You Huang, Gen Luo, Hao Fei 0001, Xiaoshuai Sun, Rongrong Ji. [doi]
- Opponent Modeling with In-context SearchYuheng Jing, Bingyun Liu, Kai Li, Yifan Zang 0001, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng. [doi]
- Multi-Label Learning with Stronger Consistency GuaranteesAnqi Mao, Mehryar Mohri, Yutao Zhong 0002. [doi]
- WikiDBs: A Large-Scale Corpus Of Relational Databases From WikidataLiane Vogel, Jan-Micha Bodensohn, Carsten Binnig. [doi]
- Intruding with Words: Towards Understanding Graph Injection Attacks at the Text LevelRunlin Lei, Yuwei Hu, Yuchen Ren, Zhewei Wei. [doi]
- Deep Equilibrium Algorithmic ReasoningDobrik Georgiev, Joseph Wilson, Davide Buffelli, Pietro Lió. [doi]
- Learning Discrete Latent Variable Structures with Tensor Rank ConditionsZhengming Chen, Ruichu Cai, Feng Xie 0002, Jie Qiao, Anpeng Wu, Zijian Li 0001, Zhifeng Hao, Kun Zhang 0001. [doi]
- Training an Open-Vocabulary Monocular 3D Detection Model without 3D DataRui Huang, Henry Zheng, Yan Wang, Zhuofan Xia, Marco Pavone 0001, Gao Huang 0001. [doi]
- VLG-CBM: Training Concept Bottleneck Models with Vision-Language GuidanceDivyansh Srivastava, Ge Yan, Lily Weng. [doi]
- Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous ComputationsAlexander Tyurin, Kaja Gruntkowska, Peter Richtárik. [doi]
- Multi-Stage Predict+Optimize for (Mixed Integer) Linear ProgramsXinyi Hu, Jasper C. H. Lee, Jimmy H. M. Lee, Peter J. Stuckey. [doi]
- Mobility-LLM: Learning Visiting Intentions and Travel Preference from Human Mobility Data with Large Language ModelsLetian Gong, Yan Lin 0006, Xinyue Zhang, Yiwen Lu, Xuedi Han, Yichen Liu 0003, Shengnan Guo 0001, Youfang Lin, Huaiyu Wan. [doi]
- Latent Representation Matters: Human-like Sketches in One-shot Drawing TasksVictor Boutin, Rishav Mukherji, Aditya Agrawal, Sabine Muzellec, Thomas Fel, Thomas Serre, Rufin VanRullen. [doi]
- Mars: Situated Inductive Reasoning in an Open-World EnvironmentXiaojuan Tang, Jiaqi Li, Yitao Liang, Song Chun Zhu, Muhan Zhang, Zilong Zheng. [doi]
- Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play PriorsZihui Wu, Yu Sun, Yifan Chen, Bingliang Zhang, Yisong Yue, Katherine L. Bouman. [doi]
- ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language ModelsYuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen 0026. [doi]
- FEDMEKI: A Benchmark for Scaling Medical Foundation Models via Federated Knowledge InjectionJiaqi Wang 0002, Xiaochen Wang 0002, Lingjuan Lyu, Jinghui Chen, Fenglong Ma. [doi]
- Causal Temporal Representation Learning with Nonstationary Sparse TransitionXiangchen Song, Zijian Li 0001, Guangyi Chen 0002, Yujia Zheng 0001, Yewen Fan, Xinshuai Dong, Kun Zhang 0001. [doi]
- Unified Graph Augmentations for Generalized Contrastive Learning on GraphsJiaming Zhuo, Yintong Lu, Hui Ning, Kun Fu, Bingxin Niu, Dongxiao He, Chuan Wang 0002, Yuanfang Guo, Zhen Wang 0004, Xiaochun Cao, Liang Yang 0002. [doi]
- DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMsLingchen Meng, Jianwei Yang, Rui Tian, Xiyang Dai, Zuxuan Wu, Jianfeng Gao 0001, Yu-Gang Jiang 0001. [doi]
- Learning from Offline Foundation Features with Tensor AugmentationsEmir Konuk, Christos Matsoukas, Moein Sorkhei, Phitchapha Lertsiravarameth, Kevin Smith 0001. [doi]
- Addressing Hidden Confounding with Heterogeneous Observational Datasets for RecommendationYanghao Xiao, Haoxuan Li, Yongqiang Tang, Wensheng Zhang 0002. [doi]
- A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning ShortcutsSamuele Bortolotti, Emanuele Marconato, Tommaso Carraro, Paolo Morettin, Emile van Krieken, Antonio Vergari, Stefano Teso, Andrea Passerini. [doi]
- Overcoming Common Flaws in the Evaluation of Selective Classification SystemsJeremias Traub, Till J. Bungert, Carsten T. Lüth, Michael Baumgartner 0001, Klaus H. Maier-Hein, Lena Maier-Hein, Paul F. Jaeger. [doi]
- Adversarially Robust Dense-Sparse Tradeoffs via Heavy-HittersDavid P. Woodruff, Samson Zhou. [doi]
- Tetrahedron Splatting for 3D GenerationChun Gu, Zeyu Yang, Zijie Pan, Xiatian Zhu, Li Zhang. [doi]
- Preference-based Pure ExplorationApurv Shukla 0001, Debabrota Basu. [doi]
- TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent CollaborationYiwei Guo, Shaobin Zhuang, Kunchang Li 0002, Yu Qiao 0001, Yali Wang 0001. [doi]
- Enhancing Robustness of Graph Neural Networks on Social Media with Explainable Inverse Reinforcement LearningYuefei Lyu, Chaozhuo Li 0001, Sihong Xie, Xi Zhang 0008. [doi]
- Light Unbalanced Optimal TransportMilena Gazdieva, Arip Asadulaev, Evgeny Burnaev, Aleksandr Korotin. [doi]
- Locally Private and Robust Multi-Armed BanditsXingyu Zhou, Komo (Wei) Zhang. [doi]
- Autoregressive Policy Optimization for Constrained Allocation TasksDavid Winkel, Niklas Strauß, Maximilian Bernhard, Zongyue Li, Thomas Seidl 0001, Matthias Schubert. [doi]
- A Siamese Transformer with Hierarchical Refinement for Lane DetectionZinan Lv, Dong Han, Wenzhe Wang, Danny Z. Chen. [doi]
- Towards Visual Text Design Transfer Across LanguagesYejin Choi 0001, Jiwan Chung, Sumin Shim, Giyeong Oh, Youngjae Yu. [doi]
- Provable Editing of Deep Neural Networks using Parametric Linear RelaxationZhe Tao, Aditya V. Thakur. [doi]
- Out-Of-Distribution Detection with Diversification (Provably)Haiyun Yao, Zongbo Han, Huazhu Fu, Xi Peng 0001, Qinghua Hu, Changqing Zhang. [doi]
- A Systematic Review of NeurIPS Dataset Management PracticesYiwei Wu, Leah Ajmani, Shayne Longpre, Hanlin Li. [doi]
- 3D Equivariant Pose Regression via Direct Wigner-D Harmonics PredictionJongmin Lee, Minsu Cho. [doi]
- Unified Gradient-Based Machine Unlearning with Remain Geometry EnhancementZhehao Huang, Xinwen Cheng, Jinghao Zheng, Haoran Wang, Zhengbao He, Tao Li, Xiaolin Huang. [doi]
- Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian InputZiang Chen, Rong Ge. [doi]
- Language Grounded Multi-agent Reinforcement Learning with Human-interpretable CommunicationHuao Li, Hossein Nourkhiz Mahjoub, Behdad Chalaki, Vaishnav Tadiparthi, Kwonjoon Lee, Ehsan Moradi-Pari, Charles Lewis, Katia P. Sycara. [doi]
- SubjECTive-QA: Measuring Subjectivity in Earnings Call Transcripts' QA Through Six-Dimensional Feature AnalysisHuzaifa Pardawala, Siddhant Sukhani, Agam Shah, Veer Kejriwal, Abhishek Pillai, Rohan Bhasin, Andrew DiBiasio, Tarun Mandapati, Dhruv Adha, Sudheer Chava. [doi]
- N-agent Ad Hoc TeamworkCaroline Wang, Arrasy Rahman, Ishan Durugkar, Elad Liebman, Peter Stone 0001. [doi]
- EGonc : Energy-based Open-Set Node Classification with substitute UnknownsQin Zhang, Zelin Shi, Shirui Pan, Junyang Chen 0001, Huisi Wu, Xiaojun Chen 0006. [doi]
- Autonomous Driving with Spiking Neural NetworksRuijie Zhu 0003, Ziqing Wang, Leilani Gilpin, Jason Eshraghian. [doi]
- On Affine Homotopy between Language EncodersRobin Chan, Reda Boumasmoud, Anej Svete, Yuxin Ren, Qipeng Guo, Zhijing Jin 0001, Shauli Ravfogel, Mrinmaya Sachan, Bernhard Schölkopf, Mennatallah El-Assady, Ryan Cotterell. [doi]
- Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and ModelsXin Li, Weize Chen, Qizhi Chu, Haopeng Li, Zhaojun Sun, Ran Li, Chen Qian, Yiwei Wei, Chuan Shi 0001, Zhiyuan Liu 0001, Maosong Sun 0001, Cheng Yang. [doi]
- D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language ModelsHaoran Que, Jiaheng Liu, Ge Zhang, Chenchen Zhang, Xingwei Qu, Yinghao Ma, Feiyu Duan, Zhiqi Bai, Jiakai Wang, Yuanxing Zhang, Xu Tan 0003, Jie Fu 0001, Jiamang Wang, Lin Qu, Wenbo Su, Bo Zheng 0007. [doi]
- Learning Linear Causal Representations from General Environments: Identifiability and Intrinsic AmbiguityJikai Jin, Vasilis Syrgkanis. [doi]
- Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and LearningOtmane Sakhi, Imad Aouali, Pierre Alquier, Nicolas Chopin. [doi]
- Warped Diffusion: Solving Video Inverse Problems with Image Diffusion ModelsGiannis Daras, Weili Nie, Karsten Kreis, Alex Dimakis, Morteza Mardani, Nikola B. Kovachki, Arash Vahdat. [doi]
- The Fine-Grained Complexity of Gradient Computation for Training Large Language ModelsJosh Alman, Zhao Song 0002. [doi]
- Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step SizesDan Qiao 0002, Kaiqi Zhang 0002, Esha Singh, Daniel Soudry, Yu-Xiang Wang 0003. [doi]
- Linguistic Collapse: Neural Collapse in (Large) Language ModelsRobert Wu, Vardan Papyan. [doi]
- The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs BetterScott Geng, Cheng-Yu Hsieh, Vivek Ramanujan, Matthew Wallingford, Chun-Liang Li, Pang Wei Koh, Ranjay Krishna. [doi]
- Euclidean distance compression via deep random featuresBrett Leroux, Luis Rademacher. [doi]
- Online Non-convex Learning in Dynamic EnvironmentsZhipan Xu, Lijun Zhang. [doi]
- Pessimistic Backward Policy for GFlowNetsHyosoon Jang, Yunhui Jang, Minsu Kim, Jinkyoo Park, Sungsoo Ahn. [doi]
- GeoNLF: Geometry guided Pose-Free Neural LiDAR FieldsWeiyi Xue, Zehan Zheng, Fan Lu 0001, Haiyun Wei, Guang Chen 0001, Changjun Jiang. [doi]
- When are dynamical systems learned from time series data statistically accurate?Jeongjin Park, Nicole Yang, Nisha Chandramoorthy. [doi]
- Detecting Bugs with Substantial Monetary Consequences by LLM and Rule-based ReasoningBrian Zhang, Zhuo Zhang 0002. [doi]
- Invisible Image Watermarks Are Provably Removable Using Generative AIXuandong Zhao, Kexun Zhang, Zihao Su, Saastha Vasan, Ilya Grishchenko, Christopher Kruegel, Giovanni Vigna, Yu-Xiang Wang 0003, Lei Li 0005. [doi]
- Event-3DGS: Event-based 3D Reconstruction Using 3D Gaussian SplattingHaiqian Han, Jianing Li 0001, Henglu Wei, Xiangyang Ji. [doi]
- MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision TasksXingkui Zhu, Yiran Guan, Dingkang Liang, Yuchao Chen, Yuliang Liu, Xiang Bai. [doi]
- On the Power of Decision Trees in Auto-Regressive Language ModelingYulu Gan, Tomer Galanti, Tomaso A. Poggio, Eran Malach. [doi]
- On Weak Regret Analysis for Dueling BanditsEl Mehdi Saad, Alexandra Carpentier, Tomás Kocák, Nicolas Verzelen. [doi]
- OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary UnderstandingYanmin Wu, Jiarui Meng, Haijie Li, Chenming Wu, Yahao Shi, Xinhua Cheng, Chen Zhao 0011, Haocheng Feng, Errui Ding, Jingdong Wang 0001, Jian Zhang 0018. [doi]
- AP-Adapter: Improving Generalization of Automatic Prompts on Unseen Text-to-Image Diffusion ModelsYuchen Fu, Zhiwei Jiang, Yuliang Liu, Cong Wang, Zexuan Deng, Zhaoling Chen, Qing Gu. [doi]
- ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic FieldKiyohiro Nakayama, Mikaela Angelina Uy, Yang You 0004, Ke Li 0011, Leonidas J. Guibas. [doi]
- Pseudo-Siamese Blind-spot Transformers for Self-Supervised Real-World DenoisingYuhui Quan, Tianxiang Zheng, Hui Ji. [doi]
- Continual Learning with Global AlignmentXueying Bai, Jinghuan Shang, Yifan Sun, Niranjan Balasubramanian. [doi]
- Personalizing Reinforcement Learning from Human Feedback with Variational Preference LearningSriyash Poddar, Yanming Wan, Hamish Ivison, Abhishek Gupta, Natasha Jaques. [doi]
- ComBack: A Versatile Dataset for Enhancing Compiler Backend Development EfficiencyMing Zhong, Fang Lyu, Lulin Wang, Hongna Geng, Lei Qiu, Huimin Cui, Xiaobing Feng. [doi]
- How to Use Diffusion Priors under Sparse Views?Qisen Wang, Yifan Zhao, Jiawei Ma, Jia Li. [doi]
- Stepping Forward on the Last MileChen Feng, Jay Zhuo, Parker Zhang, Ramchalam Kinattinkara Ramakrishnan, Zhaocong Yuan, Andrew Zou Li. [doi]
- Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal MaskingZijian Dong, Ruilin Li, Yilei Wu, Thuan Tinh Nguyen, Joanna Su Xian Chong, Fang Ji, Nathanael Ren Jie Tong, Christopher Chen, Juan Helen Zhou. [doi]
- Enhancing Semi-Supervised Learning via Representative and Diverse Sample SelectionQian Shao, Jiangrui Kang, Qiyuan Chen 0003, Zepeng Li, Hongxia Xu, Yiwen Cao, Jiajuan Liang, Jian Wu 0001. [doi]
- From Linear to Linearizable Optimization: A Novel Framework with Applications to Stationary and Non-stationary DR-submodular OptimizationMohammad Pedramfar, Vaneet Aggarwal. [doi]
- DINTR: Tracking via Diffusion-based InterpolationPha A. Nguyen, Ngan Le, Jackson David Cothren, Alper Yilmaz, Khoa Luu. [doi]
- Learning from Snapshots of Discrete and Continuous Data StreamsPramith Devulapalli, Steve Hanneke. [doi]
- HourVideo: 1-Hour Video-Language UnderstandingKeshigeyan Chandrasegaran, Agrim Gupta, Lea M. Hadzic, Taran Kota, Jimming He, Cristóbal Eyzaguirre, Zane Durante, Manling Li, Jiajun Wu 0001, Li Fei-Fei 0001. [doi]
- Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical CodesJerry Yao-Chieh Hu, Dennis Wu, Han Liu 0001. [doi]
- MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-StepTakeshi Noda, Chao Chen, Weiqi Zhang, Xinhai Liu, Yu-Shen Liu, Zhizhong Han. [doi]
- Private and Personalized Frequency Estimation in a Federated SettingAmrith Setlur, Vitaly Feldman, Kunal Talwar. [doi]
- Oracle-Efficient Reinforcement Learning for Max Value EnsemblesMarcel Hussing, Michael Kearns, Aaron Roth 0001, Sikata Bela Sengupta, Jessica Sorrell. [doi]
- Toward Global Convergence of Gradient EM for Over-Paramterized Gaussian Mixture ModelsWeihang Xu, Maryam Fazel, Simon S. Du. [doi]
- Score-Optimal Diffusion SchedulesChristopher Williams, Andrew Campbell, Arnaud Doucet, Saifuddin Syed. [doi]
- Multidimensional Fractional Programming for Normalized CutsYannan Chen, Beichen Huang, Licheng Zhao, Kaiming Shen. [doi]
- Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and GeneralizationMucong Ding, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou 0001, Tom Goldstein, John Langford 0001, Animashree Anandkumar, Furong Huang. [doi]
- Learning to Solve Quadratic Unconstrained Binary Optimization in a Classification WayMing Chen, Jie Chun, Shang Xiang, Luona Wei, Yonghao Du, Qian Wan, Yuning Chen, Yingwu Chen. [doi]
- Rethinking Parity Check Enhanced Symmetry-Preserving AnsatzGe Yan 0001, Mengfei Ran, Ruocheng Wang, Kaisen Pan, Junchi Yan. [doi]
- Bias Amplification in Language Model Evolution: An Iterated Learning PerspectiveYi Ren, Shangmin Guo, Linlu Qiu, Bailin Wang, Danica J. Sutherland. [doi]
- xLSTM: Extended Long Short-Term MemoryMaximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael Kopp 0001, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter. [doi]
- The Power of Resets in Online Reinforcement LearningZakaria Mhammedi, Dylan J. Foster, Alexander Rakhlin. [doi]
- One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object DetectionZhenyu Wang, Yali Li, Hengshuang Zhao, Shengjin Wang. [doi]
- Efficiency of the First-Price Auction in the Autobidding WorldYuan Deng, Jieming Mao, Vahab Mirrokni, Hanrui Zhang 0001, Song Zuo. [doi]
- Distributional Preference Alignment of LLMs via Optimal TransportIgor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan H. Greenewald, Jirí Navrátil 0001, Jarret Ross. [doi]
- Acceleration Exists! Optimization Problems When Oracle Can Only Compare Objective Function ValuesAleksandr V. Lobanov, Alexander V. Gasnikov, Andrey Krasnov. [doi]
- Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question AnsweringZhihua Wen, Zhiliang Tian, Zexin Jian, Zhen Huang 0006, Pei Ke, Yifu Gao, Minlie Huang, Dongsheng Li. [doi]
- Partial observation can induce mechanistic mismatches in data-constrained models of neural dynamicsWilliam Qian, Jacob A. Zavatone-Veth, Benjamin S. Ruben, Cengiz Pehlevan. [doi]
- DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography ReconstructionBowen Song, Jason Hu, Zhaoxu Luo, Jeffrey A. Fessler, Liyue Shen. [doi]
- Bifröst: 3D-Aware Image Compositing with Language InstructionsLingxiao Li, Kaixiong Gong, Weihong Li, Xili Dai, Tao Chen 0003, Xiaojun Yuan, Xiangyu Yue 0001. [doi]
- HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction AwarenessZihui Xue, Romy Luo, Changan Chen, Kristen Grauman. [doi]
- Batched Energy-Entropy acquisition for Bayesian OptimizationFelix Teufel, Carsten Stahlhut, Jesper Ferkinghoff-Borg. [doi]
- dattri: A Library for Efficient Data AttributionJunwei Deng, Ting-Wei Li, Shiyuan Zhang, Shixuan Liu, Yijun Pan, Hao Huang, Xinhe Wang, Pingbang Hu, Xingjian Zhang, Jiaqi W. Ma. [doi]
- BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-HaystackYuri Kuratov, Aydar Bulatov, Petr Anokhin, Ivan Rodkin, Dmitry Sorokin, Artyom Y. Sorokin, Mikhail Burtsev 0001. [doi]
- Physics-Informed Variational State-Space Gaussian ProcessesOliver Hamelijnck, Arno Solin, Theodoros Damoulas. [doi]
- Optimal Multi-Fidelity Best-Arm IdentificationRiccardo Poiani, Rémy Degenne, Emilie Kaufmann, Alberto Maria Metelli, Marcello Restelli. [doi]
- On the Surprising Effectiveness of Attention Transfer for Vision TransformersAlexander C. Li, Yuandong Tian, Beidi Chen, Deepak Pathak, Xinlei Chen. [doi]
- Fine-grained Control of Generative Data Augmentation in IoT SensingTianshi Wang, Qikai Yang, Ruijie Wang 0004, Dachun Sun, Jinyang Li 0004, Yizhuo Chen, Yigong Hu, Chaoqi Yang, Tomoyoshi Kimura, Denizhan Kara, Tarek F. Abdelzaher. [doi]
- Large Language Model UnlearningYuanshun Yao, Xiaojun Xu, Yang Liu. [doi]
- One-shot Federated Learning via Synthetic Distiller-Distillate CommunicationJunyuan Zhang, Songhua Liu, Xinchao Wang. [doi]
- Preference Alignment with Flow MatchingMinu Kim, Yongsik Lee, Sehyeok Kang, Jihwan Oh, Song Chong, Se-Young Yun. [doi]
- Inverse Factorized Soft Q-Learning for Cooperative Multi-agent Imitation LearningThe Viet Bui, Tien Mai, Thanh Hong Nguyen. [doi]
- JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated ImagesZhecan Wang, Junzhang Liu, Chia-Wei Tang, Hani AlOmari, Anushka Sivakumar, Rui Sun, Wenhao Li, Md. Atabuzzaman, Hammad A. Ayyubi, Haoxuan You, Alvi Md. Ishmam, Kai-Wei Chang, Shih-Fu Chang, Christopher Thomas 0004. [doi]
- CaptainCook4D: A Dataset for Understanding Errors in Procedural ActivitiesRohith Peddi, Shivvrat Arya, Bharath Challa, Likhitha Pallapothula, Akshay Vyas, Bhavya Gouripeddi, Qifan Zhang, Jikai Wang, Vasundhara Komaragiri, Eric D. Ragan, Nicholas Ruozzi, Yu Xiang 0001, Vibhav Gogate. [doi]
- MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsZunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang 0001, Xiu Li 0001. [doi]
- One Sample Fits All: Approximating All Probabilistic Values Simultaneously and EfficientlyWeida Li, Yaoliang Yu. [doi]
- SMART: Scalable Multi-agent Real-time Motion Generation via Next-token PredictionWei Wu 0021, Xiaoxin Feng, Ziyan Gao, Yuheng Kan. [doi]
- Multi-times Monte Carlo Rendering for Inter-reflection ReconstructionTengjie Zhu, Zhuo Chen, Jingnan Gao, Yichao Yan, Xiaokang Yang. [doi]
- RAGraph: A General Retrieval-Augmented Graph Learning FrameworkXinke Jiang, Rihong Qiu, Yongxin Xu, Wentao Zhang, Yichen Zhu, Ruizhe Zhang 0013, Yuchen Fang 0001, Chu Xu, Junfeng Zhao 0001, Yasha Wang. [doi]
- Causal language modeling can elicit search and reasoning capabilities on logic puzzlesKulin Shah, Nishanth Dikkala, Xin Wang, Rina Panigrahy. [doi]
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-RaysYang Zhou, Tan Li Hui Faith, Yanyu Xu, Sicong Leng, Xinxing Xu, Yong Liu 0026, Rick Siow Mong Goh. [doi]
- Fairness without Harm: An Influence-Guided Active Sampling ApproachJinlong Pang, Jialu Wang, Zhaowei Zhu, Yuanshun Yao, Chen Qian, Yang Liu 0018. [doi]
- What do Graph Neural Networks learn? Insights from Tropical GeometryTuan Anh Pham, Vikas Garg 0001. [doi]
- FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic ModelXiaobao Wu, Thong Nguyen 0003, Delvin Zhang, William Yang Wang, Anh Tuan Luu. [doi]
- Segment, Shuffle, and Stitch: A Simple Layer for Improving Time-Series RepresentationsShivam Grover, Amin Jalali, Ali Etemad. [doi]
- CiteME: Can Language Models Accurately Cite Scientific Claims?Ori Press, Andreas Hochlehnert, Ameya Prabhu, Vishaal Udandarao, Ofir Press, Matthias Bethge. [doi]
- BitDelta: Your Fine-Tune May Only Be Worth One BitJames Liu, Guangxuan Xiao, Kai Li, Jason D. Lee, Song Han 0003, Tri Dao, Tianle Cai. [doi]
- The Empirical Impact of Neural Parameter Symmetries, or Lack ThereofDerek Lim, Theo Putterman, Robin Walters 0001, Haggai Maron, Stefanie Jegelka. [doi]
- AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any ScenarioYuhan Li 0003, Hao Zhou, Wenxiang Shang, Ran Lin, Xuanhong Chen, Bingbing Ni. [doi]
- Sparse High Rank AdaptersKartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Shreya Kadambi, Rafael Esteves 0002, Shubhankar Borse, Paul N. Whatmough, Risheek Garrepalli, Mart van Baalen, Harris Teague, Markus Nagel. [doi]
- Discretely beyond 1/e: Guided Combinatorial Algortihms for Submodular MaximizationYixin Chen, Ankur Nath, Chunli Peng, Alan Kuhnle. [doi]
- Not Just Object, But State: Compositional Incremental Learning without ForgettingYanyi Zhang, Binglin Qiu, Qi Jia, Yu Liu, Ran He. [doi]
- Multi-scale Consistency for Robust 3D Registration via Hierarchical Sinkhorn TreeChengwei Ren, Yifan Feng, Weixiang Zhang, Xiao-Ping (Steven) Zhang, Yue Gao. [doi]
- The Map Equation Goes Neural: Mapping Network Flows with Graph Neural NetworksChristopher Blöcker, Chester Tan, Ingo Scholtes. [doi]
- Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ NormsPierre Clavier, Laixi Shi, Erwan Le Pennec, Eric Mazumdar, Adam Wierman, Matthieu Geist. [doi]
- Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming DataXuxing Chen, Abhishek Roy 0005, Yifan Hu, Krishnakumar Balasubramanian 0002. [doi]
- FindingEmo: An Image Dataset for Emotion Recognition in the WildLaurent Mertens, Elahe Yargholi, Hans P. Op de Beeck, Jan Van den Stock, Joost Vennekens. [doi]
- Information-theoretic Limits of Online Classification with Noisy LabelsChanglong Wu, Ananth Grama, Wojciech Szpankowski. [doi]
- The Art of Saying No: Contextual Noncompliance in Language ModelsFaeze Brahman, Sachin Kumar 0009, Vidhisha Balachandran, Pradeep Dasigi, Valentina Pyatkin, Abhilasha Ravichander, Sarah Wiegreffe, Nouha Dziri, Khyathi Raghavi Chandu, Jack Hessel, Yulia Tsvetkov, Noah A. Smith, Yejin Choi 0001, Hanna Hajishirzi. [doi]
- BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional BootstrappingTaolin Zhang 0003, Jinpeng Wang 0002, Hang Guo, Tao Dai 0001, Bin Chen 0011, Shu-Tao Xia. [doi]
- Preference Learning Algorithms Do Not Learn Preference RankingsAngelica Chen, Sadhika Malladi, Lily H. Zhang, Xinyi Chen 0001, Qiuyi (Richard) Zhang, Rajesh Ranganath, KyungHyun Cho. [doi]
- DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image GenerationHao Phung, Quan Dao, Trung Tuan Dao, Viet-Hoang Phan, Dimitris N. Metaxas, Anh Tuan Tran 0001. [doi]
- Retrieval-Augmented Diffusion Models for Time Series ForecastingJingwei Liu, Ling Yang 0006, Hongyan Li 0002, Shenda Hong. [doi]
- Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency modelJing Zhang, Linjiajie Fang, Kexin Shi, Wenjia Wang, Bingyi Jing. [doi]
- Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace LearningAndrew Bond, Zafer Dogan. [doi]
- Unified Lexical Representation for Interpretable Visual-Language AlignmentYifan Li, Yikai Wang 0002, Yanwei Fu 0001, Dongyu Ru, Zheng Zhang 0001, Tong He 0002. [doi]
- STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomicsJiawen Chen, Muqing Zhou, Wenrong Wu, Jinwei Zhang, Yun Li, Didong Li. [doi]
- Efficiency for Free: Ideal Data Are Transportable RepresentationsPeng Sun, Yi Jiang, Tao Lin. [doi]
- Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement LearningZhishuai Liu, Pan Xu 0002. [doi]
- Sim2Real-Fire: A Multi-modal Simulation Dataset for Forecast and Backtracking of Real-world Forest FireYanzhi Li, Keqiu Li, Li Guohui, Zumin Wang, Changqing Ji, Lubo Wang, Die Zuo, Qing Guo 0005, Feng Zhang, Manyu Wang, Di Lin 0002. [doi]
- Towards Global Optimal Visual In-Context Learning Prompt SelectionChengming Xu 0001, Chen Liu, Yikai Wang 0002, Yuan Yao 0011, Yanwei Fu 0001. [doi]
- MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition DatasetXin Shen, Heming Du, Hongwei Sheng, Shuyun Wang, Hui Chen, Huiqiang Chen, Zhuojie Wu, Xiaobiao Du, Jiaying Ying, Ruihan Lu, Qingzheng Xu, Xin Yu 0002. [doi]
- Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought ReasoningHao Shao, Shengju Qian, Han Xiao 0010, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu 0015, Hongsheng Li 0001. [doi]
- NanoBaseLib: A Multi-Task Benchmark Dataset for Nanopore SequencingGuangzhao Cheng, Chengbo Fu, Lu Cheng. [doi]
- Worst-Case Offline Reinforcement Learning with Arbitrary Data SupportKohei Miyaguchi. [doi]
- ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language ModelsJio Oh, Soyeon Kim, JunSeok Seo, Jindong Wang 0001, Ruochen Xu, Xing Xie 0001, Steven Whang 0001. [doi]
- DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelZhixiong Nan, Xianghong Li, Tao Xiang, Jifeng Dai. [doi]
- A Sober Look at the Robustness of CLIPs to Spurious FeaturesQizhou Wang, Yong Lin, Yongqiang Chen 0002, Ludwig Schmidt, Bo Han 0003, Tong Zhang 0001. [doi]
- SpeedLoader: An I/O efficient scheme for heterogeneous and distributed LLM operationYiqi Zhang, Yang You. [doi]
- DenoiseRep: Denoising Model for Representation LearningZhengrui Xu, Guan'an Wang, Xiaowen Huang, Jitao Sang. [doi]
- Neuro-Vision to Language: Enhancing Brain Recording-based Visual Reconstruction and Language InteractionGuobin Shen, Dongcheng Zhao, Xiang He 0004, Linghao Feng, Yiting Dong, Jihang Wang, Qian Zhang, Yi Zeng 0001. [doi]
- Instruction Tuning With Loss Over InstructionsZhengxiang Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani. [doi]
- BLURD: Benchmarking and Learning using a Unified Rendering and Diffusion ModelBoris Repasky, Ehsan Abbasnejad, Anthony R. Dick. [doi]
- Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-RationalizationWei Liu, Zhiying Deng, Zhongyu Niu, Jun Wang, Haozhao Wang, Yuankai Zhang, Ruixuan Li 0001. [doi]
- A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured DatasetsKyungeun Lee, Wonjong Rhee. [doi]
- Continual Audio-Visual Sound SeparationWeiguo Pian, Yiyang Nan, Shijian Deng, Shentong Mo, Yunhui Guo, Yapeng Tian. [doi]
- Provably Efficient Reinforcement Learning with Multinomial Logit Function ApproximationLong-Fei Li, Yu-Jie Zhang, Peng Zhao 0006, Zhi-Hua Zhou. [doi]
- FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis of Indoor ScenesYunsong Wang, Tianxin Huang, Hanlin Chen, Gim Hee Lee. [doi]
- Multimodal Task Vectors Enable Many-Shot Multimodal In-Context LearningBrandon Huang, Chancharik Mitra, Leonid Karlinsky, Assaf Arbelle, Trevor Darrell, Roei Herzig. [doi]
- Learning Segmentation from Point TrajectoriesLaurynas Karazija, Iro Laina, Christian Rupprecht 0001, Andrea Vedaldi. [doi]
- Private Stochastic Convex Optimization with Heavy Tails: Near-Optimality from Simple ReductionsHilal Asi, Daogao Liu, Kevin Tian. [doi]
- Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent MisspecificationHaolin Liu, Artin Tajdini, Andrew Wagenmaker, Chen-Yu Wei. [doi]
- Minimum Entropy Coupling with BottleneckM. Reza Ebrahimi, Jun Chen 0005, Ashish Khisti. [doi]
- FlowLLM: Flow Matching for Material Generation with Large Language Models as Base DistributionsAnuroop Sriram, Benjamin Kurt Miller, Ricky T. Q. Chen, Brandon M. Wood. [doi]
- T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward FeedbackJiachen Li, Weixi Feng, Tsu-Jui Fu, Xinyi Wang 0003, Sugato Basu, Wenhu Chen, William Yang Wang. [doi]
- SCOREQ: Speech Quality Assessment with Contrastive RegressionAlessandro Ragano, Jan Skoglund, Andrew Hines. [doi]
- Adversarially Robust Decision TransformerXiaohang Tang, Afonso Marques, Parameswaran Kamalaruban, Ilija Bogunovic. [doi]
- Theoretical Analysis of Weak-to-Strong GeneralizationHunter Lang, David A. Sontag, Aravindan Vijayaraghavan. [doi]
- Stochastic Optimal Control for Diffusion Bridges in Function SpacesByoungwoo Park, Jungwon Choi, Sungbin Lim, Juho Lee. [doi]
- Hierarchical Uncertainty Exploration via Feedforward Posterior TreesElias Nehme, Rotem Mulayoff, Tomer Michaeli. [doi]
- Trading off Consistency and Dimensionality of Convex Surrogates for Multiclass ClassificationEnrique B. Nueve, Dhamma Kimpara, Bo Waggoner, Jessica Finocchiaro. [doi]
- Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation LearningMingcheng Li, Dingkang Yang, Yang Liu 0246, Shunli Wang 0001, Jiawei Chen 0012, Shuaibing Wang, Jinjie Wei, Yue Jiang, Qingyao Xu, Xiaolu Hou, Mingyang Sun, Ziyun Qian, Dongliang Kou, Lihua Zhang. [doi]
- Adversarially Robust Multi-task Representation LearningAustin Watkins, Thanh Nguyen-Tang, Enayat Ullah, Raman Arora. [doi]
- Compressing Large Language Models using Low Rank and Low Precision DecompositionRajarshi Saha, Naomi Sagan, Varun Srivastava, Andrea Goldsmith, Mert Pilanci. [doi]
- Compositional 3D-aware Video Generation with LLM DirectorHanxin Zhu, Tianyu He, Anni Tang, Junliang Guo, Zhibo Chen 0001, Jiang Bian 0002. [doi]
- Sparsity-Agnostic Linear Bandits with Adaptive AdversariesTianyuan Jin, Kyoungseok Jang, Nicolò Cesa-Bianchi. [doi]
- Language Models as Hierarchy EncodersYuan He 0008, Moy Yuan, Jiaoyan Chen 0001, Ian Horrocks 0001. [doi]
- Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map FilteringIdo Sobol, Chenfeng Xu, Or Litany. [doi]
- Deterministic Policies for Constrained Reinforcement Learning in Polynomial TimeJeremy McMahan. [doi]
- Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space ModelWenbing Li, Hang Zhou 0010, Junqing Yu, Zikai Song, Wei Yang 0034. [doi]
- Reciprocal LearningJulian Rodemann, Christoph Jansen, Georg Schollmeyer. [doi]
- IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTSAshwin Sankar, Srija Anand, Praveen Srinivasa Varadhan, Sherry Thomas, Mehak Singal, Shridhar Kumar, Deovrat Mehendale, Aditi Krishana, Giri Raju, Mitesh M. Khapra. [doi]
- Schur Nets: exploiting local structure for equivariance in higher order graph neural networksQingqi Zhang, Ruize Xu, Risi Kondor. [doi]
- Global Rewards in Restless Multi-Armed BanditsNaveen Raman, Zheyuan Shi, Fei Fang 0001. [doi]
- A Functional Extension of Semi-Structured NetworksDavid Rügamer, Bernard X. W. Liew, Zainab Altai, Almond Stöcker. [doi]
- FIFO-Diffusion: Generating Infinite Videos from Text without TrainingJihwan Kim, Junoh Kang, Jinyoung Choi, Bohyung Han. [doi]
- Quantum Algorithms for Non-smooth Non-convex OptimizationChengchang Liu, Chaowen Guan, Jianhao He, John C. S. Lui. [doi]
- RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy EvaluationJeongyeol Kwon, Shie Mannor, Constantine Caramanis, Yonathan Efroni. [doi]
- Melting Pot Contest: Charting the Future of Generalized Cooperative IntelligenceRakshit Trivedi, Akbir Khan, Jesse Clifton, Lewis Hammond, Edgar A. Duéñez-Guzmán, Dipam Chakraborty, John P. Agapiou, Jayd Matyas, Alexander Sasha Vezhnevets, Barna Pásztor, Yunke Ao, Omar G. Younis, Jiawei Huang, Benjamin Swain, Haoyuan Qin, Mian Deng, Ziwei Deng, Utku Erdoganaras, Yue Zhao 0023, Marko Tesic, Natasha Jaques, Jakob Foerster, Vincent Conitzer, José Hernández-Orallo, Dylan Hadfield-Menell, Joel Z. Leibo. [doi]
- Distribution Learning with Valid Outputs Beyond the Worst-CaseNicholas Rittler, Kamalika Chaudhuri. [doi]
- Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in TransformersGavia Gray, Aman Tiwari, Shane Bergsma, Joel Hestness. [doi]
- On the Convergence of Loss and Uncertainty-based Active Learning AlgorithmsDaniel Haimovich, Dima Karamshuk, Fridolin Linder, Niek Tax, Milan Vojnovic. [doi]
- Inductive biases of multi-task learning and finetuning: multiple regimes of feature reuseSamuel Lippl, Jack W. Lindsey. [doi]
- GLBench: A Comprehensive Benchmark for Graph with Large Language ModelsYuhan Li 0001, Peisong Wang, Xiao Zhu, Aochuan Chen, Haiyun Jiang, Deng Cai 0002, Wai Kin (Victor) Chan, Jia Li 0009. [doi]
- Robust Neural Contextual Bandit against Adversarial CorruptionsYunzhe Qi, Yikun Ban, Arindam Banerjee 0001, Jingrui He. [doi]
- M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationMingshuang Luo, Ruibing Hou, Zhuo Li, Hong Chang 0001, Zimo Liu, Yaowei Wang, Shiguang Shan. [doi]
- WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive AerodynamicsNeil Ashton, Jordan B. Angel, Aditya S. Ghate, Gaetan K. W. Kenway, Man Long Wong, Cetin C. Kiris, Astrid Walle, Danielle Maddix, Gary Page. [doi]
- Cross-modal Representation Flattening for Multi-modal Domain GeneralizationYunfeng Fan, Wenchao Xu 0001, Haozhao Wang, Song Guo 0001. [doi]
- PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language ModelsLemei Zhang, Peng Liu 0025, Marcus Tiedemann Oekland Henriksboe, Even W. Lauvrak, Jon Atle Gulla, Heri Ramampiaro. [doi]
- Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text ModelsQingyuan Zeng, Zhenzhong Wang, Yiu-ming Cheung, Min Jiang 0005. [doi]
- GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open ScenesGaochao Song, Chong Cheng, Hao Wang. [doi]
- Achieving Õ(1/ε) Sample Complexity for Constrained Markov Decision ProcessJiashuo Jiang, Yinyu Ye 0001. [doi]
- QBB: Quantization with Binary Bases for LLMsAdrian Bulat, Yassine Ouali, Georgios Tzimiropoulos. [doi]
- Dynamic Model Predictive Shielding for Provably Safe Reinforcement LearningArko Banerjee, Kia Rahmani, Joydeep Biswas, Isil Dillig. [doi]
- TACT: Advancing Complex Aggregative Reasoning with Information Extraction ToolsAvi Caciularu, Alon Jacovi, Eyal Ben-David, Sasha Goldshtein, Tal Schuster, Jonathan Herzig, Gal Elidan, Amir Globerson. [doi]
- LLM-AutoDA: Large Language Model-Driven Automatic Data Augmentation for Long-tailed ProblemsPengkun Wang, Zhe Zhao 0008, Haibin Wen, Fanfu Wang, Binwu Wang, Qingfu Zhang 0001, Yang Wang 0015. [doi]
- Kermut: Composite kernel regression for protein variant effectsPeter Mørch Groth, Mads Herbert Kerrn, Lars Olsen, Jesper Salomon, Wouter Boomsma. [doi]
- Slice-100K: A Multimodal Dataset for Extrusion-based 3D PrintingAnushrut Jignasu, Kelly O. Marshall, Ankush Kumar Mishra, Lucas Nerone Rillo, Baskar Ganapathysubramanian, Aditya Balu, Chinmay Hegde, Adarsh Krishnamurthy. [doi]
- PPLNs: Parametric Piecewise Linear Networks for Event-Based Temporal Modeling and BeyondChen Song, Zhenxiao Liang, Bo Sun, Qixing Huang. [doi]
- Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based BackpropagationChengting Yu, Lei Liu, Gaoang Wang, Erping Li, Aili Wang 0002. [doi]
- Memory-Efficient LLM Training with Online Subspace DescentKaizhao Liang, Bo Liu, Lizhang Chen, Qiang Liu. [doi]
- Rethinking Weight Decay for Robust Fine-Tuning of Foundation ModelsJunjiao Tian, Chengyue Huang, Zsolt Kira. [doi]
- SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing PansharpeningYu Zhong, Xiao Wu, Liang-Jian Deng, Zihan Cao, Hong-Xia Dou. [doi]
- LaSe-E2V: Towards Language-guided Semantic-aware Event-to-Video ReconstructionKanghao Chen, Hangyu Li, Jiazhou Zhou, Zeyu Wang, Lin Wang 0025. [doi]
- ViLCo-Bench: VIdeo Language COntinual learning BenchmarkTianqi Tang 0002, Shohreh Deldari, Hao Xue 0001, Celso de Melo, Flora D. Salim. [doi]
- CLUES: Collaborative Private-domain High-quality Data Selection for LLMs via Training DynamicsWanru Zhao, Hongxiang Fan, Shell Xu Hu, Wangchunshu Zhou, Nicholas D. Lane. [doi]
- Cost-aware Bayesian Optimization via the Pandora's Box Gittins IndexQian Xie 0005, Raul Astudillo, Peter I. Frazier, Ziv Scully, Alexander Terenin. [doi]
- PSL: Rethinking and Improving Softmax Loss from Pairwise Perspective for RecommendationWeiqin Yang 0002, Jiawei Chen 0007, Xin Xin 0003, Sheng Zhou 0004, Binbin Hu, Yan Feng, Chun Chen 0001, Can Wang 0001. [doi]
- Newton Losses: Using Curvature Information for Learning with Differentiable AlgorithmsFelix Petersen, Christian Borgelt, Tobias Sutter, Hilde Kuehne, Oliver Deussen, Stefano Ermon. [doi]
- FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal ModelsPengxiang Li 0002, Zhi Gao, Bofei Zhang, Tao Yuan, Yuwei Wu 0001, Mehrtash Harandi, Yunde Jia, Song Chun Zhu, Qing Li 0003. [doi]
- SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural NetworkWeiyu Guo, Ying Sun 0006, Yijie Xu, Ziyue Qiao, Yongkui Yang, Hui Xiong 0001. [doi]
- Multilingual Diversity Improves Vision-Language RepresentationsThao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei W. Koh, Ranjay Krishna. [doi]
- Qualitative Mechanism IndependenceOliver Richardson, Spencer J. Peters, Joseph Y. Halpern. [doi]
- Learning to Reason via Program Generation, Emulation, and SearchNathaniel Weir, Muhammad Khalifa, Linlu Qiu, Orion Weller, Peter Clark. [doi]
- Fair and Welfare-Efficient Constrained Multi-Matchings under UncertaintyElita A. Lobo, Justin Payan, Cyrus Cousins, Yair Zick. [doi]
- Gradient-Free Methods for Nonconvex Nonsmooth Stochastic Compositional OptimizationZhuanghua Liu, Luo Luo, Bryan Kian Hsiang Low. [doi]
- Generalization Error Bounds for Two-stage Recommender Systems with Tree StructureJin Zhang, Ze Liu, Defu Lian, Enhong Chen. [doi]
- Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image EditingHaonan Lin, Yan Chen 0031, Jiahao Wang, Wenbin An, Mengmeng Wang, Feng Tian 0002, Yong Liu, Guang Dai, Jingdong Wang, QianYing Wang. [doi]
- UGC: Universal Graph CoarseningMohit Kataria, Sandeep Kumar, Jayadeva. [doi]
- No-regret Learning in Harmonic Games: Extrapolation in the Face of Conflicting InterestsDavide Legacci, Panayotis Mertikopoulos, Christos H. Papadimitriou, Georgios Piliouras, Bary S. R. Pradelski. [doi]
- Separations in the Representational Capabilities of Transformers and Recurrent ArchitecturesSatwik Bhattamishra, Michael Hahn 0001, Phil Blunsom, Varun Kanade. [doi]
- Dense Associative Memory Through the Lens of Random FeaturesBenjamin Hoover, Duen Horng Chau, Hendrik Strobelt, Parikshit Ram, Dmitry Krotov. [doi]
- RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and LocalizationBing Yang, Changsheng Quan, Yabo Wang, Pengyu Wang, Yujie Yang, Ying Fang, Nian Shao, Hui Bu, Xin Xu, Xiaofei Li. [doi]
- Convolutional Differentiable Logic Gate NetworksFelix Petersen, Hilde Kuehne, Christian Borgelt, Julian Welzel, Stefano Ermon. [doi]
- Entropy testing and its application to testing Bayesian networksClément L. Canonne, Joy Qiping Yang. [doi]
- Contrastive losses as generalized models of global epistasisDavid H. Brookes, Jakub Otwinowski, Sam Sinai. [doi]
- Upping the Game: How 2D U-Net Skip Connections Flip 3D SegmentationXingru Huang, Yihao Guo, Jian Huang, Tianyun Zhang, Hong He, Shaowei Jiang, Yaoqi Sun. [doi]
- Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGradSayantan Choudhury, Nazarii Tupitsa, Nicolas Loizou, Samuel Horváth, Martin Takác 0001, Eduard Gorbunov. [doi]
- Conditioning non-linear and infinite-dimensional diffusion processesElizabeth Louise Baker, Gefan Yang, Michael L. Severinsen, Christy Anna Hipsley, Stefan Sommer. [doi]
- Efficient and Private Marginal Reconstruction with Local Non-NegativityBrett Mullins, Miguel Fuentes, Yingtai Xiao, Daniel Kifer, Cameron Musco, Daniel R. Sheldon. [doi]
- MultiOOD: Scaling Out-of-Distribution Detection for Multiple ModalitiesHao Dong, Yue Zhao 0016, Eleni N. Chatzi, Olga Fink. [doi]
- Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language ModelsBaao Xie, Qiuyu Chen, Yunnan Wang, Zequn Zhang, Xin Jin, Wenjun Zeng. [doi]
- Face2QR: A Unified Framework for Aesthetic, Face-Preserving, and Scannable QR Code GenerationXuehao Cui, Guangyang Wu, Zhenghao Gan, Guangtao Zhai, Xiaohong Liu 0001. [doi]
- Improved Regret of Linear Ensemble SamplingHarin Lee, Min-hwan Oh. [doi]
- CALVIN: Improved Contextual Video Captioning via Instruction TuningGowthami Somepalli, Arkabandhu Chowdhury, Jonas Geiping, Ronen Basri, Tom Goldstein, David Jacobs 0001. [doi]
- Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language ModelsDongwon Jo, Taesu Kim, Yulhwa Kim, Jae-Joon Kim. [doi]
- SAND: Smooth imputation of sparse and noisy functional data with Transformer networksJu-Sheng Hong, Junwen Yao, Jonas W. Mueller, Jane-ling Wang. [doi]
- Automatically Learning Hybrid Digital Twins of Dynamical SystemsSamuel Holt, Tennison Liu, Mihaela van der Schaar. [doi]
- MADiff: Offline Multi-agent Learning with Diffusion ModelsZhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu 0001, Stefano Ermon, Weinan Zhang 0001. [doi]
- QT-ViT: Improving Linear Attention in ViT with Quadratic Taylor ExpansionYixing Xu, Chao Li, Dong Li, Xiao Sheng, Fan Jiang, Lu Tian, Emad Barsoum. [doi]
- Learning Image Priors Through Patch-Based Diffusion Models for Solving Inverse ProblemsJason Hu, Bowen Song, Xiaojian Xu 0002, Liyue Shen, Jeffrey A. Fessler. [doi]
- From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion ModelsZhuoshi Pan, Yuguang Yao, Gaowen Liu, Bingquan Shen, H. Vicky Zhao, Ramana Kompella, Sijia Liu 0001. [doi]
- Utilizing Human Behavior Modeling to Manipulate Explanations in AI-Assisted Decision Making: The Good, the Bad, and the ScaryZhuoyan Li, Ming Yin 0001. [doi]
- MKGL: Mastery of a Three-Word LanguageLingbing Guo, Zhongpu Bo, Zhuo Chen 0007, Yichi Zhang 0009, Jiaoyan Chen 0001, Yarong Lan, Mengshu Sun, Zhiqiang Zhang, Yangyifei Luo, Qian Li, Qiang Zhang, Wen Zhang, Huajun Chen. [doi]
- Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy EvaluationShreyas Chaudhari, Ameet Deshpande, Bruno C. da Silva 0001, Philip S. Thomas. [doi]
- SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial OptimizationTaisuke Yasuda 0002, Kyriakos Axiotis, Gang Fu, Mohammad Hossein Bateni 0001, Vahab Mirrokni. [doi]
- DMesh: A Differentiable Mesh RepresentationSanghyun Son 0003, Matheus Gadelha, Yang Zhou, Zexiang Xu, Ming C. Lin, Yi Zhou 0023. [doi]
- OwMatch: Conditional Self-Labeling with Consistency for Open-World Semi-Supervised LearningShengjie Niu, Lifan Lin, Jian Huang, Chao Wang. [doi]
- Efficient Federated Learning against Heterogeneous and Non-stationary Client UnavailabilityMing Xiang, Stratis Ioannidis, Edmund Yeh, Carlee Joe-Wong, Lili Su. [doi]
- Interpretable Mesomorphic Networks for Tabular DataArlind Kadra, Sebastian Pineda-Arango, Josif Grabocka. [doi]
- Almost Surely Asymptotically Constant Graph Neural NetworksSam Adam-Day, Michael Benedikt, Ismail Ilkan Ceylan, Ben Finkelshtein. [doi]
- On the Expressive Power of Tree-Structured Probabilistic CircuitsLang Yin, Han Zhao 0002. [doi]
- Rethinking Exploration in Reinforcement Learning with Effective Metric-Based Exploration BonusYiming Wang, Kaiyan Zhao, Furui Liu, Leong Hou U. [doi]
- ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language ModelsShuo Liu, Kaining Ying, Hao Zhang, Yue Yang, Yuqi Lin, Tianle Zhang, Chuanhao Li, Yu Qiao 0001, Ping Luo 0002, Wenqi Shao, Kaipeng Zhang. [doi]
- G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question AnsweringXiaoxin He, Yijun Tian 0001, Yifei Sun, Nitesh V. Chawla, Thomas Laurent 0001, Yann LeCun, Xavier Bresson, Bryan Hooi. [doi]
- WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language ArtsJiahuan Cao, Yang Liu, Yongxin Shi, Kai Ding 0009, Lianwen Jin. [doi]
- Do causal predictors generalize better to new domains?Vivian Y. Nastl, Moritz Hardt. [doi]
- Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous LearnerHanwen Zhong, Jiaxin Chen, Yutong Zhang, Di Huang, Yunhong Wang. [doi]
- WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work TasksLéo Boisvert, Megh Thakkar, Maxime Gasse, Massimo Caccia, Thibault Le Sellier De Chezelles, Quentin Cappart, Nicolas Chapados, Alexandre Lacoste, Alexandre Drouin. [doi]
- MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse AttentionHuiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir Abdi, Dongsheng Li 0002, Chin-Yew Lin, Yuqing Yang 0001, Lili Qiu. [doi]
- Pard: Permutation-Invariant Autoregressive Diffusion for Graph GenerationLingxiao Zhao, Xueying Ding, Leman Akoglu. [doi]
- AdaNovo: Towards Robust \emph{De Novo} Peptide Sequencing in Proteomics against Data BiasesJun Xia 0001, Shaorong Chen, Jingbo Zhou, Xiaojun Shan, Wenjie Du, Zhangyang Gao, Cheng Tan 0012, Bozhen Hu, Jiangbin Zheng, Stan Z. Li. [doi]
- Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image EditingSiyi Chen, Huijie Zhang, Minzhe Guo, Yifu Lu, Peng Wang 0098, Qing Qu 0001. [doi]
- Reinforcement Learning with Lookahead InformationNadav Merlis. [doi]
- Clustering in Causal Attention MaskingNikita Karagodin, Yury Polyanskiy, Philippe Rigollet. [doi]
- Learning from Teaching Regularization: Generalizable Correlations Should be Easy to ImitateCan Jin, Tong Che, Hongwu Peng, Yiyuan Li, Dimitris N. Metaxas, Marco Pavone 0001. [doi]
- Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language ModelsYushi Hu, Weijia Shi, Xingyu Fu, Dan Roth, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Ranjay Krishna. [doi]
- FastSurvival: Hidden Computational Blessings in Training Cox Proportional Hazards ModelsJiachang Liu 0001, Rui Zhang, Cynthia Rudin. [doi]
- Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object DetectionDingrong Wang, Hitesh Sapkota, Qi Yu 0001. [doi]
- Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge IntegrationMahdi Morafah, Vyacheslav Kungurtsev, Hojin Chang, Chen Chen 0001, Bill Lin 0001. [doi]
- Generalization Bounds via Conditional f-InformationZiqiao Wang, Yongyi Mao. [doi]
- Diffusion Model with Cross Attention as an Inductive Bias for DisentanglementTao Yang, Cuiling Lan, Yan Lu 0001, Nanning Zheng 0001. [doi]
- Graph Diffusion Policy OptimizationYijing Liu, Chao Du, Tianyu Pang, Chongxuan Li, Min Lin, Wei Chen 0001. [doi]
- Equivariant Blurring Diffusion for Hierarchical Molecular Conformer GenerationJiwoong Park, Yang Shen. [doi]
- Terra: A Multimodal Spatio-Temporal Dataset Spanning the EarthWei Chen 0070, Xixuan Hao, Yuankai Wu, Yuxuan Liang. [doi]
- MoGU: A Framework for Enhancing Safety of LLMs While Preserving Their UsabilityYanrui Du, Sendong Zhao, Danyang Zhao, Ming Ma, Yuhan Chen 0002, Liangyu Huo, Qing Yang 0033, Dongliang Xu, Bing Qin 0001. [doi]
- Breaking Semantic Artifacts for Generalized AI-generated Image DetectionChende Zheng, Chenhao Lin, Zhengyu Zhao 0001, Hang Wang, Xu Guo, Shuai Liu, Chao Shen 0001. [doi]
- Long-tailed Object Detection Pretraining: Dynamic Rebalancing Contrastive Learning with Dual ReconstructionChen-Long Duan, Yong Li, Xiu-Shen Wei, Lin Zhao. [doi]
- Estimating Heterogeneous Treatment Effects by Combining Weak Instruments and Observational DataMiruna Oprescu, Nathan Kallus. [doi]
- Multi-Reward Best Policy IdentificationAlessio Russo, Filippo Vannella. [doi]
- Beyond Euclidean: Dual-Space Representation Learning for Weakly Supervised Video Violence DetectionJiaxu Leng, Zhanjie Wu, Mingpi Tan, Yiran Liu, Ji Gan, Haosheng Chen 0001, Xinbo Gao 0001. [doi]
- Zero-Shot Reinforcement Learning from Low Quality DataScott R. Jeen, Tom Bewley, Jonathan M. Cullen. [doi]
- MiniCache: KV Cache Compression in Depth Dimension for Large Language ModelsAkide Liu, Jing Liu, Zizheng Pan, Yefei He, Reza Haffari, Bohan Zhuang. [doi]
- Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operatorsZekun Shi, Zheyuan Hu, Min Lin, Kenji Kawaguchi. [doi]
- Learning 1D Causal Visual Representation with De-focus Attention NetworksChenxin Tao, Xizhou Zhu, Shiqian Su, Lewei Lu, Changyao Tian, Xuan Luo, Gao Huang 0001, Hongsheng Li 0001, Yu Qiao 0001, Jie Zhou 0001, Jifeng Dai. [doi]
- Hybrid Mamba for Few-Shot SegmentationQianxiong Xu, Xuanyi Liu, Lanyun Zhu, Guosheng Lin, Cheng Long 0001, Ziyue Li 0002, Rui Zhao 0001. [doi]
- Stochastic Optimal Control MatchingCarles Domingo-Enrich, Jiequn Han, Brandon Amos, Joan Bruna, Ricky T. Q. Chen. [doi]
- In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before an Ongoing Trajectory TerminatesShicheng Liu, Minghui Zhu. [doi]
- Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular DataKai Helli, David Schnurr, Noah Hollmann, Samuel Müller 0005, Frank Hutter. [doi]
- DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized CutPaul Couairon, Mustafa Shukor, Jean-Emmanuel Haugeard, Matthieu Cord, Nicolas Thome. [doi]
- Diffusion Actor-Critic with Entropy RegulatorYinuo Wang, Likun Wang, Yuxuan Jiang 0011, Wenjun Zou, Tong Liu, Xujie Song, Wenxuan Wang 0004, Liming Xiao, Jiang Wu, Jingliang Duan, Shengbo Li 0001. [doi]
- Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow AnalysisHongru Yang, Bhavya Kailkhura, Zhangyang Wang, Yingbin Liang. [doi]
- BackdoorAlign: Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety AlignmentJiongxiao Wang, Jiazhao Li, Yiquan Li, Xiangyu Qi, Junjie Hu, Sharon Li, Patrick McDaniel, Muhao Chen, Bo Li, Chaowei Xiao. [doi]
- Rethinking the Power of Timestamps for Robust Time Series Forecasting: A Global-Local Fusion PerspectiveChengsen Wang, Qi Qi, Jingyu Wang, Haifeng Sun 0001, Zirui Zhuang, Jinming Wu, Jianxin Liao. [doi]
- Image2Struct: Benchmarking Structure Extraction for Vision-Language ModelsJosselin Somerville Roberts, Tony Lee, Chi Heem Wong, Michihiro Yasunaga, Yifan Mai, Percy Liang. [doi]
- Adaptive Proximal Gradient Method for Convex OptimizationYura Malitsky, Konstantin Mishchenko. [doi]
- SpikedAttention: Training-Free and Fully Spike-Driven Transformer-to-SNN Conversion with Winner-Oriented Spike Shift for Softmax OperationSangwoo Hwang, Seunghyun Lee, Dahoon Park, Donghun Lee, Jaeha Kung. [doi]
- Continuous Partitioning for Graph-Based Semi-Supervised LearningChester Holtz, Pengwen Chen, Zhengchao Wan, Chung-Kuan Cheng, Gal Mishne. [doi]
- CoMix: A Comprehensive Benchmark for Multi-Task Comic UnderstandingEmanuele Vivoli, Marco Bertini 0001, Dimosthenis Karatzas. [doi]
- NVRC: Neural Video Representation CompressionHo Man Kwan, Ge Gao, Fan Zhang 0017, Andrew Gower, David Bull 0001. [doi]
- Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant ProblemsBingcong Li, Liang Zhang, Niao He. [doi]
- Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasksTianyu He, Darshil Doshi, Aritra Das, Andrey Gromov. [doi]
- Length Optimization in Conformal PredictionShayan Kiyani, George J. Pappas, Hamed Hassani. [doi]
- Instruction-Guided Visual MaskingJinliang Zheng, Jianxiong Li, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan. [doi]
- Unsupervised Anomaly Detection in The Presence of Missing ValuesFeng Xiao, Jicong Fan 0001. [doi]
- Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited ModalitiesAdriel Saporta, Aahlad Manas Puli, Mark Goldstein, Rajesh Ranganath. [doi]
- Discrete-state Continuous-time Diffusion for Graph GenerationZhe Xu 0007, Ruizhong Qiu, Yuzhong Chen, Huiyuan Chen, Xiran Fan, Menghai Pan, Zhichen Zeng 0001, Mahashweta Das, Hanghang Tong. [doi]
- The tree autoencoder model, with application to hierarchical data visualizationMiguel Á. Carreira-Perpiñán, Kuat Gazizov. [doi]
- Goal-Conditioned On-Policy Reinforcement LearningXudong Gong, Dawei Feng, Kele Xu, Bo Ding, Huaimin Wang. [doi]
- Training-Free Adaptive Diffusion with Bounded Difference Approximation StrategyHancheng Ye, Jiakang Yuan, Renqiu Xia, Xiangchao Yan, Tao Chen 0003, Junchi Yan, Botian Shi, Bo Zhang 0069. [doi]
- How Does Message Passing Improve Collaborative Filtering?Mingxuan Ju, William Shiao, Zhichun Guo, Yanfang Ye 0001, Yozen Liu, Neil Shah, Tong Zhao 0003. [doi]
- Measuring Dejavu Memorization EfficientlyNarine Kokhlikyan, Bargav Jayaraman, Florian Bordes, Chuan Guo 0001, Kamalika Chaudhuri. [doi]
- Generative ForestsRichard Nock, Mathieu Guillame-Bert. [doi]
- YouDream: Generating Anatomically Controllable Consistent Text-to-3D AnimalsSandeep Mishra, Oindrila Saha, Alan C. Bovik. [doi]
- Can We Leave Deepfake Data Behind in Training Deepfake Detector?Jikang Cheng, Zhiyuan Yan 0002, Ying Zhang, Yuhao Luo, Zhongyuan Wang 0001, Chen Li. [doi]
- Recovering Complete Actions for Cross-dataset Skeleton Action RecognitionHanchao Liu, Yujiang Li, Tai-Jiang Mu, Shi-Min Hu 0001. [doi]
- Flexible mapping of abstract domains by grid cells via self-supervised extraction and projection of generalized velocity signalsAbhiram Iyer, Sarthak Chandra, Sugandha Sharma, Ila Fiete. [doi]
- Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic SegmentationYu Zheng, Guangming Wang 0001, Jiuming Liu, Marc Pollefeys, Hesheng Wang 0001. [doi]
- CAT3D: Create Anything in 3D with Multi-View Diffusion ModelsRuiQi Gao, Aleksander Holynski, Philipp Henzler, Arthur Brussee, Ricardo Martin-Brualla, Pratul P. Srinivasan, Jonathan T. Barron, Ben Poole. [doi]
- Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence ModelingSili Huang, Jifeng Hu, Zhejian Yang, Liwei Yang, Tao Luo, Hechang Chen, Lichao Sun 0001, Bo Yang. [doi]
- Piecewise-Stationary Bandits with KnapsacksXilin Zhang, Wang Chi Cheung. [doi]
- CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-trainingDavid Brandfonbrener, Hanlin Zhang, Andreas Kirsch 0002, Jonathan Richard Schwarz, Sham M. Kakade. [doi]
- Fast samplers for Inverse Problems in Iterative Refinement modelsKushagra Pandey, Ruihan Yang, Stephan Mandt. [doi]
- SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet KnowledgeChuanhao Li, Zhen Li 0026, Chenchen Jing, Shuo Liu, Wenqi Shao, Yuwei Wu 0001, Ping Luo 0002, Yu Qiao 0001, Kaipeng Zhang. [doi]
- Understanding Transformers via N-Gram StatisticsTimothy Nguyen. [doi]
- Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit LearnabilityFan Chen, Dylan J. Foster, Yanjun Han, Jian Qian, Alexander Rakhlin, Yunbei Xu. [doi]
- Personalized Federated Learning via Feature Distribution AdaptationConnor McLaughlin, Lili Su. [doi]
- Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy ExplorationBorja G. León, Francesco Riccio, Kaushik Subramanian, Peter R. Wurman, Peter Stone 0001. [doi]
- Robust Reinforcement Learning with General UtilityZiyi Chen 0002, Yan Wen, Zhengmian Hu, Heng Huang. [doi]
- Reinforced Cross-Domain Knowledge Distillation on Time Series DataQing Xu 0015, Min Wu 0008, Xiaoli Li 0001, Kezhi Mao, Zhenghua Chen. [doi]
- Fairness in Social Influence Maximization via Optimal TransportShubham Chowdhary, Giulia De Pasquale, Nicolas Lanzetti, Ana-Andreea Stoica, Florian Dörfler. [doi]
- Online Classification with PredictionsVinod Raman, Ambuj Tewari. [doi]
- How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular RetrievalPhilip Fradkin, Puria Azadi Moghadam, Karush Suri, Frederik Wenkel, Ali Bashashati, Maciej Sypetkowski, Dominique Beaini. [doi]
- On the Optimal Time Complexities in Decentralized Stochastic Asynchronous OptimizationAlexander Tyurin, Peter Richtárik. [doi]
- FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal AttentionYu Lu, Yuanzhi Liang, Linchao Zhu, Yi Yang 0001. [doi]
- A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud RegistrationRenlang Huang, Yufan Tang, Jiming Chen, Liang Li. [doi]
- Federated Learning over Connected ModesDennis Grinwald, Philipp Wiesner, Shinichi Nakajima. [doi]
- Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and AnalysisMichael Crawshaw, Mingrui Liu. [doi]
- Invariant Tokenization of Crystalline Materials for Language Model Enabled GenerationKeqiang Yan, Xiner Li, Hongyi Ling, Kenna Ashen, Carl Edwards, Raymundo Arróyave, Marinka Zitnik, Heng Ji, Xiaofeng Qian, Xiaoning Qian, Shuiwang Ji. [doi]
- Is Multiple Object Tracking a Matter of Specialization?Gianluca Mancusi, Mattia Bernardi, Aniello Panariello, Angelo Porrello, Rita Cucchiara, Simone Calderara. [doi]
- Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt AdaptationAbhinav Jain 0001, Swarat Chaudhuri, Thomas W. Reps, Christopher M. Jermaine. [doi]
- Understanding Hallucinations in Diffusion Models through Mode InterpolationSumukh K. Aithal, Pratyush Maini, Zachary C. Lipton, J. Zico Kolter. [doi]
- Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEsMd. Ashiqur Rahman, Robert Joseph George, Mogab Elleithy, Daniel V. Leibovici, Zongyi Li, Boris Bonev, Colin White, Julius Berner, Raymond A. Yeh, Jean Kossaifi, Kamyar Azizzadenesheli, Animashree Anandkumar. [doi]
- Evidential Stochastic Differential Equations for Time-Aware Sequential RecommendationKrishna Prasad Neupane, Ervine Zheng, Qi Yu 0001. [doi]
- Iterative Methods via Locally Evolving Set ProcessBaojian Zhou, Yifan Sun, Reza Babanezhad Harikandeh, Xingzhi Guo, Deqing Yang, Yanghua Xiao. [doi]
- Diffusion-based Curriculum Reinforcement LearningErdi Sayar, Giovanni Iacca, Ozgur S. Oguz, Alois Knoll. [doi]
- Explaining Datasets in Words: Statistical Models with Natural Language ParametersRuiqi Zhong, Heng Wang, Dan Klein, Jacob Steinhardt. [doi]
- Connecting Joint-Embedding Predictive Architecture with Contrastive Self-supervised LearningShentong Mo, Peter Tong. [doi]
- Learning-Augmented Algorithms for the Bahncard ProblemHailiang Zhao, Xueyan Tang, Peng Chen, ShuiGuang Deng. [doi]
- AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular VideosYuze He, Wang Zhao 0001, Shaohui Liu, Yubin Hu 0001, Yushi Bai, Yu-Hui Wen, Yongjin Liu 0001. [doi]
- Improved Regret for Bandit Convex Optimization with Delayed FeedbackYuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang 0005. [doi]
- A Study of Plasticity Loss in On-Policy Deep Reinforcement LearningArthur Juliani, Jordan T. Ash. [doi]
- UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion ModelsYihua Zhang, Chongyu Fan, Yimeng Zhang, Yuguang Yao, Jinghan Jia, Jiancheng Liu, Gaoyuan Zhang, Gaowen Liu, Ramana Kompella, Xiaoming Liu 0002, Sijia Liu 0001. [doi]
- Denoising Diffusion Path: Attribution Noise Reduction with An Auxiliary Diffusion ModelYiming Lei, Zilong Li, Junping Zhang, Hongming Shan. [doi]
- Does Worst-Performing Agent Lead the Pack? Analyzing Agent Dynamics in Unified Distributed SGDJie Hu, Yi-Ting Ma, Do Young Eun. [doi]
- Peri-midFormer: Periodic Pyramid Transformer for Time Series AnalysisQiang Wu, Gechang Yao, Zhixi Feng, Shuyuan Yang. [doi]
- Towards General Loop Invariant Generation: A Benchmark of Programs with Memory ManipulationChang Liu, Xiwei Wu, Yuan Feng 0001, Qinxiang Cao, Junchi Yan. [doi]
- Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical PerspectiveXinhao Yao, Xiaolin Hu, Shenzhi Yang, Yong Liu. [doi]
- Leveraging an ECG Beat Diffusion Model for Morphological Reconstruction from Indirect SignalsLisa Bedin, Gabriel Cardoso 0001, Josselin Duchateau, Rémi Dubois, Eric Moulines. [doi]
- Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language ModelYanpeng Ye, Jie Ren, Shaozhou Wang, Yuwei Wan, Imran Razzak, Bram Hoex, Haofen Wang, Tong Xie, Wenjie Zhang 0001. [doi]
- SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated DataJialu Li 0001, Jaemin Cho 0001, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal. [doi]
- Towards the Dynamics of a DNN Learning Symbolic InteractionsQihan Ren, Junpeng Zhang, Yang Xu, Yue Xin, Dongrui Liu, Quanshi Zhang. [doi]
- Dissecting the Failure of Invariant Learning on GraphsQixun Wang 0002, Yifei Wang 0001, Yisen Wang 0001, Xianghua Ying. [doi]
- An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement LearningQian Lin, Zongkai Liu, Danying Mo, Chao Yu 0004. [doi]
- Evaluating language models as risk scoresAndré F. Cruz, Moritz Hardt, Celestine Mendler-Dünner. [doi]
- Hyperbolic Embeddings of Supervised ModelsRichard Nock, Ehsan Amid, Frank Nielsen, Alexander Soen, Manfred K. Warmuth. [doi]
- Subject-driven Text-to-Image Generation via Preference-based Reinforcement LearningYanting Miao, William Loh, Suraj Kothawade, Pascal Poupart, Abdullah Rashwan, Yeqing Li. [doi]
- GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing TasksYu Zhang 0126, Changhao Pan, Wenxiang Guo, Ruiqi Li, Zhiyuan Zhu, Jialei Wang, Wenhao Xu, Jingyu Lu, Zhiqing Hong, Chuxin Wang, Lichao Zhang, Jinzheng He, Ziyue Jiang 0001, Yuxin Chen, Chen Yang, Jiecheng Zhou, Xinyu Cheng, Zhou Zhao. [doi]
- SCube: Instant Large-Scale Scene Reconstruction using VoxSplatsXuanchi Ren, Yifan Lu, Hanxue Liang, Jay Zhangjie Wu, Huan Ling, Mike Chen, Sanja Fidler, Francis Williams, Jiahui Huang. [doi]
- UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph ConstructionYansong Ning, Hao Liu. [doi]
- TFGDA: Exploring Topology and Feature Alignment in Semi-supervised Graph Domain Adaptation through Robust ClusteringJun Dan, Weiming Liu 0005, Chunfeng Xie, Hua Yu 0006, Shunjie Dong, Yanchao Tan. [doi]
- ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian SplattingsSuyoung Lee, Jaeyoung Chung, Jaeyoo Huh, Kyoung Mu Lee. [doi]
- CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker ConversationsLeying Zhang, Yao Qian, Long Zhou, Shujie Liu 0001, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li 0001, Lei He 0005, Sheng Zhao, Michael Zeng 0001. [doi]
- Don't Compress Gradients in Random Reshuffling: Compress Gradient DifferencesAbdurakhmon Sadiev, Grigory Malinovsky, Eduard Gorbunov, Igor Sokolov 0001, Ahmed Khaled 0001, Konstantin Burlachenko, Peter Richtárik. [doi]
- Nearly Tight Black-Box Auditing of Differentially Private Machine LearningMeenatchi Sundaram Muthu Selva Annamalai, Emiliano De Cristofaro. [doi]
- Dynamic Conditional Optimal Transport through Simulation-Free FlowsGavin Kerrigan, Giosue Migliorini, Padhraic Smyth. [doi]
- MECD: Unlocking Multi-Event Causal Discovery in Video ReasoningTieyuan Chen, Huabin Liu 0001, Tianyao He, Yihang Chen, Chaofan Gan, Xiao Ma, Cheng Zhong, Yang Zhang, Yingxue Wang, Hui Lin, Weiyao Lin. [doi]
- PANORAMIA: Privacy Auditing of Machine Learning Models without RetrainingMishaal Kazmi, Hadrien Lautraite, Alireza Akbari, Qiaoyue Tang, Mauricio Soroco, Tao Wang, Sébastien Gambs, Mathias Lécuyer. [doi]
- Distribution Guidance Network for Weakly Supervised Point Cloud Semantic SegmentationZhiyi Pan, Wei Gao 0003, Shan Liu 0001, Ge Li 0002. [doi]
- Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope TheoryPasan Dissanayake, Sanghamitra Dutta. [doi]
- Memorize What Matters: Emergent Scene Decomposition from MultitraverseYiming Li, Zehong Wang, Yue Wang 0036, Zhiding Yu, Zan Gojcic, Marco Pavone 0001, Chen Feng 0002, José M. Álvarez 0004. [doi]
- OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning DatasetShubham Toshniwal, Ivan Moshkov, Sean Narenthiran, Daria Gitman, Fei Jia, Igor Gitman. [doi]
- Reranking Laws for Language Generation: A Communication-Theoretic PerspectiveAntónio Farinhas, Haau-Sing Li, André Martins. [doi]
- Intrinsic Self-Supervision for Data Quality AuditsFabian Gröger, Simone Lionetti, Philippe Gottfrois, Álvaro González-Jiménez, Ludovic Amruthalingam, Matthew Groh, Alexander A. Navarini, Marc Pouly. [doi]
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human SupervisionZhiqing Sun, Longhui Yu, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, Chuang Gan. [doi]
- Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based ModelsSangwoong Yoon, Himchan Hwang, Dohyun Kwon 0002, Yung-Kyun Noh, Frank C. Park 0001. [doi]
- The Impact of Initialization on LoRA Finetuning DynamicsSoufiane Hayou, Nikhil Ghosh, Bin Yu 0001. [doi]
- DistrictNet: Decision-aware learning for geographical districtingCheikh Ahmed, Alexandre Forel, Axel Parmentier, Thibaut Vidal. [doi]
- Addressing Bias in Online Selection with Limited Budget of ComparisonsZiyad Benomar, Evgenii Chzhen, Nicolas Schreuder, Vianney Perchet. [doi]
- Noise-Aware Differentially Private Regression via Meta-LearningOssi Räisä, Stratis Markou, Matthew Ashman, Wessel P. Bruinsma, Marlon Tobaben, Antti Honkela, Richard E. Turner. [doi]
- BIOSCAN-5M: A Multimodal Dataset for Insect BiodiversityZahra Gharaee, Scott C. Lowe, ZeMing Gong, Pablo Millan Arias, Nicholas Pellegrino, Austin T. Wang, Joakim Bruslund Haurum, Iuliia Eyriay, Lila Kari, Dirk Steinke, Graham W. Taylor, Paul W. Fieguth, Angel X. Chang. [doi]
- Causal Inference in the Closed-Loop: Marginal Structural Models for Sequential Excursion EffectsAlexander Levis, Gabriel Loewinger, Francisco Pereira. [doi]
- Injecting Undetectable Backdoors in Obfuscated Neural Networks and Language ModelsAlkis Kalavasis, Amin Karbasi, Argyris Oikonomou, Katerina Sotiraki, Grigoris Velegkas, Manolis Zampetakis. [doi]
- The Evolution of Statistical Induction Heads: In-Context Learning Markov ChainsEzra Edelman, Nikolaos Tsilivis 0002, Benjamin L. Edelman, Eran Malach, Surbhi Goel. [doi]
- Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit FeedbackHaolin Liu, Zakaria Mhammedi, Chen-Yu Wei, Julian Zimmert. [doi]
- Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient AlgorithmQinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal. [doi]
- Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node ClassificationYihong Luo, Yuhan Chen, Siya Qiu, Yiwei Wang, Chen Zhang 0013, Yan Zhou, Xiaochun Cao, Jing Tang 0004. [doi]
- Pearls from Pebbles: Improved Confidence Functions for Auto-labelingHarit Vishwakarma, Yi Chen, Sui Jiet Tay, Satya Sai Srinath Namburi, Frederic Sala, Ramya Korlakai Vinayak. [doi]
- A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose EmbeddingYitong Dong, Yijin Li, Zhaoyang Huang, Weikang Bian, Jingbo Liu, Hujun Bao, Zhaopeng Cui, Hongsheng Li 0001, Guofeng Zhang 0001. [doi]
- Multi-view Masked Contrastive Representation Learning for Endoscopic Video AnalysisKai Hu, Ye Xiao, Yuan Zhang, Xieping Gao. [doi]
- Matching the Statistical Query Lower Bound for k-Sparse Parity Problems with Sign Stochastic Gradient DescentYiwen Kou, Zixiang Chen, Quanquan Gu, Sham M. Kakade. [doi]
- On scalable oversight with weak LLMs judging strong LLMsZachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah. [doi]
- Text2NKG: Fine-Grained N-ary Relation Extraction for N-ary relational Knowledge Graph ConstructionHaoran Luo 0001, Haihong E, Yuhao Yang 0006, Tianyu Yao, Yikai Guo, Zichen Tang, Wentai Zhang 0004, Shiyao Peng, Kaiyang Wan, Meina Song, Wei Lin, Yifan Zhu 0001, Anh Tuan Luu. [doi]
- Compositional Automata Embeddings for Goal-Conditioned Reinforcement LearningBeyazit Yalcinkaya, Niklas Lauffer, Marcell Vazquez-Chanlatte, Sanjit Seshia. [doi]
- Aligner-Encoders: Self-Attention Transformers Can Be Self-TransducersAdam Stooke, Rohit Prabhavalkar, Khe Chai Sim, Pedro Moreno Mengibar. [doi]
- Beyond Optimism: Exploration With Partially Observable RewardsSimone Parisi, Alireza Kazemipour, Michael Bowling. [doi]
- ActAnywhere: Subject-Aware Video Background GenerationBoxiao Pan, Zhan Xu, Chun-Hao Paul Huang, Krishna Kumar Singh, Yang Zhou 0009, Leonidas J. Guibas, Jimei Yang. [doi]
- Kraken: Inherently Parallel Transformers For Efficient Multi-Device InferenceRohan Baskar Prabhakar, Hengrui Zhang, David Wentzlaff. [doi]
- Quantifying Aleatoric Uncertainty of the Treatment Effect: A Novel Orthogonal LearnerValentyn Melnychuk, Stefan Feuerriegel, Mihaela van der Schaar. [doi]
- Self-Guiding Exploration for Combinatorial ProblemsZangir Iklassov, Yali Du 0001, Farkhad Akimov, Martin Takác 0001. [doi]
- SCaR: Refining Skill Chaining for Long-Horizon Robotic Manipulation via Dual RegularizationZixuan Chen, Ze Ji, Jing Huo, Yang Gao. [doi]
- CryoGEM: Physics-Informed Generative Cryo-Electron MicroscopyJiakai Zhang, Qihe Chen, Yan Zeng, Wenyuan Gao, Xuming He 0001, Zhijie Liu, Jingyi Yu. [doi]
- Fisher Flow Matching for Generative Modeling over Discrete DataOscar Davis, Samuel Kessler, Mircea Petrache, Ismail Ilkan Ceylan, Michael M. Bronstein, Avishek Joey Bose. [doi]
- A Globally Optimal Portfolio for m-Sparse Sharpe Ratio MaximizationYizun Lin, Zhao-Rong Lai, Cheng Li 0018. [doi]
- SurgicAI: A Hierarchical Platform for Fine-Grained Surgical Policy Learning and BenchmarkingJin Wu, Haoying Zhou, Peter Kazanzides, Adnan Munawar, Anqi Liu. [doi]
- Mitigating Reward Overoptimization via Lightweight Uncertainty EstimationXiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu 0018. [doi]
- A Bayesian Approach for Personalized Federated Learning in Heterogeneous SettingsDisha Makhija, Joydeep Ghosh, Nhat Ho. [doi]
- DiffHammer: Rethinking the Robustness of Diffusion-Based Adversarial PurificationKaibo Wang, Xiaowen Fu, Yuxuan Han, Yang Xiang. [doi]
- Derivatives of Stochastic Gradient Descent in parametric optimizationFranck Iutzeler, Edouard Pauwels, Samuel Vaiter. [doi]
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutesZhenhui Ye, Tianyun Zhong, Yi Ren 0006, Ziyue Jiang 0001, Jiawei Huang 0008, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang 0020, Zehan Wang 0001, Xize Cheng, Xiang Yin 0006, Zhou Zhao. [doi]
- Noise Contrastive Alignment of Language Models with Explicit RewardsHuayu Chen, Guande He, Lifan Yuan, Ganqu Cui, Hang Su, Jun Zhu. [doi]
- Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-ExpertsHang Guo, Tao Dai 0001, Yuanchao Bai, Bin Chen 0011, Xudong Ren, Zexuan Zhu, Shu-Tao Xia. [doi]
- Active Learning with LLMs for Partially Observed and Cost-Aware ScenariosNicolás Astorga, Tennison Liu, Nabeel Seedat, Mihaela van der Schaar. [doi]
- Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation PuzzlesQi Chen, Bowen Zhang, Gang Wang, Qi Wu. [doi]
- Unified Guidance for Geometry-Conditioned Molecular GenerationSirine Ayadi, Leon Hetzel, Johanna Sommer, Fabian J. Theis, Stephan Günnemann. [doi]
- Capturing the denoising effect of PCA via compression ratioChandra Sekhar Mukherjee, Nikhil Deorkar, Jiapeng Zhang. [doi]
- Improving Subgroup Robustness via Data SelectionSaachi Jain, Kimia Hamidieh, Kristian Georgiev, Andrew Ilyas, Marzyeh Ghassemi, Aleksander Madry. [doi]
- Seeing the Image: Prioritizing Visual Correlation by Contrastive AlignmentXin Xiao, Bohong Wu, Jiacong Wang, Chunyuan Li, Xun Zhou, Haoyuan Guo. [doi]
- ROIDICE: Offline Return on Investment Maximization for Efficient Decision MakingWoosung Kim, Hayeong Lee, Jongmin Lee, Byung Jun Lee. [doi]
- Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in TransformersSiyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang. [doi]
- Chain of Thoughtlessness? An Analysis of CoT in PlanningKaya Stechly, Karthik Valmeekam, Subbarao Kambhampati. [doi]
- Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RLAndrew Wagenmaker, Kevin Huang, Liyiming Ke, Kevin G. Jamieson, Abhishek Gupta 0004. [doi]
- John Ellipsoids via Lazy UpdatesDavid P. Woodruff, Taisuke Yasuda 0002. [doi]
- Efficient Sketches for Training Data Attribution and Studying the Loss LandscapeAndrea Schioppa. [doi]
- Improving Deep Learning Optimization through Constrained Parameter RegularizationJörg K. H. Franke, Michael Hefenbrock, Gregor Köhler, Frank Hutter. [doi]
- Provable Acceleration of Nesterov's Accelerated Gradient for Asymmetric Matrix Factorization and Linear Neural NetworksZhenghao Xu, Yuqing Wang, Tuo Zhao, Rachel Ward, Molei Tao. [doi]
- AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial AttacksJin Li, Ziqiang He, Anwei Luo, Jian-Fang Hu, Z. Jane Wang 0001, Xiangui Kang. [doi]
- Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing FlowsAlberto Cabezas, Louis Sharrock, Christopher Nemeth. [doi]
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language ModelsFanxu Meng, Zhaohui Wang, Muhan Zhang. [doi]
- Improving Decision SparsityYiyang Sun, Tong Wang 0011, Cynthia Rudin. [doi]
- SM3-Text-to-Query: Synthetic Multi-Model Medical Text-to-Query BenchmarkSithursan Sivasubramaniam, Cedric Osei-Akoto, Yi Zhang, Kurt Stockinger, Jonathan Fürst. [doi]
- On Socially Fair Low-Rank Approximation and Column Subset SelectionZhao Song 0002, Ali Vakilian, David P. Woodruff, Samson Zhou. [doi]
- Customized Subgraph Selection and Encoding for Drug-drug Interaction PredictionHaotong Du, Quanming Yao, Juzheng Zhang, Yang Liu, Zhen Wang. [doi]
- Octopus: A Multi-modal LLM with Parallel Recognition and Sequential UnderstandingChuyang Zhao, Yuxin Song, Junru Chen, Kang Rong, Haocheng Feng, Gang Zhang, Shufan Ji, Jingdong Wang 0001, Errui Ding, Yifan Sun 0003. [doi]
- Public-data Assisted Private Stochastic Optimization: Power and LimitationsEnayat Ullah, Michael Menart, Raef Bassily, Cristóbal Guzmán, Raman Arora. [doi]
- Understanding Visual Feature Reliance through the Lens of ComplexityThomas Fel, Louis Béthune, Andrew K. Lampinen, Thomas Serre, Katherine L. Hermann. [doi]
- Neural Embeddings Rank: Aligning 3D latent dynamics with movementsChenggang Chen, Zhiyu Yang, Xiaoqin Wang. [doi]
- Variational Distillation of Diffusion Policies into Mixture of ExpertsHongyi Zhou, Denis Blessing, Ge Li, Onur Celik, Xiaogang Jia, Gerhard Neumann, Rudolf Lioutikov. [doi]
- Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion ModelWenjia Xie, Hao Wang 0076, Luankang Zhang, Rui Zhou, Defu Lian, Enhong Chen. [doi]
- Revisiting Ensembling in One-Shot Federated LearningYoussef Allouah, Akash Dhasade, Rachid Guerraoui, Nirupam Gupta, Anne-Marie Kermarrec, Rafael Pinot, Rafael Pires 0001, Rishi Sharma 0001. [doi]
- Physics-Regularized Multi-Modal Image Assimilation for Brain Tumor LocalizationMichal Balcerak, Tamaz Amiranashvili, Andreas Wagner, Jonas Weidner, Petr Karnakov, Johannes C. Paetzold, Ivan Ezhov, Petros Koumoutsakos, Benedikt Wiestler, Bjoern H. Menze. [doi]
- Generate Universal Adversarial Perturbations for Few-Shot LearningYiman Hu, Yixiong Zou, Ruixuan Li 0001, Yuhua Li 0003. [doi]
- FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precisionJay Shah, Ganesh Bikshandi, Ying Zhang, Vijay Thakkar, Pradeep Ramani, Tri Dao. [doi]
- ProSST: Protein Language Modeling with Quantized Structure and Disentangled AttentionMingchen Li, Yang Tan, Xinzhu Ma, Bozitao Zhong, Huiqun Yu, Ziyi Zhou, Wanli Ouyang, Bingxin Zhou, Pan Tan, Liang Hong. [doi]
- Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object DetectionDongsu Song, Daehwa Ko, Jay Hoon Jung. [doi]
- WildGaussians: 3D Gaussian Splatting In the WildJonas Kulhanek, Songyou Peng, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler. [doi]
- Transfer Q-star : Principled Decoding for LLM AlignmentSouradip Chakraborty, Soumya Suvra Ghosal, Ming Yin 0003, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang. [doi]
- Adaptive Layer Sparsity for Large Language Models via Activation Correlation AssessmentWei Li, Lujun Li, Mark Lee, Shengjie Sun. [doi]
- Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language ModelsShengyun Peng, Pin-Yu Chen, Matthew Hull, Duen Horng Chau. [doi]
- Predictive Attractor ModelsRamy Mounir, Sudeep Sarkar. [doi]
- S-STE: Continuous Pruning Function for Efficient 2: 4 Sparse Pre-trainingYuezhou Hu, Jun Zhu, Jianfei Chen. [doi]
- CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper InfluenceChaochao Chen, Jiaming Zhang, Yizhao Zhang, Li Zhang, Lingjuan Lyu, Yuyuan Li, Biao Gong, Chenggang Yan. [doi]
- Adversarial Moment-Matching Distillation of Large Language ModelsChen Jia. [doi]
- How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE TransformersXin Lu, Yanyan Zhao, Bing Qin 0001, Liangyu Huo, Qing Yang 0033, Dongliang Xu. [doi]
- Quantum Deep Equilibrium ModelsPhilipp Schleich, Marta Skreta, Lasse Bjørn Kristensen, Rodrigo A. Vargas-Hernández, Alán Aspuru-Guzik. [doi]
- Tackling Uncertain Correspondences for Multi-Modal Entity AlignmentLiyi Chen, Ying Sun, Shengzhe Zhang, Yuyang Ye, Wei Wu, Hui Xiong 0001. [doi]
- AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process FeedbackJian Guan 0002, Wei Wu 0014, Zujie Wen, Peng Xu, Hongning Wang, Minlie Huang. [doi]
- A Concept-Based Explainability Framework for Large Multimodal ModelsJayneel Parekh, Pegah Khayatan, Mustafa Shukor, Alasdair Newson, Matthieu Cord. [doi]
- Pure Message Passing Can Estimate Common Neighbor for Link PredictionKaiwen Dong, Zhichun Guo, Nitesh V. Chawla. [doi]
- Beyond Accuracy: Tracking more like Human via Visual SearchDailing Zhang, Shiyu Hu, Xiaokun Feng, Xuchen Li, Meiqi Wu, Jing Zhang, Kaiqi Huang. [doi]
- Pretrained Optimization Model for Zero-Shot Black Box OptimizationXiaobin Li, Kai Wu 0003, Yujian Betterest Li, Xiaoyu Zhang 0010, Handing Wang, Jing Liu 0006. [doi]
- Rad-NeRF: Ray-decoupled Training of Neural Radiance FieldLidong Guo, Xuefei Ning, Yonggan Fu, Tianchen Zhao, Zhuoliang Kang, Jincheng Yu, Yingyan (Celine) Lin, Yu Wang 0002. [doi]
- ReGS: Reference-based Controllable Scene Stylization with Gaussian SplattingYiqun Mei, Jiacong Xu, Vishal M. Patel. [doi]
- Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse TrainingPihe Hu, Shaolong Li, Zhuoran Li, Ling Pan, Longbo Huang. [doi]
- Conformalized Time Series with Semantic FeaturesBaiting Chen, Zhimei Ren, Lu Cheng. [doi]
- ActSort: An active-learning accelerated cell sorting algorithm for large-scale calcium imaging datasetsYiqi Jiang, Hakki O. Akengin, Ji Zhou, Mehmet Aslihak, Yang Li, Radoslaw Chrapkiewicz, Oscar Hernandez, Sadegh Ebrahimi, Omar Jaidar, Yanping Zhang, Hakan Inan, Christopher Miranda, Fatih Dinc, Marta Blanco-Pozo, Mark J. Schnitzer. [doi]
- Swift Sampler: Efficient Learning of Sampler by 10 ParametersJiawei Yao, Chuming Li, Canran Xiao. [doi]
- Graph neural networks and non-commuting operatorsMauricio Velasco, Kaiying O'Hare, Bernardo Rychtenberg, Soledad Villar. [doi]
- MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze PredictionAnshul Gupta, Samy Tafasca, Arya Farkhondeh, Pierre Vuillecard, Jean-Marc Odobez. [doi]
- Advancing Video Anomaly Detection: A Concise Review and a New DatasetLiyun Zhu, Lei Wang, Arjun Raj, Tom Gedeon, Chen Chen. [doi]
- HelpSteer 2: Open-source dataset for training top-performing reward modelsZhilin Wang, Yi Dong, Olivier Delalleau, Jiaqi Zeng, Gerald Shen, Daniel Egert, Jimmy Zhang, Makesh Narsimhan Sreedhar, Oleksii Kuchaiev. [doi]
- A Huber Loss Minimization Approach to Mean Estimation under User-level Differential PrivacyPuning Zhao, Lifeng Lai, Li Shen, Qingming Li, Jiafei Wu, Zhe Liu. [doi]
- Probabilistic Decomposed Linear Dynamical Systems for Robust Discovery of Latent Neural DynamicsYenho Chen, Noga Mudrik, Kyle A. Johnsen, Sankaraleengam Alagapan, Adam S. Charles, Christopher Rozell. [doi]
- A Single-Step, Sharpness-Aware Minimization is All You Need to Achieve Efficient and Accurate Sparse TrainingJie Ji, Gen Li 0012, Jingjing Fu, Fatemeh Afghah, Linke Guo, Xiaoyong Yuan, Xiaolong Ma. [doi]
- Gradual Domain Adaptation via Manifold-Constrained Distributionally Robust OptimizationSeyed Amir Saberi, Amir Najafi 0001, Amin Behjati, Ala Emrani, Yasaman Zolfimoselo, Mahdi Shadrooy, Abolfazl S. Motahari, Babak H. Khalaj. [doi]
- Active, anytime-valid risk controlling prediction setsZiyu Xu, Nikos Karampatziakis, Paul Mineiro. [doi]
- Invariant subspaces and PCA in nearly matrix multiplication timeAleksandros Sobczyk, Marko Mladenovic, Mathieu Luisier. [doi]
- Discovering Preference Optimization Algorithms with and for Large Language ModelsChris Lu 0001, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob N. Foerster, Mihaela van der Schaar, Robert T. Lange. [doi]
- Conformalized Multiple Testing after Data-dependent SelectionXiaoning Wang, Yuyang Huo, Liuhua Peng, Changliang Zou. [doi]
- Identifying General Mechanism Shifts in Linear Causal RepresentationsTianyu Chen, Kevin Bello, Francesco Locatello, Bryon Aragam, Pradeep Ravikumar. [doi]
- Scalable Bayesian Optimization via Focalized Sparse Gaussian ProcessesYunyue Wei, Vincent Zhuang, Saraswati Soedarmadji, Yanan Sui. [doi]
- Beyond Slow Signs in High-fidelity Model ExtractionHanna Foerster, Robert D. Mullins, Ilia Shumailov, Jamie Hayes. [doi]
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision MakingZibin Dong, Yifu Yuan, Jianye Hao, Fei Ni 0001, Yi Ma 0005, Pengyi Li, Yan Zheng 0002. [doi]
- cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific PapersAnirudh Sundar, Jin Xu, William Gay, Christopher Richardson, Larry Heck. [doi]
- DiffSF: Diffusion Models for Scene Flow EstimationYushan Zhang, Bastian Wandt, Maria Magnusson, Michael Felsberg. [doi]
- Provable Benefits of Complex Parameterizations for Structured State Space ModelsYuval Ran-Milo, Eden Lumbroso, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen. [doi]
- Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and PropagationNayeon Kim, Hongje Seong, Daehyun Ji, Sujin Jang. [doi]
- Task Confusion and Catastrophic Forgetting in Class-Incremental Learning: A Mathematical Framework for Discriminative and Generative ModelingsMilad Khademi Nori, Il-Min Kim 0001. [doi]
- Geometric Analysis of Nonlinear Manifold ClusteringNimita Shinde, Tianjiao Ding, Daniel P. Robinson, René Vidal. [doi]
- Reinforcement Learning Policy as Macro Regulator Rather than Macro PlacerKe Xue 0001, Ruo-Tong Chen, Xi Lin, Yunqi Shi, Shixiong Kai, Siyuan Xu, Chao Qian 0001. [doi]
- FuseMoE: Mixture-of-Experts Transformers for Fleximodal FusionXing Han, Huy Nguyen, Carl Harris, Nhat Ho, Suchi Saria. [doi]
- $\epsilon$-Softmax: Approximating One-Hot Vectors for Mitigating Label NoiseJialiang Wang, Xiong Zhou, Deming Zhai, Junjun Jiang, Xiangyang Ji, Xianming Liu. [doi]
- Empowering Visible-Infrared Person Re-Identification with Large Foundation ModelsZhangyi Hu, Bin Yang 0026, Mang Ye. [doi]
- Quality-Improved and Property-Preserved Polarimetric Imaging via Complementarily FusingChu Zhou, Yixing Liu, Chao Xu, Boxin Shi. [doi]
- Symmetries in Overparametrized Neural Networks: A Mean Field ViewJavier Maass Martínez, Joaquín Fontbona. [doi]
- Hydra: Bidirectional State Space Models Through Generalized Matrix MixersSukjun Hwang, Aakash Sunil Lahoti, Ratish Puduppully, Tri Dao, Albert Gu. [doi]
- Improving self-training under distribution shifts via anchored confidence with theoretical guaranteesTaejong Joo, Diego Klabjan. [doi]
- Non-asymptotic Analysis of Biased Adaptive Stochastic ApproximationSobihan Surendran, Adeline Fermanian, Antoine Godichon-Baggioni, Sylvain Le Corff. [doi]
- Instance-adaptive Zero-shot Chain-of-Thought PromptingXiaosong Yuan, Chen Shen 0003, Shaotian Yan, Xiaofeng Zhang, Liang Xie, Wenxiao Wang 0001, Renchu Guan, Ying Wang, Jieping Ye. [doi]
- Low-Rank Optimal Transport through Factor Relaxation with Latent CouplingPeter Halmos, Xinhao Liu 0009, Julian Gold, Benjamin J. Raphael. [doi]
- Truth is Universal: Robust Detection of Lies in LLMsLennart Bürger, Fred A. Hamprecht, Boaz Nadler. [doi]
- E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionJiaqing Zhang, Mingxiang Cao, Weiying Xie, Jie Lei 0001, Daixun Li, Wenbo Huang, Yunsong Li, Xue Yang. [doi]
- Harmonizing Stochasticity and Determinism: Scene-responsive Diverse Human Motion PredictionZhenyu Lou, Qiongjie Cui, Tuo Wang, Zhenbo Song, Luoming Zhang, Cheng Cheng, Haofan Wang, Xu Tang, Huaxia Li, Hong Zhou. [doi]
- $C^2M^3$: Cycle-Consistent Multi-Model MergingDonato Crisostomi, Marco Fumero, Daniele Baieri, Florian Bernard, Emanuele Rodolà. [doi]
- Self-Calibrating Conformal PredictionLars van der Laan, Ahmed M. Alaa. [doi]
- Convergence of No-Swap-Regret Dynamics in Self-PlayRenato Paes Leme, Georgios Piliouras, Jon Schneider. [doi]
- Validating Climate Models with Spherical Convolutional Wasserstein DistanceRobert C. Garrett, Trevor Harris, Zhuo Wang, Bo Li. [doi]
- Attention Temperature Matters in ViT-Based Cross-Domain Few-Shot LearningYixiong Zou, Ran Ma, Yuhua Li 0003, Ruixuan Li 0001. [doi]
- APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-Calling DatasetsZuxin Liu, Thai-Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu 0001, Yihao Feng, Rithesh R. N., Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang 0016, Shelby Heinecke, Caiming Xiong. [doi]
- BendVLM: Test-Time Debiasing of Vision-Language EmbeddingsWalter Gerych, Haoran Zhang 0003, Kimia Hamidieh, Eileen Pan, Maanas K. Sharma, Tom Hartvigsen, Marzyeh Ghassemi. [doi]
- An Accelerated Algorithm for Stochastic Bilevel Optimization under Unbounded SmoothnessXiaochuan Gong, Jie Hao, Mingrui Liu. [doi]
- Is Cross-validation the Gold Standard to Estimate Out-of-sample Model Performance?Garud Iyengar, Henry Lam, Tianyu Wang. [doi]
- End-to-End Ontology Learning with Large Language ModelsAndy Lo, Albert Q. Jiang, Wenda Li, Mateja Jamnik. [doi]
- On the Efficiency of ERM in Feature LearningAyoub El Hanchi, Chris J. Maddison, Murat A. Erdogdu. [doi]
- Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive NegotiationSahar Abdelnabi, Amr Gomaa, Sarath Sivaprasad, Lea Schönherr, Mario Fritz. [doi]
- Graph-based Uncertainty Metrics for Long-form Language Model GenerationsMingjian Jiang, Yangjun Ruan, Prasanna Sattigeri, Salim Roukos, Tatsunori B. Hashimoto. [doi]
- Transferable Adversarial Attacks on SAM and Its Downstream ModelsSong Xia, Wenhan Yang, Yi Yu 0011, Xun Lin, Henghui Ding, Lingyu Duan, Xudong Jiang 0001. [doi]
- VISA: Variational Inference with Sequential Sample-Average ApproximationsHeiko Zimmermann, Christian Andersson Naesseth, Jan-Willem van de Meent. [doi]
- Dynamic Subgroup Identification in Covariate-adjusted Response-adaptive Randomization ExperimentsYanping Li, Jingshen Wang, Waverly Wei. [doi]
- Artificial Generational Intelligence: Cultural Accumulation in Reinforcement LearningJonathan Cook 0004, Chris Lu 0001, Edward Hughes 0001, Joel Z. Leibo, Jakob Foerster. [doi]
- Generalized Protein Pocket Generation with Prior-Informed Flow MatchingZaixi Zhang, Marinka Zitnik, Qi Liu 0003. [doi]
- DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset CurationYuang Ai, Xiaoqiang Zhou, Huaibo Huang, Xiaotian Han, Zhengyu Chen, Quanzeng You, Hongxia Yang. [doi]
- ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose EstimationCédric Rommel, Victor Letzelter, Nermin Samet, Renaud Marlet, Matthieu Cord, Patrick Pérez, Eduardo Valle. [doi]
- CLIP in Mirror: Disentangling text from visual images through reflectionTiancheng Wang, Yuguang Yang 0007, Linlin Yang, Shaohui Lin, Juan Zhang, Guodong Guo, Baochang Zhang 0001. [doi]
- Code Repair with LLMs gives an Exploration-Exploitation TradeoffHao Tang, Keya Hu, Jin Zhou, Sicheng Zhong, Wei-Long Zheng, Xujie Si, Kevin Ellis. [doi]
- Learning Better Representations From Less Data For Propositional SatisfiabilityMohamed Ghanem, Frederik Schmitt, Julian Siber, Bernd Finkbeiner. [doi]
- ChaosBench: A Multi-Channel, Physics-Based Benchmark for Subseasonal-to-Seasonal Climate PredictionJuan Nathaniel, Yongquan Qu, Tung Nguyen, Sungduk Yu, Julius Busecke, Aditya Grover, Pierre Gentine. [doi]
- Bridge the Modality and Capability Gaps in Vision-Language Model SelectionChao Yi, Yuhang He, De-Chuan Zhan, Han-Jia Ye. [doi]
- DeepITE: Designing Variational Graph Autoencoders for Intervention Target EstimationHongyuan Tao, Hang Yu, Jianguo Li. [doi]
- Randomized Truthful Auctions with Learning AgentsGagan Aggarwal, Anupam Gupta 0001, Andrés Perlroth, Grigoris Velegkas. [doi]
- Bayes-optimal learning of an extensive-width neural network from quadratically many samplesAntoine Maillard, Emanuele Troiani, Simon Martin 0008, Florent Krzakala, Lenka Zdeborová. [doi]
- SpaceByte: Towards Deleting Tokenization from Large Language ModelingKevin Slagle. [doi]
- Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss LandscapesXiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho. [doi]
- Interventionally Consistent Surrogates for Complex Simulation ModelsJoel Dyer, Nicholas Bishop, Yorgos Felekis, Fabio Massimo Zennaro, Anisoara Calinescu, Theodoros Damoulas, Michael J. Wooldridge. [doi]
- StreamBench: Towards Benchmarking Continuous Improvement of Language AgentsCheng-Kuang Wu, Zhi Rui Tam, Chieh-Yen Lin, Yun-Nung Chen, Hung-yi Lee. [doi]
- Efficient Reinforcement Learning by Discovering Neural PathwaysSamin Yeasar Arnob, Riyasat Ohib, Sergey M. Plis, Amy Zhang 0001, Alessandro Sordoni, Doina Precup. [doi]
- Classification Done Right for Vision-Language Pre-TrainingZilong Huang, Qinghao Ye, Bingyi Kang, Jiashi Feng, Haoqi Fan 0001. [doi]
- NaturalBench: Evaluating Vision-Language Models on Natural Adversarial SamplesBaiqi Li, Zhiqiu Lin, Wenxuan Peng, Jean de Dieu Nyandwi, Daniel Jiang, Zixian Ma, Simran Khanuja, Ranjay Krishna, Graham Neubig, Deva Ramanan. [doi]
- MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-TrainingBo Chen 0026, Zhilei Bei, Xingyi Cheng, Pan Li, Jie Tang 0001, Le Song. [doi]
- Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMsZhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao 0001, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei W. Koh, Bryan Hooi. [doi]
- Inferring stochastic low-rank recurrent neural networks from neural dataMatthijs Pals, A Erdem Sagtekin, Felix Pei, Manuel Glöckler, Jakob H. Macke. [doi]
- A Boosting-Type Convergence Result for AdaBoost.MH with Factorized Multi-Class ClassifiersXin Zou, Zhengyu Zhou, Jingyuan Xu, Weiwei Liu. [doi]
- Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training DataJohannes Treutlein, Dami Choi, Jan Betley, Samuel Marks, Cem Anil, Roger B. Grosse, Owain Evans. [doi]
- Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement LearningShuGuang Yu, Shuxing Fang, Ruixin Peng, Zhengling Qi, Fan Zhou, Chengchun Shi. [doi]
- Efficient Streaming Algorithms for Graphlet SamplingYann Bourreau, Marco Bressan 0002, T.-H. Hubert Chan, Qipeng Kuang, Mauro Sozio. [doi]
- Diffeomorphic interpolation for efficient persistence-based topological optimizationMathieu Carrière, Marc Theveneau, Théo Lacombe. [doi]
- Zero-shot Generalizable Incremental Learning for Vision-Language Object DetectionJieren Deng, Haojian Zhang, Kun Ding, Jianhua Hu, Xingxuan Zhang, Yunkuan Wang. [doi]
- Ultrafast classical phylogenetic method beats large protein language models on variant effect predictionSebastian Prillo, Wilson Wu, Yun Song. [doi]
- Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz ConstraintsJay Bear, Adam Prügel-Bennett, Jonathon Hare. [doi]
- Discovering Sparsity Allocation for Layer-wise Pruning of Large Language ModelsLujun Li, Peijie Dong, Zhenheng Tang, Xiang Liu, Qiang Wang, Wenhan Luo, Wei Xue, Qifeng Liu, Xiaowen Chu, Yike Guo. [doi]
- Towards a Scalable Reference-Free Evaluation of Generative ModelsAzim Ospanov, Jingwei Zhang, Mohammad Jalali, Xuenan Cao, Andrej Bogdanov, Farzan Farnia. [doi]
- Analytically deriving Partial Information Decomposition for affine systems of stable and convolution-closed distributionsChaitanya Goswami, Amanda Merkley. [doi]
- DisCEdit: Model Editing by Identifying Discriminative ComponentsChaitanya Murti, Chiranjib Bhattacharyya. [doi]
- Noisy Dual Mirror Descent: A Near Optimal Algorithm for Jointly-DP Convex Resource AllocationDu Chen, Geoffrey A. Chua. [doi]
- Scalable Early Childhood Reading Performance PredictionZhongkai Shangguan, Zanming Huang, Eshed Ohn-Bar, Ola Ozernov-Palchik, Derek Kosty, Michael Stoolmiller, Hank Fien. [doi]
- Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online AdaptationHaoqi Yuan, Yuhui Fu 0005, Feiyang Xie, Zongqing Lu. [doi]
- Boosting Weakly Supervised Referring Image Segmentation via Progressive ComprehensionZaiquan Yang, Yuhao Liu 0001, Jiaying Lin, Gerhard P. Hancke 0002, Rynson W. H. Lau. [doi]
- FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel ExtractionFeijie Wu, XingChen Wang, Yaqing Wang, Tianci Liu 0003, Lu Su 0001, Jing Gao 0004. [doi]
- Learning to Assist Humans without Inferring RewardsVivek Myers, Evan Ellis, Sergey Levine, Benjamin Eysenbach, Anca D. Dragan. [doi]
- CycleNet: Enhancing Time Series Forecasting through Modeling Periodic PatternsShengsheng Lin, Weiwei Lin 0001, Xinyi Hu, Wentai Wu, Ruichao Mo, Haocheng Zhong. [doi]
- Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantificationJannik Franzen, Claudia Winklmayr, Vanessa Emanuela Guarino, Christoph Karg, Xiaoyan Yu, Nora Koreuber, Jan Philipp Albrecht, Philip Bischoff, Dagmar Kainmueller. [doi]
- Improved Sample Complexity for Multiclass PAC LearningSteve Hanneke, Shay Moran, Qian Zhang. [doi]
- Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language ModelsSadegh Mahdavi, Raquel Aoki, Keyi Tang, Yanshuai Cao. [doi]
- HairDiffusion: Vivid Multi-Colored Hair Editing via Latent DiffusionYu Zeng, Yang Zhang 0012, Jiachen Liu, LinLin Shen, Kaijun Deng, Weizhao He, Jinbao Wang. [doi]
- Revisiting motion information for RGB-Event tracking with MOT philosophyTianlu Zhang, Kurt Debattista, Qiang Zhang 0020, Guiguang Ding, Jungong Han. [doi]
- Enhancing Feature Diversity Boosts Channel-Adaptive Vision TransformersChau Pham, Bryan A. Plummer. [doi]
- Piecewise deterministic generative modelsAndrea Bertazzi, Dario Shariatian, Umut Simsekli, Eric Moulines, Alain Durmus. [doi]
- The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion ModelsSaravanan Kandasamy 0002, Dheeraj Nagaraj. [doi]
- RedPajama: an Open Dataset for Training Large Language ModelsMaurice Weber, Daniel Y. Fu, Quentin Anthony, Yonatan Oren, Shane Adams, Anton Alexandrov, Xiaozhong Lyu, Huu Nguyen, Xiaozhe Yao, Virginia Adams, Ben Athiwaratkun, Rahul Chalamala, Kezhen Chen, Max Ryabinin, Tri Dao, Percy Liang, Christopher Ré, Irina Rish, Ce Zhang 0001. [doi]
- Efficient Adversarial Training in LLMs with Continuous AttacksSophie Xhonneux, Alessandro Sordoni, Stephan Günnemann, Gauthier Gidel, Leo Schwinn. [doi]
- CRAG - Comprehensive RAG BenchmarkXiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Xu, an Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang 0001, Lei Chen 0002, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Scott Yih, Xin Dong 0001. [doi]
- TARP-VP: Towards Evaluation of Transferred Adversarial Robustness and Privacy on Label Mapping Visual Prompting ModelsZhen Chen, Yi Zhang, Fu Wang, Xingyu Zhao 0001, Xiaowei Huang 0001, Wenjie Ruan. [doi]
- FedSSP: Federated Graph Learning with Spectral Knowledge and Personalized PreferenceZihan Tan, Guancheng Wan, Wenke Huang, Mang Ye. [doi]
- Diversify, Contextualize, and Adapt: Efficient Entropy Modeling for Neural Image CodecJun Hyuk Kim, Seungeon Kim, Won-Hee Lee, Dokwan Oh. [doi]
- Depth Anything V2Lihe Yang, Bingyi Kang, Zilong Huang, Zhen Zhao 0001, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao. [doi]
- MassSpecGym: A benchmark for the discovery and identification of moleculesRoman Bushuiev, Anton Bushuiev, Niek F. de Jonge, Adamo Young, Fleming Kretschmer, Raman Samusevich, Janne Heirman, Fei Wang, Luke Zhang, Kai Dührkop, Marcus Ludwig, Nils A. Haupt, Apurva Kalia, Corinna Brungs, Robin Schmid, Russell Greiner, Bo Wang, David S. Wishart, Liping Liu 0001, Juho Rousu, Wout Bittremieux, Hannes Rost, Tytus D. Mak, Soha Hassoun, Florian Huber, Justin J. J. van der Hooft, Michael A. Stravs, Sebastian Böcker, Josef Sivic, Tomás Pluskal. [doi]
- On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and CapabilityChenyu Zheng, Wei Huang, Rongzhen Wang, Guoqiang Wu, Jun Zhu, Chongxuan Li. [doi]
- AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental LearningMinghao Chen 0001, Yihang Li, Yanting Yang, Shiyu Yu, Binbin Lin, Xiaofei He 0001. [doi]
- A provable control of sensitivity of neural networks through a direct parameterization of the overall bi-LipschitznessYuri Kinoshita, Taro Toyoizumi. [doi]
- ReactZyme: A Benchmark for Enzyme-Reaction PredictionChenqing Hua, Bozitao Zhong, Sitao Luan, Liang Hong, Guy Wolf, Doina Precup, Shuangjia Zheng. [doi]
- A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion ModelsHamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem. [doi]
- FUG: Feature-Universal Graph Contrastive Pre-training for Graphs with Diverse Node FeaturesJitao Zhao, Di Jin 0001, Meng Ge, Lianze Shan, Xin Wang 0030, Dongxiao He, Zhiyong Feng 0002. [doi]
- Nearly Optimal Approximation of Matrix Functions by the Lanczos MethodNoah Amsel, Tyler Chen, Anne Greenbaum, Cameron Musco, Christopher Musco. [doi]
- Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic SpaceXin Qiu, Risto Miikkulainen. [doi]
- BioTrove: A Large Curated Image Dataset Enabling AI for BiodiversityChih-Hsuan Yang, Benjamin Feuer, Talukder Zaki Jubery, Zi K. Deng, Andre Nakkab, Md. Zahid Hasan, Shivani Chiranjeevi, Kelly O. Marshall, Nirmal Baishnab, Asheesh Kumar Singh, Arti Singh, Soumik Sarkar, Nirav C. Merchant, Chinmay Hegde, Baskar Ganapathysubramanian. [doi]
- Fully Explicit Dynamic Gaussian SplattingJunoh Lee, Changyeon Won, Hyunjun Jung, Inhwan Bae, Hae-Gon Jeon. [doi]
- Entrywise error bounds for low-rank approximations of kernel matricesAlexander Modell. [doi]
- Optimal Private and Communication Constraint Distributed Goodness-of-Fit Testing for Discrete Distributions in the Large Sample RegimeLasse Vuursteen. [doi]
- ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation SamplingFrancesca Babiloni, Alexandros Lattas, Jiankang deng, Stefanos Zafeiriou. [doi]
- Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression LearningChenyu Yang, Xizhou Zhu, Jinguo Zhu, Weijie Su 0002, Junjie Wang, Xuan Dong, Wenhai Wang, Bin Li 0025, Jie Zhou 0001, Yu Qiao 0001, Jifeng Dai. [doi]
- The Implicit Bias of Gradient Descent toward Collaboration between Layers: A Dynamic Analysis of Multilayer PerceptionsZheng Wang 0074, Geyong Min, Wenjie Ruan. [doi]
- Transfer Learning for Diffusion ModelsYidong Ouyang, Liyan Xie, Hongyuan Zha, Guang Cheng. [doi]
- Bounds for the smallest eigenvalue of the NTK for arbitrary spherical data of arbitrary dimensionKedar Karhadkar, Michael Murray, Guido F. Montúfar. [doi]
- You Only Cache Once: Decoder-Decoder Architectures for Language ModelsYutao Sun, Li Dong 0010, Yi Zhu, Shaohan Huang, Wenhui Wang 0003, Shuming Ma, Quanlu Zhang, Jianyong Wang 0001, Furu Wei. [doi]
- Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language ModelsHanxiao Zhang, Lin Ju, Chan Wu, Jinjing Huang, Youshao Xiao, Zhenglei Zhou, Zhiming Fan, Zhaoxin Huan, Siyuan Li, Fanzhuang Meng, Lei Liang, Xiaolu Zhang, Jun Zhou. [doi]
- Med-Real2Sim: Non-Invasive Medical Digital Twins using Physics-Informed Self-Supervised LearningKeying Kuang, Frances Dean, Jack B. Jedlicki, David Ouyang, Anthony Philippakis, David A. Sontag, Ahmed M. Alaa. [doi]
- AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant DeploymentYonggan Fu, Zhongzhi Yu, Junwei Li, Jiayi Qian, Yongan Zhang, Xiangchi Yuan, Dachuan Shi, Roman Yakunin, Yingyan (Celine) Lin. [doi]
- Continual learning with the neural tangent ensembleAri S. Benjamin, Christian-Gernot Pehle, Kyle Daruwalla. [doi]
- Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent CollaborationJunyang Wang 0001, Haiyang Xu, Haitao Jia, Xi Zhang, Ming Yan, Weizhou Shen, Ji Zhang 0011, Fei Huang 0004, Jitao Sang. [doi]
- Fast Encoder-Based 3D from Casual Videos via Point Track ProcessingYoni Kasten, Wuyue Lu 0004, Haggai Maron. [doi]
- Probing Social Bias in Labor Market Text Generation by ChatGPT: A Masked Language Model ApproachLei Ding 0013, Yang Hu, Nicole Denier, Enze Shi, Junxi Zhang, Qirui Hu, Karen D. Hughes, Linglong Kong, Bei Jiang. [doi]
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge BasesShirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Y. Zou, Jure Leskovec. [doi]
- LACIE: Listener-Aware Finetuning for Calibration in Large Language ModelsElias Stengel-Eskin, Peter Hase, Mohit Bansal. [doi]
- Long-range Brain Graph TransformerShuo Yu, Shan Jin, Ming Li, Tabinda Sarwar, Feng Xia 0001. [doi]
- Kernel PCA for Out-of-Distribution DetectionKun Fang 0004, Qinghua Tao, Kexin Lv, Mingzhen He, Xiaolin Huang, Jie Yang 0002. [doi]
- Referring Human Pose and Mask Estimation In the WildBo Miao, Mingtao Feng, Zijie Wu, Mohammed Bennamoun, Yongsheng Gao 0001, Ajmal Mian. [doi]
- DiffGS: Functional Gaussian Splatting DiffusionJunsheng Zhou, Weiqi Zhang, Yu-Shen Liu. [doi]
- Soft Superpixel Neighborhood AttentionKent W. Gauen, Stanley H. Chan. [doi]
- VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot ManipulationYoupeng Wen, Junfan Lin, Yi Zhu 0004, Jianhua Han, Hang Xu 0004, Shen Zhao, Xiaodan Liang. [doi]
- Sample Selection via Contrastive Fragmentation for Noisy Label RegressionChris Dongjoo Kim, Sangwoo Moon 0001, Jihwan Moon 0002, Dongyeon Woo, Gunhee Kim. [doi]
- SimPO: Simple Preference Optimization with a Reference-Free RewardYu Meng 0001, Mengzhou Xia, Danqi Chen 0001. [doi]
- Animate3D: Animating Any 3D Model with Multi-view Video DiffusionYanqin Jiang, Chaohui Yu, Chenjie Cao, Fan Wang 0019, Weiming Hu, Jin Gao. [doi]
- Continual Learning in the Frequency DomainRuiqi Liu, Boyu Diao, Libo Huang, Zijia An, Zhulin An, Yongjun Xu 0001. [doi]
- PROSPECT PTMs: Rich Labeled Tandem Mass Spectrometry Dataset of Modified Peptides for Machine Learning in ProteomicsWassim Gabriel, Omar Shouman, Eva Ayla Schröder, Florian Bößl, Mathias Wilhelm 0001. [doi]
- Learning to Decouple the Lights for 3D Face Texture ModelingTianxin Huang, Zhenyu Zhang 0005, Ying Tai, Gim Hee Lee. [doi]
- Fast Rates in Stochastic Online Convex Optimization by Exploiting the Curvature of Feasible SetsTaira Tsuchiya, Shinji Ito. [doi]
- Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample ComplexityQian Yu, Yining Wang, Baihe Huang, Qi Lei, Jason D. Lee. [doi]
- Symmetry-Informed Governing Equation DiscoveryJianke Yang, Wang Rao, Nima Dehmamy, Robin Walters, Rose Yu. [doi]
- Collaborative Cognitive Diagnosis with Disentangled Representation Learning for Learner ModelingWeibo Gao, Qi Liu, Linan Yue, Fangzhou Yao, Hao Wang, Yin Gu, Zheng Zhang. [doi]
- Targeted Sequential Indirect Experiment DesignElisabeth Ailer, Niclas Dern, Jason S. Hartford, Niki Kilbertus. [doi]
- GrounDiT: Grounding Diffusion Transformers via Noisy Patch TransplantationYuseung Lee, Taehoon Yoon, Minhyuk Sung. [doi]
- Multi-Instance Partial-Label Learning with Margin AdjustmentWei Tang, Yin-Fang Yang, Zhaofei Wang, Weijia Zhang, Min-Ling Zhang. [doi]
- Reconstruction Attacks on Machine Unlearning: Simple Models are VulnerableMartín Bertran, Shuai Tang, Michael Kearns, Jamie H. Morgenstern, Aaron Roth 0001, Steven Z. Wu. [doi]
- Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category DiscoveryHaonan Lin, Wenbin An, Jiahao Wang, Yan Chen, Feng Tian, Mengmeng Wang, QianYing Wang, Guang Dai, Jingdong Wang 0001. [doi]
- Neural Gaffer: Relighting Any Object via DiffusionHaian Jin, Yuan Li, Fujun Luan, Yuanbo Xiangli, Sai Bi, Kai Zhang, Zexiang Xu, Jin Sun 0009, Noah Snavely. [doi]
- Surge Phenomenon in Optimal Learning Rate and Batch Size ScalingShuaipeng Li, Penghao Zhao, Hailin Zhang 0004, Xingwu Sun, Hao Wu, Dian Jiao, Weiyan Wang, Chengjun Liu, Zheng Fang, Jinbao Xue, Yangyu Tao, Bin Cui 0001, Di Wang. [doi]
- Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMsAbhimanyu Hans, John Kirchenbauer, Yuxin Wen, Neel Jain, Hamid Kazemi, Prajwal Singhania, Siddharth Singh, Gowthami Somepalli, Jonas Geiping, Abhinav Bhatele, Tom Goldstein. [doi]
- Faster Differentially Private Top-k Selection: A Joint Exponential Mechanism with PruningHao Wu, Hanwen Zhang. [doi]
- MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMsZhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, Pengguang Chen, Jianbo Dai, Yuxuan Yao, Rongwu Xu, Zehan Qi, Wanru Zhao, Linling Shen, Jianqiao Lu, Haochen Tan, Yukang Chen, Hao Zhang, Zhan Shi, Bailin Wang, Zhijiang Guo, Jiaya Jia. [doi]
- Stabilized Proximal-Point Methods for Federated OptimizationXiaowen Jiang, Anton Rodomanov, Sebastian U. Stich. [doi]
- Tree of Attacks: Jailbreaking Black-Box LLMs AutomaticallyAnay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum S. Anderson, Yaron Singer, Amin Karbasi. [doi]
- MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-LearningBin-Bin Gao. [doi]
- Hierarchical Visual Feature Aggregation for OCR-Free Document UnderstandingJaeyoo Park, Jin-Young Choi, Jeonghyung Park, Bohyung Han. [doi]
- TabularBench: Benchmarking Adversarial Robustness for Tabular Deep Learning in Real-world Use-casesThibault Simonetto, Salah Ghamizi, Maxime Cordy. [doi]
- LiT: Unifying LiDAR "Languages" with LiDAR TranslatorYixing Lao, Tao Tang, Xiaoyang Wu 0002, Peng Chen, Kaicheng Yu, Hengshuang Zhao. [doi]
- CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle DetectorsLinye Lyu, Jiawei Zhou, Daojing He, Yu Li 0007. [doi]
- Demystify Mamba in Vision: A Linear Attention PerspectiveDongchen Han, Ziyi Wang, Zhuofan Xia, Yizeng Han, Yifan Pu, Chunjiang Ge, Jun Song, Shiji Song, Bo Zheng, Gao Huang 0001. [doi]
- Learning to Embed Distributions via Maximum Kernel EntropyOleksii Kachaiev, Stefano Recanatesi. [doi]
- Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot LearningHaoyi Zhu, Yating Wang, Di Huang, Weicai Ye, Wanli Ouyang, Tong He 0001. [doi]
- AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingYizhe Huang, Xingbo Wang, Hao Liu, Fanqi Kong, Aoyang Qin, Min Tang 0001, Xiaoxi Wang, Song Chun Zhu, Mingjie Bi, Siyuan Qi, Xue Feng. [doi]
- Nearest Neighbor Speculative Decoding for LLM Generation and AttributionMinghan Li 0002, Xilun Chen 0002, Ari Holtzman, Beidi Chen, Jimmy Lin, Scott Yih, Victoria Lin 0002. [doi]
- Safe Exploitative Play with Untrusted Type BeliefsTongxin Li, Tinashe Handina, Shaolei Ren, Adam Wierman. [doi]
- Higher-Order Causal Message Passing for Experimentation with Complex InterferenceMohsen Bayati, Yuwei Luo, William Overman, Mohamad Sadegh Shirani Faradonbeh, Ruoxuan Xiong. [doi]
- Differential Privacy in Scalable General Kernel Learning via $K$-means Nystr{\"o}m Random FeaturesBonwoo Lee, Jeongyoun Ahn, Cheolwoo Park. [doi]
- Goal Conditioned Reinforcement Learning for Photo Finishing TuningJiarui Wu, Yujin Wang, Lingen Li, Zhang Fan, Tianfan Xue. [doi]
- Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt TemplatesKaifeng Lyu, Haoyu Zhao, Xinran Gu, Dingli Yu, Anirudh Goyal, Sanjeev Arora. [doi]
- MedCalc-Bench: Evaluating Large Language Models for Medical CalculationsNikhil Khandekar, Qiao Jin 0001, Guangzhi Xiong, Soren Dunn, Serina S. Applebaum, Zain Anwar, Maame Sarfo-Gyamfi, Conrad W. Safranek, Abid A Anwar, Andrew Zhang, Aidan Gilson, Maxwell B. Singer, Amisha D. Dave, Andrew Taylor, Aidong Zhang, Qingyu Chen 0001, Zhiyong Lu. [doi]
- How to Boost Any Loss FunctionRichard Nock, Yishay Mansour. [doi]
- Adversarial Schrödinger Bridge MatchingNikita Gushchin, Daniil Selikhanovych, Sergei Kholkin, Evgeny Burnaev, Alexander Korotin. [doi]
- Beyond Accuracy: Ensuring Correct Predictions With Correct RationalesTang Li 0005, Mengmeng Ma 0002, Xi Peng 0005. [doi]
- AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesXixi Hu 0001, Qiang Liu, Xingchao Liu, Bo Liu. [doi]
- BertaQA: How Much Do Language Models Know About Local Culture?Julen Etxaniz, Gorka Azkune, Aitor Soroa, Oier Lopez de Lacalle, Mikel Artetxe. [doi]
- Optimization Algorithm Design via Electric CircuitsStephen Boyd, Tetiana Parshakova, Ernest K. Ryu, Jaewook J. Suh. [doi]
- Iteratively Refined Early Interaction Alignment for Subgraph Matching based Graph RetrievalAshwin Ramachandran, Vaibhav Raj, Indradyumna Roy, Soumen Chakrabarti, Abir De. [doi]
- Semidefinite Relaxations of the Gromov-Wasserstein DistanceJunyu Chen, Binh T. Nguyen, Shang Koh, Yong Sheng Soh. [doi]
- Doubly Hierarchical Geometric Representations for Strand-based Human Hairstyle GenerationYunlu Chen, Francisco Vicente Carrasco 0001, Christian Häne, Giljoo Nam, Jean Charles Bazin, Fernando De la Torre. [doi]
- Automated Efficient Estimation using Monte Carlo Efficient Influence FunctionsRaj Agrawal, Sam Witty, Andy Zane, Elias Bingham. [doi]
- No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model PerformanceVishaal Udandarao, Ameya Prabhu, Adhiraj Ghosh, Yash Sharma 0001, Philip Torr 0001, Adel Bibi, Samuel Albanie, Matthias Bethge. [doi]
- Identifying Selections for Unsupervised Subtask DiscoveryYiwen Qiu, Yujia Zheng 0001, Kun Zhang 0001. [doi]
- Identification of Analytic Nonlinear Dynamical Systems with Non-asymptotic GuaranteesNegin Musavi, Ziyao Guo, Geir E. Dullerud, Yingying Li. [doi]
- BAKU: An Efficient Transformer for Multi-Task Policy LearningSiddhant Haldar, Zhuoran Peng, Lerrel Pinto. [doi]
- Supra-Laplacian Encoding for Transformer on Dynamic GraphsYannis Karmim, Marc Lafon, Raphaël Fournier-S'niehotta, Nicolas Thome. [doi]
- Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive StepsizesYan Huang 0036, Xiang Li, Yipeng Shen, Niao He, Jinming Xu 0002. [doi]
- Verified Code Transpilation with LLMsSahil Bhatia, Jie Qiu, Niranjan Hasabnis, Sanjit Seshia, Alvin Cheung. [doi]
- Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent CodingJiewen Yang, Yiqun Lin, Bin Pu, Xiaomeng Li. [doi]
- SureMap: Simultaneous mean estimation for single-task and multi-task disaggregated evaluationMisha Khodak, Lester Mackey, Alexandra Chouldechova, Miro Dudík. [doi]
- FastDrag: Manipulate Anything in One StepXuanjia Zhao, Jian Guan 0001, Congyi Fan, Dongli Xu, Youtian Lin, Haiwei Pan, Pengming Feng. [doi]
- Non-asymptotic Convergence of Training Transformers for Next-token PredictionRuiquan Huang, Yingbin Liang, Jing Yang 0002. [doi]
- Long-range Meta-path Search on Large-scale Heterogeneous GraphsChao Li, Zijie Guo, Qiuting He, Kun He 0001. [doi]
- Towards Understanding Evolving Patterns in Sequential DataQiuhao Zeng, Long-Kai Huang, Qi Chen, Charles X. Ling, Boyu Wang 0004. [doi]
- Is Your LiDAR Placement Optimized for 3D Scene Understanding?Ye Li, Lingdong Kong, Hanjiang Hu, Xiaohao Xu, Xiaonan Huang. [doi]
- Oja's Algorithm for Streaming Sparse PCASyamantak Kumar, Purnamrita Sarkar. [doi]
- Adaptable Logical Control for Large Language ModelsHonghua Zhang, Po-Nien Kung, Masahiro Yoshida, Guy Van den Broeck, Nanyun Peng 0001. [doi]
- Safe LoRA: The Silver Lining of Reducing Safety Risks when Finetuning Large Language ModelsChia-Yi Hsu, Yu-Lin Tsai, Chih-Hsun Lin, Pin-Yu Chen, Chia-Mu Yu, Chun-Ying Huang. [doi]
- Grasp as You Say: Language-guided Dexterous Grasp GenerationYi-Lin Wei, Jian-Jian Jiang, Chengyi Xing, Xiantuo Tan, Xiao-Ming Wu 0002, Hao Li 0076, Mark R. Cutkosky, Wei-Shi Zheng 0001. [doi]
- On Causal Discovery in the Presence of Deterministic RelationsLoka Li, Haoyue Dai, Hanin Al Ghothani, Biwei Huang, Jiji Zhang, Shahar Harel, Isaac Bentwich, Guangyi Chen 0002, Kun Zhang 0001. [doi]
- Robust Contrastive Multi-view Clustering against Dual Noisy CorrespondenceRuiming Guo, Mouxing Yang, Yijie Lin 0001, Xi Peng 0001, Peng Hu 0002. [doi]
- Deep Submodular Peripteral NetworksGantavya Bhatt, Arnav Das, Jeff A. Bilmes. [doi]
- Evidence of Learned Look-Ahead in a Chess-Playing Neural NetworkErik Jenner, Shreyas Kapur, Vasil Georgiev, Cameron Allen, Scott Emmons, Stuart J. Russell. [doi]
- Text-Aware Diffusion for Policy LearningCalvin Luo, Mandy He, Zilai Zeng, Chen Sun 0002. [doi]
- FineStyle: Fine-grained Controllable Style Personalization for Text-to-image ModelsGong Zhang 0011, Kihyuk Sohn, Meera Hahn, Humphrey Shi, Irfan Essa. [doi]
- Forgetting, Ignorance or Myopia: Revisiting Key Challenges in Online Continual LearningXinrui Wang, Chuanxing Geng, Wenhai Wan, Shao-Yuan Li, Songcan Chen. [doi]
- Robust Sparse Regression with Non-Isotropic DesignsChih-Hung Liu 0001, Gleb Novikov. [doi]
- Generative Fractional Diffusion ModelsGabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek. [doi]
- Block Transformer: Global-to-Local Language Modeling for Fast InferenceNamgyu Ho, Sangmin Bae, Taehyeon Kim 0001, Hyunjik Jo, Yireun Kim, Tal Schuster, Adam Fisch, James Thorne, Se-Young Yun. [doi]
- Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videosPolina Turishcheva, Paul G. Fahey, Michaela Vystrcilová, Laura Hansel, Rachel Froebe, Kayla Ponder, Yongrong Qiu, Konstantin Willeke, Mohammad Bashiri, Ruslan Baikulov, Yu Zhu, Lei Ma 0008, Shan Yu, Tiejun Huang 0001, Bryan Li, Wolf De Wulf, Nina Kudryashova, Matthias H. Hennig, Nathalie Rochefort, Arno Onken, Eric Y. Wang, Zhiwei Ding, Andreas S. Tolias, Fabian H. Sinz, Alexander S. Ecker. [doi]
- Generated and Pseudo Content guided Prototype Refinement for Few-shot Point Cloud SegmentationLili Wei, Congyan Lang, Ziyi Chen, Tao Wang 0011, Yidong Li, Jun Liu 0036. [doi]
- Con4m: Context-aware Consistency Learning Framework for Segmented Time Series ClassificationJunru Chen, Tianyu Cao, Jing Xu, Jiahe Li 0008, Zhilong Chen, Tao Xiao, Yang Yang 0009. [doi]
- Advancing Cross-domain Discriminability in Continual Learning of Vision-Language ModelsYicheng Xu, Yuxin Chen, Jiahao Nie 0002, Yusong Wang, Huiping Zhuang, Manabu Okumura. [doi]
- A Combinatorial Algorithm for the Semi-Discrete Optimal Transport ProblemPankaj K. Agarwal, Sharath Raghvendra, Pouyan Shirzadian, Keegan Yao. [doi]
- Risk-Averse Fine-tuning of Large Language ModelsSapana Chaudhary, Ujwal Dinesha, Dileep Kalathil, Srinivas Shakkottai. [doi]
- Zero-Shot Transfer of Neural ODEsTyler Ingebrand, Adam J. Thorpe, Ufuk Topcu. [doi]
- Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Yang Dai, Oubo Ma, Longfei Zhang, Xingxing Liang, Shengchao Hu, Mengzhu Wang, Shouling Ji, Jincai Huang 0001, Li Shen 0008. [doi]
- VeLoRA: Memory Efficient Training using Rank-1 Sub-Token ProjectionsRoy Miles, Pradyumna Reddy, Ismail Elezi, Jiankang deng. [doi]
- Separation and Bias of Deep Equilibrium Models on Expressivity and Learning DynamicsZhoutong Wu, Yimu Zhang, Cong Fang 0001, Zhouchen Lin. [doi]
- DRIP: Unleashing Diffusion Priors for Joint Foreground and Alpha Prediction in Image MattingXiaodi Li, Zongxin Yang, Ruijie Quan, Yi Yang. [doi]
- Mixture of Link Predictors on GraphsLi Ma 0012, Haoyu Han 0001, Juanhui Li, Harry Shomer, Hui Liu 0031, Xiaofeng Gao 0001, Jiliang Tang. [doi]
- Few-Shot Adversarial Prompt Learning on Vision-Language ModelsYiwei Zhou, Xiaobo Xia, Zhiwei Lin, Bo Han 0003, Tongliang Liu. [doi]
- MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked AutoencodersXueying Jiang, Sheng Jin 0002, Xiaoqin Zhang 0002, Ling Shao 0001, Shijian Lu. [doi]
- Hierarchical and Density-based Causal ClusteringKwangho Kim, Jisu Kim, Larry A. Wasserman, Edward H. Kennedy. [doi]
- Generating Origin-Destination Matrices in Neural Spatial Interaction ModelsIoannis Zachos, Mark Girolami, Theodoros Damoulas. [doi]
- Using Time-Aware Graph Neural Networks to Predict Temporal Centralities in Dynamic GraphsFranziska Heeg, Ingo Scholtes. [doi]
- Functionally Constrained Algorithm Solves Convex Simple Bilevel ProblemHuaqing Zhang, Lesi Chen, Jing Xu, Jingzhao Zhang. [doi]
- Pricing and Competition for Generative AIRafid Mahmood. [doi]
- 4Diffusion: Multi-view Video Diffusion Model for 4D GenerationHaiyu Zhang, Xinyuan Chen, Yaohui Wang, Xihui Liu, Yunhong Wang, Yu Qiao. [doi]
- Selective ExplanationsLucas Monteiro Paes, Dennis Wei, Flávio P. Calmon. [doi]
- Pretrained Transformer Efficiently Learns Low-Dimensional Target Functions In-ContextKazusato Oko, Yujin Song, Taiji Suzuki, Denny Wu. [doi]
- Protecting Your LLMs with Information BottleneckZichuan Liu, Zefan Wang, Linjie Xu, Jinyu Wang, Lei Song, Tianchun Wang, Chunlin Chen, Wei Cheng 0002, Jiang Bian. [doi]
- TSDS: Data Selection for Task-Specific Model FinetuningZifan Liu, Amin Karbasi, Theodoros Rekatsinas. [doi]
- Revisiting Adversarial Patches for Designing Camera-Agnostic Attacks against Person DetectionHui Wei 0004, Zhixiang Wang, Kewei Zhang, Jiaqi Hou, Yuanwei Liu, Hao Tang 0005, Zheng Wang 0007. [doi]
- Learning to compute Gröbner basesHiroshi Kera, Yuki Ishihara, Yuta Kambe, Tristan Vaccon, Kazuhiro Yokoyama. [doi]
- UniAudio 1.5: Large Language Model-Driven Audio Codec is A Few-Shot Audio Task LearnerDongchao Yang, Haohan Guo, Yuanyuan Wang, Rongjie Huang, Xiang Li, Xu Tan 0003, Xixin Wu, Helen Meng. [doi]
- Train-Attention: Meta-Learning Where to Focus in Continual Knowledge LearningYeongbin Seo, Dongha Lee 0003, Jinyoung Yeo. [doi]
- DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsShangqian Gao, Chi-Heng Lin, Ting Hua, Zheng Tang, Yilin Shen, Hongxia Jin, Yen-Chang Hsu. [doi]
- A hierarchical decomposition for explaining ML performance discrepanciesHarvineet Singh, Fan Xia, Adarsh Subbaswamy, Alexej Gossmann, Jean Feng. [doi]
- Emergence of heavy tails in homogenized stochastic gradient descentZhezhe Jiao, Martin Keller-Ressel. [doi]
- A Local Method for Satisfying Interventional Fairness with Partially Known Causal GraphsHaoxuan Li, Yue Liu, Zhi Geng, Kun Zhang. [doi]
- E.T. Bench: Towards Open-Ended Event-Level Video-Language UnderstandingYe Liu, Zongyang Ma, Zhongang Qi, Yang Wu 0001, Ying Shan, Chang Wen Chen. [doi]
- The Space Complexity of Approximating Logistic LossGregory Dexter, Petros Drineas, Rajiv Khanna. [doi]
- LoFiT: Localized Fine-tuning on LLM RepresentationsFangcong Yin, Xi Ye, Greg Durrett. [doi]
- Learning-Augmented Dynamic Submodular MaximizationArpit Agarwal, Eric Balkanski. [doi]
- Transferability Bound Theory: Exploring Relationship between Adversarial Transferability and FlatnessMingyuan Fan 0003, Xiaodan Li, Cen Chen, Wenmeng Zhou, Yaliang Li. [doi]
- Quasi-Bayes meets VinesDavid Huk, Yuanhe Zhang, Ritabrata Dutta, Mark Steel. [doi]
- GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian SplattingXiufeng Huang, Ruiqi Li, Yiu-ming Cheung, Ka-Chun Cheung, Simon See, Renjie Wan. [doi]
- Adaptive Exploration for Data-Efficient General Value Function EvaluationsArushi Jain, Josiah Hanna, Doina Precup. [doi]
- Decentralized Noncooperative Games with Coupled Decision-Dependent DistributionsWenjing Yan, Xuanyu Cao. [doi]
- APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and CommentsXin Jin, Qianqian Qiao, Yi Lu, Huaye Wang, Heng Huang, Shan Gao, Jianfei Liu, Rui Li. [doi]
- DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric FinetuningYuxuan Duan, Yan Hong 0001, Bo Zhang 0075, Jun Lan, Huijia Zhu, Weiqiang Wang, Jianfu Zhang 0003, Li Niu 0002, Liqing Zhang 0001. [doi]
- Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content GenerationHongbo Wang, Jie Cao 0002, Jin Liu, Xiaoqiang Zhou, Huaibo Huang, Ran He 0001. [doi]
- A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding OptimizationChieh-Yun Chen, Chiang Tseng, Li-Wu Tsao, Hong-Han Shuai. [doi]
- Advancing Fine-Grained Classification by Structure and Subject Preserving AugmentationEyal Michaeli, Ohad Fried. [doi]
- Binding in hippocampal-entorhinal circuits enables compositionality in cognitive mapsChristopher J. Kymn, Sonia Mazelet, Anthony Thomas, Denis Kleyko, Edward Paxon Frady, Fritz Sommer, Bruno A. Olshausen. [doi]
- Occupancy-based Policy Gradient: Estimation, Convergence, and OptimalityAudrey Huang, Nan Jiang 0008. [doi]
- Non-Euclidean Mixture Model for Social Network EmbeddingRoshni G. Iyer, Yewen Wang, Wei Wang 0010, Yizhou Sun. [doi]
- NovoBench: Benchmarking Deep Learning-based \emph{De Novo} Sequencing Methods in ProteomicsJingbo Zhou, Shaorong Chen, Jun Xia 0001, Sizhe Liu, Tianze Ling, Wenjie Du, Yue Liu 0008, Jianwei Yin, Stan Z. Li. [doi]
- MARPLE: A Benchmark for Long-Horizon InferenceEmily Jin, Zhuoyi Huang, Jan-Philipp Fränken, Weiyu Liu, Hannah Cha, Erik Brockbank, Sarah Wu, Ruohan Zhang, Jiajun Wu 0001, Tobias Gerstenberg. [doi]
- MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video UnderstandingXinYu Fang, Kangrui Mao, Haodong Duan, Xiangyu Zhao, Yining Li, Dahua Lin, Kai Chen 0026. [doi]
- Implicit Bias of Mirror Flow on Separable DataScott Pesme, Radu-Alexandru Dragomir, Nicolas Flammarion. [doi]
- Policy Learning from Tutorial Books via Understanding, Rehearsing and IntrospectingXiong-Hui Chen, Ziyan Wang, Yali Du 0001, Shengyi Jiang, Meng Fang, Yang Yu, Jun Wang 0012. [doi]
- A Separation in Heavy-Tailed Sampling: Gaussian vs. Stable Oracles for Proximal SamplersYe He 0003, Alireza Mousavi Hosseini, Krishnakumar Balasubramanian 0001, Murat A. Erdogdu. [doi]
- Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement LearningQi Wang, Junming Yang, Yunbo Wang, Xin Jin, Wenjun Zeng, Xiaokang Yang. [doi]
- Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial RegularizerZhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu 0001, Hongyi Guo, Yingxiang Yang, Jose H. Blanchet, Zhaoran Wang 0001. [doi]
- Rethinking Misalignment in Vision-Language Model Adaptation from a Causal PerspectiveYanan Zhang, Jiangmeng Li, Lixiang Liu, Wenwen Qiang. [doi]
- Leveraging Separated World Model for Exploration in Visually Distracted EnvironmentsKaichen Huang, Shenghua Wan, Minghao Shao, Hai-Hang Sun, Le Gan, Shuai Feng, De-Chuan Zhan. [doi]
- MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable ConvergenceIonut-Vlad Modoranu, Mher Safaryan, Grigory Malinovsky, Eldar Kurtic, Thomas Robert 0007, Peter Richtárik, Dan Alistarh. [doi]
- FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image ClassificationKexue Fu, Xiaoyuan Luo, Linhao Qu, Shuo Wang, Ying Xiong, Ilias Maglogiannis, Longxiang Gao, Manning Wang. [doi]
- Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion ModelsZiyi Wu, Yulia Rubanova, Rishabh Kabra, Drew A. Hudson, Igor Gilitschenski, Yusuf Aytar, Sjoerd van Steenkiste, Kelsey R. Allen, Thomas Kipf. [doi]
- Semantic Feature Learning for Universal Unsupervised Cross-Domain RetrievalLixu Wang, Xinyu Du, Qi Zhu 0002. [doi]
- Federated Model Heterogeneous Matryoshka Representation LearningLiping Yi, Han Yu 0001, Chao Ren 0006, Gang Wang, Xiaoguang Liu 0001, Xiaoxiao Li. [doi]
- OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree QueriesYuhang Lu, Xinge Zhu, Tai Wang, Yuexin Ma. [doi]
- SS1: Accelerating Inference with Fast and Expressive Sketch Structured TransformAditya Desai, Kimia Saedi, Apoorv Walia, Jihyeong Lee, Keren Zhou 0001, Anshumali Shrivastava. [doi]
- Can Graph Neural Networks Expose Training Data Properties? An Efficient Risk Assessment ApproachHanyang Yuan, Jiarong Xu, Renhong Huang, Mingli Song, Chunping Wang 0001, Yang Yang 0009. [doi]
- LLM Dataset Inference: Did you train on my dataset?Pratyush Maini, Hengrui Jia, Nicolas Papernot, Adam Dziedzic. [doi]
- Harnessing small projectors and multiple views for efficient vision pretrainingArna Ghosh, Kumar Krishna Agrawal, Shagun Sodhani, Adam Oberman, Blake A. Richards. [doi]
- Simplified and Generalized Masked Diffusion for Discrete DataJiaxin Shi, Kehang Han, Zhe Wang, Arnaud Doucet, Michalis K. Titsias. [doi]
- Aligning to Thousands of Preferences via System Message GeneralizationSeongyun Lee, Sue Hyun Park, Seungone Kim, Minjoon Seo. [doi]
- Improved Sample Complexity Bounds for Diffusion Model TrainingShivam Gupta 0002, Aditya Parulekar, Eric Price 0001, Zhiyang Xun. [doi]
- Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and BenchmarkingPranav Singh Chib, Pravendra Singh. [doi]
- Learning to Edit Visual Programs with Self-SupervisionR. Kenny Jones, Renhao Zhang, Aditya Ganeshan, Daniel Ritchie. [doi]
- Improving the Learning Capability of Small-size Image Restoration Network by Deep Fourier ShiftingMan Zhou. [doi]
- HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation LearningLu Bai 0001, Zhuo Xu, Lixin Cui, Ming Li, Yue Wang 0014, Edwin R. Hancock. [doi]
- Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksZaijing Li, Yuquan Xie, Rui Shao, Gongwei Chen, Dongmei Jiang, Liqiang Nie. [doi]
- Geometric-Averaged Preference Optimization for Soft Preference LabelsHiroki Furuta, Kuang-Huei Lee, Shixiang Shane Gu, Yutaka Matsuo, Aleksandra Faust, Heiga Zen, Izzeddin Gur. [doi]
- Adaptive Domain Learning for Cross-domain Image DenoisingZian Qian, Chenyang Qi, Ka Lung Law, Hao Fu, Chenyang Lei, Qifeng Chen. [doi]
- Robust Reinforcement Learning from Corrupted Human FeedbackAlexander Bukharin, Ilgee Hong, Haoming Jiang, Zichong Li, Qingru Zhang, Zixuan Zhang, Tuo Zhao. [doi]
- HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure PriorsPanwang Pan, Zhuo Su 0006, Chenguo Lin, Zhen Fan 0015, Yongjie Zhang, Zeming Li, Tingting Shen, Yadong Mu, Yebin Liu. [doi]
- CemiFace: Center-based Semi-hard Synthetic Face Generation for Face RecognitionZhonglin Sun, Siyang Song, Ioannis Patras, Georgios Tzimiropoulos. [doi]
- A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular DataAndrej Tschalzev, Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt. [doi]
- Optimal Batched Best Arm IdentificationTianyuan Jin, Yu Yang 0001, Jing Tang 0004, Xiaokui Xiao, Pan Xu 0002. [doi]
- Evaluating alignment between humans and neural network representations in image-based learning tasksCan Demircan, Tankred Saanum, Leonardo Pettini, Marcel Binz, Blazej M. Baczkowski, Christian F. Doeller, Mona M. Garvert, Eric Schulz. [doi]
- Generating compositional scenes via Text-to-image RGBA Instance GenerationAlessandro Fontanella, Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Sarah Parisot. [doi]
- An Autoencoder-Like Nonnegative Matrix Co-Factorization for Improved Student Cognitive ModelingShenbao Yu, Yinghui Pan, Yifeng Zeng, Prashant Doshi, Guoquan Liu, Kim-Leng Poh, Mingwei Lin. [doi]
- DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor ControlZichen Jeff Cui, Hengkai Pan, Aadhithya Iyer, Siddhant Haldar, Lerrel Pinto. [doi]
- Faster Accelerated First-order Methods for Convex Optimization with Strongly Convex Function ConstraintsZhenwei Lin, Qi Deng. [doi]
- Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI SystemsLingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei A. Zaharia, James Y. Zou. [doi]
- Evaluating Numerical Reasoning in Text-to-Image ModelsIvana Kajic, Olivia Wiles, Isabela Albuquerque, Matthias Bauer, Su Wang 0001, Jordi Pont-Tuset, Aida Nematzadeh. [doi]
- RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and ManipulationJiaming Liu, Mengzhen Liu, Zhenyu Wang, Pengju An, Xiaoqi Li, Kaichen Zhou, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang. [doi]
- CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive LearningYiping Wang, Yifang Chen 0001, Wendan Yan, Alex Fang, Wenjing Zhou, Kevin G. Jamieson, Simon S. Du. [doi]
- BAN: Detecting Backdoors Activated by Adversarial Neuron NoiseXiaoyun Xu, Zhuoran Liu, Stefanos Koffas, Shujian Yu, Stjepan Picek. [doi]
- OpenDlign: Open-World Point Cloud Understanding with Depth-Aligned ImagesYe Mao, Junpeng Jing, Krystian Mikolajczyk. [doi]
- Learning Representations for Hierarchies with Minimal SupportBenjamin Rozonoyer, Michael Boratko, Dhruvesh Patel, Wenlong Zhao 0001, Shib Sankar Dasgupta, Hung Le, Andrew McCallum. [doi]
- ProtGO: Function-Guided Protein Modeling for Unified Representation LearningBozhen Hu, Cheng Tan 0012, Yongjie Xu, Zhangyang Gao, Jun Xia 0001, Lirong Wu, Stan Z. Li. [doi]
- OT4P: Unlocking Effective Orthogonal Group Path for Permutation RelaxationYaming Guo, Chen Zhu 0003, Hengshu Zhu, Tieru Wu. [doi]
- Multi-Winner ReconfigurationJiehua Chen 0001, Christian Hatschka, Sofia Simola. [doi]
- Accelerated Regularized Learning in Finite N-Person GamesKyriakos Lotidis, Angeliki Giannou, Panayotis Mertikopoulos, Nicholas Bambos. [doi]
- Any2Graph: Deep End-To-End Supervised Graph Prediction With An Optimal Transport LossPaul Krzakala, Junjie Yang, Rémi Flamary, Florence d'Alché-Buc, Charlotte Laclau, Matthieu Labeau. [doi]
- TPC: Test-time Procrustes Calibration for Diffusion-based Human Image AnimationSunjae Yoon, Gwanhyeong Koo, Younghwan Lee, Chang Dong Yoo. [doi]
- Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based PoliciesYipu Chen, Haotian Xue 0002, Yongxin Chen. [doi]
- Improving Context-Aware Preference Modeling for Language ModelsSilviu Pitis, Ziang Xiao, Nicolas Le Roux, Alessandro Sordoni. [doi]
- Using Unity to Help Solve Reinforcement LearningConnor Brennan, Andrew Williams, Omar G. Younis, Vedant Vyas, Daria Yasafova, Irina Rish. [doi]
- Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian SplattingZiyi Yang, Xinyu Gao, Yang-Tian Sun, Yihua Huang 0002, Xiaoyang Lyu, Wen Zhou, Shaohui Jiao, Xiaojuan Qi 0001, Xiaogang Jin 0001. [doi]
- Controlling Multiple Errors Simultaneously with a PAC-Bayes BoundReuben Adams, John Shawe-Taylor, Benjamin Guedj. [doi]
- Proving Theorems RecursivelyHaiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang, Jianqiao Lu, Zhicheng Yang, Jing Tang 0004, Jian Yin 0001, Zhenguo Li, Xiaodan Liang. [doi]
- Conditional Controllable Image FusionBing Cao 0002, Xingxin Xu, Pengfei Zhu 0001, Qilong Wang 0001, Qinghua Hu. [doi]
- PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic DevicesHanqing Zhu, Wenyan Cong, Guojin Chen, Shupeng Ning, Ray Chen, Jiaqi Gu 0002, David Z. Pan. [doi]
- Fair Online Bilateral TradeFrançois Bachoc, Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni. [doi]
- Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data CoverageHaobo Zhang, Xiyue Peng, Honghao Wei, Xin Liu. [doi]
- MetaAligner: Towards Generalizable Multi-Objective Alignment of Language ModelsKailai Yang, Zhiwei Liu, Qianqian Xie, Jimin Huang, Tianlin Zhang, Sophia Ananiadou. [doi]
- PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical CompetitionGeorge Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri. [doi]
- Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space ModelsAli Behrouz, Michele Santacatterina, Ramin Zabih. [doi]
- Large language model validity via enhanced conformal prediction methodsJohn J. Cherian, Isaac Gibbs, Emmanuel J. Candès. [doi]
- Scalable Neural Network Verification with Branch-and-bound Inferred Cutting PlanesDuo Zhou, Christopher Brix, Grani A. Hanasusanto, Huan Zhang 0001. [doi]
- LLM-based Skill Diffusion for Zero-shot Policy AdaptationWoo Kyung Kim, Youngseok Lee, Jooyoung Kim, Honguk Woo. [doi]
- The surprising efficiency of temporal difference learning for rare event predictionXiaoou Cheng, Jonathan Weare. [doi]
- Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMsChing-An Cheng, Allen Nie, Adith Swaminathan. [doi]
- Doing Experiments and Revising Rules with Natural Language and Probabilistic ReasoningTop Piriyakulkij, Cassidy Langenfeld, Tuan Anh Le 0001, Kevin Ellis. [doi]
- IPM-LSTM: A Learning-Based Interior Point Method for Solving Nonlinear ProgramsXi Gao, Jinxin Xiong, Akang Wang, Qihong Duan, Jiang Xue, Qingjiang Shi. [doi]
- The Selective G-Bispectrum and its Inversion: Applications to G-Invariant NetworksSimon Mataigne, Johan Mathe, Sophia Sanborn, Christopher Hillar, Nina Miolane. [doi]
- GSDF: 3DGS Meets SDF for Improved Neural Rendering and ReconstructionMulin Yu, Tao Lu 0005, Linning Xu, Lihan Jiang, Yuanbo Xiangli, Bo Dai 0002. [doi]
- UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization ProblemsZhi Zheng, Changliang Zhou, Xialiang Tong, Mingxuan Yuan, Zhenkun Wang 0001. [doi]
- CondTSF: One-line Plugin of Dataset Condensation for Time Series ForecastingJianrong Ding, Zhanyu Liu, Guanjie Zheng, Haiming Jin, Linghe Kong. [doi]
- MSA Generation with Seqs2Seqs Pretraining: Advancing Protein Structure PredictionsLe Zhang, Jiayang Chen, Tao Shen, Yu Li 0006, Siqi Sun. [doi]
- Learning Equilibria in Adversarial Team Markov Games: A Nonconvex-Hidden-Concave Min-Max Optimization ProblemFivos Kalogiannis, Jingming Yan, Ioannis Panageas. [doi]
- Most Influential Subset Selection: Challenges, Promises, and BeyondYuzheng Hu, Pingbang Hu, Han Zhao 0002, Jiaqi W. Ma. [doi]
- Voila-A: Aligning Vision-Language Models with User's Gaze AttentionKun Yan, Zeyu Wang, Lei Ji 0001, Yuntao Wang 0001, Nan Duan, Shuai Ma 0001. [doi]
- Learning Distinguishable Trajectory Representation with Contrastive LossTianxu Li, Kun Zhu 0001, Juan Li, Yang Zhang. [doi]
- Multi-Chain Graphs of Graphs: A New Approach to Analyzing Blockchain DatasetsBingqiao Luo, Zhen Zhang 0023, Qian Wang, Bingsheng He. [doi]
- Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game ModelsAdam Karvonen, Benjamin Wright, Can Rager, Rico Angell, Jannik Brinkmann, Logan Smith, Claudio Mayrink Verdun, David Bau, Samuel Marks. [doi]
- Make Your LLM Fully Utilize the ContextShengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng 0001, Jian-Guang Lou, Weizhu Chen. [doi]
- Identifying Equivalent Training DynamicsWilliam T. Redman, Juan M. Bello-Rivas, Maria Fonoberova, Ryan Mohr, Yannis G. Kevrekidis, Igor Mezic. [doi]
- DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted AveragingMatteo Pagliardini, Amirkeivan Mohtashami, François Fleuret, Martin Jaggi. [doi]
- No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPOSkander Moalla, Andrea Miele, Daniil Pyatko, Razvan Pascanu, Caglar Gulcehre. [doi]
- Improving Sparse Decomposition of Language Model Activations with Gated Sparse AutoencodersSenthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, János Kramár, Rohin Shah, Neel Nanda. [doi]
- SAMPa: Sharpness-aware Minimization ParallelizedWanyun Xie, Thomas Pethick, Volkan Cevher. [doi]
- From Instance Training to Instruction Learning: Task Adapters Generation from InstructionsHuanxuan Liao, Shizhu He, Yao Xu, Yuanzhe Zhang, Yanchao Hao, Shengping Liu, Kang Liu, Jun Zhao. [doi]
- Navigable Graphs for High-Dimensional Nearest Neighbor Search: Constructions and LimitsHaya Diwan, Jinrui Gou, Cameron Musco, Christopher Musco, Torsten Suel. [doi]
- Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe SamplingYiran Zhao 0006, Wenyue Zheng, Tianle Cai, Do Xuan Long, Kenji Kawaguchi, Anirudh Goyal, Michael Qizhe Shieh. [doi]
- A Neural Network Approach for Efficiently Answering Most Probable Explanation Queries in Probabilistic ModelsShivvrat Arya, Tahrima Rahman, Vibhav Gogate. [doi]
- Tolerant Algorithms for Learning with Arbitrary Covariate ShiftSurbhi Goel, Abhishek Shetty, Konstantinos Stavropoulos, Arsen Vasilyan. [doi]
- CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search FrameworkYiyang Zhao, Yunzhuo Liu, Bo Jiang 0003, Tian Guo 0001. [doi]
- What If the Input is Expanded in OOD Detection?Boxuan Zhang, Jianing Zhu, Zengmao Wang, Tongliang Liu, Bo Du 0001, Bo Han 0003. [doi]
- HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language ModelKhoa Vo 0001, Thinh Phan, Kashu Yamazaki, Minh Tran, Ngan Le. [doi]
- Local and Adaptive Mirror Descents in Extensive-Form GamesCôme Fiegel, Pierre Ménard, Tadashi Kozuno, Rémi Munos, Vianney Perchet, Michal Valko. [doi]
- WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language ModelsLiwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar 0009, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Yejin Choi 0001, Nouha Dziri. [doi]
- Confidence Regulation Neurons in Language ModelsAlessandro Stolfo, Ben Wu, Wes Gurnee, Yonatan Belinkov, Xingyi Song, Mrinmaya Sachan, Neel Nanda. [doi]
- Grammar-Aligned DecodingKanghee Park, Jiayu Wang, Taylor Berg-Kirkpatrick, Nadia Polikarpova, Loris D'Antoni. [doi]
- SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D PriorsChenyang Ma, Kai Lu, Ta Ying Cheng, Niki Trigoni, Andrew Markham. [doi]
- Language Model as Visual ExplainerXingyi Yang, Xinchao Wang. [doi]
- An Information Theoretic Perspective on Conformal PredictionAlvaro H. C. Correia, Fabio Valerio Massoli, Christos Louizos, Arash Behboodi. [doi]
- Color-Oriented Redundancy Reduction in Dataset DistillationBowen Yuan, Zijian Wang 0009, Mahsa Baktashmotlagh, Yadan Luo, Zi Huang. [doi]
- Algorithmic progress in language modelsAnson Ho, Tamay Besiroglu, Ege Erdil, Zifan Carl Guo, David Owen 0001, Robi Rahman, David Atkinson, Neil Thompson, Jaime Sevilla. [doi]
- Reinforcing LLM Agents via Policy Optimization with Action DecompositionMuning Wen, Ziyu Wan, Jun Wang, Weinan Zhang, Ying Wen 0001. [doi]
- FlexCap: Describe Anything in Images in Controllable DetailDebidatta Dwibedi, Vidhi Jain, Jonathan Tompson, Andrew Zisserman, Yusuf Aytar. [doi]
- Video Token Merging for Long Video UnderstandingSeon-Ho Lee, Jue Wang, Zhikang Zhang, David Fan, Xinyu Li. [doi]
- Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial OptimizationYang Li, Jinpei Guo, Runzhong Wang, Hongyuan Zha, Junchi Yan. [doi]
- Foundations of Multivariate Distributional Reinforcement LearningHarley Wiltzer, Jesse Farebrother, Arthur Gretton, Mark Rowland 0001. [doi]
- Off-Policy Selection for Initiating Human-Centric Experimental DesignGe Gao, Xi Yang, Qitong Gao, Song Ju, Miroslav Pajic, Min Chi. [doi]
- Training-Free Open-Ended Object Detection and Segmentation via Attention as PromptsZhiwei Lin, Yongtao Wang, Zhi Tang 0001. [doi]
- Transductive Learning is CompactJulian Asilis, Siddartha Devic, Shaddin Dughmi, Vatsal Sharan, Shang-Hua Teng. [doi]
- Watermarking Makes Language Models RadioactiveTom Sander, Pierre Fernandez, Alain Durmus, Matthijs Douze, Teddy Furon. [doi]
- StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video SequencesShangkun Sun, Jiaming Liu, Huaxia Li, Guoqing Liu, Thomas H. Li, Wei Gao 0003. [doi]
- Stronger Than You Think: Benchmarking Weak Supervision on Realistic TasksTianyi Zhang, Linrong Cai, Jeffrey Li, Nicholas Roberts, Neel Guha, Frederic Sala. [doi]
- Scaling Laws and Compute-Optimal Training Beyond Fixed Training DurationsAlexander Hägele, Elie Bakouch, Atli Kosson, Loubna Ben Allal, Leandro von Werra, Martin Jaggi. [doi]
- Learning rigid-body simulators over implicit shapes for large-scale scenes and visionYulia Rubanova, Tatiana Lopez-Guevara, Kelsey R. Allen, Will Whitney, Kimberly L. Stachenfeld, Tobias Pfaff. [doi]
- Byzantine Robustness and Partial Participation Can Be Achieved at Once: Just Clip Gradient DifferencesGrigory Malinovsky, Peter Richtárik, Samuel Horváth, Eduard Gorbunov. [doi]
- Alleviate Anchor-Shift: Explore Blind Spots with Cross-View Reconstruction for Incomplete Multi-View ClusteringSuyuan Liu, Siwei Wang 0001, Ke Liang 0006, Junpu Zhang, Zhibin Dong, Tianrui Liu, En Zhu, Xinwang Liu 0002, Kunlun He. [doi]
- Marrying Causal Representation Learning with Dynamical Systems for ScienceDingling Yao, Caroline Muller, Francesco Locatello. [doi]
- GRANOLA: Adaptive Normalization for Graph Neural NetworksMoshe Eliasof, Beatrice Bevilacqua, Carola-Bibiane Schönlieb, Haggai Maron. [doi]
- Bigger, Regularized, Optimistic: scaling for compute and sample efficient continuous controlMichal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Milos, Marek Cygan. [doi]
- Learning from Pattern Completion: Self-supervised Controllable GenerationZhiqiang Chen, Guofan Fan, Jinying Gao, Lei Ma 0008, Bo Lei, Tiejun Huang, Shan Yu. [doi]
- Towards Harmless Rawlsian Fairness Regardless of Demographic PriorXuanqian Wang, Jing Li 0009, Ivor W. Tsang, Yew-Soon Ong. [doi]
- The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine LearningRuben Ohana, Michael McCabe, Lucas Meyer, Rudy Morel, Fruzsina J. Agocs, Miguel Beneitez, Marsha Berger, Blakesley Burkhart, Stuart B. Dalziel, Drummond B. Fielding, Daniel Fortunato, Jared A. Goldberg, Keiya Hirashima, Yan-Fei Jiang, Rich R. Kerswell, Suryanarayana Maddu, Jonah Miller, Payel Mukhopadhyay, Stefan S. Nixon, Jeff Shen, Romain Watteaux, Bruno Régaldo-Saint Blancard, François Rozet, Liam Holden Parker, Miles D. Cranmer, Shirley Ho. [doi]
- S-MolSearch: 3D Semi-supervised Contrastive Learning for Bioactive Molecule SearchGengmo Zhou, Zhen Wang, Feng Yu, Guolin Ke, Zhewei Wei, Zhifeng Gao. [doi]
- WizardArena: Post-training Large Language Models via Simulated Offline Chatbot ArenaHaipeng Luo, Qingfeng Sun, Can Xu, Pu Zhao 0004, Qingwei Lin, Jian-Guang Lou, Shifeng Chen, Yansong Tang, Weizhu Chen. [doi]
- Online Learning of Delayed ChoicesRecep Yusuf Bekci. [doi]
- FUSU: A Multi-temporal-source Land Use Change Segmentation Dataset for Fine-grained Urban Semantic UnderstandingShuai Yuan, Guancong Lin, Lixian Zhang, Runmin Dong, Jinxiao Zhang, Shuang Chen, Juepeng Zheng, Jie Wang, Haohuan Fu. [doi]
- emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface ElectromyographyViswanath Sivakumar, Jeffrey Seely, Alan Du, Sean R. Bittner, Adam Berenzweig, Anuoluwapo Bolarinwa, Alexandre Gramfort, Michael I. Mandel. [doi]
- Reinforcement Learning with LTL and ω-Regular Objectives via Optimality-Preserving Translation to Average RewardsXuan Bach Le, Dominik Wagner 0001, Leon Witzman, Alexander Rabinovich, Luke Ong. [doi]
- TrackIME: Enhanced Video Point Tracking via Instance Motion EstimationSeong Hyeon Park, Huiwon Jang, Byungwoo Jeon, Sukmin Yun, Paul Hongsuck Seo, Jinwoo Shin. [doi]
- II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language ModelsZiqiang Liu, Feiteng Fang, Xi Feng, Xeron Du, Chenhao Zhang 0005, Noah Wang, Yuelin Bai, Qixuan Zhao, Liyang Fan, Chengguang Gan, Hongquan Lin, Jiaming Li, Yuansheng Ni, Haihong Wu, Yaswanth Narsupalli, Zhigang Zheng, Chengming Li, Xiping Hu 0001, Ruifeng Xu 0001, Xiaojun Chen 0006, Min Yang 0007, Jiaheng Liu, Ruibo Liu, Wenhao Huang, Ge Zhang, Shiwen Ni. [doi]
- Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo CollectionsJiacong Xu, Yiqun Mei, Vishal M. Patel. [doi]
- Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric VideosLuigi Seminara, Giovanni Maria Farinella, Antonino Furnari. [doi]
- SE(3)-bi-equivariant Transformers for Point Cloud AssemblyZiming Wang, Rebecka Jörnsten. [doi]
- Measuring Per-Unit Interpretability at Scale Without HumansRoland S. Zimmermann, David A. Klindt, Wieland Brendel. [doi]
- Parameter Symmetry and Noise Equilibrium of Stochastic Gradient DescentZiyin Liu, Mingze Wang, Hongchao Li, Lei Wu. [doi]
- Typicalness-Aware Learning for Failure DetectionYijun Liu, Jiequan Cui, Zhuotao Tian, Senqiao Yang, Qingdong He, Xiaoling Wang, Jingyong Su. [doi]
- Time-Constrained Robust MDPsAdil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel Rachelson. [doi]
- Real-time Core-Periphery Guided ViT with Smart Data Layout Selection on Mobile DevicesZhihao Shu, Xiaowei Yu, Zihao Wu 0001, Wenqi Jia 0003, Yinchen Shi, Miao Yin, Tianming Liu 0001, Dajiang Zhu, Wei Niu 0002. [doi]
- Efficient Lifelong Model Evaluation in an Era of Rapid ProgressAmeya Prabhu, Vishaal Udandarao, Philip Torr 0001, Matthias Bethge, Adel Bibi, Samuel Albanie. [doi]
- Resolving Discrepancies in Compute-Optimal Scaling of Language ModelsTomer Porian, Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, Yair Carmon. [doi]
- Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement LearningAbdullah Akgül, Manuel Haussmann, Melih Kandemir. [doi]
- Amortizing intractable inference in diffusion models for vision, language, and controlSiddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin. [doi]
- Mixed Dynamics In Linear Networks: Unifying the Lazy and Active RegimesZhenfeng Tu, Santiago Aranguri, Arthur Jacot. [doi]
- A Unified Confidence Sequence for Generalized Linear Models, with Applications to BanditsJunghyun Lee, Se-Young Yun, Kwang-Sung Jun. [doi]
- Constrained Diffusion with Trust SamplingWilliam Huang, Yifeng Jiang 0002, Tom Van Wouwe, C. Karen Liu. [doi]
- Toward Real Ultra Image Segmentation: Leveraging Surrounding Context to Cultivate General Segmentation ModelSai Wang, Yutian Lin, Yu Wu 0011, Bo Du 0001. [doi]
- Neural Krylov Iteration for Accelerating Linear System SolvingJian Luo, Jie Wang 0005, Hong Wang, Huanshuo Dong, Zijie Geng, Hanzhu Chen, Yufei Kuang. [doi]
- HuRef: HUman-REadable Fingerprint for Large Language ModelsBoyi Zeng, Lizheng Wang, Yuncong Hu, Yi Xu, Chenghu Zhou, Xinbing Wang, Yu Yu, Zhouhan Lin. [doi]
- LLM Evaluators Recognize and Favor Their Own GenerationsArjun Panickssery, Samuel R. Bowman, Shi Feng. [doi]
- Boosted Conformal Prediction IntervalsRan Xie, Rina Barber, Emmanuel J. Candès. [doi]
- HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image GenerationBo Cheng, Yuhang Ma, wuliebucha, Shanyuan Liu, Ao Ma, Xiaoyu Wu, Dawei Leng, Yuhui Yin. [doi]
- Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time SeriesIlan Naiman, Nimrod Berman, Itai Pemper, Idan Arbiv, Gal Fadlon, Omri Azencot. [doi]
- Trans-LoRA: towards data-free Transferable Parameter Efficient FinetuningRunqian Wang, Soumya Ghosh, David D. Cox, Diego Antognini, Aude Oliva, Rogério Feris, Leonid Karlinsky. [doi]
- Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial ClientsYoussef Allouah, Abdellah El Mrini, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot. [doi]
- Binary Search with Distributional PredictionsMichael Dinitz, Sungjin Im, Thomas Lavastida, Benjamin Moseley, Aidin Niaparast, Sergei Vassilvitskii. [doi]
- MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language ModelsTessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju. [doi]
- Fully Unconstrained Online LearningAshok Cutkosky, Zakaria Mhammedi. [doi]
- IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationXiaochen Ma 0001, Xuekang Zhu, Lei Su, Bo Du, Zhuohang Jiang, Bingkui Tong, Zeyu Lei, Xinyu Yang, Chi-Man Pun, Jiancheng Lv 0001, Jizhe Zhou. [doi]
- Integrating Deep Metric Learning with Coreset for Active Learning in 3D SegmentationArvind Vepa, Zukang Yang, Andrew Choi, Jungseock Joo, Fabien Scalzo, Yizhou Sun. [doi]
- Marginal Causal Flows for Validation and InferenceDaniel de Vassimon Manela, Laura Battaglia, Robin J. Evans. [doi]
- Extending Video Masked Autoencoders to 128 framesNitesh Bharadwaj Gundavarapu, Luke Friedman, Raghav Goyal, Chaitra Hegde, Eirikur Agustsson, Sagar Waghmare, Mikhail Sirotenko, Ming-Hsuan Yang 0001, Tobias Weyand, Boqing Gong, Leonid Sigal. [doi]
- Information-theoretic Generalization Analysis for Expected Calibration ErrorFutoshi Futami, Masahiro Fujisawa. [doi]
- Real-Time Recurrent Learning using Trace Units in Reinforcement LearningEsraa Elelimy, Adam White 0001, Michael Bowling, Martha White. [doi]
- Boosting Alignment for Post-Unlearning Text-to-Image Generative ModelsMyeongseob Ko, Henry Li, Zhun Wang, Jonathan Patsenker, Jiachen T. Wang, Qinbin Li, Ming Jin 0002, Dawn Song, Ruoxi Jia 0001. [doi]
- Local to Global: Learning Dynamics and Effect of Initialization for TransformersAshok Vardhan Makkuva, Marco Bondaschi, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar, Chanakya Ekbote. [doi]
- Understanding Transformer Reasoning Capabilities via Graph AlgorithmsClayton Sanford, Bahare Fatemi, Ethan Hall, Anton Tsitsulin, Mehran Kazemi, Jonathan Halcrow, Bryan Perozzi, Vahab Mirrokni. [doi]
- Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learningDaniel Kunin, Allan Raventós, Clémentine Dominé, Feng Chen, David A. Klindt, Andrew M. Saxe, Surya Ganguli. [doi]
- Few-Shot Task Learning through Inverse Generative ModelingAviv Netanyahu, Yilun Du, Antonia Bronars, Jyothish Pari, Josh Tenenbaum 0001, Tianmin Shu, Pulkit Agrawal 0001. [doi]
- Finding good policies in average-reward Markov Decision Processes without prior knowledgeAdrienne Tuynman, Rémy Degenne, Emilie Kaufmann. [doi]
- From Biased to Unbiased Dynamics: An Infinitesimal Generator ApproachTimothée Devergne, Vladimir Kostic, Michele Parrinello, Massimiliano Pontil. [doi]
- Gaussian Process Bandits for Top-k RecommendationsMohit Yadav, Cameron Musco, Daniel R. Sheldon. [doi]
- A Generative Model of Symmetry TransformationsJames Urquhart Allingham, Bruno Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger 0001, Richard E. Turner, Eric T. Nalisnick, José Miguel Hernández-Lobato. [doi]
- Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion ModelYifan Duan, Jian Zhao, Pengcheng, Junyuan Mao, Hao Wu, Jingyu Xu, Shilong Wang, Caoyuan Ma, Kai Wang, Kun Wang, Xuelong Li. [doi]
- LIVE: Learnable In-Context Vector for Visual Question AnsweringYingzhe Peng, Chenduo Hao, Xinting Hu, Jiawei Peng, Xin Geng 0001, Xu Yang 0021. [doi]
- SIRIUS : Contexual Sparisty with Correction for Efficient LLMsYang Zhou, Zhuoming Chen, Zhaozhuo Xu, Victoria Lin 0002, Beidi Chen. [doi]
- Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid ModelingWanghan Xu, Fenghua Ling, zhangwenlong, Tao Han 0002, Hao Chen 0045, Wanli Ouyang, Lei Bai 0001. [doi]
- Provable Tempered Overfitting of Minimal Nets and Typical NetsItamar Harel, William Hoza, Gal Vardi, Itay Evron, Nati Srebro, Daniel Soudry. [doi]
- Bridging The Gap between Low-rank and Orthogonal Adaptation via Householder Reflection AdaptationShen Yuan, Haotian Liu, Hongteng Xu. [doi]
- Adjust Pearson's $r$ to Measure Arbitrary Monotone DependenceXinbo Ai. [doi]
- BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End LearningJianming Pan, Zeqi Ye, Xiao Yang, Xu Yang, Weiqing Liu, Lewen Wang, Jiang Bian 0002. [doi]
- Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image SynthesisYuxi Ren, Xin Xia, Yanzuo Lu, Jiacheng Zhang, Jie Wu, Pan Xie, Xing Wang, XueFeng Xiao. [doi]
- xRAG: Extreme Context Compression for Retrieval-augmented Generation with One TokenXin Cheng 0002, Xun Wang, Xingxing Zhang 0002, Tao Ge 0001, Si-Qing Chen, Furu Wei, Huishuai Zhang, Dongyan Zhao 0001. [doi]
- Theoretical Characterisation of the Gauss Newton Conditioning in Neural NetworksJim Zhao, Sidak Pal Singh, Aurélien Lucchi. [doi]
- On the Identifiability of Hybrid Deep Generative Models: Meta-Learning as a SolutionYubo Ye, Maryam Toloubidokhti, Sumeet Vadhavkar, Xiajun Jiang, Huafeng Liu 0003, Linwei Wang. [doi]
- Agent Planning with World Knowledge ModelShuofei Qiao, Runnan Fang, Ningyu Zhang 0001, Yuqi Zhu, Xiang Chen 0016, Shumin Deng, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen. [doi]
- Precipitation Downscaling with Spatiotemporal Video DiffusionPrakhar Srivastava 0003, Ruihan Yang, Gavin Kerrigan, Gideon Dresdner, Jeremy McGibbon, Christopher S. Bretherton, Stephan Mandt. [doi]
- Drago: Primal-Dual Coupled Variance Reduction for Faster Distributionally Robust OptimizationRonak Mehta, Jelena Diakonikolas, Zaïd Harchaoui. [doi]
- Minimizing UCB: a Better Local Search Strategy in Local Bayesian OptimizationZheyi Fan, Wenyu Wang, Szu-Hui Ng, Qingpei Hu. [doi]
- OPUS: Occupancy Prediction Using a Sparse SetJiabao Wang, Zhaojiang Liu, Qiang Meng, Liujiang Yan, Ke Wang, Jie Yang, Wei Liu, Qibin Hou, Ming-Ming Cheng. [doi]
- Ordered Momentum for Asynchronous SGDChang-Wei Shi, Yi-Rui Yang, Wu-Jun Li. [doi]
- Should We Really Edit Language Models? On the Evaluation of Edited Language ModelsQi Li, Xiang Liu, Zhenheng Tang, Peijie Dong, ZeYu Li, Xinglin Pan, Xiaowen Chu. [doi]
- Doubly Mild Generalization for Offline Reinforcement LearningYixiu Mao, Qi Wang, Yun Qu 0002, Yuhang Jiang, Xiangyang Ji. [doi]
- Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion ModelsTuomas Kynkäänniemi, Miika Aittala, Tero Karras, Samuli Laine, Timo Aila, Jaakko Lehtinen. [doi]
- KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model RolloutsJing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu 0001. [doi]
- Scaling transformer neural networks for skillful and reliable medium-range weather forecastingTung Nguyen, Rohan Shah, Hritik Bansal, Troy Arcomano, Romit Maulik, Rao Kotamarthi, Ian T. Foster, Sandeep Madireddy, Aditya Grover. [doi]
- The State of Data Curation at NeurIPS: An Assessment of Dataset Development Practices in the Datasets and Benchmarks TrackEshta Bhardwaj, Harshit Gujral, Siyi Wu, Ciara Zogheib, Tegan Maharaj, Christoph Becker 0001. [doi]
- Differentially Private Reinforcement Learning with Self-PlayDan Qiao 0002, Yu-Xiang Wang 0003. [doi]
- Equivariant spatio-hemispherical networks for diffusion MRI deconvolutionAxel Elaldi, Guido Gerig, Neel Dey. [doi]
- Expectile Regularization for Fast and Accurate Training of Neural Optimal TransportNazar Buzun, Maksim Bobrin, Dmitry V. Dylov. [doi]
- Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic SegmentationQi Bi, Jingjun Yi, Hao Zheng 0008, Haolan Zhan, Yawen Huang, Wei Ji 0011, Yuexiang Li, Yefeng Zheng 0001. [doi]
- Skinned Motion Retargeting with Dense Geometric Interaction PerceptionZijie Ye, Jia-Wei Liu, Jia Jia, Shikun Sun, Mike Zheng Shou. [doi]
- Interpretable Image Classification with Adaptive Prototype-based Vision TransformersChiyu Ma, Jon Donnelly, Wenjun Liu, Soroush Vosoughi, Cynthia Rudin, Chaofan Chen. [doi]
- MG-Net: Learn to Customize QAOA with Circuit Depth AwarenessYang Qian, Xinbiao Wang, Yuxuan Du, Yong Luo 0002, Dacheng Tao. [doi]
- Confusion-Resistant Federated Learning via Diffusion-Based Data Harmonization on Non-IID DataXiaohong Chen, Canran Xiao, Yongmei Liu. [doi]
- Yo'LLaVA: Your Personalized Language and Vision AssistantThao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee. [doi]
- Progressive Exploration-Conformal Learning for Sparsely Annotated Object Detection in Aerial ImagesZihan Lu, Chenxu Wang, Chunyan Xu, Xiangwei Zheng 0001, Zhen Cui 0001. [doi]
- LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential RecommendationQidong Liu, Xian Wu 0001, Yejing Wang, Zijian Zhang 0009, Feng Tian 0002, Yefeng Zheng 0001, Xiangyu Zhao 0001. [doi]
- LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low Resource and Extinct LanguagesAndrew M. Bean, Simi Hellsten, Harry Mayne, Jabez Magomere, Ethan Chi, Ryan Chi, Scott Hale, Hannah Rose Kirk. [doi]
- A Full-duplex Speech Dialogue Scheme Based On Large Language ModelPeng Wang, Songshuo Lu, Yaohua Tang, Sijie Yan, Wei Xia, Yuanjun Xiong. [doi]
- Human Expertise in Algorithmic PredictionRohan Alur, Manish Raghavan, Devavrat Shah. [doi]
- Layer-Adaptive State Pruning for Deep State Space ModelsMinseon Gwak, Seongrok Moon, Joohwan Ko, PooGyeon Park. [doi]
- Croissant: A Metadata Format for ML-Ready DatasetsMubashara Akhtar, Omar Benjelloun, Costanza Conforti, Luca Foschini, Joan Giner-Miguelez, Pieter Gijsbers, Sujata S. Goswami, Nitisha Jain, Michalis Karamousadakis, Michael Kuchnik, Satyapriya Krishna, Sylvain Lesage, Quentin Lhoest, Pierre Marcenac, Manil Maskey, Peter Mattson, Luis Oala, Hamidah Oderinwale, Pierre Ruyssen, Tim Santos, Rajat Shinde, Elena Simperl, Arjun Suresh, Goeffry Thomas, Slava Tykhonov, Joaquin Vanschoren, Susheel Varma, Jos van der Velde, Steffen Vogler, Carole-Jean Wu, Luyao Zhang. [doi]
- Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of ExpertsHuy Nguyen, Nhat Ho, Alessandro Rinaldo. [doi]
- Learning from higher-order correlations, efficiently: hypothesis tests, random features, and neural networksEszter Székely, Lorenzo Bardone, Federica Gerace, Sebastian Goldt. [doi]
- RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-FoldAmrith Setlur, Saurabh Garg, Xinyang Geng, Naman Garg, Virginia Smith, Aviral Kumar. [doi]
- PRODuctive bandits: Importance Weighting No MoreJulian Zimmert, Teodor Vanislavov Marinov. [doi]
- Adaptive Preference Scaling for Reinforcement Learning with Human FeedbackIlgee Hong, Zichong Li, Alexander Bukharin, Yixiao Li, Haoming Jiang, Tianbao Yang, Tuo Zhao. [doi]
- Consistency of Neural Causal Partial IdentificationJiyuan Tan, Jose H. Blanchet, Vasilis Syrgkanis. [doi]
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value EvictionRenze Chen, Zhuofeng Wang, Beiquan Cao, Tong Wu, Size Zheng 0001, Xiuhong Li, Xuechao Wei, Shengen Yan, Meng Li, Yun Liang 0001. [doi]
- Skill-aware Mutual Information Optimisation for Zero-shot Generalisation in Reinforcement LearningXuehui Yu, Mhairi Dunion, Xin Li, Stefano V. Albrecht. [doi]
- Provably and Practically Efficient Adversarial Imitation Learning with General Function ApproximationTian Xu, Zhilong Zhang, Ruishuo Chen, Yihao Sun, Yang Yu. [doi]
- DOFEN: Deep Oblivious Forest ENsembleKuan-Yu Chen, Ping-Han Chiang, Hsin-Rung Chou, Chih-Sheng Chen, Tien-Hao Chang. [doi]
- Understanding the Role of Equivariance in Self-supervised LearningYifei Wang 0001, Kaiwen Hu, Sharut Gupta, Ziyu Ye, Yisen Wang 0001, Stefanie Jegelka. [doi]
- ContextCite: Attributing Model Generation to ContextBenjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry. [doi]
- Dual Critic Reinforcement Learning under Partial ObservabilityJinqiu Li, Enmin Zhao, Tong Wei, Junliang Xing, Shiming Xiang. [doi]
- Unlocking Tokens as Data Points for Generalization Bounds on Larger Language ModelsSanae Lotfi, Yilun Kuang, Marc Finzi, Brandon Amos, Micah Goldblum, Andrew Gordon Wilson. [doi]
- AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality MaskingShiqi Sun, Yantao Lu, Ning Liu, Bo Jiang, Jinchao Chen, Ying Zhang 0060. [doi]
- Dimension-free Private Mean Estimation for Anisotropic DistributionsYuval Dagan, Michael I. Jordan, Xuelin Yang, Lydia Zakynthinou, Nikita Zhivotovskiy. [doi]
- Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMsRudolf Laine, Bilal Chughtai, Jan Betley, Kaivalya Hariharan, Mikita Balesni, Jérémy Scheurer, Marius Hobbhahn, Alexander Meinke, Owain Evans. [doi]
- Cloud Object Detector Adaptation by Integrating Different Source KnowledgeShuaifeng Li, Mao Ye 0001, Lihua Zhou, Nianxin Li, Siying Xiao, Song Tang 0001, Xiatian Zhu. [doi]
- Mechanism design augmented with output adviceGeorge Christodoulou 000, Alkmini Sgouritsa, Ioannis Vlachos. [doi]
- DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text DetectionXiao Yu, Yuang Qi, Kejiang Chen, Guoqiang Chen, Xi Yang, Pengyuan Zhu, Xiuwei Shang, Weiming Zhang, Nenghai Yu. [doi]
- Toward Dynamic Non-Line-of-Sight Imaging with Mamba Enforced Temporal ConsistencyYue Li, Yi Sun, Shida Sun, Juntian Ye, Yueyi Zhang, Feihu Xu, Zhiwei Xiong. [doi]
- Neural Network Reparametrization for Accelerated Optimization in Molecular SimulationsNima Dehmamy, Csaba Both, Jeet Mohapatra, Subhro Das, Tommi Jaakkola. [doi]
- The Many Faces of Optimal Weak-to-Strong LearningMikael Møller Høgsgaard, Kasper Green Larsen, Markus Engelund Mathiasen. [doi]
- Functional Bilevel Optimization for Machine LearningIeva Petrulionyte, Julien Mairal, Michael Arbel. [doi]
- Inverse M-Kernels for Linear Universal Approximators of Non-Negative FunctionsHideaki Kim. [doi]
- RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric SpaceJingdi Chen, Hanhan Zhou, Yongsheng Mei, Carlee Joe-Wong, Gina C. Adam, Nathaniel D. Bastian, Tian Lan 0001. [doi]
- Predicting Ground State Properties: Constant Sample Complexity and Deep Learning AlgorithmsMarc Wanner, Laura Lewis, Chiranjib Bhattacharyya, Devdatt P. Dubhashi, Alexandru Gheorghiu. [doi]
- DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene RenderingJiahao Lu, Jiacheng Deng 0002, Ruijie Zhu 0002, Yanzhe Liang, Wenfei Yang, Xu Zhou, Tianzhu Zhang. [doi]
- Toward Semantic Gaze Target DetectionSamy Tafasca, Anshul Gupta, Victor Bros, Jean-Marc Odobez. [doi]
- Diffusion-Reward Adversarial Imitation LearningChun-Mao Lai, Hsiang-Chun Wang, Ping-Chun Hsieh, Yu-Chiang Frank Wang, Min-Hung Chen, Shao-Hua Sun. [doi]
- Post-Hoc Reversal: Are We Selecting Models Prematurely?Rishabh Ranjan, Saurabh Garg, Mrigank Raman, Carlos Guestrin, Zachary C. Lipton. [doi]
- GarmentLab: A Unified Simulation and Benchmark for Garment ManipulationHaoran Lu, Ruihai Wu, Yitong Li, Sijie Li, Ziyu Zhu, Chuanruo Ning, Yan Zhao 0035, Longzan Luo, Yuanpei Chen, Hao Dong 0003. [doi]
- DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine DomainKun Wang, Zhiqiang Yan, Junkai Fan, Wanlu Zhu, Xiang Li, Jun Li, Jian Yang. [doi]
- Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept SpaceCore Francisco Park, Maya Okawa, Andrew Lee, Ekdeep Singh Lubana, Hidenori Tanaka. [doi]
- 4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBsMinjie Wang, Quan Gan, David Wipf, Zheng Zhang 0001, Christos Faloutsos, Weinan Zhang 0001, Muhan Zhang, Zhenkun Cai, Jiahang Li, Zunyao Mao, Yakun Song, Jianheng Tang, Yanlin Zhang, Guang Yang, Chuan Lei, Xiao Qin, Ning Li 0029, Han Zhang 0057, Yanbo Wang, Zizhao Zhang. [doi]
- Infer Induced Sentiment of Comment Response to Video: A New Task, Dataset and BaselineQi Jia 0004, Baoyu Fan, Cong Xu, Lu Liu, Liang Jin, Guoguang Du, Zhenhua Guo 0003, Yaqian Zhao, Xuanjing Huang 0001, RenGang Li. [doi]
- Graph Neural Networks Do Not Always OversmoothBastian Epping, Alexandre René, Moritz Helias, Michael T. Schaub. [doi]
- Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF DatasetsIke Obi, Rohan Pant, Srishti Shekhar Agrawal, Maham Ghazanfar, Aaron Basiletti. [doi]
- Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of ExemplarsZhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low. [doi]
- Effective Rank Analysis and Regularization for Enhanced 3D Gaussian SplattingJunha Hyung, Susung Hong, Sungwon Hwang, Jaeseong Lee, Jaegul Choo, Jin-Hwa Kim. [doi]
- TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language NegativesMaitreya Patel, Abhiram Kusumba, Sheng Cheng, Changhoon Kim, Tejas Gokhale, Chitta Baral, Yezhou Yang. [doi]
- Carrot and Stick: Eliciting Comparison Data and BeyondYiling Chen, Shi Feng, Fang-Yi Yu. [doi]
- Rethinking Fourier Transform from A Basis Functions Perspective for Long-term Time Series ForecastingRunze Yang 0002, Longbing Cao, Jie Yang 0002, Jianxun Li. [doi]
- Online Learning with Sublinear Best-Action QueriesMatteo Russo 0002, Andrea Celli, Riccardo Colini-Baldeschi, Federico Fusco, Daniel Haimovich, Dima Karamshuk, Stefano Leonardi 0001, Niek Tax. [doi]
- EM Distillation for One-step Diffusion ModelsSirui Xie, Zhisheng Xiao, Diederik P. Kingma, Tingbo Hou, Ying Nian Wu, Kevin P. Murphy, Tim Salimans, Ben Poole, RuiQi Gao. [doi]
- Mixture of neural fields for heterogeneous reconstruction in cryo-EMAxel Levy, Rishwanth Raghu, David Shustin, Adele Rui-Yang Peng, Huan Li, Oliver Biggs Clarke, Gordon Wetzstein, Ellen D. Zhong. [doi]
- DAPE: Data-Adaptive Positional Encoding for Length ExtrapolationChuanyang Zheng, Yihang Gao, Han Shi, Minbin Huang, Jingyao Li, Jing Xiong, Xiaozhe Ren, Michael K. Ng 0001, Xin Jiang, Zhenguo Li, Yu Li. [doi]
- Vision Mamba MenderJiacong Hu, Anda Cao, Zunlei Feng, Shengxuming Zhang, Yi Wang, Lingxiang Jia, Mingli Song. [doi]
- Aligning Diffusion Behaviors with Q-functions for Efficient Continuous ControlHuayu Chen, Kaiwen Zheng, Hang Su, Jun Zhu. [doi]
- Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakesWeifeng Liu, Tianyi She, Jiawei Liu, Boheng Li, Dongyu Yao, Ziyou Liang, Run Wang. [doi]
- MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly DetectionHaoyang He, Yuhu Bai, Jiangning Zhang, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Lei Xie 0007. [doi]
- GIC: Gaussian-Informed Continuum for Physical Property Identification and SimulationJunhao Cai, Yuji Yang, Weihao Yuan 0001, Yisheng He, Zilong Dong, Liefeng Bo, Hui Cheng, Qifeng Chen. [doi]
- Variational Flow Matching for Graph GenerationFloor Eijkelboom, Grigory Bartosh, Christian Andersson Naesseth, Max Welling, Jan-Willem van de Meent. [doi]
- Quantifying the Gain in Weak-to-Strong GeneralizationMoses Charikar, Chirag Pabbaraju, Kirankumar Shiragur. [doi]
- What Factors Affect Multi-Modal In-Context Learning? An In-Depth ExplorationLibo Qin 0001, Qiguang Chen, Hao Fei 0003, Zhi Chen 0006, Min Li 0007, Wanxiang Che. [doi]
- Efficient Centroid-Linkage ClusteringMohammad Hossein Bateni 0001, Laxman Dhulipala, Willem Fletcher, Kishen N. Gowda, D. Ellis Hershkowitz, Rajesh Jayaram, Jakub Lacki. [doi]
- MambaLRP: Explaining Selective State Space Sequence ModelsFarnoush Rezaei Jafari, Grégoire Montavon, Klaus-Robert Müller, Oliver Eberle. [doi]
- Newton Informed Neural Operator for Solving Nonlinear Partial Differential EquationsWenrui Hao, Xinliang Liu, Yahong Yang. [doi]
- Data Augmentation with Diffusion for Open-Set Semi-Supervised LearningSeonghyun Ban, Heesan Kong, Kee-Eung Kim. [doi]
- WildVision: Evaluating Vision-Language Models in the Wild with Human PreferencesYujie Lu, Dongfu Jiang, Wenhu Chen, William Yang Wang, Yejin Choi 0001, Bill Yuchen Lin. [doi]
- Solving Minimum-Cost Reach Avoid using Reinforcement LearningOswin So, Cheng Ge, Chuchu Fan. [doi]
- Monoculture in Matching MarketsKenny Peng, Nikhil Garg 0001. [doi]
- Historical Test-time Prompt Tuning for Vision Foundation ModelsJingyi Zhang 0005, Jiaxing Huang 0001, Xiaoqin Zhang 0002, Ling Shao 0001, Shijian Lu. [doi]
- Breaking the False Sense of Security in Backdoor Defense through Re-Activation AttackMingli Zhu, Siyuan Liang, Baoyuan Wu. [doi]
- Convergence Analysis of Split Federated Learning on Heterogeneous DataPengchao Han, Chao Huang, Geng Tian, Ming Tang, Xin Liu. [doi]
- Rule Based Rewards for Language Model SafetyTong Mu, Alec Helyar, Johannes Heidecke, Joshua Achiam, Andrea Vallone, Ian Kivlichan, Molly Lin, Alex Beutel, John Schulman, Lilian Weng. [doi]
- An In-depth Investigation of Sparse Rate Reduction in Transformer-like ModelsYunzhe Hu, Difan Zou, Dong Xu. [doi]
- A versatile informative diffusion model for single-cell ATAC-seq data generation and analysisLei Huang, Lei Xiong, Na Sun, Zunpeng Liu, Ka Chun Wong, Manolis Kellis. [doi]
- Selective Generation for Controllable Language ModelsMinJae Lee, Kyungmin Kim, Taesoo Kim, Sangdon Park 0001. [doi]
- A Simple Framework for Generalization in Visual RL under Dynamic Scene PerturbationsWonil Song, Hyesong Choi, Kwanghoon Sohn, Dongbo Min. [doi]
- Approximation Rate of the Transformer Architecture for Sequence ModelingHaotian Jiang, Qianxiao Li. [doi]
- Any2Policy: Learning Visuomotor Policy with Any-ModalityYichen Zhu, Zhicai Ou, Feifei Feng, Jian Tang 0008. [doi]
- SF-V: Single Forward Video Generation ModelZhixing Zhang, Yanyu Li, Yushu Wu, Yanwu Xu, Anil Kag, Ivan Skorokhodov, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Dimitris N. Metaxas, Sergey Tulyakov, Jian Ren 0005. [doi]
- Causal Effect Identification in a Sub-Population with Latent VariablesAmir Mohammad Abouei, Ehsan Mokhtarian, Negar Kiyavash, Matthias Grossglauser. [doi]
- Muscles in Time: Learning to Understand Human Motion In-Depth by Simulating Muscle ActivationsDavid Schneider, Simon Reiß, Marco Kugler, Alexander Jaus, Kunyu Peng, Susanne Sutschet, M. Saquib Sarfraz, Sven Matthiesen, Rainer Stiefelhagen. [doi]
- A Kernel Perspective on Distillation-based Collaborative LearningSejun Park, Kihun Hong, Ganguk Hwang. [doi]
- Non-geodesically-convex optimization in the Wasserstein spaceHoang Phuc Hau Luu, Hanlin Yu, Bernardo Williams, Petrus Mikkola, Marcelo Hartmann, Kai Puolamäki, Arto Klami. [doi]
- Expected Probabilistic HierarchiesMarcel Kollovieh, Bertrand Charpentier, Daniel Zügner, Stephan Günnemann. [doi]
- Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud AnalysisHongyu Sun 0006, Qiuhong Ke, Yongcai Wang, Wang Chen, Kang Yang, Deying Li 0001, Jianfei Cai 0001. [doi]
- Multi-modal Situated Reasoning in 3D ScenesXiongkun Linghu, Jiangyong Huang, Xuesong Niu, Xiaojian (Shawn) Ma, Baoxiong Jia, Siyuan Huang 0001. [doi]
- Self-playing Adversarial Language Game Enhances LLM ReasoningPengyu Cheng, Tianhao Hu, Han Xu, Zhisong Zhang, Yong Dai, Lei Han, Nan Du, Xiaolong Li. [doi]
- Complete Graphical Criterion for Sequential Covariate Adjustment in Causal InferenceYonghan Jung, Min Woo Park, Sanghack Lee. [doi]
- GREATS: Online Selection of High-Quality Data for LLM Training in Every IterationJiachen T. Wang, Tong Wu, Dawn Song, Prateek Mittal, Ruoxi Jia 0001. [doi]
- Compact Language Models via Pruning and Knowledge DistillationSaurav Muralidharan, Sharath Turuvekere Sreenivas, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov 0001. [doi]
- UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language ModelsJiachen Liang, Ruibing Hou, Minyang Hu, Hong Chang 0001, Shiguang Shan, Xilin Chen 0001. [doi]
- Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)Michael Saxon, Fatima Jahara, Mahsa Khoshnoodi, Yujie Lu, Aditya Sharma, William Yang Wang. [doi]
- Optimization Can Learn Johnson Lindenstrauss EmbeddingsNikos Tsikouras, Constantine Caramanis, Christos Tzamos. [doi]
- Cal-DPO: Calibrated Direct Preference Optimization for Language Model AlignmentTeng Xiao, Yige Yuan, Huaisheng Zhu, Mingxiao Li, Vasant G. Honavar. [doi]
- Amortized Fourier Neural OperatorsZipeng Xiao, Siqi Kou, Zhongkai Hao, Bokai Lin, Zhijie Deng. [doi]
- Pseudo-Private Data Guided Model Inversion AttacksXiong Peng, Bo Han 0003, Feng Liu 0003, Tongliang Liu, Mingyuan Zhou. [doi]
- Learning from Highly Sparse Spatio-temporal DataLeyan Deng, Chenwang Wu, Defu Lian, Enhong Chen. [doi]
- Aligning Diffusion Models by Optimizing Human UtilityShufan Li, Konstantinos Kallidromitis, Akash Gokul, Yusuke Kato, Kazuki Kozuka. [doi]
- HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated LearningMomin Ahmad Khan, Yasra Chandio, Fatima M. Anwar 0001. [doi]
- Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias CorrectingXingyu Zhu, Beier Zhu, Yi Tan 0001, Shuo Wang 0008, Yanbin Hao, Hanwang Zhang. [doi]
- Robust Graph Neural Networks via Unbiased AggregationZhichao Hou, Ruiqi Feng, Tyler Derr, Xiaorui Liu. [doi]
- FreqBlender: Enhancing DeepFake Detection by Blending Frequency KnowledgeHanzhe Li, Jiaran Zhou, Yuezun Li, Baoyuan Wu, Bin Li 0011, Junyu Dong. [doi]
- Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning FrameworkZhongchao Yi, Zhengyang Zhou, Qihe Huang, Yanjiang Chen, Liheng Yu, Xu Wang, Yang Wang 0015. [doi]
- DOPPLER: Differentially Private Optimizers with Low-pass Filter for Privacy Noise ReductionXinwei Zhang 0001, Zhiqi Bu, Mingyi Hong 0001, Meisam Razaviyayn. [doi]
- UNIT: Unifying Image and Text Recognition in One Vision EncoderYi Zhu 0004, Yanpeng Zhou, Chunwei Wang, Yang Cao, Jianhua Han, Lu Hou, Hang Xu. [doi]
- One-Shot Safety Alignment for Large Language Models via Optimal DualizationXinmeng Huang, Shuo Li, Edgar Dobriban, Osbert Bastani, Hamed Hassani, Dongsheng Ding. [doi]
- Efficient LLM Scheduling by Learning to RankYichao Fu, Siqi Zhu, Runlong Su, Aurick Qiao, Ion Stoica, Hao Zhang 0108. [doi]
- eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modelingMatthew Dowling, Yuan Zhao 0004, Memming Park. [doi]
- Unveiling Encoder-Free Vision-Language ModelsHaiwen Diao, Yufeng Cui, Xiaotong Li, Yueze Wang, Huchuan Lu, Xinlong Wang. [doi]
- SwitchHead: Accelerating Transformers with Mixture-of-Experts AttentionRóbert Csordás, Piotr Piekos, Kazuki Irie, Jürgen Schmidhuber. [doi]
- Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference HeadsAvelina Asada Hadji-Kyriacou, Ognjen Arandjelovic. [doi]
- VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion ModelsWenhao Wang, Yi Yang. [doi]
- GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned ExpertsShirley Wu, Kaidi Cao, Bruno Ribeiro 0001, James Y. Zou, Jure Leskovec. [doi]
- Neural Model CheckingMirco Giacobbe, Daniel Kroening, Abhinandan Pal, Michael Tautschnig. [doi]
- Listenable Maps for Zero-Shot Audio ClassifiersFrancesco Paissan, Luca Della Libera, Mirco Ravanelli, Cem Subakan. [doi]
- Learning-Augmented Priority QueuesZiyad Benomar, Christian Coester. [doi]
- Robust Mixture Learning when Outliers Overwhelm Small GroupsDaniil Dmitriev, Rares-Darius Buhai, Stefan Tiegel, Alexander Wolters, Gleb Novikov, Amartya Sanyal, David Steurer, Fanny Yang. [doi]
- Optimal Transport-based Labor-free Text Prompt Modeling for Sketch Re-identificationRui Li, Tingting Ren, Jie Wen 0001, Jinxing Li. [doi]
- Full-Distance Evasion of Pedestrian Detectors in the Physical WorldZhi-cheng, Zhanhao Hu, Yuqiu Liu, Jianmin Li 0001, Hang Su, Xiaolin Hu 0001. [doi]
- Generalizablity of Memorization Neural NetworkLijia Yu, Xiao-Shan Gao, Lijun Zhang, Yibo Miao. [doi]
- EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language ModelsJinhee Kim, Taesung Kim, Jaegul Choo. [doi]
- Spiking Graph Neural Network on Riemannian ManifoldsLi Sun 0008, Zhenhao Huang, Qiqi Wan, Hao Peng 0001, Philip S. Yu. [doi]
- Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?Ruisheng Cao, Fangyu Lei, Haoyuan Wu, Jixuan Chen, Yeqiao Fu, Hongcheng Gao, Xinzhuang Xiong, Hanchong Zhang, Wenjing Hu, Yuchen Mao, Tianbao Xie, Hongshen Xu, Danyang Zhang, Sida I. Wang, Ruoxi Sun 0002, Pengcheng Yin, Caiming Xiong, Ansong Ni, Qian Liu, Victor Zhong, Lu Chen 0002, Kai Yu, Tao Yu 0009. [doi]
- Optimal Hypothesis Selection in (Almost) Linear TimeMaryam Aliakbarpour, Mark Bun, Adam Smith. [doi]
- EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health RecordsYeonsu Kwon, Jiho Kim, Gyubok Lee, Seongsu Bae, Daeun Kyung, Wonchul Cha, Tom Pollard, Alistair Johnson, Edward Choi. [doi]
- Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world CorruptionsJie Wang, Tingfa Xu, Lihe Ding, Jianan Li 0001. [doi]
- Conditional Synthesis of 3D Molecules with Time Correction SamplerHojung Jung, Youngrok Park, Laura Schmid, Jaehyeong Jo, Dongkyu Lee, Bongsang Kim, Se-Young Yun, Jinwoo Shin. [doi]
- The Fragility of Fairness: Causal Sensitivity Analysis for Fair Machine LearningJake Fawkes, Nic Fishman, Mel Andrews, Zachary C. Lipton. [doi]
- Continuous Temporal Domain GeneralizationZekun Cai, Guangji Bai, Renhe Jiang, Xuan Song 0001, Liang Zhao 0002. [doi]
- PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation TechniquesDerui Zhu, Dingfan Chen, Xiongfei Wu, Jiahui Geng, Zhuo Li, Jens Grossklags, Lei Ma 0003. [doi]
- Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal LearningDivyam Madaan, Taro Makino, Sumit Chopra, KyungHyun Cho. [doi]
- Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and ToolboxHaohui Wang, Weijie Guan, Jianpeng Chen, Zi Wang, Dawei Zhou 0003. [doi]
- ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous DrivingTao Ma 0002, Hongbin Zhou, Qiusheng Huang, Xuemeng Yang, Jianfei Guo, Bo Zhang, Min Dou, Yu Qiao 0001, Botian Shi, Hongsheng Li 0001. [doi]
- Collaborative Refining for Learning from Inaccurate LabelsBin Han, Yi-Xuan Sun, Ya-Lin Zhang 0001, Libang Zhang, Haoran Hu, Longfei Li, Jun Zhou 0011, Guo Ye, Huimei He. [doi]
- HYDRA: Model Factorization Framework for Black-Box LLM PersonalizationYuchen Zhuang, Haotian Sun, Yue Yu, Rushi Qiang, Qifan Wang, Chao Zhang, Bo Dai 0001. [doi]
- Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual RecognitionMengke Li 0001, Ye Liu, Yang Lu 0009, Yiqun Zhang 0006, Yiu-ming Cheung, Hui Huang 0004. [doi]
- InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context MemoryChaojun Xiao, Pengle Zhang, Xu Han 0007, Guangxuan Xiao, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu 0001, Maosong Sun 0001. [doi]
- Improving robustness to corruptions with multiplicative weight perturbationsTrung Q. Trinh, Markus Heinonen, Luigi Acerbi, Samuel Kaski. [doi]
- LoTLIP: Improving Language-Image Pre-training for Long Text UnderstandingWei Wu, Kecheng Zheng, Shuailei Ma, Fan Lu, Yuxin Guo, Yifei Zhang, Wei Chen 0001, Qingpei Guo, Yujun Shen, Zheng-Jun Zha. [doi]
- How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation NetworksEtai Littwin, Omid Saremi, Madhu Advani, Vimal Thilak, Preetum Nakkiran, Chen Huang 0001, Joshua M. Susskind. [doi]
- EGSST: Event-based Graph Spatiotemporal Sensitive Transformer for Object DetectionSheng Wu, Hang-sheng, Hui Feng 0001, Bo Hu 0002. [doi]
- GV-Rep: A Large-Scale Dataset for Genetic Variant Representation LearningZehui Li, Vallijah Subasri, Guy-Bart Stan, Yiren Zhao, Bo Wang. [doi]
- Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language ModelsMengyuan Chen, Junyu Gao 0001, Changsheng Xu. [doi]
- ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based EvaluationJingnan Zheng, Han Wang, An Zhang, Tai D. Nguyen, Jun Sun 0001, Tat-Seng Chua. [doi]
- Error Correction Output Codes for Robust Neural Networks against Weight-errors: A Neural Tangent Kernel Point of ViewAnlan Yu, Shusen Jing, Ning Lyu, Wujie Wen, Zhiyuan Yan. [doi]
- Rule Extrapolation in Language Modeling: A Study of Compositional Generalization on OOD PromptsAnna Mészáros, Szilvia Ujváry, Wieland Brendel, Patrik Reizinger, Ferenc Huszar. [doi]
- ChronoEpilogi: Scalable Time Series Selection with Multiple SolutionsEtienne Vareille, Michele Linardi, Ioannis Tsamardinos, Vassilis Christophides. [doi]
- I2EBench: A Comprehensive Benchmark for Instruction-based Image EditingYiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji. [doi]
- Bridging Gaps: Federated Multi-View Clustering in Heterogeneous Hybrid ViewsXinyue Chen, Yazhou Ren 0001, Jie Xu 0044, Fangfei Lin, Xiaorong Pu, Yang Yang 0002. [doi]
- Revisiting Self-Supervised Heterogeneous Graph Learning from Spectral Clustering PerspectiveYujie Mo, Zhihe Lu, Runpeng Yu, Xiaofeng Zhu 0001, Xinchao Wang. [doi]
- Mitigating Object Hallucination via Concentric Causal AttentionYun Xing, Yiheng Li, Ivan Laptev, Shijian Lu. [doi]
- Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph TransformersJinsong Chen 0002, Hanpeng Liu, John E. Hopcroft, Kun He 0001. [doi]
- CIFD: Controlled Information Flow to Enhance Knowledge DistillationYashas Malur Saidutta, Rakshith Sharma Srinivasa, Jaejin Cho, Ching Hua Lee, Chouchang Yang, Yilin Shen, Hongxia Jin. [doi]
- 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and ComposabilityBaohao Liao, Christof Monz. [doi]
- A survey and benchmark of high-dimensional Bayesian optimization of discrete sequencesMiguel González Duque, Richard Michael, Simon Bartels, Yevgen Zainchkovskyy, Søren Hauberg, Wouter Boomsma. [doi]
- BiDM: Pushing the Limit of Quantization for Diffusion ModelsXingyu Zheng, Xianglong Liu 0001, Yichen Bian, Xudong Ma, Yulun Zhang 0001, Jiakai Wang, Jinyang Guo, Haotong Qin. [doi]
- Parallelizing Model-based Reinforcement Learning Over the Sequence LengthZirui Wang, Yue Deng, Junfeng Long, Yin Zhang. [doi]
- What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable InsightsXin Wen 0004, Bingchen Zhao, Yilun Chen, Jiangmiao Pang, Xiaojuan Qi 0001. [doi]
- What Rotary Position Embedding Can Tell Us: Identifying Query and Key Weights Corresponding to Basic Syntactic or High-level Semantic InformationYiting Chen 0003, Junchi Yan. [doi]
- OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring ModelingLinhui Xiao, Xiaoshan Yang, Fang Peng, Yaowei Wang 0001, Changsheng Xu. [doi]
- Seeing Beyond the Crop: Using Language Priors for Out-of-Bounding Box Keypoint PredictionBavesh Balaji, Jerrin Bright, Yuhao Chen 0001, Sirisha Rambhatla, John S. Zelek, David A. Clausi. [doi]
- Understanding the Gains from Repeated Self-DistillationDivyansh Pareek, Simon S. Du, Sewoong Oh. [doi]
- QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMsSaleh Ashkboos, Amirkeivan Mohtashami, Maximilian L. Croci, Bo Li, Pashmina Cameron, Martin Jaggi, Dan Alistarh, Torsten Hoefler, James Hensman. [doi]
- Fair Allocation in Dynamic Mechanism DesignAlireza Fallah 0001, Michael I. Jordan, Annie Ulichney. [doi]
- A Simple Remedy for Dataset Bias via Self-Influence: A Mislabeled Sample PerspectiveYeonsung Jung, Jaeyun Song, June Yong Yang, Jin-Hwa Kim, Sungyub Kim, Eunho Yang. [doi]
- DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EMYingjun Shen, Haizhao Dai, Qihe Chen, Yan Zeng, Jiakai Zhang, Yuan Pei, Jingyi Yu. [doi]
- BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGOSebastian Dittert, Vincent Moens, Gianni De Fabritiis. [doi]
- Benchmarking the Attribution Quality of Vision ModelsRobin Hesse, Simone Schaub-Meyer, Stefan Roth 0001. [doi]
- Towards Effective Planning Strategies for Dynamic Opinion NetworksBharath Muppasani, Protik Nag, Vignesh Narayanan, Biplav Srivastava, Michael N. Huhns. [doi]
- Conformal Classification with Equalized Coverage for Adaptively Selected GroupsYanfei Zhou, Matteo Sesia. [doi]
- What type of inference is planning?Miguel Lázaro-Gredilla, Li Yang Ku, Kevin P. Murphy, Dileep George. [doi]
- Sketched Lanczos uncertainty score: a low-memory summary of the Fisher informationMarco Miani, Lorenzo Beretta 0001, Søren Hauberg. [doi]
- Learning Complete Protein Representation by Dynamically Coupling of Sequence and StructureBozhen Hu, Cheng Tan 0012, Jun Xia 0001, Yue Liu 0008, Lirong Wu, Jiangbin Zheng, Yongjie Xu, Yufei Huang 0002, Stan Z. Li. [doi]
- Jailbreaking Large Language Models Against Moderation Guardrails via Cipher CharactersHaibo Jin, Andy Zhou, Joe D. Menke, Haohan Wang. [doi]
- MVGamba: Unify 3D Content Generation as State Space Sequence ModelingXuanyu Yi, Zike Wu, Qiuhong Shen, Qingshan Xu 0001, Pan Zhou 0002, Joo-Hwee Lim, Shuicheng Yan, Xinchao Wang, Hanwang Zhang. [doi]
- CoBo: Collaborative Learning via Bilevel OptimizationDiba Hashemi, Lie He, Martin Jaggi. [doi]
- Bisimulation Metrics are Optimal Transport Distances, and Can be Computed EfficientlySergio Calo, Anders Jonsson 0001, Gergely Neu, Ludovic Schwartz, Javier Segovia Aguas. [doi]
- MUVERA: Multi-Vector Retrieval via Fixed Dimensional EncodingRajesh Jayaram, Laxman Dhulipala, Majid Hadian, Jason Lee, Vahab Mirrokni. [doi]
- Reasons and Solutions for the Decline in Model Performance after EditingXiusheng Huang, Jiaxiang Liu, Yequan Wang, Kang Liu 0001. [doi]
- Induced Model Matching: Restricted Models Help Train Full-Featured ModelsUsama Muneeb, Mesrob I. Ohannessian. [doi]
- Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-upsFengyu Gao, Ruiquan Huang, Jing Yang. [doi]
- Learning Group Actions on Latent RepresentationsYinzhu Jin, Aman Shrivastava, Tom Fletcher. [doi]
- On the Expressivity and Sample Complexity of Node-Individualized Graph Neural NetworksPaolo Pellizzoni, Till Hendrik Schulz, Dexiong Chen, Karsten M. Borgwardt. [doi]
- TSGM: A Flexible Framework for Generative Modeling of Synthetic Time SeriesAlexander V. Nikitin, Letizia Iannucci, Samuel Kaski. [doi]
- Identifiable Object-Centric Representation Learning via Probabilistic Slot AttentionAvinash Kori, Francesco Locatello, Ainkaran Santhirasekaram, Francesca Toni, Ben Glocker, Fabio De Sousa Ribeiro. [doi]
- Implicit Optimization Bias of Next-token Prediction in Linear ModelsChristos Thrampoulidis. [doi]
- Replicability in Learning: Geometric Partitions and KKM-Sperner LemmaJason Vander Woude, Peter Dixon 0002, Aduri Pavan, Jamie Radcliffe, N. V. Vinodchandran. [doi]
- Consent in Crisis: The Rapid Decline of the AI Data CommonsShayne Longpre, Robert Mahari, Ariel Lee, Campbell Lund, Hamidah Oderinwale, William Brannon, Nayan Saxena, Naana Obeng-Marnu, Tobin South, Cole-Hunter, Kevin Klyman, Christopher Klamm, Hailey Schoelkopf, Nikhil Singh 0003, Manuel Cherep, Ahmad Anis, An Dinh, Caroline Shamiso Chitongo, Da Yin, Damien Sileo, Deividas Mataciunas, Diganta Misra, Emad A. Alghamdi, Enrico Shippole, Jianguo Zhang 0005, Joanna Materzynska, Kun Qian 0016, Kushagra Tiwary, Lester James V. Miranda, Manan Dey, Minnie Liang, Mohammed Hamdy, Niklas Muennighoff, Seonghyeon Ye, Seungone Kim, Shrestha Mohanty, Vipul Gupta, Vivek Sharma 0001, Minh Chien Vu, Xuhui Zhou, Yizhi Li, Caiming Xiong, Luis Villa, Stella Biderman, Hanlin Li, Daphne Ippolito, Sara Hooker, Jad Kabbara, Alex Pentland. [doi]
- DarkSAM: Fooling Segment Anything Model to Segment NothingZiqi Zhou 0001, Yufei Song, Minghui Li, Shengshan Hu, Xianlong Wang 0001, Leo Yu Zhang, Dezhong Yao 0001, Hai Jin 0001. [doi]
- Introspective Planning: Aligning Robots' Uncertainty with Inherent Task AmbiguityKaiqu Liang, Zixu Zhang, Jaime F. Fisac. [doi]
- Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without GuidanceKuan Heng Lin, Sicheng Mo, Ben Klingher, Fangzhou Mu, Bolei Zhou. [doi]
- Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution DetectionYing Yang, De Cheng, Chaowei Fang, Yubiao Wang, Changzhe Jiao, Lechao Cheng, Nannan Wang 0001, Xinbo Gao 0001. [doi]
- Breaking the curse of dimensionality in structured density estimationRobert A. Vandermeulen, Wai Ming Tai, Bryon Aragam. [doi]
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal PerceptionXiaotong Li, Fan Zhang, Haiwen Diao, Yueze Wang, Xinlong Wang, Lingyu Duan. [doi]
- Dense Connector for MLLMsHuanjin Yao, Wenhao Wu, Taojiannan Yang, Yuxin Song, Mengxi Zhang, Haocheng Feng, Yifan Sun 0003, Zhiheng Li, Wanli Ouyang, Jingdong Wang 0001. [doi]
- Bayesian Domain Adaptation with Gaussian Mixture Domain-IndexingYanfang Ling, Jiyong Li, Lingbo Li, Shangsong Liang. [doi]
- Boosting Graph Pooling with Persistent HomologyChaolong Ying, Xinjian Zhao, Tianshu Yu. [doi]
- Similarity-Navigated Conformal Prediction for Graph Neural NetworksJianqing Song, Jianguo Huang, Wenyu Jiang, Baoming Zhang, Shuangjie Li, Chongjun Wang. [doi]
- SeTAR: Out-of-Distribution Detection with Selective Low-Rank ApproximationYixia Li, Boya Xiong, Guanhua Chen 0001, Yun Chen 0007. [doi]
- Linking In-context Learning in Transformers to Human Episodic MemoryJi-An Li, Corey Y. Zhou, Marcus K. Benna, Marcelo G. Mattar. [doi]
- Antigen-Specific Antibody Design via Direct Energy-based Preference OptimizationXiangxin Zhou, Dongyu Xue, Ruizhe Chen, Zaixiang Zheng, Liang Wang 0001, Quanquan Gu. [doi]
- Generating Code World Models with Large Language Models Guided by Monte Carlo Tree SearchNicola Dainese, Matteo Merler, Minttu Alakuijala, Pekka Marttinen. [doi]
- Visual Data Diagnosis and Debiasing with Concept GraphsRwiddhi Chakraborty, Yinong Wang, Jialu Gao, Runkai Zheng, Cheng Zhang 0014, Fernando De la Torre. [doi]
- Learning Elastic Costs to Shape Monge DisplacementsMichal Klein, Aram-Alexandre Pooladian, Pierre Ablin, Eugène Ndiaye, Jonathan Niles-Weed, Marco Cuturi. [doi]
- Approximated Orthogonal Projection Unit: Stabilizing Regression Network Training Using Natural GradientShaoqi Wang, Chunjie Yang, Siwei Lou. [doi]
- Graph Classification via Reference Distribution Learning: Theory and PracticeZixiao Wang, Jicong Fan. [doi]
- Accelerating Nash Equilibrium Convergence in Monte Carlo Settings Through Counterfactual Value Based Fictitious PlayQi Ju 0001, Falin Hei, Ting Feng, Dengbing Yi, Zhemei Fang, Yunfeng Luo. [doi]
- ST$_k$: A Scalable Module for Solving Top-k ProblemsHanchen Xia, Weidong Liu 0005, Xiaojun Mao. [doi]
- LG-CAV: Train Any Concept Activation Vector with Language GuidanceQihan Huang, Jie Song, Mengqi Xue, Haofei Zhang, Bingde Hu, Huiqiong Wang, Hao Jiang, Xingen Wang, Mingli Song. [doi]
- Vector Quantization Prompting for Continual LearningLi Jiao, Qiuxia Lai, Yu Li 0007, Qiang Xu 0001. [doi]
- DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume RenderingZhongpai Gao, Benjamin Planche, Meng Zheng 0002, Xiao Chen, Terrence Chen, Ziyan Wu. [doi]
- VeXKD: The Versatile Integration of Cross-Modal Fusion and Knowledge Distillation for 3D PerceptionYuzhe Ji, Yijie Chen, Liuqing Yang 0001, Rui Ding, Meng Yang, Xinhu Zheng. [doi]
- Taming the Long Tail in Human Mobility PredictionXiaohang Xu 0002, Renhe Jiang, Chuang Yang, Zipei Fan, Kaoru Sezaki. [doi]
- Polynomial-Time Computation of Exact $\Phi$-Equilibria in Polyhedral GamesGabriele Farina, Charilaos Pipis. [doi]
- LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorchXiaoyuan Zhang, Liang Zhao, Yingying Yu, Xi Lin 0001, Yifan Chen 0001, Han Zhao 0002, Qingfu Zhang 0001. [doi]
- Estimating Ego-Body Pose from Doubly Sparse Egocentric Video DataSeunggeun Chi, Pin-Hao Huang, Enna Sachdeva, Hengbo Ma, Karthik Ramani, Kwonjoon Lee. [doi]
- emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose EstimationSasha Salter, Richard Warren, Collin Schlager, Adrian Spurr, Shangchen Han, Rohin Bhasin, Yujun Cai, Peter Walkington, Anuoluwapo Bolarinwa, Robert J. Wang, Nathan Danielson, Josh Merel, Eftychios A. Pnevmatikakis, Jesse Marshall. [doi]
- Low Precision Local Training is Enough for Federated LearningZhiwei Li, YiQiu Li, Binbin Lin, Zhongming Jin, Weizhong Zhang. [doi]
- AgentBoard: An Analytical Evaluation Board of Multi-turn LLM AgentsChang Ma, Junlei Zhang, Zhihao Zhu, Cheng Yang, Yujiu Yang, Yaohui Jin, Zhenzhong Lan, Lingpeng Kong, Junxian He. [doi]
- DiffuPac: Contextual Mimicry in Adversarial Packets Generation via Diffusion ModelAbdullah Bin Jasni, Akiko Manada, Kohei Watabe. [doi]
- Hybrid Generative AI for De Novo Design of Co-Crystals with Enhanced TabletabilityNina Gubina, Andrei Dmitrenko, Gleb V. Solovev, Lyubov Yamshchikova, Oleg Petrov, Ivan Lebedev, Nikita Serov, Grigorii Kirgizov, Nikolay O. Nikitin, Vladimir Vinogradov. [doi]
- Large Language Models Must Be Taught to Know What They Don't KnowSanyam Kapoor, Nate Gruver, Manley Roberts, Katie Collins, Arka Pal, Umang Bhatt, Adrian Weller, Samuel Dooley, Micah Goldblum, Andrew Gordon Wilson. [doi]
- Customizing Language Models with Instance-wise LoRA for Sequential RecommendationXiaoyu Kong, Jiancan Wu, An Zhang 0003, Leheng Sheng, Hui Lin, Xiang Wang, Xiangnan He 0001. [doi]
- Testably Learning Polynomial Threshold FunctionsLucas Slot, Stefan Tiegel, Manuel Wiedmer. [doi]
- Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signalsHui Zheng, Haiteng Wang, Wei-Bang Jiang, Zhongtao Chen, Li He, Pei-Yang Lin, Peng-Hu Wei, Guo-Guang Zhao, Yun-Zhe Liu. [doi]
- Who's asking? User personas and the mechanics of latent misalignmentAsma Ghandeharioun, Ann Yuan, Marius Guerard, Emily Reif, Michael A. Lepori, Lucas Dixon. [doi]
- On the Target-kernel Alignment: a Unified Analysis with Kernel ComplexityChao Wang, Xin He, Yuwen Wang, Junhui Wang. [doi]
- Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD TrainingAnchit Jain, Rozhin Nobahari, Aristide Baratin, Stefano Sarao Mannelli. [doi]
- Multi-Label Open Set RecognitionYibo Wang, Jun-Yi Hang, Min-Ling Zhang. [doi]
- Counter-Current Learning: A Biologically Plausible Dual Network Approach for Deep LearningChia-Hsiang Kao, Bharath Hariharan. [doi]
- Generalizable and Animatable Gaussian Head AvatarXuangeng Chu, Tatsuya Harada. [doi]
- Contextual Bilevel Reinforcement Learning for Incentive AlignmentVinzenz Thoma, Barna Pásztor, Andreas Krause 0001, Giorgia Ramponi, Yifan Hu. [doi]
- Convolutions and More as Einsum: A Tensor Network Perspective with Advances for Second-Order MethodsFelix Dangel. [doi]
- SHDocs: A dataset, benchmark, and method to efficiently generate high-quality, real-world specular highlight data with near-perfect alignmentJovin Leong, Koa Di, Benjamin Cham, Shaun Heng. [doi]
- Learning De-Biased Representations for Remote-Sensing ImageryZichen Tian, Zhaozheng Chen, Qianru Sun. [doi]
- WeiPer: OOD Detection using Weight Perturbations of Class ProjectionsMaximilian Granz, Manuel Heurich, Tim Landgraf. [doi]
- IMAGPose: A Unified Conditional Framework for Pose-Guided Person GenerationFei Shen, Jinhui Tang 0001. [doi]
- Perceiving Longer Sequences With Bi-Directional Cross-Attention TransformersMarkus Hiller, Krista A. Ehinger, Tom Drummond. [doi]
- Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement LearningAneesh Muppidi, Zhiyu Zhang, Heng Yang. [doi]
- Supervised Kernel ThinningAlbert Gong, Kyuseong Choi, Raaz Dwivedi. [doi]
- Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep LearningRonglong Fang, Yuesheng Xu. [doi]
- Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained ModelsYuxin Wen, Leo Marchyok, Sanghyun Hong 0001, Jonas Geiping, Tom Goldstein, Nicholas Carlini. [doi]
- Stochastic Newton Proximal Extragradient MethodRuichen Jiang, Michal Derezinski, Aryan Mokhtari. [doi]
- CARE: a Benchmark Suite for the Classification and Retrieval of EnzymesJason Yang, Ariane Mora, Shengchao Liu, Bruce J. Wittmann, Animashree Anandkumar, Frances H. Arnold, Yisong Yue. [doi]
- Great Minds Think Alike: The Universal Convergence Trend of Input SalienceYipei Wang, Jeffrey Siskind, Xiaoqian Wang. [doi]
- General Detection-based Text Line RecognitionRaphaël Baena, Syrine Kalleli, Mathieu Aubry. [doi]
- Weisfeiler and Leman Go Loopy: A New Hierarchy for Graph Representational LearningRaffaele Paolino, Sohir Maskey, Pascal Welke, Gitta Kutyniok. [doi]
- LLMCBench: Benchmarking Large Language Model Compression for Efficient DeploymentGe Yang, Changyi He, Jinyang Guo, Jianyu Wu, Yifu Ding, Aishan Liu, Haotong Qin, Pengliang Ji, Xianglong Liu 0001. [doi]
- SceneCraft: Layout-Guided 3D Scene GenerationXiuyu Yang, Yunze Man, Junkun Chen, Yu-Xiong Wang. [doi]
- Debiasing Synthetic Data Generated by Deep Generative ModelsAlexander Decruyenaere, Heidelinde Dehaene, Paloma Rabaey, Johan Decruyenaere, Christiaan Polet, Thomas Demeester, Stijn Vansteelandt. [doi]
- Hints-In-Browser: Benchmarking Language Models for Programming Feedback GenerationNachiket Kotalwar, Alkis Gotovos, Adish Singla. [doi]
- DiP-GO: A Diffusion Pruner via Few-step Gradient OptimizationHaowei Zhu, Dehua Tang, Ji Liu, Mingjie Lu, Jintu Zheng, Jinzhang Peng, Dong Li 0025, Yu Wang 0002, Fan Jiang, Lu Tian, Spandan Tiwari, Ashish Sirasao, Jun-Hai Yong, Bin Wang 0034, Emad Barsoum. [doi]
- Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian StructureXiang Li, Yixiang Dai, Qing Qu 0001. [doi]
- UV-free Texture Generation with Denoising and Geodesic Heat DiffusionSimone Foti, Stefanos Zafeiriou, Tolga Birdal. [doi]
- Geodesic Optimization for Predictive Shift Adaptation on EEG dataApolline Mellot, Antoine Collas, Sylvain Chevallier, Alexandre Gramfort, Denis A. Engemann. [doi]
- Improved Analysis for Bandit Learning in Matching MarketsFang Kong 0002, Zilong Wang, Shuai Li 0010. [doi]
- RETR: Multi-View Radar Detection Transformer for Indoor PerceptionRyoma Yataka, Adriano Cardace, Perry Wang 0004, Petros Boufounos, Ryuhei Takahashi. [doi]
- LAM3D: Large Image-Point Clouds Alignment Model for 3D Reconstruction from Single ImageRuikai Cui, Xibin Song, Weixuan Sun, Senbo Wang, Weizhe Liu, Shenzhou Chen, Taizhang Shang, Yang Li 0193, Nick Barnes, Hongdong Li, Pan Ji. [doi]
- A Surprisingly Simple Approach to Generalized Few-Shot Semantic SegmentationTomoya Sakai, Haoxiang Qiu, Takayuki Katsuki, Daiki Kimura, Takayuki Osogami, Tadanobu Inoue. [doi]
- Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation ModelsYeming Wen, Swarat Chaudhuri. [doi]
- Observational Scaling Laws and the Predictability of Langauge Model PerformanceYangjun Ruan, Chris J. Maddison, Tatsunori B. Hashimoto. [doi]
- Efficient Adaptation of Pre-trained Vision Transformer via Householder TransformationWei Dong, Yuan Sun, Yiting Yang, Xing Zhang, Zhijun Lin, Qingsen Yan, Haokui Zhang, Peng Wang, Yang Yang, Hengtao Shen. [doi]
- π-realizable Constrained MDPsTian Tian, Lin Yang 0011, Csaba Szepesvári. [doi]
- State Chrono Representation for Enhancing Generalization in Reinforcement LearningJianda Chen, Wen Zheng Terence Ng, Zichen Chen, Sinno Jialin Pan, Tianwei Zhang 0004. [doi]
- When is Multicalibration Post-Processing Necessary?Dutch Hansen, Siddartha Devic, Preetum Nakkiran, Vatsal Sharan. [doi]
- Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion ModelsGen Li 0005, Yuling Yan. [doi]
- A Recipe for Charge Density PredictionXiang Fu 0005, Andrew S. Rosen, Kyle Bystrom, Rui Wang 0086, Albert Musaelian, Boris Kozinsky, Tess E. Smidt, Tommi S. Jaakkola. [doi]
- Causal Imitation for Markov Decision Processes: a Partial Identification ApproachKangrui Ruan, Junzhe Zhang 0001, Xuan Di, Elias Bareinboim. [doi]
- CALE: Continuous Arcade Learning EnvironmentJesse Farebrother, Pablo Samuel Castro. [doi]
- Linear Regression using Heterogeneous Data BatchesAyush Jain 0001, Rajat Sen, Weihao Kong, Abhimanyu Das, Alon Orlitsky. [doi]
- Tensor-Based Synchronization and the Low-Rankness of the Block Trifocal TensorDaniel Miao, Gilad Lerman, Joe Kileel 0001. [doi]
- Graph-enhanced Optimizers for Structure-aware Recommendation Embedding EvolutionCong Xu, Jun Wang 0006, Jianyong Wang 0001, Wei Zhang 0056. [doi]
- Uncovering the Redundancy in Graph Self-supervised Learning ModelsZhibiao Wang, Xiao Wang, Haoyue Deng, Nian Liu, Shirui Pan, Chunming Hu. [doi]
- Towards Robust Multimodal Sentiment Analysis with Incomplete DataHaoyu Zhang, Wenbin Wang, Tianshu Yu. [doi]
- Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language ModelsLu Yu 0004, Haiyang Zhang, Changsheng Xu. [doi]
- Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise ModelsSujai Hiremath, Jacqueline R. M. A. Maasch, Mengxiao Gao, Promit Ghosal, Kyra Gan. [doi]
- On the Optimality of Dilated Entropy and Lower Bounds for Online Learning in Extensive-Form GamesZhiyuan Fan, Christian Kroer, Gabriele Farina. [doi]
- DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward PropagationSunghyeon Woo, Baeseong Park, Byeongwook Kim, Minjung Jo, Se Jung Kwon, Dongsuk Jeon, Dongsoo Lee. [doi]
- Blind Image Restoration via Fast Diffusion InversionHamadi Chihaoui, Abdelhak Lemkhenter, Paolo Favaro. [doi]
- Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of AttentionSusung Hong. [doi]
- You Don't Need Domain-Specific Data Augmentations When Scaling Self-Supervised LearningThéo Moutakanni, Maxime Oquab, Marc Szafraniec, Maria Vakalopoulou, Piotr Bojanowski. [doi]
- Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language ModelsSamuel Holt, Zhaozhi Qian, Tennison Liu, James Weatherall, Mihaela van der Schaar. [doi]
- DiffPO: A causal diffusion model for learning distributions of potential outcomesYuchen Ma 0005, Valentyn Melnychuk, Jonas Schweisthal, Stefan Feuerriegel. [doi]
- On Convergence of Adam for Stochastic Optimization under Relaxed AssumptionsYusu Hong, Junhong Lin. [doi]
- SA3DIP: Segment Any 3D Instance with Potential 3D PriorsXi Yang 0011, Xu Gu, Xingyilang Yin, Xinbo Gao 0001. [doi]
- Last-Iterate Convergence for Generalized Frank-Wolfe in Monotone Variational InequalitiesZaiwei Chen, Eric Mazumdar. [doi]
- Improving Generalization of Dynamic Graph Learning via Environment PromptKuo Yang, Zhengyang Zhou, Qihe Huang, Limin Li, Yuxuan Liang, Yang Wang. [doi]
- Transcendence: Generative Models Can Outperform The Experts That Train ThemEdwin Zhang, Vincent Zhu, Naomi Saphra, Anat Kleiman, Benjamin L. Edelman, Milind Tambe, Sham M. Kakade, Eran Malach. [doi]
- SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small ModelsYu Yang 0007, Siddhartha Mishra, Jeffrey N. Chiang, Baharan Mirzasoleiman. [doi]
- Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern GeneratorsChangze Lv, Dongqi Han, Yansen Wang, Xiaoqing Zheng, Xuanjing Huang 0001, Dongsheng Li 0002. [doi]
- Are Language Models Actually Useful for Time Series Forecasting?Mingtian Tan, Mike A. Merrill, Vinayak Gupta, Tim Althoff, Tom Hartvigsen. [doi]
- Sample-Efficient Constrained Reinforcement Learning with General ParameterizationWashim Uddin Mondal, Vaneet Aggarwal. [doi]
- SGD vs GD: Rank Deficiency in Linear NetworksAditya Vardhan Varre, Margarita Sagitova, Nicolas Flammarion. [doi]
- Towards Open Respiratory Acoustic Foundation Models: Pretraining and BenchmarkingYuwei Zhang, Tong Xia, Jing Han 0010, Yu Wu, Georgios Rizos, Yang Liu 0101, Mohammed Mosuily, Jagmohan Chauhan, Cecilia Mascolo. [doi]
- Estimating Epistemic and Aleatoric Uncertainty with a Single ModelMatthew Chan, Maria Molina, Chris Metzler. [doi]
- Decomposed Prompt Decision Transformer for Efficient Unseen Task GeneralizationHongling Zheng, Li Shen 0008, Yong Luo 0002, Tongliang Liu, Jialie Shen 0001, Dacheng Tao. [doi]
- ImageNet3D: Towards General-Purpose Object-Level 3D UnderstandingWufei Ma, Guofeng Zhang 0020, Qihao Liu, Guanning Zeng, Adam Kortylewski, Yaoyao Liu 0001, Alan L. Yuille. [doi]
- LCGen: Mining in Low-Certainty Generation for View-consistent Text-to-3DZeng Tao, Tong Yang, Junxiong Lin, Xinji Mai, Haoran Wang, Beining Wang, Enyu Zhou, Yan Wang, Wenqiang Zhang. [doi]
- Reinforcement Learning with Adaptive Regularization for Safe Control of Critical SystemsHaozhe Tian, Homayoun Hamedmoghadam, Robert Shorten, Pietro Ferraro. [doi]
- OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization DatasetAllen Roush, Yusuf Shabazz, Arvind Balaji, Peter Zhang, Stefano Mezza, Markus Zhang, Sanjay Basu, Sriram Vishwanath, Ravid Shwartz-Ziv. [doi]
- Multi-Scale Representation Learning for Protein Fitness PredictionZuobai Zhang, Pascal Notin, Yining Huang, Aurélie C. Lozano, Vijil Chenthamarakshan, Debora S. Marks, Payel Das, Jian Tang 0005. [doi]
- Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLQi Lv, Xiang Deng, Gongwei Chen, Michael Yu Wang, Liqiang Nie. [doi]
- Learning Truncated Causal History Model for Video RestorationAmirhosein Ghasemabadi, Muhammad Kamran Janjua, Mohammad Salameh, Di Niu. [doi]
- SOI: Scaling Down Computational Complexity by Estimating Partial States of the ModelGrzegorz Stefanski, Pawel Daniluk, Artur Szumaczuk, Jakub Tkaczuk. [doi]
- Active Classification with Few Queries under MisspecificationVasilis Kontonis, Mingchen Ma, Christos Tzamos. [doi]
- No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design ChoicesQi Pang, Shengyuan Hu 0001, Wenting Zheng, Virginia Smith. [doi]
- Few-shot Algorithms for Consistent Neural Decoding (FALCON) BenchmarkBrianna Karpowicz, Joel Ye, Chaofei Fan, Pablo Tostado-Marcos, Fabio Rizzoglio, Clayton Washington, Thiago Scodeler, Diogo de Lucena, Samuel R. Nason-Tomaszewski, Matthew Mender, Xuan Ma, Ezequiel M. Arneodo, Leigh R. Hochberg, Cynthia A. Chestek, Jaimie M. Henderson, Timothy Gentner, Vikash Gilja, Lee E. Miller, Adam Rouse, Robert Gaunt, Jennifer L. Collinger, Chethan Pandarinath. [doi]
- SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained ModelsLinglan Zhao, Xuerui Zhang, Ke Yan, Shouhong Ding, Weiran Huang 0001. [doi]
- ACFun: Abstract-Concrete Fusion Facial StylizationJiapeng Ji, Kun Wei, Ziqi Zhang, Cheng Deng. [doi]
- MetaCURL: Non-stationary Concave Utility Reinforcement LearningBianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane. [doi]
- Copycats: the many lives of a publicly available medical imaging datasetAmelia Jiménez-Sánchez, Natalia Rozalia Avlona, Dovile Juodelyte, Théo Sourget, Caroline Vang-Larsen, Anna Rogers, Hubert Dariusz Zajac, Veronika Cheplygina. [doi]
- Self-Distilled Depth Refinement with Noisy Poisson FusionJiaqi Li, Yiran Wang 0005, Jinghong Zheng 0002, Zihao Huang 0001, Ke Xian, Zhiguo Cao 0001, Jianming Zhang 0001. [doi]
- MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion TokensAnas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Guha, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu 0001, Yejin Choi 0001, Ludwig Schmidt. [doi]
- Learning the Infinitesimal Generator of Stochastic Diffusion ProcessesVladimir Kostic, Hélène Halconruy, Timothée Devergne, Karim Lounici, Massimiliano Pontil. [doi]
- Diffusion-Inspired Truncated Sampler for Text-Video RetrievalJiamian Wang, Pichao Wang, Dongfang Liu, Qiang Guan, Sohail A. Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang Tao. [doi]
- Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug DesignXiangxin Zhou, Jiaqi Guan, Yijia Zhang, Xingang Peng, Liang Wang 0001, Jianzhu Ma. [doi]
- NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video ReconstructionZixuan Gong, Guangyin Bao, Qi Zhang 0020, Zhongwei Wan, Duoqian Miao 0001, Shoujin Wang, Lei Zhu 0003, Changwei Wang 0001, Rongtao Xu, Liang Hu 0004, Ke Liu, Yu Zhang 0133. [doi]
- Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual ClassificationZhaorui Tan, Xi Yang, Qiufeng Wang, Anh Nguyen 0003, Kaizhu Huang. [doi]
- Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal ModelsYang Jiao, Shaoxiang Chen 0001, Zequn Jie, Jingjing Chen 0001, Lin Ma 0002, Yu-Gang Jiang 0001. [doi]
- DataComp-LM: In search of the next generation of training sets for language modelsJeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Guha, Sedrick Scott Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee F. Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner 0001, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alex Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar. [doi]
- Scalable DBSCAN with Random ProjectionsHaochuan Xu, Ninh Pham. [doi]
- Secret Collusion among AI Agents: Multi-Agent Deception via SteganographySumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip Torr 0001, Lewis Hammond, Christian Schröder de Witt. [doi]
- The Limits of Differential Privacy in Online LearningBo Li 0001, Wei Wang 0030, Peng Ye. [doi]
- When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human FeedbackLeon Lang, Davis Foote, Stuart J. Russell, Anca D. Dragan, Erik Jenner, Scott Emmons. [doi]
- Nearly Minimax Optimal Submodular Maximization with Bandit FeedbackArtin Tajdini, Lalit Jain, Kevin G. Jamieson. [doi]
- Your contrastive learning problem is secretly a distribution alignment problemZihao Chen, Chi-Heng Lin, Ran Liu, Jingyun Xiao, Eva L. Dyer. [doi]
- Long-Tailed Out-of-Distribution Detection via Normalized Outlier Distribution AdaptationWenjun Miao, Guansong Pang, Jin Zheng, Xiao Bai 0001. [doi]
- Cross-Scale Self-Supervised Blind Image Deblurring via Implicit Neural RepresentationTianjing Zhang, Yuhui Quan, Hui Ji. [doi]
- Causal Contrastive Learning for Counterfactual Regression Over TimeMouad El Bouchattaoui, Myriam Tami, Benoit Lepetit, Paul-Henry Cournède. [doi]
- Testing Semantic Importance via BettingJacopo Teneggi, Jeremias Sulam. [doi]
- Reward Machines for Deep RL in Noisy and Uncertain EnvironmentsAndrew C. Li, Zizhao Chen, Toryn Q. Klassen, Pashootan Vaezipoor, Rodrigo Toro Icarte, Sheila A. McIlraith. [doi]
- Dimension-free deterministic equivalents and scaling laws for random feature regressionLeonardo Defilippis, Bruno Loureiro, Theodor Misiakiewicz. [doi]
- CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingDongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu 0015, Hongsheng Li 0001. [doi]
- On the Sparsity of the Strong Lottery Ticket HypothesisEmanuele Natale, Davide Ferré, Giordano Giambartolomei, Frédéric Giroire, Frederik Mallmann-Trenn. [doi]
- Derivative-enhanced Deep Operator NetworkYuan Qiu 0012, Nolan Bridges, Peng Chen. [doi]
- FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and DetectionXinting Liao, Weiming Liu 0005, Pengyang Zhou, Fengyuan Yu, Jiahe Xu 0003, Jun Wang, Wenjie Wang, Chaochao Chen 0001, Xiaolin Zheng. [doi]
- DAGER: Exact Gradient Inversion for Large Language ModelsIvo Petrov, Dimitar I. Dimitrov, Maximilian Baader, Mark Niklas Müller, Martin T. Vechev. [doi]
- Understanding the Differences in Foundation Models: Attention, State Space Models, and Recurrent Neural NetworksJerome Sieber, Carmen Amo Alonso, Alexandre Didier, Melanie N. Zeilinger, Antonio Orvieto. [doi]
- Guiding a Diffusion Model with a Bad Version of ItselfTero Karras, Miika Aittala, Tuomas Kynkäänniemi, Jaakko Lehtinen, Timo Aila, Samuli Laine. [doi]
- Vision Foundation Model Enables Generalizable Object Pose EstimationKai Chen 0028, Yiyao Ma, Xingyu Lin, Stephen James, Jianshu Zhou, Yun-Hui Liu, Pieter Abbeel, Dou Qi 0001. [doi]
- Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain KnowledgeFang Dong, Mengyi Chen, Jixian Zhou, Yubin Shi, Yixuan Chen 0003, Mingzhi Dong, Yujiang Wang 0001, Dongsheng Li 0002, Xiaochen Yang, Rui Zhu 0006, Robert P. Dick, Qin Lv, Fan Yang 0001, Tun Lu, Ning Gu, Li Shang. [doi]
- FSP-Laplace: Function-Space Priors for the Laplace Approximation in Bayesian Deep LearningTristan Cinquin, Marvin Pförtner, Vincent Fortuin, Philipp Hennig, Robert Bamler. [doi]
- Towards Human-AI Complementarity with Prediction SetsGiovanni De Toni, Nastaran Okati, Suhas Thejaswi, Eleni Straitouri, Manuel Rodriguez. [doi]
- JaxMARL: Multi-Agent RL Environments and Algorithms in JAXAlexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook 0004, Andrei Lupu, Garðar Ingvarsson, Timon Willi, Ravi Hammond, Akbir Khan, Christian Schröder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert T. Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktäschel, Chris Lu 0001, Jakob Foerster. [doi]
- Uncovering Safety Risks of Large Language Models through Concept Activation VectorZhihao Xu, Ruixuan Huang, Changyu Chen, Xiting Wang. [doi]
- Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language ModelsWenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia 0005, Li Dong 0004, Lei Cui 0001, Furu Wei. [doi]
- Trajectory Diffusion for ObjectGoal NavigationXinyao Yu 0002, Sixian Zhang, Xinhang Song, Xiaorong Qin, Shuqiang Jiang. [doi]
- Efficient $\Phi$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form GamesBrian Hu Zhang, Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm. [doi]
- The Implicit Bias of Heterogeneity towards Invariance: A Study of Multi-Environment Matrix SensingYang Xu, Yihong Gu, Cong Fang 0001. [doi]
- Wasserstein Gradient Boosting: A Framework for Distribution-Valued Supervised LearningTakuo Matsubara. [doi]
- Deep linear networks for regression are implicitly regularized towards flat minimaPierre Marion, Lénaïc Chizat. [doi]
- Neural Pfaffians: Solving Many Many-Electron Schrödinger EquationsNicholas Gao, Stephan Günnemann. [doi]
- OccamLLM: Fast and Exact Language Model Arithmetic in a Single StepOwen Dugan, Donato Jiménez-Benetó, Charlotte Loh, Zhuo Chen, Rumen Dangovski, Marin Soljacic. [doi]
- IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code ProgramsPatrick Tser Jern Kon, Jiachen Liu, Yiming Qiu, Weijun Fan, Ting He, Lei Lin, Haoran Zhang, Owen Park, George Elengikal, Yuxin Kang, Ang Chen 0001, Mosharaf Chowdhury, Myungjin Lee, Xinyu Wang. [doi]
- Exploitation of a Latent Mechanism in Graph Contrastive Learning: Representation ScatteringDongxiao He, Lianze Shan, Jitao Zhao, Hengrui Zhang, Zhen Wang, Weixiong Zhang. [doi]
- MeLLoC: Lossless Compression with High-order Mechanism LearningXinyue Luo, Jin Cheng 0003, Yu Chen. [doi]
- Multi-language Diversity Benefits AutoformalizationAlbert Q. Jiang, Wenda Li, Mateja Jamnik. [doi]
- Stabilizing Zero-Shot Prediction: A Novel Antidote to Forgetting in Continual Vision-Language TasksZijian Gao, Xingxing Zhang, Kele Xu, XinJun Mao, Huaimin Wang. [doi]
- Star-Agents: Automatic Data Optimization with LLM Agents for Instruction TuningHang Zhou, Yehui Tang, Haochen Qin, Yujie Yang, Renren Jin, Deyi Xiong, Kai Han 0002, Yunhe Wang 0001. [doi]
- ProEdit: Simple Progression is All You Need for High-Quality 3D Scene EditingJunkun Chen, Yu-Xiong Wang. [doi]
- MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D EditingChenjie Cao, Chaohui Yu, Fan Wang 0019, Xiangyang Xue 0001, Yanwei Fu 0001. [doi]
- SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionYuxuan Li, Xiang Li, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang. [doi]
- Understanding Information Storage and Transfer in Multi-Modal Large Language ModelsSamyadeep Basu, Martin Grayson, Cecily Morrison, Besmira Nushi, Soheil Feizi, Daniela Massiceti. [doi]
- What does guidance do? A fine-grained analysis in a simple settingMuthu Chidambaram, Khashayar Gatmiry, Sitan Chen, Holden Lee, Jianfeng Lu 0001. [doi]
- Adaptive Passive-Aggressive Framework for Online Regression with Side InformationRunhao Shi, Jiaxi Ying, Daniel P. Palomar. [doi]
- LP-3DGS: Learning to Prune 3D Gaussian SplattingZhaoliang Zhang, Tianchen Song, Yongjae Lee, Li Yang 0009, Cheng Peng 0008, Rama Chellappa, Deliang Fan. [doi]
- RFLPA: A Robust Federated Learning Framework against Poisoning Attacks with Secure AggregationPeihua Mai, Ran Yan, Yan Pang. [doi]
- How many classifiers do we need?Hyunsuk Kim, Liam Hodgkinson, Ryan Theisen, Michael W. Mahoney. [doi]
- Reparameterized Multi-Resolution Convolutions for Long Sequence ModellingJake Cunningham, Giorgio Giannone, Mingtian Zhang, Marc Peter Deisenroth. [doi]
- GeoPlant: Spatial Plant Species Prediction DatasetLukás Picek, Christophe Botella, Maximilien Servajean, César Leblanc, Rémi Palard, Théo Larcher, Benjamin Deneu, Diego Marcos, Pierre Bonnet, Alexis Joly. [doi]
- Universal Physics Transformers: A Framework For Efficiently Scaling Neural OperatorsBenedikt Alkin, Andreas Fürst, Simon Schmid, Lukas Gruber, Markus Holzleitner, Johannes Brandstetter. [doi]
- Robust Fine-tuning of Zero-shot Models via Variance ReductionBeier Zhu, Jiequan Cui, Hanwang Zhang. [doi]
- Take A Shortcut Back: Mitigating the Gradient Vanishing for Training Spiking Neural NetworksYufei Guo, Yuanpei Chen, Zecheng Hao, Weihang Peng 0001, Zhou Jie, Yuhan Zhang, Xiaode Liu, Zhe Ma 0001. [doi]
- Multi-Agent Imitation Learning: Value is Easy, Regret is HardJingwu Tang, Gokul Swamy, Fei Fang 0001, Zhiwei Steven Wu. [doi]
- Learning to Discuss Strategically: A Case Study on One Night Ultimate WerewolfXuanfa Jin, Ziyan Wang, Yali Du 0001, Meng Fang, Haifeng Zhang, Jun Wang. [doi]
- SELF-DISCOVER: Large Language Models Self-Compose Reasoning StructuresPei Zhou, Jay Pujara, Xiang Ren 0001, Xinyun Chen, Heng Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng. [doi]
- A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement LearningTom Yan, Zachary C. Lipton. [doi]
- GFlowNet Assisted Biological Sequence EditingPouya M. Ghari, Alex M. Tseng, Gökcen Eraslan, Romain Lopez, Tommaso Biancalani, Gabriele Scalia, Ehsan Hajiramezanali. [doi]
- GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian SplatsSangeek Hyun, Jae-Pil Heo. [doi]
- Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingManling Li, Shiyu Zhao, Qineng Wang, Kangrui Wang, Yu Zhou, Sanjana Srivastava, Cem Gokmen, Tony Lee, Li Erran Li, Ruohan Zhang, Weiyu Liu, Percy Liang, Li Fei-Fei 0001, Jiayuan Mao, Jiajun Wu 0001. [doi]
- Dynamic Service Fee Pricing under Strategic Behavior: Actions as Instruments and Phase TransitionRui Ai, David Simchi-Levi, Feng Zhu. [doi]
- AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image GenerationLianyu Pang, Jian Yin 0001, Baoquan Zhao, Feize Wu, Fu Lee Wang, Qing Li 0001, Xudong Mao. [doi]
- When does perceptual alignment benefit vision representations?Shobhita Sundaram, Stephanie Fu, Lukas Muttenthaler, Netanel Tamir, Lucy Chai, Simon Kornblith, Trevor Darrell, Phillip Isola. [doi]
- A generalized neural tangent kernel for surrogate gradient learningLuke Eilers, Raoul-Martin Memmesheimer, Sven Goedeke. [doi]
- VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision ComputationShiwei Wu, Joya Chen, Kevin Qinghong Lin, Qimeng Wang, Yan Gao, Qianli Xu, Tong Xu 0001, Yao Hu, Enhong Chen, Mike Zheng Shou. [doi]
- Instruction Embedding: Latent Representations of Instructions Towards Task IdentificationYiwei Li 0001, Jiayi Shi, Shaoxiong Feng, Peiwen Yuan, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Prof. Kan. [doi]
- Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language ModelsBowen Ping, Shuo Wang, Hanqing Wang, Xu Han 0007, Yuzhuang Xu, Yukun Yan, Yun Chen 0007, Baobao Chang, Zhiyuan Liu 0001, Maosong Sun 0001. [doi]
- Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNsRong Ma, Jie Chen, Xiangyang Xue 0001, Jian Pu. [doi]
- Periodic agent-state based Q-learning for POMDPsAmit Sinha, Matthieu Geist, Aditya Mahajan. [doi]
- Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement LearningTong Yang, Shicong Cen, Yuting Wei 0001, Yuxin Chen 0002, Yuejie Chi. [doi]
- Unified Mechanism-Specific Amplification by Subsampling and Group Privacy AmplificationJan Schuchardt, Mihail Stoian, Arthur Kosmala, Stephan Günnemann. [doi]
- Gradient-free Decoder Inversion in Latent Diffusion ModelsSeongmin Hong, Suh Yoon Jeon, Kyeonghyun Lee, Ernest K. Ryu, Se Young Chun. [doi]
- Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based ModelsDaniela de Albuquerque, John M. Pearson. [doi]
- CoSW: Conditional Sample Weighting for Smoke Segmentation with Label NoiseLujian Yao, Haitao Zhao 0002, Zhongze Wang, Kaijie Zhao, Jingchao Peng. [doi]
- Rethinking Optimal Transport in Offline Reinforcement LearningArip Asadulaev, Rostislav Korst, Aleksandr Korotin, Vage Egiazarian, Andrey Filchenkov, Evgeny Burnaev. [doi]
- ETO: Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography HypothesesJunjie Ni, Guofeng Zhang 0001, Guanglin Li 0005, Yijin Li, Xinyang Liu, Zhaoyang Huang, Hujun Bao. [doi]
- Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for ChemistryMarvin Alberts, Oliver Schilter, Federico Zipoli, Nina Hartrampf, Teodoro Laino. [doi]
- TopoFR: A Closer Look at Topology Alignment on Face RecognitionJun Dan, Yang Liu 0155, Jiankang deng, Haoyu Xie, Siyuan Li 0002, Baigui Sun, Shan Luo 0001. [doi]
- Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering PlaylistsJoachim Baumann 0002, Celestine Mendler-Dünner. [doi]
- Trajectory Flow Matching with Applications to Clinical Time Series ModellingXi Zhang, Yuan Pu, Yuki Kawamura, Andrew Loza, Yoshua Bengio, Dennis L. Shung, Alexander Tong 0001. [doi]
- Regret Minimization in Stackelberg Games with Side InformationKeegan Harris, Zhiwei Steven Wu, Maria-Florina Balcan. [doi]
- Expecting The Unexpected: Towards Broad Out-Of-Distribution DetectionCharles Guille-Escuret, Pierre-André Noël, Ioannis Mitliagkas, David Vázquez 0001, João Monteiro 0002. [doi]
- Stratified Prediction-Powered Inference for Effective Hybrid Evaluation of Language ModelsAdam Fisch, Joshua Maynez, R. Alex Hofer, Bhuwan Dhingra, Amir Globerson, William W. Cohen. [doi]
- Is Behavior Cloning All You Need? Understanding Horizon in Imitation LearningDylan J. Foster, Adam Block, Dipendra Misra. [doi]
- UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New PeaksJingjing Ren, Wenbo Li, Haoyu Chen, Renjing Pei, Bin Shao, Yong Guo, Long Peng, Fenglong Song, Lei Zhu. [doi]
- A Simple yet Universal Framework for Depth CompletionJin-Hwi Park, Hae-Gon Jeon. [doi]
- Can neural operators always be continuously discretized?Takashi Furuya, Michael Puthawala, Matti Lassas, Maarten V. De Hoop. [doi]
- 3D Gaussian Splatting as Markov Chain Monte CarloShakiba Kheradmand, Daniel Rebain, Gopal Sharma, Weiwei Sun, Yang-Che Tseng, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi. [doi]
- Evaluate then Cooperate: Shapley-based View Cooperation Enhancement for Multi-view ClusteringFangdi Wang, Jiaqi Jin, Jingtao Hu, Suyuan Liu, Xihong Yang, Siwei Wang 0001, Xinwang Liu 0002, En Zhu. [doi]
- Cardinality-Aware Set Prediction and Top-$k$ ClassificationCorinna Cortes, Anqi Mao, Christopher Mohri, Mehryar Mohri, Yutao Zhong 0002. [doi]
- Towards Estimating Bounds on the Effect of Policies under Unobserved ConfoundingAlexis Bellot, Silvia Chiappa. [doi]
- The Feature Speed Formula: a flexible approach to scale hyper-parameters of deep neural networksLénaïc Chizat, Praneeth Netrapalli. [doi]
- TARSS-Net: Temporal-Aware Radar Semantic Segmentation NetworkYoucheng Zhang, Liwen Zhang 0001, ZijunHu, Pengcheng Pi, Teng Li, Yuanpei Chen, Shi Peng, Zhe Ma 0001. [doi]
- DataStealing: Steal Data from Diffusion Models in Federated Learning with Multiple TrojansYuan Gan, Jiaxu Miao, Yi Yang. [doi]
- Learning Where to Edit Vision TransformersYunqiao Yang, Long-Kai Huang, Shengzhuang Chen, Kede Ma, Ying Wei 0001. [doi]
- Estimating the Hallucination Rate of Generative AIAndrew Jesson, Nicolas Beltran-Velez, Quentin Chu, Sweta Karlekar, Jannik Kossen, Yarin Gal, John P. Cunningham, David M. Blei. [doi]
- Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text RecognitionMehreen Saeed, Adrian Chan, Anupam Mijar, Joseph Moukarzel, Georges Habchi, Carlos Younes, Amin Elias, Chau-Wai Wong, Akram Khater. [doi]
- DALD: Improving Logits-based Detector without Logits from Black-box LLMsCong Zeng, Shengkun Tang, Xianjun Yang, Yuanzhou Chen, Yiyou Sun, Zhiqiang Xu, Yao Li, Haifeng Chen, Wei Cheng 0002, Dongkuan Xu. [doi]
- A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware PerspectiveYunpeng Qing, Shunyu Liu 0001, Jingyuan Cong, Kaixuan Chen 0004, Yihe Zhou, Mingli Song. [doi]
- A Synthetic Dataset for Personal Attribute InferenceHanna Yukhymenko, Robin Staab, Mark Vero, Martin T. Vechev. [doi]
- GO4Align: Group Optimization for Multi-Task AlignmentJiayi Shen, Qi Wang 0009, Zehao Xiao, Nanne van Noord, Marcel Worring. [doi]
- Single-Loop Stochastic Algorithms for Difference of Max-Structured Weakly Convex FunctionsQuanqi Hu, Qi Qi 0006, Zhaosong Lu, Tianbao Yang. [doi]
- Mixture of Nested Experts: Adaptive Processing of Visual TokensGagan Jain, Nidhi Hegde 0003, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain 0002, Anurag Arnab, Sujoy Paul. [doi]
- Implicit Curriculum in Procgen Made ExplicitZhenxiong Tan, Kaixin Wang, Xinchao Wang. [doi]
- Nonparametric Instrumental Variable Regression through Stochastic Approximate GradientsYuri R. Fonseca, Caio Peixoto, Yuri F. Saporito. [doi]
- Poseidon: Efficient Foundation Models for PDEsMaximilian Herde, Bogdan Raonic, Tobias Rohner, Roger Käppeli, Roberto Molinaro, Emmanuel de Bézenac, Siddhartha Mishra. [doi]
- Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel MachinesEdward Milsom, Ben Anson, Laurence Aitchison. [doi]
- Beyond Efficiency: Molecular Data Pruning for Enhanced GeneralizationDingshuo Chen, Zhixun Li, Yuyan Ni, Guibin Zhang, Ding Wang, Qiang Liu 0006, Shu Wu, Jeffrey Xu Yu, Liang Wang. [doi]
- Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language ModelsJiayu Wang, Yifei Ming, Zhenmei Shi, Vibhav Vineet, Xin Wang 0066, Sharon Li 0001, Neel Joshi. [doi]
- Constructing Semantics-Aware Adversarial Examples with a Probabilistic PerspectiveAndi Zhang 0001, Mingtian Zhang, Damon Wischik. [doi]
- Geometry Awakening: Cross-Geometry Learning Exhibits Superiority over Individual StructuresYadong Sun, Xiaofeng Cao, Yu Wang, Wei Ye, Jingcai Guo, Qing Guo. [doi]
- Federated Behavioural Planes: Explaining the Evolution of Client Behaviour in Federated LearningDario Fenoglio, Gabriele Dominici, Pietro Barbiero, Alberto Tonda, Martin Gjoreski, Marc Langheinrich. [doi]
- Certified Machine Unlearning via Noisy Stochastic Gradient DescentEli Chien, Haoyu Wang 0004, Ziang Chen, Pan Li 0005. [doi]
- Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language ModelsArshia Hemmat, Adam Davies, Tom A. Lamb, Jianhao Yuan, Philip Torr 0001, Ashkan Khakzar, Francesco Pinto. [doi]
- Implicit Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D ScenesQi Ma, Danda Pani Paudel, Ender Konukoglu, Luc Van Gool. [doi]
- DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic StatesBozhou Zhang, Nan Song, Li Zhang. [doi]
- The Sample-Communication Complexity Trade-off in Federated Q-LearningSudeep Salgia, Yuejie Chi. [doi]
- Transformers need glasses! Information over-squashing in language tasksFederico Barbero, Andrea Banino, Steven Kapturowski, Dharshan Kumaran, João Guilherme Madeira Araújo, Oleksandr Vitvitskyi, Razvan Pascanu, Petar Velickovic. [doi]
- Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous ContradictionsZhe Hu, Tuo Liang, Jing Li, Yiren Lu 0002, Yunlai Zhou, Yiran Qiao, Jing Ma 0002, Yu Yin 0001. [doi]
- Don't Look Twice: Faster Video Transformers with Run-Length TokenizationRohan Choudhury, Guanglei Zhu, Sihan Liu, Koichiro Niinuma, Kris Kitani, László A. Jeni. [doi]
- Membership Inference Attacks against Large Vision-Language ModelsZhan Li, Yongtao Wu, Yihang Chen, Francesco Tonin, Elías Abad-Rocamora, Volkan Cevher. [doi]
- SlimSAM: 0.1% Data Makes Segment Anything SlimZigeng Chen, Gongfan Fang, Xinyin Ma, Xinchao Wang. [doi]
- How does Inverse RL Scale to Large State Spaces? A Provably Efficient ApproachFilippo Lazzati, Mirco Mutti, Alberto Maria Metelli. [doi]
- Controlling Continuous Relaxation for Combinatorial OptimizationYuma Ichikawa. [doi]
- DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose OptimizationYueming Xu, Haochen Jiang, Zhongyang Xiao, Jianfeng Feng, Li Zhang 0040. [doi]
- Approaching Human-Level Forecasting with Language ModelsDanny Halawi, Fred Zhang, Chen Yueh-Han, Jacob Steinhardt. [doi]
- Non-convolutional graph neural networksYuanqing Wang, KyungHyun Cho. [doi]
- SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal DomainPierre Colombo, Telmo Pessoa Pires, Malik Boudiaf, Rui Melo, Gabriel Hautreux, Etienne Malaboeuf, Johanne Charpentier, Dominic Culver, Michael Desa. [doi]
- Randomized Exploration in Cooperative Multi-Agent Reinforcement LearningHao-Lun Hsu, Weixin Wang, Miroslav Pajic, Pan Xu 0002. [doi]
- First-Order Minimax Bilevel OptimizationYifan Yang, Zhaofeng Si, Siwei Lyu, Kaiyi Ji. [doi]
- SGLang: Efficient Execution of Structured Language Model ProgramsLianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Chuyue Sun, Jeff Huang 0001, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark W. Barrett, Ying Sheng 0007. [doi]
- OTTER: Effortless Label Distribution Adaptation of Zero-shot ModelsChangho Shin, Jitian Zhao, Sonia Cromp, Harit Vishwakarma, Frederic Sala. [doi]
- Alias-Free Mamba Neural OperatorJianwei Zheng 0001, Wei Li, Ni Xu, Junwei Zhu, Xiaoxu Lin, Xiaoqin Zhang. [doi]
- Towards Open-Vocabulary Semantic Segmentation Without Semantic LabelsHeeseong Shin, Chaehyun Kim, Sunghwan Hong, Seokju Cho, Anurag Arnab, Paul Hongsuck Seo, Seungryong Kim. [doi]
- MultiOrg: A Multi-rater Organoid-detection DatasetChristina Bukas, Harshavardhan Subramanian, Fenja See, Carina Steinchen, Ivan Ezhov, Gowtham Boosarpu, Sara Asgharpour, Gerald Burgstaller, Mareike Lehmann, Florian Kofler, Marie Piraud. [doi]
- Task-oriented Time Series Imputation Evaluation via Generalized RepresentersZhixian Wang, Linxiao Yang, Liang Sun, Qingsong Wen, Yi Wang 0022. [doi]
- You Only Look Around: Learning Illumination-Invariant Feature for Low-light Object DetectionMingbo Hong, Shen Cheng, Haibin Huang, Haoqiang Fan, Shuaicheng Liu. [doi]
- On $f$-Divergence Principled Domain Adaptation: An Improved FrameworkZiqiao Wang, Yongyi Mao. [doi]
- KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationColeman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami. [doi]
- A Gradient Accumulation Method for Dense Retriever under Memory ConstraintJaehee Kim, Yukyung Lee, Pilsung Kang 0001. [doi]
- Refusal in Language Models Is Mediated by a Single DirectionAndy Arditi, Oscar Obeso, Aaquib Syed, Daniel Paleka, Nina Panickssery, Wes Gurnee, Neel Nanda. [doi]
- Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained OptimizationKai Hu, Weichen Yu, Yining Li, Tianjun Yao, Xiang Li, Wenhe Liu, Lijun Yu, Zhiqiang Shen, Kai Chen 0026, Matt Fredrikson. [doi]
- Interaction-Force Transport Gradient FlowsEgor Gladin, Pavel E. Dvurechenskii, Alexander Mielke, Jia-Jie Zhu. [doi]
- Unconditional stability of a recurrent neural circuit implementing divisive normalizationShivang Rawat, David J. Heeger, Stefano Martiniani. [doi]
- Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMsXuan Zhang, Chao Du, Tianyu Pang, Qian Liu, Wei Gao, Min Lin. [doi]
- Understanding Linear Probing then Fine-tuning Language Models from NTK PerspectiveAkiyoshi Tomihari, Issei Sato. [doi]
- Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable SystemsRohan R. Paleja, Michael Munje, Kimberlee Chestnut Chang, Reed Jensen, Matthew C. Gombolay. [doi]
- Stability and Generalization of Asynchronous SGD: Sharper Bounds Beyond Lipschitz and SmoothnessXiaoge Deng, Tao Sun 0005, Shengwei Li, Dongsheng Li 0001, Xicheng Lu. [doi]
- ARC: A Generalist Graph Anomaly Detector with In-Context LearningYixin Liu 0001, Shiyuan Li, Yu Zheng 0013, Qingfeng Chen, Chengqi Zhang, Shirui Pan. [doi]
- 4M-21: An Any-to-Any Vision Model for Tens of Tasks and ModalitiesRoman Bachmann 0001, Oguzhan Fatih Kar, David Mizrahi, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir. [doi]
- Where Do Large Learning Rates Lead Us?Ildus Sadrtdinov, Maxim Kodryan, Eduard Pokonechny, Ekaterina Lobacheva, Dmitry P. Vetrov. [doi]
- GenWarp: Single Image to Novel Views with Semantic-Preserving Generative WarpingJunyoung Seo, Kazumi Fukuda, Takashi Shibuya 0001, Takuya Narihira, Naoki Murata, Shoukang Hu, Chieh-Hsin Lai, Seungryong Kim, Yuki Mitsufuji. [doi]
- LVD-2M: A Long-take Video Dataset with Temporally Dense CaptionsTianwei Xiong, Yuqing Wang, Daquan Zhou, Zhijie Lin, Jiashi Feng, Xihui Liu. [doi]
- Conditional Outcome Equivalence: A Quantile Alternative to CATEJosh Givens, Henry W. J. Reeve, Song Liu, Katarzyna Reluga. [doi]
- Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-imageYu Zhao, Hao Fei 0001, Xiangtai Li, Libo Qin 0004, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang 0005, Jianguo Wei. [doi]
- Strategic Multi-Armed Bandit Problems Under Debt-Free ReportingAhmed Ben Yahmed, Clément Calauzènes, Vianney Perchet. [doi]
- Distribution-Aware Data Expansion with Diffusion ModelsHaowei Zhu, Ling Yang, Jun-Hai Yong, Hongzhi Yin, Jiawei Jiang, Meng Xiao, Wentao Zhang, Bin Wang. [doi]
- Finding Transformer Circuits With Edge PruningAdithya Bhaskar, Alexander Wettig, Dan Friedman, Danqi Chen 0001. [doi]
- Fetch and Forge: Efficient Dataset Condensation for Object DetectionDing Qi, Jian Li 0062, Jinlong Peng, Bo Zhao, Shuguang Dou, Jialin Li, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Cairong Zhao. [doi]
- SkiLD: Unsupervised Skill Discovery Guided by Factor InteractionsZizhao Wang, Jiaheng Hu, Caleb Chuck, Stephen Chen, Roberto Martín-Martín, Amy Zhang 0001, Scott Niekum, Peter Stone 0001. [doi]
- π-Realizability and ConcentrabilityVolodymyr Tkachuk, Gellért Weisz, Csaba Szepesvári. [doi]
- AdaNeg: Adaptive Negative Proxy Guided OOD Detection with Vision-Language ModelsYabin Zhang, Lei Zhang. [doi]
- Decision-Focused Learning with Directional GradientsMichael Huang, Vishal Gupta 0004. [doi]
- Semi-Random Matrix Completion via Flow-Based Adaptive ReweightingJonathan A. Kelner, Jerry Li 0001, Allen Liu, Aaron Sidford, Kevin Tian. [doi]
- RegExplainer: Generating Explanations for Graph Neural Networks in Regression TasksJiaxing Zhang 0002, Zhuomin Chen, Hao Mei, Longchao Da, Dongsheng Luo, Hua Wei 0001. [doi]
- InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object InteractionSirui Xu 0002, Ziyin Wang, Yu-Xiong Wang, Liangyan Gui. [doi]
- Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap FeaturesChengkai Hou, Zhengrong Xue, Bingyang Zhou, JingHan Ke, Lin Shao 0002, Huazhe Xu. [doi]
- ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMsZhaochen Su, Jun Zhang, Xiaoye Qu, Tong Zhu 0002, Yanshu Li, Jiashuo Sun, Juntao Li, Min Zhang, Yu Cheng. [doi]
- Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-ExpertsSukwon Yun, Inyoung Choi, Jie Peng, Yangfan Wu, Jingxuan Bao, Qiyiwen Zhang, Jiayi Xin, Qi Long, Tianlong Chen. [doi]
- Iterative Reasoning Preference OptimizationRichard Yuanzhe Pang, Weizhe Yuan, He He, KyungHyun Cho, Sainbayar Sukhbaatar, Jason Weston. [doi]
- Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR MaterialsYawar Siddiqui, Tom Monnier, Filippos Kokkinos, Mahendra Kariya, Yanir Kleiman, Emilien Garreau, Oran Gafni, Natalia Neverova, Andrea Vedaldi, Roman Shapovalov, David Novotný. [doi]
- ContextGS : Compact 3D Gaussian Splatting with Anchor Level Context ModelYufei Wang, Zhihao Li, Lanqing Guo, Wenhan Yang, Alex C. Kot, Bihan Wen. [doi]
- Set-based Neural Network Encoding Without Weight TyingBruno Andreis, Bedionita Soro, Philip H. S. Torr, Sung Ju Hwang. [doi]
- A Canonicalization Perspective on Invariant and Equivariant LearningGeorge Ma, Yifei Wang 0001, Derek Lim, Stefanie Jegelka, Yisen Wang 0001. [doi]
- Slicing Vision Transformer for Flexible InferenceYitian Zhang, Huseyin Coskun, Xu Ma 0005, Huan Wang, Ke Ma, Xi Stephen Chen, Derek Hao Hu, Yun Fu 0001. [doi]
- A Layer-Wise Natural Gradient Optimizer for Training Deep Neural NetworksXiaolei Liu, Shaoshuai Li, Kaixin Gao, Binfeng Wang. [doi]
- Reflective Multi-Agent Collaboration based on Large Language ModelsXiaohe Bo, Zeyu Zhang 0007, Quanyu Dai, Xueyang Feng, Lei Wang, Rui Li 0086, Xu Chen 0017, Ji-Rong Wen. [doi]
- Sharpness-Aware Minimization Activates the Interactive Teaching's Understanding and OptimizationMingwei Xu, Xiaofeng Cao 0002, Ivor W. Tsang. [doi]
- DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion ModelsHengkang Wang, Xu Zhang, Taihui Li, Yuxiang Wan, Tiancong Chen, Ju Sun. [doi]
- Approximating mutual information of high-dimensional variables using learned representationsGokul Gowri, Xiao-Kang Lun, Allon M. Klein, Peng Yin. [doi]
- Constant Acceleration FlowDogyun Park, Sojin Lee, Sihyeon Kim, Taehoon Lee, Youngjoon Hong, Hyunwoo J. Kim. [doi]
- GAIA: Rethinking Action Quality Assessment for AI-Generated VideosZijian Chen 0001, Wei Sun 0029, Yuan Tian 0017, Jun Jia, Zicheng Zhang, Jiarui Wang, Ru Huang 0002, Xiongkuo Min, Guangtao Zhai, Wen-Jun Zhang 0005. [doi]
- Accelerating Transformers with Spectrum-Preserving Token MergingChau Tran, Duy M. H. Nguyen, Manh-Duy Nguyen, TrungTin Nguyen, Ngan Le, Pengtao Xie, Daniel Sonntag, James Y. Zou, Binh Nguyen, Mathias Niepert. [doi]
- Can LLMs Learn by Teaching for Better Reasoning? A Preliminary StudyXuefei Ning, Zifu Wang, Shiyao Li, Zinan Lin 0001, Peiran Yao, Tianyu Fu 0004, Matthew B. Blaschko, Guohao Dai, Huazhong Yang, Yu Wang 0002. [doi]
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMsSukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li 0002, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu 0001, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen. [doi]
- An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted ObservationsWeimin Bai, Yifei Wang, Wenzheng Chen, He Sun. [doi]
- Modeling Latent Neural Dynamics with Gaussian Process Switching Linear Dynamical SystemsAmber Hu, David M. Zoltowski, Aditya Nair, David Anderson, Lea Duncker, Scott W. Linderman. [doi]
- Novel Object Synthesis via Adaptive Text-Image HarmonyZeren Xiong, Zedong Zhang, Zikun Chen, Shuo Chen 0003, Xiang Li, Gan Sun, Jian Yang, Jun Li. [doi]
- QVAE-Mole: The Quantum VAE with Spherical Latent Variable Learning for 3-D Molecule GenerationHuaijin Wu, Xinyu Ye, Junchi Yan. [doi]
- Boosting Generalization in Parametric PDE Neural Solvers through Adaptive ConditioningArmand Kassaï Koupaï, Jorge Mifsut Benet, Yuan Yin, Jean-Noël Vittaut, Patrick Gallinari. [doi]
- Variational Delayed Policy OptimizationQingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang. [doi]
- Real-time Stereo-based 3D Object Detection for Streaming PerceptionChangcai Li, Zonghua Gu 0001, Gang Chen, Libo Huang, Wei Zhang 0092, Huihui Zhou. [doi]
- Coherence-free Entrywise Estimation of Eigenvectors in Low-rank Signal-plus-noise Matrix ModelsHao Yan, Keith Levin. [doi]
- ♮-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information SettingTaihei Oki, Shinsaku Sakaue. [doi]
- FlexPlanner: Flexible 3D Floorplanning via Deep Reinforcement Learning in Hybrid Action Space with Multi-Modality RepresentationRuizhe Zhong, Xingbo Du, Shixiong Kai, Zhentao Tang, Siyuan Xu, Jianye Hao, Mingxuan Yuan, Junchi Yan. [doi]
- One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosZechen Bai, Tong He 0002, Haiyang Mei, Pichao Wang, Ziteng Gao, Joya Chen, liulei, Zheng Zhang 0001, Mike Zheng Shou. [doi]
- Multi-Group Proportional Representation in RetrievalAlex Oesterling, Claudio Mayrink Verdun, Alex Glynn, Carol Xuan Long, Lucas Monteiro Paes, Sajani Vithana, Martina Cardone, Flávio P. Calmon. [doi]
- DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object DetectionJia Syuen Lim, Zhuoxiao Chen, Zhi Chen 0010, Mahsa Baktashmotlagh, Xin Yu 0002, Zi Huang, Yadan Luo. [doi]
- Relational Verification Leaps Forward with RABBitTarun Suresh, Debangshu Banerjee, Gagandeep Singh 0001. [doi]
- LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban SimulationBowen Li, Zhaoyu Li, Qiwei Du, Jinqi Luo, Wenshan Wang, Yaqi Xie, Simon Stepputtis, Chen Wang, Katia P. Sycara, Pradeep Ravikumar, Alexander G. Gray, Xujie Si, Sebastian A. Scherer. [doi]
- Gated Slot Attention for Efficient Linear-Time Sequence ModelingYu Zhang 0092, Songlin Yang, Rui-Jie Zhu 0003, Yue Zhang 0004, Leyang Cui, Yiqiao Wang 0005, Bolun Wang, Freda Shi, Bailin Wang, Wei Bi, Peng Zhou, Guohong Fu. [doi]
- Bandits with Preference Feedback: A Stackelberg Game PerspectiveBarna Pásztor, Parnian Kassraie, Andreas Krause 0001. [doi]
- On the Scalability of GNNs for Molecular GraphsMaciej Sypetkowski, Frederik Wenkel, Farimah Poursafaei, Nia Dickson, Karush Suri, Philip Fradkin, Dominique Beaini. [doi]
- On the Impacts of the Random Initialization in the Neural Tangent Kernel TheoryGuhan Chen, Yicheng Li 0002, Qian Lin. [doi]
- SkipPredict: When to Invest in Predictions for SchedulingRana Shahout, Michael Mitzenmacher. [doi]
- Offline Multitask Representation Learning for Reinforcement LearningHaque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin 0003, Doina Precup. [doi]
- Non-Stationary Learning of Neural Networks with Automatic Soft Parameter ResetAlexandre Galashov, Michalis K. Titsias, András György 0001, Clare Lyle, Razvan Pascanu, Yee Whye Teh, Maneesh Sahani. [doi]
- I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenRoi Cohen, Konstantin Dobler, Eden Biran, Gerard de Melo. [doi]
- Can Simple Averaging Defeat Modern Watermarks?Pei Yang, Hai Ci, Yiren Song, Mike Zheng Shou. [doi]
- Statistical Estimation in the Spiked Tensor Model via the Quantum Approximate Optimization AlgorithmLeo Zhou, Joao Basso, Song Mei. [doi]
- Replicable Uniformity TestingSihan Liu, Christopher Ye 0001. [doi]
- LESS: Label-Efficient and Single-Stage Referring 3D SegmentationXuexun Liu, Xiaoxu Xu, Jinlong Li, Qiudan Zhang, Xu Wang 0006, Nicu Sebe, Lin Ma 0002. [doi]
- WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery BenchmarkingYunchao Liu, Ha Dong, Xin Wang, Rocco Moretti, Yu Wang 0160, Zhaoqian Su, Jiawei Gu, Bobby Bodenheimer, Charles David Weaver, Jens Meiler, Tyler Derr. [doi]
- Provable Posterior Sampling with Denoising Oracles via Tilted TransportJoan Bruna, Jiequn Han. [doi]
- Overcoming Brittleness in Pareto-Optimal Learning Augmented AlgorithmsAlex Elenter, Spyros Angelopoulos 0001, Christoph Dürr, Yanni Lefki. [doi]
- X-Ray: A Sequential 3D Representation For GenerationTao Hu 0011, Wenhang Ge, Yuyang Zhao, Gim Hee Lee. [doi]
- BitsFusion: 1.99 bits Weight Quantization of Diffusion ModelYang Sui, Yanyu Li, Anil Kag, Yerlan Idelbayev, Junli Cao, Ju Hu, Dhritiman Sagar, Bo Yuan 0001, Sergey Tulyakov, Jian Ren 0005. [doi]
- Generative Hierarchical Materials SearchSherry Yang 0001, Simon L. Batzner, RuiQi Gao, Muratahan Aykol, Alexander L. Gaunt, Brendan McMorrow, Danilo Jimenez Rezende, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk. [doi]
- First-Order Methods for Linearly Constrained Bilevel OptimizationGuy Kornowski, Swati Padmanabhan, Kai Wang, Zhe Zhang, Suvrit Sra. [doi]
- Reproducibility of predictive networks for mouse visual cortexPolina Turishcheva, Max F. Burg, Fabian H. Sinz, Alexander S. Ecker. [doi]
- Edit Distance Robust Watermarks via Indexing Pseudorandom CodesNoah Golowich, Ankur Moitra. [doi]
- QueST: Self-Supervised Skill Abstractions for Learning Continuous ControlAtharva Mete, Haotian Xue 0002, Albert Wilcox, Yongxin Chen, Animesh Garg. [doi]
- Incorporating Surrogate Gradient Norm to Improve Offline Optimization TechniquesCuong Dao, Phi-Le Nguyen, Truong Thao Nguyen, Nghia Hoang. [doi]
- Language Without Borders: A Dataset and Benchmark for Code-Switching Lip ReadingXueyi Zhang, Mingrui Lao, Peng Zhao, Jun Tang 0001, Yanming Guo, Siqi Cai, Xianghu Yue, Haizhou Li 0001. [doi]
- Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling ParadoxRaymond Zhang, Richard Combes. [doi]
- Diffusion Imitation from ObservationBo-Ruei Huang, Chun-Kai Yang, Chun-Mao Lai, Dai-Jie Wu, Shao-Hua Sun. [doi]
- Learning to Merge Tokens via Decoupled Embedding for Efficient Vision TransformersDong-Hoon Lee, Seunghoon Hong. [doi]
- Approximately Equivariant Neural ProcessesMatthew Ashman, Cristiana Diaconu, Adrian Weller, Wessel P. Bruinsma, Richard E. Turner. [doi]
- FIDE: Frequency-Inflated Conditional Diffusion Model for Extreme-Aware Time Series GenerationAsadullah Hill Galib, Pang-Ning Tan, Lifeng Luo. [doi]
- Active learning of neural population dynamics using two-photon holographic optogeneticsAndrew Wagenmaker, Lu Mi, Marton Rozsa, Matthew S. Bull, Karel Svoboda, Kayvon Daie, Matthew D. Golub, Kevin G. Jamieson. [doi]
- Enhancing Efficiency of Safe Reinforcement Learning via Sample ManipulationShangding Gu, Laixi Shi, Yuhao Ding, Alois Knoll, Costas J. Spanos, Adam Wierman, Ming Jin 0002. [doi]
- DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing DataHanyang Chen, Yang Jiang, Shengnan Guo 0001, Xiaowei Mao, Youfang Lin, Huaiyu Wan. [doi]
- Model Sensitivity Aware Continual LearningZhenyi Wang 0001, Heng Huang. [doi]
- Identifying Latent State-Transition Processes for Individualized Reinforcement LearningYuewen Sun, Biwei Huang, Yu Yao, Donghuo Zeng, Xinshuai Dong, Songyao Jin, Boyang Sun, Roberto Legaspi, Kazushi Ikeda, Peter Spirtes, Kun Zhang. [doi]
- CultureLLM: Incorporating Cultural Differences into Large Language ModelsCheng Li, Mengzhuo Chen, Jindong Wang 0001, Sunayana Sitaram, Xing Xie 0001. [doi]
- Efficient Leverage Score Sampling for Tensor Train DecompositionVivek Bharadwaj, Beheshteh T. Rakhshan, Osman Asif Malik, Guillaume Rabusseau. [doi]
- Cost-efficient Knowledge-based Question Answering with Large Language ModelsJunnan Dong, Qinggang Zhang, Chuang Zhou 0002, Hao Chen, Daochen Zha, Xiao Huang 0002. [doi]
- Physics-informed Neural Networks for Functional Differential Equations: Cylindrical Approximation and Its Convergence GuaranteesTaiki Miyagawa, Takeru Yokota. [doi]
- 3: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face RecognitionJianqing Xu, Shen Li 0004, Jiaying Wu, Miao Xiong, Ailin Deng, Jiazhen Ji, Yuge Huang, Guodong Mu, Wenjie Feng 0001, Shouhong Ding, Bryan Hooi. [doi]
- UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and BeyondKun Zhou 0001, Xinyu Lin, Zhonghang Liu, Xiaoguang Han 0001, Jiangbo Lu. [doi]
- Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative ModelMark Rowland 0001, Kevin Kevin Li, Rémi Munos, Clare Lyle, Yunhao Tang, Will Dabney. [doi]
- Unifying Generation and Prediction on Graphs with Latent Graph DiffusionCai Zhou, Xiyuan Wang, Muhan Zhang. [doi]
- Assemblage: Automatic Binary Dataset Construction for Machine LearningChang Liu, Rebecca Saul, Yihao Sun, Edward Raff, Maya Fuchs, Townsend Southard Pantano, James Holt, Kristopher K. Micinski. [doi]
- Latent Neural Operator for Solving Forward and Inverse PDE ProblemsTian Wang, Chuang Wang. [doi]
- United We Stand, Divided We Fall: Fingerprinting Deep Neural Networks via Adversarial TrajectoriesTianlong Xu, Chen Wang, Gaoyang Liu, Yang Yang, Kai Peng 0001, Wei Liu. [doi]
- Spectral Learning of Shared Dynamics Between Generalized-Linear ProcessesLucine L. Oganesian, Omid G. Sani, Maryam Shanechi. [doi]
- FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological SensingJitesh Joshi, Sos S. Agaian, Youngjun Cho. [doi]
- Online Composite Optimization Between Stochastic and Adversarial EnvironmentsYibo Wang 0005, Sijia Chen, Wei Jiang, Wenhao Yang, Yuanyu Wan, Lijun Zhang 0005. [doi]
- Beyond Prompts: Dynamic Conversational Benchmarking of Large Language ModelsDavid Castillo-Bolado, Joseph Davidson, Finlay Gray, Marek Rosa. [doi]
- Autonomous Agents for Collaborative Task under Information AsymmetryWei Liu, Chenxi Wang, Yifei Wang, Zihao Xie, Rennai Qiu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Chen Qian. [doi]
- TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving ScenesYanping Fu, Wenbin Liao, Xinyuan Liu 0003, Hang Xu, Yike Ma, Yucheng Zhang, Feng Dai. [doi]
- EMR-Merging: Tuning-Free High-Performance Model MergingChenyu Huang, Peng Ye, Tao Chen 0003, Tong He 0001, Xiangyu Yue 0001, Wanli Ouyang. [doi]
- Web-Scale Visual Entity Recognition: An LLM-Driven Data ApproachMathilde Caron, Alireza Fathi, Cordelia Schmid, Ahmet Iscen. [doi]
- Identity Decoupling for Multi-Subject Personalization of Text-to-Image ModelsSangwon Jang, Jaehyeong Jo, Kimin Lee, Sung Ju Hwang. [doi]
- Decomposable Transformer Point ProcessesAristeidis Panos. [doi]
- Simplifying Latent Dynamics with Softly State-Invariant World ModelsTankred Saanum, Peter Dayan, Eric Schulz. [doi]
- Mind the Gap Between Prototypes and Images in Cross-domain FinetuningHongduan Tian, Feng Liu 0003, Zhanke Zhou, Tongliang Liu, Chengqi Zhang, Bo Han 0003. [doi]
- B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading MemoryLuca Zancato, Arjun Seshadri, Yonatan Dukler, Aditya Golatkar, Yantao Shen 0002, Benjamin Bowman, Matthew Trager, Alessandro Achille, Stefano Soatto. [doi]
- Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal InputsMustafa Shukor, Matthieu Cord. [doi]
- Generalized Eigenvalue Problems with Generative PriorsZhaoqiang Liu, Wen Li, Junren Chen. [doi]
- Diffusion Tuning: Transferring Diffusion Models via Chain of ForgettingJincheng Zhong, Xingzhuo Guo, Jiaxiang Dong, Mingsheng Long. [doi]
- Partial Transportability for Domain GeneralizationKasra Jalaldoust, Alexis Bellot, Elias Bareinboim. [doi]
- Harmonizing Visual Text Comprehension and GenerationZhen Zhao, Jingqun Tang, Binghong Wu, Chunhui Lin, Shu Wei, Hao Liu, Xin Tan 0002, Zhizhong Zhang 0001, Can Huang, Yuan Xie 0006. [doi]
- Neuro-Symbolic Data Generation for Math ReasoningZenan Li, Zhi Zhou, Yuan Yao 0001, Xian Zhang, Yu-Feng Li, Chun Cao, Fan Yang, Xiaoxing Ma. [doi]
- Sketching for Distributed Deep Learning: A Sharper AnalysisMayank Shrivastava, Berivan Isik, Qiaobo Li, Sanmi Koyejo, Arindam Banerjee. [doi]
- Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration ModelsRegev Cohen, Idan Kligvasser, Ehud Rivlin, Daniel Freedman. [doi]
- Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context LengthXuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang 0108, Jonathan May, Luke Zettlemoyer, Omer Levy, Chunting Zhou. [doi]
- Robust Conformal Prediction Using Privileged InformationShai Feldman, Yaniv Romano. [doi]
- Identifiability Analysis of Linear ODE Systems with Hidden ConfoundersYuanyuan Wang, Biwei Huang, Wei Huang, Xi Geng, Mingming Gong. [doi]
- Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?Pedro R. A. S. Bassi, Wenxuan Li, Yucheng Tang, Fabian Isensee, Zifu Wang, Jieneng Chen, Yu-Cheng Chou, Yannick Kirchhoff, Maximilian R. Rokuss, Ziyan Huang, Jin Ye, Junjun He, Tassilo Wald, Constantin Ulrich, Michael Baumgartner 0001, Saikat Roy, Klaus H. Maier-Hein, Paul F. Jaeger, Yiwen Ye, Yutong Xie, Jianpeng Zhang, Ziyang Chen, Yong Xia 0001, Zhaohu Xing, Lei Zhu, Yousef Sadegheih, Afshin Bozorgpour, Pratibha Kumari 0001, Reza Azad, Dorit Merhof, Pengcheng Shi, Ting Ma, Yuxin Du, Fan Bai 0008, Tiejun Huang 0001, Bo Zhao, Haonan Wang, Xiaomeng Li, Hanxue Gu, Haoyu Dong, Jichen Yang, Maciej A. Mazurowski, Saumya Gupta, Linshan Wu, Jia-Xin Zhuang, Hao Chen, Holger Roth, Daguang Xu, Matthew B. Blaschko, Sergio Decherchi, Andrea Cavalli, Alan L. Yuille, Zongwei Zhou. [doi]
- Bayesian Adaptive Calibration and Optimal DesignRafael Oliveira 0001, Dino Sejdinovic, David Howard, Edwin V. Bonilla. [doi]
- Perplexity-aware Correction for Robust Alignment with Noisy PreferencesKeyi Kong, Xilie Xu, Di Wang 0015, Jingfeng Zhang, Mohan S. Kankanhalli. [doi]
- Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior ModelsHui-Po Wang, Mario Fritz. [doi]
- Learn more, but bother less: parameter efficient continual learningFuli Qiao, Mehrdad Mahdavi. [doi]
- Robot Policy Learning with Temporal Optimal Transport RewardYuwei Fu, Haichao Zhang, Di Wu, Wei Xu, Benoit Boulet. [doi]
- Is Score Matching Suitable for Estimating Point Processes?Haoqun Cao, Zizhuo Meng, Tianjun Ke, Feng Zhou. [doi]
- 2: Effective Sharpness Aware Minimization Requires Layerwise Perturbation ScalingMoritz Haas, Jin Xu, Volkan Cevher, Leena Chennuru Vankadara. [doi]
- VHELM: A Holistic Evaluation of Vision Language ModelsTony Lee, Haoqin Tu, Chi Heem Wong, Wenhao Zheng, Yiyang Zhou, Yifan Mai, Josselin Somerville Roberts, Michihiro Yasunaga, Huaxiu Yao, Cihang Xie, Percy Liang. [doi]
- Multivariate Probabilistic Time Series Forecasting with Correlated ErrorsVincent Zhihao Zheng, Lijun Sun. [doi]
- Trade-Offs of Diagonal Fisher Information Matrix EstimatorsAlexander Soen, Ke Sun 0001. [doi]
- Online Bayesian Persuasion Without a ClueFrancesco Bacchiocchi, Matteo Bollini, Matteo Castiglioni, Alberto Marchesi 0001, Nicola Gatti 0001. [doi]
- UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN ManipulationHanzhang Zhou, Zijian Feng, Zixiao Zhu, Junlang Qian, Kezhi Mao. [doi]
- SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-TuningYexiao He, Ziyao Wang, Zheyu Shen, Guoheng Sun, Yucong Dai, Yongkai Wu, Hongyi Wang 0001, Ang Li 0005. [doi]
- Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic ModelsAviv Bick, Kevin Y. Li, Eric P. Xing, J. Zico Kolter, Albert Gu. [doi]
- Why are Visually-Grounded Language Models Bad at Image Classification?Yuhui Zhang, Alyssa Unell, Xiaohan Wang, Dhruba Ghosh, Yuchang Su, Ludwig Schmidt, Serena Yeung. [doi]
- Revisiting Few-Shot Object Detection with Vision-Language ModelsAnish Madan, Neehar Peri, Shu Kong, Deva Ramanan. [doi]
- On Tractable Φ-Equilibria in Non-Concave GamesYang Cai 0001, Constantinos Daskalakis, Haipeng Luo, Chen-Yu Wei, Weiqiang Zheng. [doi]
- Abductive Reasoning in Logical Credal NetworksRadu Marinescu 0002, Junkyu Lee 0001, Debarun Bhattacharjya, Fábio Gagliardi Cozman, Alexander G. Gray. [doi]
- Understanding Bias in Large-Scale Visual DatasetsBoya Zeng, Yida Yin, Zhuang Liu 0003. [doi]
- Multi-Agent Coordination via Multi-Level CommunicationGang Ding, Zeyuan Liu, Zhirui Fang, Kefan Su, Liwen Zhu 0003, Zongqing Lu. [doi]
- STONE: A Submodular Optimization Framework for Active 3D Object DetectionRuiyu Mao, Sarthak Kumar Maharana, Rishabh K. Iyer, Yunhui Guo. [doi]
- Shared Autonomy with IDA: Interventional Diffusion AssistanceBrandon McMahan, Zhenghao Mark Peng, Bolei Zhou, Jonathan C. Kao. [doi]
- Retrieval & Fine-Tuning for In-Context Tabular ModelsValentin Thomas, Junwei Ma, Rasa Hosseinzadeh, Keyvan Golestan, Guangwei Yu, Maksims Volkovs, Anthony L. Caterini. [doi]
- KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image SynthesisYoungwan Lee, KwanYong Park, Yoorhim Cho, Yong-Ju Lee, Sung Ju Hwang. [doi]
- Safety through feedback in Constrained RLShashank Reddy Chirra, Pradeep Varakantham, Praveen Paruchuri. [doi]
- TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic SceneSandika Biswas, Qianyi Wu, Biplab Banerjee, Hamid Rezatofighi. [doi]
- CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat IntelligenceMd Tanvirul Alam, Dipkamal Bhusal, Le-Nguyen, Nidhi Rastogi. [doi]
- CV-VAE: A Compatible Video VAE for Latent Generative Video ModelsSijie Zhao, Yong Zhang 0034, Xiaodong Cun, Shaoshu Yang, Muyao Niu, Xiaoyu Li, Wenbo Hu 0002, Ying Shan. [doi]
- How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear RegressionXingwu Chen, Lei Zhao, Difan Zou. [doi]
- Understanding the Expressivity and Trainability of Fourier Neural Operator: A Mean-Field PerspectiveTakeshi Koshizuka, Masahiro Fujisawa, Yusuke Tanaka, Issei Sato. [doi]
- Time-FFM: Towards LM-Empowered Federated Foundation Model for Time Series ForecastingQingxiang Liu, Xu Liu, Chenghao Liu, Qingsong Wen, Yuxuan Liang. [doi]
- Empowering Active Learning for 3D Molecular Graphs with Geometric Graph IsomorphismRonast Subedi, Lu Wei, Wenhan Gao 0002, Shayok Chakraborty, Yi Liu. [doi]
- Testing Calibration in Nearly-Linear TimeLunjia Hu, Arun Jambulapati, Kevin Tian, Chutong Yang. [doi]
- Extensive-Form Game Solving via Blackwell Approachability on TreeplexesDarshan Chakrabarti, Julien Grand-Clément, Christian Kroer. [doi]
- Controlled maximal variability along with reliable performance in recurrent neural networksChiara Mastrogiuseppe, Rubén Moreno-Bote. [doi]
- UDON: Universal Dynamic Online distillatioN for generic image representationsNikolaos-Antonios Ypsilantis, Kaifeng Chen, André Araújo 0001, Ondrej Chum. [doi]
- Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted ImagesQi Song 0003, Ziyuan Luo, Ka-Chun Cheung, Simon See, Renjie Wan. [doi]
- Hierarchical Selective ClassificationShani Goren, Ido Galil, Ran El-Yaniv. [doi]
- SyncVIS: Synchronized Video Instance SegmentationRongkun Zheng, Lu Qi, Xi Chen 0072, Yi Wang 0074, Kun Wang, Yu Qiao 0001, Hengshuang Zhao. [doi]
- Optimal Rates for Vector-Valued Spectral Regularization Learning AlgorithmsDimitri Meunier, Zikai Shen, Mattes Mollenhauer, Arthur Gretton, Zhu Li. [doi]
- GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative ModelsZaitang Li, Pin-Yu Chen, Tsung-Yi Ho. [doi]
- A Label is Worth A Thousand Images in Dataset DistillationTian Qin, Zhiwei Deng, David Alvarez-Melis. [doi]
- Smoothed Online Classification can be Harder than Batch ClassificationVinod Raman, Unique Subedi, Ambuj Tewari. [doi]
- Accelerating Pre-training of Multimodal LLMs via Chain-of-SightZiyuan Huang, Kaixiang Ji, Biao Gong, Zhiwu Qing, Qinglong Zhang, Kecheng Zheng, Jian Wang, Jingdong Chen, Ming Yang. [doi]
- GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic EvaluationsJinhao Duan, Renming Zhang, James Diffenderfer, Bhavya Kailkhura, Lichao Sun 0001, Elias Stengel-Eskin, Mohit Bansal, Tianlong Chen, Kaidi Xu. [doi]
- HEST-1k: A Dataset For Spatial Transcriptomics and Histology Image AnalysisGuillaume Jaume, Paul Doucet, Andrew H. Song, Ming-Yang Lu, Cristina Almagro-Pérez, Sophia J. Wagner, Anurag Vaidya, Richard J. Chen, Drew F. K. Williamson, Ahrong Kim, Faisal Mahmood. [doi]
- Group-wise oracle-efficient algorithms for online multi-group learningSamuel Deng, Jingwen Liu, Daniel J. Hsu. [doi]
- Interfacing Foundation Models' EmbeddingsXueyan Zou, Linjie Li, Jianfeng Wang, Jianwei Yang, Mingyu Ding, Junyi Wei, Zhengyuan Yang, Feng Li 0040, Hao Zhang 0097, Shilong Liu, Arul Aravinthan, Yong Jae Lee, Lijuan Wang. [doi]
- Visual Anchors Are Strong Information Aggregators For Multimodal Large Language ModelHaogeng Liu, Quanzeng You, Xiaotian Han, Yongfei Liu, Huaibo Huang, Ran He 0001, Hongxia Yang. [doi]
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AIPengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li 0044, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang 0001, Bin Fu, Jianfei Cai 0001, Bohan Zhuang, Eric J. Seibel, Junjun He, Yu Qiao 0001. [doi]
- Improving Adaptivity via Over-Parameterization in Sequence ModelsYicheng Li, Qian Lin. [doi]
- GraphTrail: Translating GNN Predictions into Human-Interpretable Logical RulesBurouj Armgaan, Manthan Dalmia, Sourav Medya, Sayan Ranu. [doi]
- Language Generation in the LimitJon M. Kleinberg, Sendhil Mullainathan. [doi]
- AHA: Human-Assisted Out-of-Distribution Generalization and DetectionHaoyue Bai 0001, Jifan Zhang, Robert D. Nowak. [doi]
- Universal Rates of Empirical Risk MinimizationSteve Hanneke, Mingyue Xu. [doi]
- Large Pre-trained time series models for cross-domain Time series analysis tasksHarshavardhan Kamarthi, B. Aditya Prakash. [doi]
- Towards Understanding the Working Mechanism of Text-to-Image Diffusion ModelMingyang Yi, Aoxue Li, Yi Xin, Zhenguo Li. [doi]
- Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNALifeng Qiao, Peng Ye, Yuchen Ren, Weiqiang Bai, Chaoqi Liang, Xinzhu Ma, Nanqing Dong, Wanli Ouyang. [doi]
- TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of SightHyun-Kurl Jang, Jihun Kim, Hyeokjun Kweon, Kuk-Jin Yoon. [doi]
- Performative Control for Linear Dynamical SystemsSongfu Cai, Fei Han, Xuanyu Cao. [doi]
- Transformers Can Do Arithmetic with the Right EmbeddingsSean McLeish, Arpit Bansal, Alex Stein, Neel Jain, John Kirchenbauer, Brian R. Bartoldson, Bhavya Kailkhura, Abhinav Bhatele, Jonas Geiping, Avi Schwarzschild, Tom Goldstein. [doi]
- Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion ModelsLiulei Li, Wenguan Wang, Yi Yang 0001. [doi]
- Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language ModelsYuancheng Xu, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu 0006, Tom Goldstein, Furong Huang. [doi]
- ZeroMark: Towards Dataset Ownership Verification without Disclosing WatermarkJunfeng Guo, Yiming Li, Ruibo Chen, Yihan Wu, Chenxi Liu, Heng Huang. [doi]
- Non-parametric classification via expand-and-sparsify representationKaushik Sinha. [doi]
- EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific EvaluationsJia Li 0011, Ge Li 0001, Xuanming Zhang, Yunfei Zhao, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li. [doi]
- Topological obstruction to the training of shallow ReLU neural networksMarco Nurisso, Pierrick Leroy, Francesco Vaccarino. [doi]
- Communication Efficient Distributed Training with Distributed LionBo Liu, Lemeng Wu, Lizhang Chen, Kaizhao Liang, Jiaxu Zhu, Chen Liang, Raghuraman Krishnamoorthi, Qiang Liu. [doi]
- Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealingDavid Perera, Victor Letzelter, Théo Mariotte, Adrien Cortés, Mickaël Chen, Slim Essid, Gaël Richard. [doi]
- Wasserstein convergence of Cech persistence diagrams for samplings of submanifoldsCharles Arnal, David Cohen-Steiner, Vincent Divol. [doi]
- Task Me AnythingJieyu Zhang, WeiKai Huang, Zixian Ma, Oscar Michel, Dong He 0002, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna. [doi]
- Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or MemorizingZhongwang Zhang, Pengxiao Lin, Zhiwei Wang, Yaoyu Zhang, Zhi-Qin John Xu. [doi]
- Single Image Reflection Separation via Dual-Stream Interactive TransformersQiming Hu, Hainuo Wang, Xiaojie Guo 0001. [doi]
- MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with VisualizationsYubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen 0001, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang 0001, Liangming Pan, Yu-Gang Jiang 0001, Jiaqi Wang 0003, Yixin Cao 0002, Aixin Sun. [doi]
- Paralinguistics-Aware Speech-Empowered Large Language Models for Natural ConversationHeeseung Kim, Soonshin Seo, Kyeongseok Jeong, Ohsung Kwon, Soyoon Kim, Jungwhan Kim, Jaehong Lee, Eunwoo Song, Myungwoo Oh, Jung-Woo Ha, Sungroh Yoon, Kang Min Yoo. [doi]
- Searching for Efficient Linear Layers over a Continuous Space of Structured MatricesAndres Potapczynski, Shikai Qiu, Marc Finzi, Christopher Ferri, Charlie Chen, Micah Goldblum, C. Bayan Bruss, Christopher De Sa, Andrew Gordon Wilson. [doi]
- Return of Unconditional Generation: A Self-supervised Representation Generation MethodTianhong Li, Dina Katabi, Kaiming He. [doi]
- MeshFormer : High-Quality Mesh Generation with 3D-Guided Reconstruction ModelMinghua Liu, Chong Zeng 0001, Xinyue Wei, Ruoxi Shi, Linghao Chen, Chao Xu 0016, Mengqi Zhang, Zhaoning Wang, Xiaoshuai Zhang, Isabella Liu, Hongzhi Wu, Hao Su 0001. [doi]
- AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural FieldsLouis Serrano, Thomas X. Wang, Etienne Le Naour, Jean-Noël Vittaut, Patrick Gallinari. [doi]
- Breaking Long-Tailed Learning Bottlenecks: A Controllable Paradigm with Hypernetwork-Generated Diverse ExpertsZhe Zhao 0008, Haibin Wen, Zikang Wang, Pengkun Wang, Fanfu Wang, Song Lai, Qingfu Zhang 0001, Yang Wang 0015. [doi]
- SCRREAM : SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a BenchmarkHyunjun Jung, Weihang Li, Shun-Cheng Wu, William Bittner, Nikolas Brasch, Jifei Song, Eduardo Pérez-Pellitero, Zhensong Zhang, Arthur Moreau, Nassir Navab, Benjamin Busam. [doi]
- Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation ModelsShenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng 0001. [doi]
- Rethinking the Diffusion Models for Missing Data Imputation: A Gradient Flow PerspectiveZhichao Chen 0001, Haoxuan Li 0001, Fangyikang Wang, Odin Zhang, Hu Xu, Xiaoyu Jiang, Zhihuan Song, Hao Wang. [doi]
- Sub-optimal Experts mitigate Ambiguity in Inverse Reinforcement LearningRiccardo Poiani, Gabriele Curti, Alberto Maria Metelli, Marcello Restelli. [doi]
- GFT: Graph Foundation Model with Transferable Tree VocabularyZehong Wang, Zheyuan Zhang, Nitesh V. Chawla, Chuxu Zhang, Yanfang Ye 0002. [doi]
- BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network InferenceChangwoo Lee 0001, Soo Min Kwon, Qing Qu 0001, Hun-Seok Kim. [doi]
- Improved learning rates in multi-unit uniform price auctionsMarius Potfer, Dorian Baudry, Hugo Richard, Vianney Perchet, Cheng Wan. [doi]
- Stochastic contextual bandits with graph feedback: from independence number to MAS numberYuxiao Wen, Yanjun Han, Zhengyuan Zhou. [doi]
- Going Beyond Heuristics by Imposing Policy Improvement as a ConstraintChi-Chang Lee, Zhang-Wei Hong, Pulkit Agrawal 0001. [doi]
- Optimal Aggregation of Prediction Intervals under Unsupervised Domain ShiftJiawei Ge, Debarghya Mukherjee, Jianqing Fan. [doi]
- Multiple Physics Pretraining for Spatiotemporal Surrogate ModelsMichael McCabe, Bruno Régaldo-Saint Blancard, Liam Holden Parker, Ruben Ohana, Miles D. Cranmer, Alberto Bietti, Michael Eickenberg, Siavash Golkar, Géraud Krawezik, François Lanusse, Mariel Pettee, Tiberiu Tesileanu, KyungHyun Cho, Shirley Ho. [doi]
- Navigating the Effect of Parametrization for Dimensionality ReductionHaiyang Huang 0003, Yingfan Wang, Cynthia Rudin. [doi]
- $\nabla^2$DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network PotentialsKuzma Khrabrov, Anton Ber, Artem Tsypin, Konstantin Ushenin, Egor Rumiantsev, Alexander Telepov, Dmitry Protasov, Ilya Shenbin, Anton Alekseev 0001, Mikhail Shirokikh, Sergey I. Nikolenko, Elena Tutubalina, Artur Kadurin. [doi]
- Combining Statistical Depth and Fermat Distance for Uncertainty QuantificationHai-Vy Nguyen, Fabrice Gamboa, Reda Chhaibi, Sixin Zhang, Serge Gratton, Thierry Giaccone. [doi]
- LLMDFA: Analyzing Dataflow in Code with Large Language ModelsChengpeng Wang, Wuqi Zhang, Zian Su, Xiangzhe Xu, Xiaoheng Xie, Xiangyu Zhang 0001. [doi]
- In-N-Out: Lifting 2D Diffusion Prior for 3D Object Removal via Tuning-Free Latents AlignmentDongting Hu, Huan Fu, Jiaxian Guo, Liuhua Peng, Tingjin Chu, Feng Liu 0003, Tongliang Liu, Mingming Gong. [doi]
- Towards Scalable and Stable Parallelization of Nonlinear RNNsXavier Gonzalez, Andrew Warrington, Jimmy T. H. Smith, Scott W. Linderman. [doi]
- Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand AvatarsXuan Huang, Hanhui Li, Wanquan Liu, Xiaodan Liang, Yiqiang Yan, Yuhao Cheng, Chenqiang Gao. [doi]
- AID: Attention Interpolation of Text-to-Image DiffusionQiyuan He, Jinghao Wang, Ziwei Liu 0002, Angela Yao. [doi]
- Alignment for HonestyYuqing Yang 0004, Ethan Chern, Xipeng Qiu, Graham Neubig, Pengfei Liu 0003. [doi]
- Exact Gradients for Stochastic Spiking Neural Networks Driven by Rough SignalsChristian Holberg, Cristopher Salvi. [doi]
- SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLMMing Nie, Dan Ding, Chunwei Wang, Yuanfan Guo, Jianhua Han, Hang Xu, Li Zhang. [doi]
- Towards a "Universal Translator" for Neural Dynamics at Single-Cell, Single-Spike ResolutionYizi Zhang, Yanchen Wang, Donato Jiménez-Benetó, Zixuan Wang, Mehdi Azabou, Blake A. Richards, Renee Tung, Olivier Winter, International Brain Laboratory, Eva L. Dyer, Liam Paninski, Cole L. Hurwitz. [doi]
- The High Line: Exact Risk and Learning Rate Curves of Stochastic Adaptive Learning Rate AlgorithmsElizabeth Collins-Woodfin, Inbar Seroussi, Begoña García Malaxechebarría, Andrew W. Mackenzie, Elliot Paquette, Courtney Paquette. [doi]
- VLKEB: A Large Vision-Language Model Knowledge Editing BenchmarkHan Huang, Haitian Zhong, Tao Yu, Qiang Liu 0006, Shu Wu, Liang Wang 0001, Tieniu Tan. [doi]
- Facilitating Multimodal Classification via Dynamically Learning Modality GapYang Yang 0074, Fengqiang Wan, Qing-Yuan Jiang, Yi Xu 0008. [doi]
- Multistep Distillation of Diffusion Models via Moment MatchingTim Salimans, Thomas Mensink, Jonathan Heek, Emiel Hoogeboom. [doi]
- Streaming Bayes GFlowNetsTiago da Silva, Daniel Augusto de Souza, Diego Mesquita. [doi]
- Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data AugmentationNing-Hsu (Albert) Wang, Yu-Lun Liu. [doi]
- On the Saturation Effects of Spectral Algorithms in Large DimensionsWeihao Lu 0002, Haobo Zhang 0004, Yicheng Li, Qian Lin. [doi]
- Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image ModelsMatthew Zheng, Enis Simsar, Hidir Yesiltepe, Federico Tombari, Joel Simon, Pinar Yanardag Delul. [doi]
- Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge DistillationYu-Liang Zhan, Zhong-Yi Lu, Hao Sun, Ze-Feng Gao. [doi]
- LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe AlignmentJuelin Zhu, Shen Yan, Long Wang, Shengyue Zhang, Yu Liu 0008, Maojun Zhang. [doi]
- Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?Richard Ren, Steven Basart, Adam Khoja, Alice Gatti, Long Phan, Xuwang Yin, Mantas Mazeika, Alexander Pan, Gabriel Mukobi, Ryan H. Kim, Stephen Fitz, Dan Hendrycks. [doi]
- Conformal Prediction for Class-wise Coverage via Augmented Label Rank CalibrationYuanjie Shi, Subhankar Ghosh, Taha Belkhouja, Jana Doppa, Yan Yan 0006. [doi]
- ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial OptimizationHuayang Huang, Yu Wu, Qian Wang. [doi]
- Beyond Primal-Dual Methods in Bandits with Stochastic and Adversarial ConstraintsMartino Bernasconi, Matteo Castiglioni, Andrea Celli, Federico Fusco. [doi]
- Shaping the distribution of neural responses with interneurons in a recurrent circuit modelDavid Lipshutz, Eero P. Simoncelli. [doi]
- Nuclear Fusion Diamond Polishing DatasetAntonios Alexos, Junze Liu, Shashank Galla, Sean Hayes, Kshitij Bhardwaj, Alexander Schwartz, Monika Biener, Pierre Baldi, Satish T. S. Bukkapatnam, Suhas Bhandarkar. [doi]
- How Does Variance Shape the Regret in Contextual Bandits?Zeyu Jia, Jian Qian, Alexander Rakhlin, Chen-Yu Wei. [doi]
- Graphcode: Learning from multiparameter persistent homology using graph neural networksFlorian Russold, Michael Kerber. [doi]
- LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light ScenesZefan Qu, Ke Xu 0010, Gerhard P. Hancke 0002, Rynson W. H. Lau. [doi]
- SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code AgentsNiels Mündler, Mark Niklas Müller, Jingxuan He, Martin T. Vechev. [doi]
- FedNE: Surrogate-Assisted Federated Neighbor Embedding for Dimensionality ReductionZiwei Li, Xiaoqi Wang, Hong-You Chen, Han-Wei Shen, Wei-Lun Chao. [doi]
- Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP IterationHongming Zhang 0003, Chenjun Xiao, Chao Gao, Han Wang, Bo Xu 0002, Martin Müller 0003. [doi]
- Image Reconstruction Via Autoencoding Sequential Deep Image PriorIsmail Alkhouri, Shijun Liang, Evan Bell, Qing Qu 0001, Rongrong Wang, Saiprasad Ravishankar. [doi]
- FedGTST: Boosting Global Transferability of Federated Models via Statistics TuningEvelyn Ma, Chao Pan 0003, S. Rasoul Etesami 0001, Han Zhao, Olgica Milenkovic. [doi]
- Representation Noising: A Defence Mechanism Against Harmful FinetuningDomenic Rosati, Jan Wehner, Kai Williams, Lukasz Bartoszcze, Robie Gonzales, Carsten Maple, Subhabrata Majumdar, Hassan Sajjad 0001, Frank Rudzicz. [doi]
- Disentangling Linear Quadratic Control with Untrusted ML PredictionsTongxin Li, Hao Liu, Yisong Yue. [doi]
- Learning a Single Neuron Robustly to Distributional Shifts and Adversarial Label NoiseShuyao Li, Sushrut Karmalkar, Ilias Diakonikolas, Jelena Diakonikolas. [doi]
- GAVEL: Generating Games via Evolution and Language ModelsGraham Todd, Alexander Padula, Matthew Stephenson, Éric Piette, Dennis J. N. J. Soemers, Julian Togelius. [doi]
- GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution ShiftsDeyu Zou, Shikun Liu, Siqi Miao 0001, Victor Fung, Shiyu Chang, Pan Li 0005. [doi]
- Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience RegularizationHaoran Li 0010, Zhennan Jiang, Yuhui Chen, Dongbin Zhao. [doi]
- Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation PlatformsFan Yao, Yiming Liao, Jingzhou Liu, Shaoliang Nie, Qifan Wang, Haifeng Xu, Hongning Wang. [doi]
- CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-ExpertsJiachen Li 0003, Xinyao Wang, Sijie Zhu, Chia-Wen Kuo, Lu Xu, Fan Chen, Jitesh Jain, Humphrey Shi, Longyin Wen. [doi]
- A Compositional Atlas for Algebraic CircuitsBenjie Wang 0001, Denis Deratani Mauá, Guy Van den Broeck, YooJung Choi 0001. [doi]
- Classic GNNs are Strong Baselines: Reassessing GNNs for Node ClassificationYuankai Luo, Lei Shi 0002, Xiao-Ming Wu 0003. [doi]
- Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component AnalysisRachel S. Y. Teo, Tan Nguyen. [doi]
- StackEval: Benchmarking LLMs in Coding AssistanceNidhish Shah, Zulkuf Genc, Dogu Araci. [doi]
- Unrolled denoising networks provably learn to perform optimal Bayesian inferenceAayush Karan, Kulin Shah, Sitan Chen, Yonina C. Eldar. [doi]
- Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional EncodingZhenyu Zhang 0015, Runjin Chen, Shiwei Liu 0003, Zhewei Yao, Olatunji Ruwase, Beidi Chen, Xiaoxia Wu, Zhangyang Wang. [doi]
- Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation ModelsYuchen Hu, Chen Chen, Chao-Han Yang, Chengwei Qin, Pin-Yu Chen, Engsiong Chng, Chao Zhang. [doi]
- Is Function Similarity Over-Engineered? Building a BenchmarkRebecca Saul, Chang Liu, Noah Fleischmann, Richard Zak, Kristopher K. Micinski, Edward Raff, James Holt. [doi]
- Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy ChurnHongyao Tang, Glen Berseth. [doi]
- No Regrets: Investigating and Improving Regret Approximations for Curriculum DiscoveryAlexander Rutherford, Michael Beukman, Timon Willi, Bruno Lacerda, Nick Hawes, Jakob N. Foerster. [doi]
- Flaws can be Applause: Unleashing Potential of Segmenting Ambiguous Objects in SAMChenxin Li, Yuzhi Huang, Wuyang Li, Hengyu Liu 0007, Xinyu Liu 0001, Qing Xu, Zhen Chen 0013, Yue Huang 0001, Yixuan Yuan. [doi]
- Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task StructureHanseul Cho 0002, Jaeyoung Cha, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta 0001, Chulhee Yun. [doi]
- Rethinking No-reference Image Exposure Assessment from Holism to Pixel: Models, Datasets and BenchmarksShuai He, Shuntian Zheng, Anlong Ming, Banyu Wu, Huadong Ma. [doi]
- 2D-OOB: Attributing Data Contribution Through Joint Valuation FrameworkYifan Sun, Jingyan Shen, Yongchan Kwon. [doi]
- Implicitly Guided Design with PropEn: Match your Data to Follow the GradientNatasa Tagasovska, Vladimir Gligorijevic, KyungHyun Cho, Andreas Loukas. [doi]
- A Metalearned Neural Circuit for Nonparametric Bayesian InferenceJake Snell, Gianluca M. Bencomo, Tom Griffiths 0001. [doi]
- Universal Online Convex Optimization with 1 Projection per RoundWenhao Yang, Yibo Wang 0005, Peng Zhao 0006, Lijun Zhang 0005. [doi]
- Linear Uncertainty Quantification of Graphical Model InferenceChenghua Guo, Han Yu, Jiaxin Liu, Chao Chen, Qi Li, Sihong Xie, Xi Zhang. [doi]
- Bridge-IF: Learning Inverse Protein Folding with Markov BridgesYiheng Zhu, Jialu Wu, Qiuyi Li, Jiahuan Yan, Mingze Yin, Wei Wu, Mingyang Li, Jieping Ye, Zheng Wang, Jian Wu. [doi]
- Scale-invariant Optimal Sampling for Rare-events Data and Sparse ModelsJing Wang, HaiYing Wang 0004, Hao Zhang. [doi]
- SpeAr: A Spectral Approach for Zero-Shot Node ClassificationTing Guo, Da Wang, Jiye Liang, Kaihan Zhang, Jianchao Zeng 0001. [doi]
- Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts ConversionFilip Szatkowski, Bartosz Wójcik, Mikolaj Piórczynski, Simone Scardapane. [doi]
- Matryoshka Query Transformer for Large Vision-Language ModelsWenbo Hu 0006, Zi-Yi Dou, Liunian Harold Li, Amita Kamath, Nanyun Peng 0001, Kai-Wei Chang. [doi]
- Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement LearningHao Ma, Tianyi Hu, Zhiqiang Pu, Boyin Liu, Xiaolin Ai, Yanyan Liang 0001, Min Chen. [doi]
- The Surprising Effectiveness of SP Voting with Partial PreferencesHadi Hosseini, Debmalya Mandal, Amrit Puhan. [doi]
- Zipfian WhiteningSho Yokoi, Han Bao 0002, Hiroto Kurita, Hidetoshi Shimodaira. [doi]
- EfficientCAPER: An End-to-End Framework for Fast and Robust Category-Level Articulated Object Pose EstimationXinyi Yu, Haonan Jiang, Li Zhang, Lin Yuanbo Wu, Linlin Ou, Liu Liu. [doi]
- DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion ModelsZhengyang Yu, Zhaoyuan Yang, Jing Zhang. [doi]
- SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and RolloutChiyu Max Jiang, Yijing Bai, Andre Cornman, Christopher Davis, Xiukun Huang, Hong Jeon, Sakshum Kulshrestha, John Lambert, Shuangyu Li, Xuanyu Zhou, Carlos Fuertes, Chang Yuan, Mingxing Tan, Yin Zhou, Dragomir Anguelov. [doi]
- Virtual Scanning: Unsupervised Non-line-of-sight Imaging from Irregularly Undersampled TransientsXingyu Cui, Huanjing Yue, Song Li, Xiangjun Yin, Yusen Hou, Yun Meng, Kai Zou, Xiaolong Hu, Jingyu Yang. [doi]
- Exploring Jacobian Inexactness in Second-Order Methods for Variational Inequalities: Lower Bounds, Optimal Algorithms and Quasi-Newton ApproximationsArtem Agafonov, Petr Ostroukhov, Roman Mozhaev, Konstantin Yakovlev, Eduard Gorbunov, Martin Takác, Alexander V. Gasnikov, Dmitry Kamzolov. [doi]
- Beyond the Doors of Perception: Vision Transformers Represent Relations Between ObjectsMichael A. Lepori, Alexa R. Tartaglini, Wai Keen Vong, Thomas Serre, Brenden M. Lake, Ellie Pavlick. [doi]
- Easy Regional Contrastive Learning of Expressive Fashion RepresentationsDaiqing Qi, Handong Zhao, Sheng Li 0001. [doi]
- TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency ModelsKiwoong Yoo, Owen Oertell, Junhyun Lee, Sanghoon Lee, Jaewoo Kang. [doi]
- Predicting Future Actions of Reinforcement Learning AgentsStephen Chung, Scott Niekum, David Krueger 0001. [doi]
- The GAN is dead; long live the GAN! A Modern GAN BaselineNick Huang, Aaron Gokaslan, Volodymyr Kuleshov, James Tompkin 0001. [doi]
- DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback AlignmentGongpei Zhao, Tao Wang 0011, Congyan Lang, Yi Jin 0001, Yidong Li, Haibin Ling. [doi]
- Principled Bayesian Optimization in Collaboration with Human ExpertsWenjie Xu, Masaki Adachi, Colin N. Jones, Michael A. Osborne. [doi]
- Thought of Search: Planning with Language Models Through The Lens of EfficiencyMichael Katz 0001, Harsha Kokel, Kavitha Srinivas, Shirin Sohrabi. [doi]
- Lookback Prophet InequalitiesZiyad Benomar, Dorian Baudry, Vianney Perchet. [doi]
- FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference ImagesZheng Yu, Yaohua Wang, Siying Cui, Aixi Zhang, Wei-Long Zheng, Senzhang Wang. [doi]
- Sample Complexity of Interventional Causal Representation LearningEmre Acartürk, Burak Varici, Karthikeyan Shanmugam, Ali Tajer. [doi]
- Association of Objects May Engender Stereotypes: Mitigating Association-Engendered Stereotypes in Text-to-Image GenerationJunlei Zhou, Jiashi Gao, Xiangyu Zhao 0001, Xin Yao 0001, Xuetao Wei. [doi]
- Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space ModelYuheng Shi, Minjing Dong, Chang Xu. [doi]
- Unscrambling disease progression at scale: fast inference of event permutations with optimal transportPeter A. Wijeratne, Daniel C. Alexander. [doi]
- GUIDE: Real-Time Human-Shaped AgentsLingyu Zhang, Zhengran Ji, Nicholas R. Waytowich, Boyuan Chen 0001. [doi]
- ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language ModelYiming Sun, Fan Yu, Shaoxiang Chen, Yu Zhang, Junwei Huang, Yang Li, Chenhui Li, Changbo Wang. [doi]
- Private Online Learning via Lazy AlgorithmsHilal Asi, Tomer Koren, Daogao Liu, Kunal Talwar. [doi]
- Progressive Entropic Optimal Transport SolversParnian Kassraie, Aram-Alexandre Pooladian, Michal Klein, James Thornton, Jonathan Niles-Weed, Marco Cuturi. [doi]
- Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward LayersXiuying Wei, Skander Moalla, Razvan Pascanu, Caglar Gulcehre. [doi]
- Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition TimeZixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu. [doi]
- HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language ModelsRhea Sukthanker, Arber Zela, Benedikt Staffler, Aaron Klein, Lennart Purucker, Jörg K. H. Franke, Frank Hutter. [doi]
- RTify: Aligning Deep Neural Networks with Human Behavioral DecisionsYu-Ang Cheng, Ivan F. Rodriguez Rodriguez, Sixuan Chen, Kohitij Kar, Takeo Watanabe, Thomas Serre. [doi]
- BuckTales: A multi-UAV dataset for multi-object tracking and re-identification of wild antelopesHemal Naik, Junran Yang, Dipin Das, Margaret Crofoot, Akanksha Rathore, Vivek Hari Sridhar. [doi]
- Improved Guarantees for Fully Dynamic k-Center Clustering with Outliers in General Metric SpacesLeyla Biabani, Annika Hennes, Denise La Gordt Dillie, Morteza Monemizadeh, Melanie Schmidt 0001. [doi]
- A Framework for Bilevel Optimization on Riemannian ManifoldsAndi Han, Bamdev Mishra, Pratik Kumar Jawanpuria, Akiko Takeda. [doi]
- Coded Computing for Resilient Distributed Computing: A Learning-Theoretic FrameworkParsa Moradi, Behrooz Tahmasebi, Mohammad Ali Maddah-Ali. [doi]
- VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksJiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu 0001, Zhe Chen 0017, Wenhai Wang, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo 0002, Yu Qiao, Jifeng Dai. [doi]
- A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual GenerationGwanghyun Kim, Alonso-Martinez, Yu-Chuan Su, Brendan Jou, José Lezama, Agrim Gupta, Lijun Yu, Lu Jiang 0004, Aren Jansen, Jacob Walker, Krishna Somandepalli. [doi]
- NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationMomin Haider, Ming Yin, Menglei Zhang, Arpit Gupta, Jing Zhu, Yu-Xiang Wang. [doi]
- RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural ModelingTianhang Wang, Fan Lu 0001, Zehan Zheng, Zhijun Li 0001, Guang Chen 0001, Changjun Jiang. [doi]
- Superposed Decoding: Multiple Generations from a Single Autoregressive Inference PassEthan Shen, Alan Fan, Sarah M. Pratt, Jae Sung Park, Matthew Wallingford, Sham M. Kakade, Ari Holtzman, Ranjay Krishna, Ali Farhadi, Aditya Kusupati. [doi]
- PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for ManipulationFei Ni 0001, Jianye Hao, Shiguang Wu 0001, Longxin Kou, Yifu Yuan, Zibin Dong, Jinyi Liu 0002, Mingzhi Li, Yuzheng Zhuang, Yan Zheng 0002. [doi]
- Compact Proofs of Model Performance via Mechanistic InterpretabilityJason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan. [doi]
- Recurrent Reinforcement Learning with MemoroidsSteven D. Morad, Chris Lu 0001, Ryan Kortvelesy, Stephan Liwicki, Jakob Foerster, Amanda Prorok. [doi]
- Approximately Pareto-optimal Solutions for Bi-Objective k-ClusteringAnna Arutyunova, Jan Eube, Heiko Röglin, Melanie Schmidt 0001, Sarah Sturm, Julian Wargalla. [doi]
- Dendritic Integration Inspired Artificial Neural Networks Capture Data CorrelationChongming Liu, Jingyang Ma, Songting Li, Douglas Zhou. [doi]
- Is Your HD Map Constructor Reliable under Sensor Corruptions?Xiaoshuai Hao, Mengchuan Wei, Yifan Yang, Haimei Zhao, Hui Zhang 0093, Yi Zhou 0020, Qiang Wang, Weiming Li, Lingdong Kong, Jing Zhang 0037. [doi]
- Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language ModelsChengzhengxu Li, Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Chen Liu, Yu Lan, Chao Shen. [doi]
- An Accelerated Gradient Method for Convex Smooth Simple Bilevel OptimizationJincheng Cao, Ruichen Jiang, Erfan Yazdandoost Hamedani, Aryan Mokhtari. [doi]
- UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with ReflectionsFangjinhua Wang, Marie-Julie Rakotosaona, Michael Niemeyer, Richard Szeliski, Marc Pollefeys, Federico Tombari. [doi]
- CableInspect-AD: An Expert-Annotated Anomaly Detection DatasetAkshatha Arodi, Margaux Luck, Jean-Luc Bedwani, Aldo Zaimi, Ge Li, Nicolas Pouliot, Julien Beaudry, Gaétan Marceau-Caron. [doi]
- Benchmarking Complex Instruction-Following with Multiple Constraints CompositionBosi Wen, Pei Ke, Xiaotao Gu, Lindong Wu, Hao Huang, Jinfeng Zhou, Wenchuang Li, Binxin Hu, Wendy Gao, Jiaxing Xu, Yiming Liu, Jie Tang, Hongning Wang, Minlie Huang. [doi]
- Stealth edits to large language modelsOliver J. Sutton, Qinghua Zhou, Wei Wang 0357, Desmond J. Higham, Alexander N. Gorban, Alexander Bastounis, Ivan Tyukin. [doi]
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet VideosYunong Liu, Cristóbal Eyzaguirre, Manling Li, Shubh Khanna, Juan Carlos Niebles, Vineeth Ravi, Saumitra Mishra, Weiyu Liu, Jiajun Wu 0001. [doi]
- SVFT: Parameter-Efficient Fine-Tuning with Singular VectorsVijay Lingam, Atula Neerkaje, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Eunsol Choi, Alex Dimakis, Aleksandar Bojchevski, Sujay Sanghavi. [doi]
- DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot ExecutionYang Yue, Yulin Wang, Bingyi Kang, Yizeng Han, Shenzhi Wang, Shiji Song, Jiashi Feng, Gao Huang 0001. [doi]
- Unveiling the Bias Impact on Symmetric Moral Consistency of Large Language ModelsZiyi Zhou, Xinwei Guo, Jiashi Gao, Xiangyu Zhao 0001, Shiyao Zhang, Xin Yao 0001, Xuetao Wei. [doi]
- KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled QuantizationTianyi Zhang 0011, Jonah Yi, Zhaozhuo Xu, Anshumali Shrivastava. [doi]
- The Impact of Geometric Complexity on Neural Collapse in Transfer LearningMichael Munn, Benoit Dherin, Javier Gonzalvo. [doi]
- Robust group and simultaneous inferences for high-dimensional single index modelWeiChao Yang, Hongwei Shi, Xu Guo, Changliang Zou. [doi]
- MaNo: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution ShiftsRenchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An 0001. [doi]
- MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence ModelsZichun Yu, Spandan Das, Chenyan Xiong. [doi]
- CriticEval: Evaluating Large-scale Language Model as CriticTian Lan 0003, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xian-Ling Mao. [doi]
- Robust Prompt Optimization for Defending Language Models Against Jailbreaking AttacksAndy Zhou, Bo Li, Haohan Wang. [doi]
- AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database QueriesIrina Saparina, Mirella Lapata. [doi]
- From Causal to Concept-Based Representation LearningGoutham Rajendran, Simon Buchholz, Bryon Aragam, Bernhard Schölkopf, Pradeep Ravikumar. [doi]
- How Far Can Transformers Reason? The Globality Barrier and Inductive ScratchpadEmmanuel Abbe, Samy Bengio, Aryo Lotfi, Colin Sandon, Omid Saremi. [doi]
- LeDex: Training LLMs to Better Self-Debug and Explain CodeNan Jiang 0012, Xiaopeng Li 0002, Shiqi Wang 0002, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma 0001, Anoop Deoras. [doi]
- Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based DiscriminationShelly Golan, Roy Ganz, Michael Elad. [doi]
- Leveraging Drift to Improve Sample Complexity of Variance Exploding Diffusion ModelsRuofeng Yang, Zhijie Wang, Bo Jiang, Shuai Li. [doi]
- Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion ModellingGrigory Bartosh, Dmitry P. Vetrov, Christian Andersson Naesseth. [doi]
- Adaptive and Optimal Second-order Optimistic Methods for Minimax OptimizationRuichen Jiang, Ali Kavis, Qiujiang Jin, Sujay Sanghavi, Aryan Mokhtari. [doi]
- Global Convergence in Training Large-Scale TransformersCheng Gao, Yuan Cao, Zihao Li, Yihan He, Mengdi Wang, Han Liu, Jason M. Klusowski, Jianqing Fan. [doi]
- Decoupled Kullback-Leibler Divergence LossJiequan Cui, Zhuotao Tian, Zhisheng Zhong, Xiaojuan Qi 0001, Bei Yu 0001, Hanwang Zhang. [doi]
- Improving Neural Network Surface Processing with Principal CurvaturesJosquin Harrison, James Benn, Maxime Sermesant. [doi]
- Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks using the Marginal LikelihoodRayen Dhahri, Alexander Immer, Bertrand Charpentier, Stephan Günnemann, Vincent Fortuin. [doi]
- An effective framework for estimating individualized treatment rulesJoowon Lee, Jared D. Huling, Guanhua Chen. [doi]
- Faster Algorithms for User-Level Private Stochastic Convex OptimizationAndrew Lowy, Daogao Liu, Hilal Asi. [doi]
- Generalizable Person Re-identification via Balancing Alignment and UniformityYoonki Cho, Jaeyoon Kim, Woo-Jae Kim, Junsik Jung, Sung-Eui Yoon. [doi]
- SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike StreamsKang Chen, Shiyan Chen, Jiyuan Zhang, Baoyue Zhang, Yajing Zheng, Tiejun Huang 0001, Zhaofei Yu. [doi]
- Topic-Conversation Relevance (TCR) Dataset and BenchmarksYaran Fan, Jamie Pool, Senja Filipi, Ross Cutler. [doi]
- Model Fusion through Bayesian Optimization in Language Model Fine-TuningChaeyun Jang, Hyungi Lee, Jungtaek Kim, Juho Lee. [doi]
- Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformersYibo Jiang, Goutham Rajendran, Pradeep Ravikumar, Bryon Aragam. [doi]
- UniIF: Unified Molecule Inverse FoldingZhangyang Gao, Jue Wang 0004, Cheng Tan 0012, Lirong Wu, Yufei Huang 0002, Siyuan Li 0002, Zhirui Ye, Stan Z. Li. [doi]
- Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate ShiftJiayun Wu, Jiashuo Liu, Peng Cui, Steven Wu 0001. [doi]
- MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMsJihyung Kil, Zheda Mai, Justin Lee, Arpita Chowdhury, Zihe Wang, Kerrie Cheng, Lemeng Wang, Ye Liu, Wei-Lun Chao. [doi]
- T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative ModelsYibo Miao, Yifan Zhu, Lijia Yu, Jun Zhu 0001, Xiao-Shan Gao, Yinpeng Dong. [doi]
- MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank ExpertsJie Zhu, Yixiong Chen, Mingyu Ding, Ping Luo 0002, Leye Wang, Jingdong Wang 0001. [doi]
- Context and Geometry Aware Voxel Transformer for Semantic Scene CompletionZhu Yu 0001, Runmin Zhang, Jiacheng Ying, Junchen Yu, Xiaohai Hu, Lun Luo, Si-Yuan Cao, Hui-Liang Shen. [doi]
- Multimodal Large Language Models Make Text-to-Image Generative Models Align BetterXun Wu, Shaohan Huang, Guolong Wang, Jing Xiong, Furu Wei. [doi]
- EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI DetectionQinqian Lei, Bo Wang 0019, Robby T. Tan. [doi]
- Action Imitation in Common Action Space for Customized Action Image SynthesisWang Lin, Jingyuan Chen, Jiaxin Shi, Zirun Guo, Yichen Zhu, Zehan Wang 0001, Tao Jin 0004, Zhou Zhao, Fei Wu 0001, Shuicheng Yan, Hanwang Zhang. [doi]
- Retrieval-Retro: Retrieval-based Inorganic Retrosynthesis with Expert KnowledgeHeewoong Noh, Namkyeong Lee, Gyoung S. Na, Chanyoung Park 0001. [doi]
- START: A Generalized State Space Model with Saliency-Driven Token-Aware TransformationJintao Guo, Lei Qi 0001, Yinghuan Shi, Yang Gao 0001. [doi]
- ActionAtlas: A VideoQA Benchmark for Domain-specialized Action RecognitionMohammadreza Salehi, Jae Sung Park, Aditya Kusupati, Ranjay Krishna, Yejin Choi 0001, Hanna Hajishirzi, Ali Farhadi. [doi]
- Divergences between Language Models and Human BrainsYuchen Zhou, Emmy Liu, Graham Neubig, Michael J. Tarr, Leila Wehbe. [doi]
- Dual-Personalizing Adapter for Federated Foundation ModelsYiyuan Yang, Guodong Long, Tao Shen 0001, Jing Jiang 0002, Michael Blumenstein. [doi]
- VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector BanksYang Li 0146, Shaobo Han, Jonathan Shihao Ji. [doi]
- ProG: A Graph Prompt Learning BenchmarkChenyi Zi, Haihong Zhao, Xiangguo Sun, Yiqing Lin, Hong Cheng 0001, Jia Li 0009. [doi]
- HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian SplattingQiankun Gao, Jiarui Meng, Chengxiang Wen, Jie Chen, Jian Zhang. [doi]
- Deep Learning in Medical Image Registration: Magic or Mirage?Rohit Jena, Deeksha Sethi, Pratik Chaudhari, James C. Gee. [doi]
- The Best of Both Worlds: On the Dilemma of Out-of-distribution DetectionQingyang Zhang, Qiuxuan Feng, Joey Tianyi Zhou, Yatao Bian, Qinghua Hu, Changqing Zhang. [doi]
- Classifier-guided Gradient Modulation for Enhanced Multimodal LearningZirun Guo, Tao Jin, Jingyuan Chen, Zhou Zhao. [doi]
- Ad Auctions for LLMs via Retrieval Augmented GenerationMohammadTaghi Hajiaghayi, Sébastien Lahaie, Keivan Rezaei, Suho Shin. [doi]
- SPRINQL: Sub-optimal Demonstrations driven Offline Imitation LearningHuy Hoang, Tien Mai, Pradeep Varakantham. [doi]
- IR-CM: The Fast and General-purpose Image Restoration Method Based on Consistency ModelXiaoxuan Gong, Jie Ma. [doi]
- Vript: A Video Is Worth Thousands of WordsDongjie Yang, Suyuan Huang, Chengqiang Lu, Xiaodong Han, Haoxin Zhang, Yan Gao, Yao Hu, Hai Zhao 0001. [doi]
- Prototypical Hash Encoding for On-the-Fly Fine-Grained Category DiscoveryHaiyang Zheng, Nan Pu, Wenjing Li 0005, Nicu Sebe, Zhun Zhong. [doi]
- Neural Collapse To Multiple Centers For Imbalanced DataHongRen Yan, Yuhua Qian, Furong Peng, Jiachen Luo, Zheqing Zhu, Feijiang Li. [doi]
- A Topology-aware Graph Coarsening Framework for Continual Graph LearningXiaoxue Han, Zhuo Feng, Yue Ning 0001. [doi]
- Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement LearningYuanlin Duan, Guofeng Cui, He Zhu 0001. [doi]
- Graph Structure Inference with BAM: Neural Dependency Processing via Bilinear AttentionPhilipp Froehlich, Heinz Koeppl. [doi]
- CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor SearchMing Yang, Yuzheng Cai, Weiguo Zheng. [doi]
- StepbaQ: Stepping backward as Correction for Quantized Diffusion ModelsYi-Chung Chen, Zhi-Kai Huang, Jing-Ren Chen. [doi]
- Learning Optimal Tax Design in Nonatomic Congestion GamesQiwen Cui, Maryam Fazel, Simon S. Du. [doi]
- Particle Semi-Implicit Variational InferenceJen Ning Lim, Adam M. Johansen. [doi]
- Generative Modelling of Structurally Constrained GraphsManuel Madeira, Clément Vignac, Dorina Thanou, Pascal Frossard. [doi]
- Generalization Analysis for Label-Specific Representation LearningYi-Fan Zhang, Min-Ling Zhang. [doi]
- 2DQuant: Low-bit Post-Training Quantization for Image Super-ResolutionKai Liu, Haotong Qin, Yong Guo, Xin Yuan 0002, Linghe Kong, Guihai Chen, Yulun Zhang 0001. [doi]
- Fairness-Aware Estimation of Graphical ModelsZhuoping Zhou, Davoud Ataee Tarzanagh, Bojian Hou, Qi Long, Li Shen 0001. [doi]
- RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable GuaranteesXun Xian, Ganghua Wang, Xuan Bi, Jayanth Srinivasa, Ashish Kundu, Mingyi Hong 0001, Jie Ding 0002. [doi]
- MAN TruckScenes: A multimodal dataset for autonomous trucking in diverse conditionsFelix Fent, Fabian Kuttenreich, Florian Ruch, Farija Rizwin, Stefan Juergens, Lorenz Lechermann, Christian Nissler, Andrea Perl, Ulrich Voll, Min Yan, Markus Lienkamp. [doi]
- ReXTime: A Benchmark Suite for Reasoning-Across-Time in VideosJr-Jen Chen, Yu-Chien Liao, Hsi-Che Lin, Yu-Chu Yu, Yen-Chun Chen 0001, Yu-Chiang Frank Wang. [doi]
- Vivid-ZOO: Multi-View Video Generation with Diffusion ModelBing Li, Cheng Zheng, Wenxuan Zhu, Jinjie Mai, Biao Zhang 0005, Peter Wonka, Bernard Ghanem. [doi]
- Physically Compatible 3D Object Modeling from a Single ImageMinghao Guo, Bohan Wang, Pingchuan Ma 0002, Tianyuan Zhang, Crystal Elaine Owens, Chuang Gan, Josh Tenenbaum 0001, Kaiming He, Wojciech Matusik. [doi]
- Cross-model Control: Improving Multiple Large Language Models in One-time TrainingJiayi Wu, Hao Sun 0015, Hengyi Cai, Lixin Su, Shuaiqiang Wang, Dawei Yin, Xiang Li 0067, Ming Gao 0001. [doi]
- ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language ModelsXiang Meng, Kayhan Behdin, Haoyue Wang, Rahul Mazumder. [doi]
- DTGB: A Comprehensive Benchmark for Dynamic Text-Attributed GraphsJiasheng Zhang, Jialin Chen, Menglin Yang 0004, Aosong Feng, Shuang Liang 0002, Jie Shao 0001, Rex Ying. [doi]
- Consistency Models for Scalable and Fast Simulation-Based InferenceMarvin Schmitt, Valentin Pratz, Ullrich Köthe, Paul-Christian Bürkner, Stefan T. Radev. [doi]
- Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionChaoda Zheng, Feng Wang 0018, Naiyan Wang, Shuguang Cui, Zhen Li 0026. [doi]
- Many-Shot In-Context LearningRishabh Agarwal, Avi Singh, Lei Zhang, Bernd Bohnet, Luis Rosias, Stephanie C. Y. Chan, Biao Zhang, Ankesh Anand, Zaheer Abbas, Azade Nova, John D. Co-Reyes, Eric Chu, Feryal M. P. Behbahani, Aleksandra Faust, Hugo Larochelle. [doi]
- Mixture of Demonstrations for In-Context LearningSong Wang, Zihan Chen 0002, Chengshuai Shi, Cong Shen, Jundong Li. [doi]
- Fundamental Convergence Analysis of Sharpness-Aware MinimizationPham Khanh, Hoang-Chau Luong, Boris S. Mordukhovich, Dat Tran. [doi]
- Efficient Contextual LLM Cascades through Budget-Constrained Policy LearningXuechen Zhang 0002, Zijian Huang 0015, Ege Onur Taga, Carlee Joe-Wong, Samet Oymak, Jiasi Chen. [doi]
- Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision TransformersKai Yan, Alexander G. Schwing, Yu-Xiong Wang. [doi]
- SafeWorld: Geo-Diverse Safety AlignmentDa Yin, Haoyi Qiu, Kung-Hsiang Huang, Kai-Wei Chang, Nanyun Peng 0001. [doi]
- ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token IdentificationYefei He, Luoming Zhang, Weijia Wu 0001, Jing Liu 0048, Hong Zhou, Bohan Zhuang. [doi]
- Meteor: Mamba-based Traversal of Rationale for Large Language and Vision ModelsByung kwan Lee, Chae Won Kim, Beomchan Park, Yong Man Ro. [doi]
- Multi-Head Mixture-of-ExpertsXun Wu, Shaohan Huang, Wenhui Wang 0003, Shuming Ma, Li Dong 0004, Furu Wei. [doi]
- Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision ProcessesAsaf Cassel, Aviv Rosenberg 0002. [doi]
- OnlineTAS: An Online Baseline for Temporal Action SegmentationQing Zhong, Guodong Ding, Angela Yao. [doi]
- HGDL: Heterogeneous Graph Label Distribution LearningYufei Jin, Heng Lian, Yi He 0007, Xingquan Zhu 0001. [doi]
- VisMin: Visual Minimal-Change UnderstandingRabiul Awal, Saba Ahmadi, Le Zhang, Aishwarya Agrawal. [doi]
- Optimal Flow Matching: Learning Straight Trajectories in Just One StepNikita Kornilov, Petr Mokrov, Alexander V. Gasnikov, Alexander Korotin. [doi]
- Expressive Gaussian Human Avatars from Monocular RGB VideoHezhen Hu, Zhiwen Fan, Tianhao Wu, Yihan Xi, Seoyoung Lee, Georgios Pavlakos, Zhangyang Wang. [doi]
- Towards training digitally-tied analog blocks via hybrid gradient computationTimothy Nest, Maxence Ernoult. [doi]
- AUCSeg: AUC-oriented Pixel-level Long-tail Semantic SegmentationBoyu Han, Qianqian Xu, Zhiyong Yang 0001, Shilong Bao, Peisong Wen, Yangbangyan Jiang, Qingming Huang. [doi]
- Efficient Availability Attacks against Supervised and Contrastive Learning SimultaneouslyYihan Wang, Yifan Zhu, Xiao-Shan Gao. [doi]
- Geometric Exploitation for Indoor Panoramic Semantic SegmentationDinh Duc Cao, Seok-Joon Kim, Kyusung Cho. [doi]
- Unleashing Region Understanding in Intermediate Layers for MLLM-based Referring Expression GenerationYaoyuan Liang, Zhuojun Cai, Jian Xu, Guanbo Huang, Yiran Wang, Xiao Liang, Jiahao Liu, Ziran Li, Jingang Wang, Shao-Lun Huang. [doi]
- Group and Shuffle: Efficient Structured Orthogonal ParametrizationMikhail Gorbunov, Nikolay Yudin, Vera Soboleva, Aibek Alanov, Alexey Naumov, Maxim Rakhuba. [doi]
- p Perturbations for Universal RobustnessEnyi Jiang, Gagandeep Singh. [doi]
- REDUCR: Robust Data Downsampling using Class Priority ReweightingWilliam Bankes, George Hughes, Ilija Bogunovic, Zi Wang. [doi]
- Visual Pinwheel Centers Act as Geometric Saliency DetectorsHaixin Zhong, Mingyi Huang, Wei Dai, Haoyu Wang, Anna Roe, Yuguo Yu. [doi]
- Pre-training Differentially Private Models with Limited Public DataZhiqi Bu, Xinwei Zhang 0001, Sheng Zha, Mingyi Hong 0001, George Karypis. [doi]
- Chain-of-Thought Reasoning Without PromptingXuezhi Wang 0002, Denny Zhou. [doi]
- Infusing Self-Consistency into Density Functional Theory Hamiltonian Prediction via Deep Equilibrium ModelsZun Wang, Chang Liu, Nianlong Zou, He Zhang, Xinran Wei, Lin Huang, Lijun Wu, Bin Shao. [doi]
- Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiTLe Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Xiangyang Zhu, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang 0001, Kaipeng Zhang, Lirui Zhao, Si Liu 0001, Xiangyu Yue 0001, Wanli Ouyang, Yu Qiao 0001, Hongsheng Li 0001, Peng Gao 0007. [doi]
- Accurate and Steady Inertial Pose Estimation through Sequence Structure Learning and ModulationYinghao Wu, Chaoran Wang, Lu Yin, Shihui Guo, Yipeng Qin. [doi]
- Optimal deep learning of holomorphic operators between Banach spacesBen Adcock, Nick C. Dexter, Sebastian Moraga Scheuermann. [doi]
- KptLLM: Unveiling the Power of Large Language Model for Keypoint ComprehensionJie Yang, Wang Zeng, Sheng Jin 0007, Lumin Xu, Wentao Liu 0002, Chen Qian 0006, Ruimao Zhang. [doi]
- Even Sparser Graph TransformersHamed Shirzad, Honghao Lin, Balaji Venkatachalam, Ameya Velingker, David P. Woodruff, Danica J. Sutherland. [doi]
- Revive Re-weighting in Imbalanced Learning by Density Ratio EstimationJiaan Luo, Feng Hong 0004, Jiangchao Yao, Bo Han 0003, Ya Zhang 0002, Yanfeng Wang 0001. [doi]
- $\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose EstimationWeiquan Wang, Jun Xiao 0001, Chunping Wang, Wei Liu, Zhao Wang, Long Chen. [doi]
- In-Context Learning with Representations: Contextual Generalization of Trained TransformersTong Yang, Yu Huang 0023, Yingbin Liang, Yuejie Chi. [doi]
- DevBench: A multimodal developmental benchmark for language learningAlvin W. M. Tan, Chunhua Yu, Bria Long, Wanjing Ma, Tonya Murray, Rebecca D. Silverman, Jason D. Yeatman, Michael C. Frank. [doi]
- Vision-Language Models are Strong Noisy Label DetectorsTong Wei, Hao-tian Li, Chun-shu Li, Jiang-Xin Shi, Yu-Feng Li, Min-Ling Zhang. [doi]
- Solving Sparse \& High-Dimensional-Output Regression via CompressionRenyuan Li, Zhehui Chen, Guanyi Wang. [doi]
- INQUIRE: A Natural World Text-to-Image Retrieval BenchmarkEdward Vendrow, Omiros Pantazis, Alexander Shepard, Gabriel J. Brostow, Kate E. Jones, Oisin Mac Aodha, Sara Beery, Grant Van Horn. [doi]
- NeuralSteiner: Learning Steiner Tree for Overflow-avoiding Global Routing in Chip DesignRuizhi Liu, Zhisheng Zeng, Shizhe Ding, Jingyan Sui, Xingquan Li, Dongbo Bu. [doi]
- Adaptive Randomized Smoothing: Certified Adversarial Robustness for Multi-Step DefencesSaiyue Lyu, Shadab Shaikh, Frederick Shpilevskiy, Evan Shelhamer, Mathias Lécuyer. [doi]
- Streaming Long Video Understanding with Large Language ModelsRui Qian, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Shuangrui Ding, Dahua Lin, Jiaqi Wang. [doi]
- Learning Structure-Aware Representations of Dependent TypesKonstantinos Kogkalidis, Orestis Melkonian, Jean-Philippe Bernardy. [doi]
- Practical 0.385-Approximation for Submodular Maximization Subject to a Cardinality ConstraintMurad Tukan, Loay Mualem, Moran Feldman. [doi]
- Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential FamiliesFrançois Bertholom, Randal Douc, François Roueff. [doi]
- The Edge-of-Reach Problem in Offline Model-Based Reinforcement LearningAnya Sims, Cong Lu, Jakob Foerster, Yee Whye Teh. [doi]
- Unveiling LoRA Intrinsic Ranks via Salience AnalysisWenjun Ke, Jiahao Wang, Peng Wang, Jiajun Liu, Dong Nie, Guozheng Li, Yining Li. [doi]
- Cross-Modality Perturbation Synergy Attack for Person Re-identificationYunpeng Gong, Zhun Zhong, Yansong Qu, Zhiming Luo, Rongrong Ji, Min Jiang 0005. [doi]
- Optimizing the coalition gain in Online Auctions with Greedy Structured BanditsDorian Baudry, Hugo Richard, Maria Cherifa, Vianney Perchet, Clément Calauzènes. [doi]
- Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum ComparatorSiyuan Xu, Minghui Zhu. [doi]
- Open-Book Neural Algorithmic ReasoningHefei Li, Chao Peng 0004, Chenyang Xu, Zhengfeng Yang. [doi]
- RobIR: Robust Inverse Rendering for High-Illumination ScenesZiyi Yang, Chenyanzhen, Xinyu Gao, Yazhen Yuan, Yu Wu, Xiaowei Zhou, Xiaogang Jin 0001. [doi]
- AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and GenerationAnil Kag, n n, Jierun Chen, Junli Cao, Willi Menapace, Aliaksandr Siarohin, Sergey Tulyakov, Jian Ren 0005. [doi]
- A Swiss Army Knife for Heterogeneous Federated Learning: Flexible Coupling via Trace NormTianchi Liao, Lele Fu, Jialong Chen, Zhen Wang, Zibin Zheng, Chuan Chen 0001. [doi]
- Hypothesis Testing the Circuit Hypothesis in LLMsClaudia Shi, Nicolas Beltran-Velez, Achille Nazaret, Carolina Zheng, Adrià Garriga-Alonso, Andrew Jesson, Maggie Makar, David M. Blei. [doi]
- Temporal Graph Neural Tangent Kernel with Graphon-GuaranteedKatherine Tieu, Dongqi Fu, Yada Zhu, Hendrik F. Hamann, Jingrui He. [doi]
- ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign UsersGuanlin Li, Kangjie Chen, Shudong Zhang, Jie Zhang, Tianwei Zhang. [doi]
- Continuous Contrastive Learning for Long-Tailed Semi-Supervised RecognitionZi-Hao Zhou, Siyuan Fang, Zi-Jing Zhou, Tong Wei 0001, Yuanyu Wan, Min-Ling Zhang. [doi]
- Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian ProcessesJihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato. [doi]
- Fairness and Efficiency in Online Class MatchingMohammadTaghi Hajiaghayi, Shayan Chashm Jahan, Mohammad Sharifi, Suho Shin, Max Springer. [doi]
- VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real TimeSicheng Xu, Guojun Chen, Yu-Xiao Guo 0001, Jiaolong Yang, Chong Li, Zhenyu Zang, Yizhong Zhang, Xin Tong 0001, Baining Guo. [doi]
- Transfer Learning for Latent Variable Network ModelsAkhil Jalan, Arya Mazumdar, Soumendu Sundar Mukherjee, Purnamrita Sarkar. [doi]
- Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion TransformerShuang Wu, Youtian Lin, Yifei Zeng, Feihu Zhang, Jingxi Xu 0001, Philip Torr 0001, Xun Cao, Yao Yao 0008. [doi]
- Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning RateFan-Ming Luo, Zuolin Tu, Zefang Huang, Yang Yu 0001. [doi]
- Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection?Qingsong Zhao, Yi Wang 0074, Jilan Xu, Yinan He, Zifan Song, Limin Wang 0002, Yu Qiao 0001, Cairong Zhao. [doi]
- Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series ClassificationYihe Wang, Nan Huang, Taida Li, Yujun Yan, Xiang Zhang 0012. [doi]
- IDGen: Item Discrimination Induced Prompt Generation for LLM EvaluationFan Lin, Shuyi Xie, Yong Dai, Wenlin Yao, Tianjiao Lang, Yu Zhang 0004. [doi]
- Semi-Open 3D Object Retrieval via Hierarchical Equilibrium on HypergraphYang Xu, Yifan Feng, Jun Zhang, Jun-Hai Yong, Yue Gao. [doi]
- Statistical and Geometrical properties of the Kernel Kullback-Leibler divergenceAnna Korba, Francis Bach, Clémentine Chazal. [doi]
- Elo Uncovered: Robustness and Best Practices in Language Model EvaluationMeriem Boubdir, Edward Kim, Beyza Ermis, Sara Hooker, Marzieh Fadaee. [doi]
- Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image SynthesisDeepak Sridhar, Abhishek Peri, Rohith Rachala, Nuno Vasconcelos. [doi]
- Scaling Laws in Linear Regression: Compute, Parameters, and DataLicong Lin, Jingfeng Wu, Sham M. Kakade, Peter L. Bartlett, Jason D. Lee. [doi]
- Collaboration! Towards Robust Neural Methods for Routing ProblemsJianan Zhou 0002, Yaoxin Wu, Zhiguang Cao, Wen Song, Jie Zhang 0002, Zhiqi Shen 0001. [doi]
- Quantum algorithm for large-scale market equilibrium computationPo-Wei Huang, Patrick Rebentrost. [doi]
- Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language ModelsAlliot Nagle, Adway Girish, Marco Bondaschi, Michael Gastpar, Ashok Vardhan Makkuva, Hyeji Kim. [doi]
- When to Sense and Control? A Time-adaptive Approach for Continuous-Time RLLenart Treven, Bhavya Sukhija, Yarden As, Florian Dörfler, Andreas Krause 0001. [doi]
- NeuralFluid: Nueral Fluidic System Design and Control with Differentiable SimulationYifei Li 0002, YuChen Sun, Pingchuan Ma 0002, Eftychios Sifakis, Tao Du 0001, Bo Zhu 0002, Wojciech Matusik. [doi]
- $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth EstimationYinshuang Xu, Dian Chen 0005, Katherine Liu, Sergey Zakharov, Rares Ambrus, Kostas Daniilidis, Vitor Guizilini. [doi]
- ReMoDetect: Reward Models Recognize Aligned LLM's GenerationsHyunseok Lee, Jihoon Tack, Jinwoo Shin. [doi]
- When to Act and When to Ask: Policy Learning With Deferral Under Hidden ConfoundingMarah Ghoummaid, Uri Shalit. [doi]
- InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HDXiaoyi Dong, Pan Zhang 0001, Yuhang Zang, Yuhang Cao, Bin Wang 0065, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan 0001, Yang Gao, Zhe Chen 0017, Xinyue Zhang 0005, Wei Li 0044, Jingwen Li, Wenhai Wang, Kai Chen 0026, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao 0001, Dahua Lin, Jiaqi Wang 0003. [doi]
- Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time ComplexityHaoxuan Chen, Yinuo Ren, Lexing Ying, Grant M. Rotskoff. [doi]
- Reshuffling Resampling Splits Can Improve Generalization of Hyperparameter OptimizationThomas Nagler, Lennart Schneider, Bernd Bischl, Matthias Feurer. [doi]
- LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor SearchElias Jääsaari, Ville Hyvönen, Teemu Roos. [doi]
- Uniform Last-Iterate Guarantee for Bandits and Reinforcement LearningJunyan Liu, Yunfan Li, Ruosong Wang, Lin Yang. [doi]
- Understanding and Improving Training-free Loss-based Diffusion GuidanceYifei Shen, Xinyang Jiang, Yifan Yang, Yezhen Wang, Dongqi Han, Dongsheng Li. [doi]
- Vocal Call Locator Benchmark (VCL) for localizing rodent vocalizations from multi-channel audioRalph Peterson, Aramis Tanelus, Christopher Ick, Bartul Mimica, Niegil Francis Muttath Joseph, Violet Ivan, Aman Choudhri, Annegret Falkner, Mala Murthy, David Schneider, Dan Sanes, Alex Williams. [doi]
- Task-Agnostic Machine-Learning-Assisted InferenceJiacheng Miao, Qiongshi Lu. [doi]
- Distributional regression: CRPS-error bounds for model fitting, model selection and convex aggregationDombry Clement, Ahmed Zaoui. [doi]
- Asynchronous Perception Machine for Efficient Test Time TrainingRajat Modi, Yogesh S. Rawat. [doi]
- VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of ThoughtGabriel Sarch, Lawrence Jang, Michael J. Tarr, William W. Cohen, Kenneth Marino, Katerina Fragkiadaki. [doi]
- NeuralClothSim: Neural Deformation Fields Meet the Thin Shell TheoryNavami Kairanda, Marc Habermann, Christian Theobalt, Vladislav Golyanik. [doi]
- Multistable Shape from Shading Emerges from Patch DiffusionXinran Nicole Han, Todd E. Zickler, Ko Nishino. [doi]
- Repurposing Language Models into Embedding Models: Finding the Compute-Optimal RecipeAlbert Q. Jiang, Alicja Ziarko, Bartosz Piotrowski, Wenda Li, Mateja Jamnik, Piotr Milos. [doi]
- PaCE: Parsimonious Concept Engineering for Large Language ModelsJinqi Luo, Tianjiao Ding, Kwan Ho Ryan Chan, Darshan Thaker, Aditya Chattopadhyay, Chris Callison-Burch, René Vidal. [doi]
- Deep Learning for Computing Convergence Rates of Markov ChainsYanlin Qu, Jose H. Blanchet, Peter W. Glynn. [doi]
- Building Timeseries Dataset: Empowering Large-Scale Building AnalyticsArian Prabowo, Xiachong Lin, Imran Razzak, Hao Xue 0001, Emily W. Yap, Matthew Amos, Flora D. Salim. [doi]
- A Bayesian Approach to Data Point SelectionXinnuo Xu, Minyoung Kim 0001, Royson Lee, Brais Martínez, Timothy M. Hospedales. [doi]
- HumanVid: Demystifying Training Data for Camera-controllable Human Image AnimationZhenzhi Wang 0001, Yixuan Li 0002, Yanhong Zeng, Youqing Fang, Yuwei Guo 0002, Wenran Liu, Jing Tan 0002, Kai Chen 0026, Tianfan Xue, Bo Dai 0002, Dahua Lin. [doi]
- Clustering then Propagation: Select Better Anchors for Knowledge Graph EmbeddingKe Liang 0006, Yue Liu 0008, Hao Li 0025, Lingyuan Meng, Suyuan Liu, Siwei Wang 0001, Sihang Zhou 0001, Xinwang Liu 0002. [doi]
- Markov Equivalence and Consistency in Differentiable Structure LearningChang Deng, Kevin Bello, Pradeep Ravikumar, Bryon Aragam. [doi]
- TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous VariablesYuxuan Wang, Haixu Wu, Jiaxiang Dong, Guo Qin, Haoran Zhang, Yong Liu, Yunzhong Qiu, Jianmin Wang 0001, Mingsheng Long. [doi]
- Temporal Sentence Grounding with Relevance Feedback in VideosJianfeng Dong, Xiaoman Peng, Daizong Liu, Xiaoye Qu, Xun Yang 0001, Cuizhu Bao, Meng Wang 0001. [doi]
- CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language ModelsPeng Xia, Ze Chen, Juanxi Tian, Yangrui Gong, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, JunZhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun 0001, ZongYuan Ge, Gang Li, James Y. Zou, Huaxiu Yao. [doi]
- Co-occurrence is not Factual Association in Language ModelsXiao Zhang, Miao Li, Ji Wu. [doi]
- PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEsZhongkai Hao, Jiachen Yao, Chang Su, Hang Su 0006, Ziao Wang, Fanzhi Lu, Zeyu Xia, Yichi Zhang, Songming Liu, Lu Lu, Jun Zhu 0001. [doi]
- Autoregressive Image Generation without Vector QuantizationTianhong Li, Yonglong Tian, He Li, Mingyang Deng, Kaiming He. [doi]
- UniDSeg: Unified Cross-Domain 3D Semantic Segmentation via Visual Foundation Models PriorYao Wu, Mingwei Xing, Yachao Zhang 0001, Xiaotong Luo, Yuan Xie 0006, Yanyun Qu. [doi]
- Fast Tree-Field Integrators: From Low Displacement Rank to Topological TransformersKrzysztof Marcin Choromanski, Arijit Sehanobish, Somnath Basu Roy Chowdhury, Han Lin, Kumar Avinava Dubey, Tamás Sarlós, Snigdha Chaturvedi. [doi]
- Dual Cone Gradient Descent for Training Physics-Informed Neural NetworksYoungsik Hwang, Dong Young Lim. [doi]
- OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement LearningYihang Yao, Zhepeng Cen, Wenhao Ding, Haohong Lin, Shiqi Liu 0005, Tingnan Zhang, Wenhao Yu 0009, Ding Zhao. [doi]
- LCM: Locally Constrained Compact Point Cloud Model for Masked Point ModelingYaohua Zha, Naiqi Li, Yanzi Wang, Tao Dai 0001, Hang Guo, Bin Chen, Zhi Wang 0001, Zhihao Ouyang, Shu-Tao Xia. [doi]
- Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object DetectionGuowen Zhang, Lue Fan, Chenhang He, Zhen Lei 0001, Zhaoxiang Zhang 0001, Lei Zhang 0006. [doi]
- Graph Neural Flows for Unveiling Systemic Interactions Among Irregularly Sampled Time SeriesGiangiacomo Mercatali, André Freitas, Jie Chen. [doi]
- Persistence Homology Distillation for Semi-supervised Continual LearningYan Fan, Yu Wang 0106, Pengfei Zhu 0001, Dongyue Chen, Qinghua Hu. [doi]
- A benchmark for prediction of transcriptomic responses to chemical perturbations across cell typesArtur Szalata, Andrew Benz, Robrecht Cannoodt, Mauricio Cortes, Jason Fong, Sunil Kuppasani, Richard Lieberman, Tianyu Liu, Javier Mas-Rosario, Rico Meinl, Jalil Nourisa, Jared Tumiel, Tin M. Tunjic, Mengbo Wang, Noah Weber, Hongyu Zhao, Benedict Anchang, Fabian J. Theis, Malte Luecken, Daniel Burkhardt. [doi]
- STL: Still Tricky Logic (for System Validation, Even When Showing Your Work)Isabelle Hurley, Rohan Paleja, Ashley Suh 0001, Jaime Daniel Peña, Ho Chit Siu. [doi]
- SciCode: A Research Coding Benchmark Curated by ScientistsMinyang Tian, Luyu Gao, Shizhuo Dylan Zhang, Xinan Chen, Cunwei Fan, Xuefei Guo, Roland Haas, Pan Ji, Kittithat Krongchon, Yao Li, Shengyan Liu, Di Luo, Yutao Ma, Hao Tong, Kha Trinh, Chenyu Tian, Zihan Wang, Bohao Wu, Shengzhu Yin, Minhui Zhu, Kilian Lieret, Yanxin Lu, Genglin Liu, Yufeng Du, Tianhua Tao, Ofir Press, Jamie Callan, Eliu A. Huerta, Hao Peng. [doi]
- Exocentric-to-Egocentric Video GenerationJia-Wei Liu, Weijia Mao, Zhongcong Xu, Jussi Keppo, Mike Zheng Shou. [doi]
- Fair Secretaries with Unfair PredictionsEric Balkanski, Will Ma, Andreas Maggiori. [doi]
- MixEval: Deriving Wisdom of the Crowd from LLM Benchmark MixturesJinjie Ni, Fuzhao Xue, Xiang Yue, Yuntian Deng, Mahir Shah, Kabir Jain, Graham Neubig, Yang You 0001. [doi]
- SegVol: Universal and Interactive Volumetric Medical Image SegmentationYuxin Du, Fan Bai 0008, Tiejun Huang 0001, Bo Zhao. [doi]
- UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond ScalingHaider Al-Tahan, Quentin Garrido, Randall Balestriero, Diane Bouchacourt, Caner Hazirbas, Mark Ibrahim. [doi]
- Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication HeterogeneityAlexander Tyurin, Marta Pozzi, Ivan Ilin, Peter Richtárik. [doi]
- Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from VideosCuong Le, John Viktor Johansson, Manon Kok, Bastian Wandt. [doi]
- InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward ModelingYuchun Miao, Sen Zhang, Liang Ding 0006, Rong Bao, Lefei Zhang, Dacheng Tao. [doi]
- FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity RefinerWenliang Zhao, Minglei Shi, Xumin Yu, Jie Zhou 0001, Jiwen Lu. [doi]
- Provable and Efficient Dataset Distillation for Kernel Ridge RegressionYilan Chen 0002, Wei Huang 0034, Lily Weng. [doi]
- General Articulated Objects Manipulation in Real Images via Part-Aware Diffusion ProcessZhou Fang, Yong-Lu Li 0001, Lixin Yang 0001, Cewu Lu. [doi]
- Towards Unified Multimodal Editing with Enhanced Knowledge CollaborationKaihang Pan, Zhaoyu Fan, Juncheng Li 0006, Qifan Yu, Hao Fei 0001, Siliang Tang, Richang Hong, Hanwang Zhang, Qianru Sun. [doi]
- Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View SynthesisLiang Han, Junsheng Zhou, Yu-Shen Liu, Zhizhong Han. [doi]
- CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual ScenesParitosh Parmar, Eric Peh, Ruirui Chen 0002, Ting En Lam, Yuhan Chen, Elston Tan, Basura Fernando. [doi]
- Learning Infinitesimal Generators of Continuous Symmetries from DataGyeonghoon Ko, Hyunsu Kim, Juho Lee 0001. [doi]
- Unelicitable Backdoors via Cryptographic Transformer CircuitsAndis Draguns, Andrew Gritsevskiy, Sumeet Ramesh Motwani, Christian Schröder de Witt. [doi]
- Personalized Federated Learning with Mixture of Models for Adaptive Prediction and Model Fine-TuningPouya M. Ghari, Yanning Shen. [doi]
- Amortized Active Causal Induction with Deep Reinforcement LearningYashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster 0001. [doi]
- Active Learning of General Halfspaces: Label Queries vs Membership QueriesIlias Diakonikolas, Daniel M. Kane, Mingchen Ma. [doi]
- Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal KernelJialin Li, Marta Zagórowska, Giulia De Pasquale, Alisa Rupenyan, John Lygeros. [doi]
- Time-Reversal Provides Unsupervised Feedback to LLMsYerram Varun, Rahul Madhavan, Sravanti Addepalli, Arun Sai Suggala, Karthikeyan Shanmugam, Prateek Jain 0002. [doi]
- Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement LearningXinran Li, Ling Pan, Jun Zhang. [doi]
- Local Curvature Smoothing with Stein's Identity for Efficient Score MatchingGenki Osada, Makoto Shing, Takashi Nishide. [doi]
- Offline Behavior DistillationShiye Lei, Sen Zhang 0006, Dacheng Tao. [doi]
- Fast Proxy Experiment Design for Causal Effect IdentificationSepehr Elahi, Sina Akbari, Jalal Etesami, Negar Kiyavash, Patrick Thiran. [doi]
- TableRAG: Million-Token Table Understanding with Language ModelsSi-An Chen, Lesly Miculicich, Julian Eisenschlos, Zifeng Wang 0002, Zilong Wang 0002, Yanfei Chen, Yasuhisa Fujii, Hsuan-Tien Lin, Chen-Yu Lee, Tomas Pfister. [doi]
- Learning to Shape In-distribution Feature Space for Out-of-distribution DetectionYonggang Zhang 0003, Jie Lu 0001, Bo Peng, Zhen Fang 0001, Yiu-ming Cheung. [doi]
- The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text EmbeddingKenneth C. Enevoldsen, Márton Kardos, Niklas Muennighoff, Kristoffer L. Nielbo. [doi]
- OccFusion: Rendering Occluded Humans with Generative Diffusion PriorsAdam Sun, Tiange Xiang, Scott L. Delp, Li Fei-Fei 0001, Ehsan Adeli 0001. [doi]
- LM-HT SNN: Enhancing the Performance of SNN to ANN Counterpart through Learnable Multi-hierarchical Threshold ModelZecheng Hao, Xinyu Shi, Yujia Liu, Zhaofei Yu, Tiejun Huang 0001. [doi]
- CONTRAST: Continual Multi-source Adaptation to Dynamic DistributionsSk Miraj Ahmed, Fahim Faisal Niloy, Xiangyu Chang, Dripta S. Raychaudhuri, Samet Oymak, Amit K. Roy Chowdhury. [doi]
- FilterNet: Harnessing Frequency Filters for Time Series ForecastingKun Yi 0001, Jingru Fei, Qi Zhang 0020, Hui He, Shufeng Hao, Defu Lian, Wei Fan 0010. [doi]
- Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion SimulationJin Woo Lee, Jaehyun Park, Min-Jun Choi, Kyogu Lee. [doi]
- Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation InconsistencyYiran Liu, Ke Yang, Zehan Qi, Xiao Liu, Yang Yu, Cheng Xiang Zhai. [doi]
- FLAME : Factuality-Aware Alignment for Large Language ModelsSheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Scott Yih, Xilun Chen 0002. [doi]
- Learning Neural Contracting Dynamics: Extended Linearization and Global GuaranteesSean Jaffe, Alexander Davydov 0001, Deniz Lapsekili, Ambuj K. Singh, Francesco Bullo. [doi]
- Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingJianbiao Mei, Yukai Ma, Xuemeng Yang, Licheng Wen, Xinyu Cai, Xin Li, Daocheng Fu, Bo Zhang, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yong Liu, Yu Qiao. [doi]
- Persistent Test-time Adaptation in Recurring Testing ScenariosTrung-Hieu Hoang, MinhDuc Vo, Minh Do. [doi]
- Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionKeyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, Liwei Wang. [doi]
- ACES: Generating a Diversity of Challenging Programming Puzzles with Autotelic Generative ModelsJulien Pourcel, Cédric Colas, Gaia Molinaro, Pierre-Yves Oudeyer, Laetitia Teodorescu. [doi]
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningRui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang 0001. [doi]
- Contextual Linear Optimization with Bandit FeedbackYichun Hu, Nathan Kallus, Xiaojie Mao, Yanchen Wu. [doi]
- REBEL: Reinforcement Learning via Regressing Relative RewardsZhaolin Gao, Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, Drew Bagnell, Jason D. Lee, Wen Sun 0002. [doi]
- PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric ApplicationsDingkang Yang, Jinjie Wei, Dongling Xiao, Shunli Wang 0001, Tong Wu, Gang Li, Mingcheng Li, Shuaibing Wang, Jiawei Chen, Yue Jiang, Qingyao Xu, Ke Li, Peng Zhai, Lihua Zhang. [doi]
- The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional SpaceHongyao Tang, Min Zhang, Chen Chen, Jianye Hao. [doi]
- Idiographic Personality Gaussian Process for Psychological AssessmentYehu Chen, Muchen Xi, Joshua Jackson, Jacob M. Montgomery, Roman Garnett. [doi]
- Ordering-Based Causal Discovery for Linear and Nonlinear RelationsZhuopeng Xu, Yujie Li, Cheng Liu, Ning Gui. [doi]
- LinNet: Linear Network for Efficient Point Cloud Representation LearningHao Deng, Kunlei Jing, Shengmei Chen, Cheng Liu, Jiawei Ru, Bo Jiang 0014, Lin Wang 0026. [doi]
- Splatter a Video: Video Gaussian Representation for Versatile ProcessingYang-Tian Sun, Yihua Huang 0002, Lin Ma, Xiaoyang Lyu, Yan-Pei Cao, Xiaojuan Qi 0001. [doi]
- Archaeoscape: Bringing Aerial Laser Scanning Archaeology to the Deep Learning EraYohann Perron, Vladyslav Sydorov, Adam P. Wijker, Damian Evans, Christophe Pottier, Loïc Landrieu. [doi]
- Barely Random Algorithms and Collective Metrical Task SystemsRomain Cosson, Laurent Massoulié. [doi]
- Sample Complexity of Posted Pricing for a Single ItemBilly Jin, Thomas Kesselheim, Will Ma, Sahil Singla 0001. [doi]
- Vista: A Generalizable Driving World Model with High Fidelity and Versatile ControllabilityShenyuan Gao, Jiazhi Yang, Li Chen 0008, Kashyap Chitta, Yihang Qiu, Andreas Geiger 0001, Jun Zhang, Hongyang Li 0001. [doi]
- Scaling Sign Language TranslationBiao Zhang 0006, Garrett Tanzer, Orhan Firat. [doi]
- Privacy without Noisy Gradients: Slicing Mechanism for Generative Model TrainingKristjan H. Greenewald, Yuancheng Yu, Hao Wang, Kai Xu. [doi]
- Sharing Key Semantics in Transformer Makes Efficient Image RestorationBin Ren, Yawei Li 0001, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc Van Gool, Ming-Hsuan Yang 0001, Nicu Sebe. [doi]
- WikiDO: A New Benchmark Evaluating Cross-Modal Retrieval for Vision-Language ModelsTankala Pavan Kalyan, Piyush Singh Pasi, Sahil Dharod, Azeem Motiwala, Preethi Jyothi, Aditi Chaudhary, Krishna Srinivasan. [doi]
- Molecule Generation with Fragment Retrieval AugmentationSeul Lee, Karsten Kreis, Srimukh Prasad Veccham, Meng Liu 0015, Danny Reidenbach, Saee Paliwal, Arash Vahdat, Weili Nie. [doi]
- Statistical Efficiency of Distributional Temporal Difference LearningYang Peng, Liangyu Zhang, Zhihua Zhang. [doi]
- Cooperative Hardware-Prompt Learning for Snapshot Compressive ImagingJiamian Wang, Zongliang Wu, Yulun Zhang, Xin Yuan, Tao Lin, Zhiqiang Tao. [doi]
- Efficient Sign-Based Optimization: Accelerating Convergence via Variance ReductionWei Jiang, Sifan Yang, Wenhao Yang, Lijun Zhang. [doi]
- Gradient Methods for Online DR-Submodular Maximization with Stochastic Long-Term ConstraintsGuanyu Nie, Vaneet Aggarwal, Christopher J. Quinn. [doi]
- ODGEN: Domain-specific Object Detection Data Generation with Diffusion ModelsJingyuan Zhu, Shiyu Li, Yuxuan Liu, Jian Yuan, Ping Huang, Jiulong Shan, Huimin Ma 0001. [doi]
- The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language ModelsHannah Rose Kirk, Alexander Whitefield, Paul Röttger, Andrew M. Bean, Katerina Margatina, Rafael Mosquera Gómez, Juan Ciro, Max Bartolo, Adina Williams, He He 0001, Bertie Vidgen, Scott Hale. [doi]
- Remix-DiT: Mixing Diffusion Transformers for Multi-Expert DenoisingGongfan Fang, Xinyin Ma, Xinchao Wang. [doi]
- DETAIL: Task DEmonsTration Attribution for Interpretable In-context LearningZijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low. [doi]
- A Simple and Optimal Approach for Universal Online Learning with Gradient VariationsYu-Hu Yan, Peng Zhao 0006, Zhi-Hua Zhou. [doi]
- Learning-Augmented Approximation Algorithms for Maximum Cut and Related ProblemsVincent Cohen-Addad, Tommaso d'Orsi, Anupam Gupta 0001, Euiwoong Lee, Debmalya Panigrahi. [doi]
- Diffusion Twigs with Loop Guidance for Conditional Graph GenerationGiangiacomo Mercatali, Yogesh Verma, André Freitas, Vikas Garg 0001. [doi]
- On Learning Multi-Modal Forgery Representation for Diffusion Generated Video DetectionXiufeng Song, Xiao Guo, Jiache Zhang, Qirui Li, Lei Bai 0001, Xiaoming Liu 0002, Guangtao Zhai, Xiaohong Liu 0001. [doi]
- 3D Gaussian Rendering Can Be Sparser: Efficient Rendering via Learned Fragment PruningZhifan Ye, Chenxi Wan, Chaojian Li, Jihoon Hong, Sixu Li, Leshu Li, Yongan Zhang, Yingyan (Celine) Lin. [doi]
- Learning to Predict Structural VibrationsJan van Delden, Julius Schultz, Christopher Blech, Sabine C. Langer, Timo Lüddecke. [doi]
- Large Language Models' Expert-level Global History Knowledge Benchmark (HiST-LLM)Jakob Hauser, Dániel Kondor, Jenny Reddish, Majid Benam, Enrico Cioni, Federica Villa, James Bennett, Daniel Hoyer, Pieter Francois, Peter Turchin, R. Maria Del Rio Chanona. [doi]
- FactorSim: Generative Simulation via Factorized RepresentationFan-Yun Sun, S. I. Harini, Angela Yi, Yihan Zhou, Alex Zook, Jonathan Tremblay, Logan Cross, Jiajun Wu 0001, Nick Haber. [doi]
- RMLR: Extending Multinomial Logistic Regression into General GeometriesZiheng Chen, Yue Song, Rui Wang, Xiaojun Wu, Nicu Sebe. [doi]
- Randomized Strategic Facility Location with PredictionsEric Balkanski, Vasilis Gkatzelis, Golnoosh Shahkarami. [doi]
- Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix CompletionZhiwei Bai, Jiajie Zhao, Yaoyu Zhang. [doi]
- Aligning Individual and Collective Objectives in Multi-Agent CooperationYang Li 0116, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du 0001, Ying Wen 0001, Wei Pan 0004. [doi]
- Constrained Latent Action Policies for Model-Based Offline Reinforcement LearningMarvin Alles, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl. [doi]
- Bayesian Strategic ClassificationLee Cohen 0001, Saeed Sharifi-Malvajerdi, Kevin Stangl, Ali Vakilian, Juba Ziani. [doi]
- How Control Information Influences Multilingual Text Image Generation and Editing?Boqiang Zhang, Zuan Gao, Yadong Qu, Hongtao Xie. [doi]
- Sample Efficient Bayesian Learning of Causal Graphs from InterventionsZihan Zhou, Muhammad Qasim Elahi, Murat Kocaoglu. [doi]
- Contextual Decision-Making with Knapsacks Beyond the Worst CaseZhaohua Chen 0001, Rui Ai 0002, Mingwei Yang 0002, Yuqi Pan, Chang Wang 0004, Xiaotie Deng. [doi]
- WaterMax: breaking the LLM watermark detectability-robustness-quality trade-offEva Giboulot, Teddy Furon. [doi]
- Zero-shot Image Editing with Reference ImitationXi Chen, Yutong Feng, Mengting Chen, Yiyang Wang, Shilong Zhang, Yu Liu 0063, Yujun Shen, Hengshuang Zhao. [doi]
- Scaling Continuous Latent Variable Models as Probabilistic Integral CircuitsGennaro Gala, Cassio P. de Campos, Antonio Vergari, Erik Quaeghebeur. [doi]
- RectifID: Personalizing Rectified Flow with Anchored Classifier GuidanceZhicheng Sun 0001, Zhenhao Yang, Yang Jin, Haozhe Chi, Kun Xu 0005, Liwei Chen, Hao Jiang, Yang Song 0008, Kun Gai, Yadong Mu. [doi]
- RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature GuidanceJiaoJiao Fan, Haotian Xue 0002, Qinsheng Zhang, Yongxin Chen. [doi]
- Realizable H-Consistent and Bayes-Consistent Loss Functions for Learning to DeferAnqi Mao, Mehryar Mohri, Yutao Zhong 0002. [doi]
- Deep Graph Neural Networks via Posteriori-Sampling-based Node-Adaptative Residual ModuleJingbo Zhou, Yixuan Du, Ruqiong Zhang, Jun Xia 0001, Zhizhi Yu, Zelin Zang, Di Jin 0001, Carl Yang 0001, Rui Zhang, Stan Z. Li. [doi]
- Scalable Kernel Inverse OptimizationYouyuan Long, Tolga Ok, Pedro Zattoni Scroccaro, Peyman Mohajerin Esfahani. [doi]
- Toward Approaches to Scalability in 3D Human Pose EstimationJun-Hui Kim, Seong-Whan Lee. [doi]
- MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based TokenizationOrevaoghene Ahia, Sachin Kumar 0009, Hila Gonen, Valentin Hofmann, Tomasz Limisiewicz, Yulia Tsvetkov, Noah A. Smith. [doi]
- Understanding Multi-Granularity for Open-Vocabulary Part SegmentationJiho Choi, Seonho Lee, Seungho Lee, Minhyun Lee, Hyunjung Shim. [doi]
- TFG: Unified Training-Free Guidance for Diffusion ModelsHaotian Ye, Haowei Lin, Jiaqi Han, Minkai Xu, Sheng Liu, Yitao Liang, Jianzhu Ma, James Y. Zou, Stefano Ermon. [doi]
- Fractal Patterns May Illuminate the Success of Next-Token PredictionIbrahim M. Alabdulmohsin, Vinh Q. Tran 0002, Mostafa Dehghani 0001. [doi]
- EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language ModelsRui Zhao, Hangjie Yuan, Yujie Wei, Shiwei Zhang, Yuchao Gu, Lingmin Ran, Xiang Wang, Jay Zhangjie Wu, David Junhao Zhang, Yingya Zhang, Mike Zheng Shou. [doi]
- HyperPrism: An Adaptive Non-linear Aggregation Framework for Distributed Machine Learning over Non-IID Data and Time-varying Communication LinksHaizhou Du, Yijian Chen, Ryan Yang, Yuchen Li 0006, Linghe Kong. [doi]
- PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly DetectionQihang Zhou, Jiangtao Yan, Shibo He, Wenchao Meng, Jiming Chen 0001. [doi]
- Distributional Reinforcement Learning with Regularized Wasserstein LossKe Sun 0013, Yingnan Zhao, Wulong Liu, Bei Jiang, Linglong Kong. [doi]
- Is O(log N) practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RLEnoch H. Kang, P. R. Kumar. [doi]
- Stylus: Automatic Adapter Selection for Diffusion ModelsMichael Luo, Justin Wong, Brandon Trabucco, Yanping Huang, Joseph E. Gonzalez, Zhifeng Chen, Ruslan Salakhutdinov, Ion Stoica. [doi]
- NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General TasksBernardo Esteves, Miguel Vasco, Francisco S. Melo. [doi]
- Efficient Large Multi-modal Models via Visual Context CompressionJieneng Chen, Luoxin Ye, Ju He, Zhaoyang Wang, Daniel Khashabi, Alan L. Yuille. [doi]
- UniTS: A Unified Multi-Task Time Series ModelShanghua Gao, Teddy Koker, Owen Queen, Tom Hartvigsen, Theodoros Tsiligkaridis, Marinka Zitnik. [doi]
- MatrixNet: Learning over symmetry groups using learned group representationsLucas Laird, Circe Hsu, Asilata Bapat, Robin Walters 0001. [doi]
- 3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object DetectionMingSheng Li, Jiakang Yuan, Sijin Chen, Lin Zhang, Anyu Zhu, Xin Chen, Tao Chen. [doi]
- Local Superior Soups: A Catalyst for Model Merging in Cross-Silo Federated LearningMinghui Chen, Meirui Jiang, Xin Zhang, Dou Qi, Zehua Wang, Xiaoxiao Li. [doi]
- MindMerger: Efficiently Boosting LLM Reasoning in non-English LanguagesZixian Huang, Wenhao Zhu, Gong Cheng 0001, Lei Li, Fei Yuan. [doi]
- Zero-Shot Tokenizer TransferBenjamin Minixhofer, Edoardo Maria Ponti, Ivan Vulic. [doi]
- MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language ModelsYichi Zhang, Yao Huang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Yifan Wang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu. [doi]
- WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language ModelsPeng Wang 0104, Zexi Li, Ningyu Zhang 0001, Ziwen Xu, Yunzhi Yao, Yong Jiang 0001, Pengjun Xie, Fei Huang 0004, Huajun Chen. [doi]
- SymILO: A Symmetry-Aware Learning Framework for Integer Linear OptimizationQian Chen, Tianjian Zhang, Linxin Yang, Qingyu Han, Akang Wang, Ruoyu Sun 0001, Xiaodong Luo, Tsung-Hui Chang. [doi]
- ABCFair: an Adaptable Benchmark approach for Comparing Fairness MethodsMaryBeth Defrance, Maarten Buyl, Tijl De Bie. [doi]
- From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement LearningPusen Dong, Tianchen Zhu, Yue Qiu, Haoyi Zhou, Jianxin Li 0002. [doi]
- Learning to Handle Complex Constraints for Vehicle Routing ProblemsJieyi Bi, Yining Ma 0001, Jianan Zhou 0002, Wen Song, Zhiguang Cao, Yaoxin Wu, Jie Zhang 0002. [doi]
- TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation LearningNemin Wu, Qian Cao, Zhangyu Wang, Zeping Liu, Yanlin Qi, Jielu Zhang, Joshua Ni, Xiaobai Yao, Hongxu Ma, Lan Mu, Stefano Ermon, Tanuja Ganu, Akshay Nambi 0001, Ni Lao, Gengchen Mai. [doi]
- Improving the Training of Rectified FlowsSangyun Lee, Zinan Lin 0001, Giulia Fanti. [doi]
- Parameterized Approximation Schemes for Fair-Range ClusteringZhen Zhang, Xiaohong Chen, Limei Liu, Jie Chen, Junyu Huang, Qilong Feng. [doi]
- RanDumb: Random Representations Outperform Online Continually Learned RepresentationsAmeya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip Torr 0001, Ozan Sener, Puneet K. Dokania. [doi]
- PLIP: Language-Image Pre-training for Person Representation LearningJialong Zuo, Jiahao Hong, Feng Zhang, Changqian Yu, Hanyu Zhou, Changxin Gao, Nong Sang, Jingdong Wang 0001. [doi]
- Lisa: Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning AttackTiansheng Huang, Sihao Hu, Fatih Ilhan, Selim F. Tekin, Ling Liu 0001. [doi]
- Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level SignatureTong Zhou 0002, Xuandong Zhao, Xiaolin Xu 0001, Shaolei Ren. [doi]
- TrajCLIP: Pedestrian trajectory prediction method using contrastive learning and idempotent networksPengfei Yao, Yinglong Zhu, Huikun Bi, Tianlu Mao, Zhaoqi Wang. [doi]
- EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like SketchingXinwang Chen, Ning Liu 0007, Yichen Zhu, Feifei Feng, Jian Tang. [doi]
- Extending Multi-modal Contrastive RepresentationsZiang Zhang, Zehan Wang 0001, Luping Liu, Rongjie Huang, Xize Cheng, Zhenhui Ye, Wang Lin, Huadai Liu, Haifeng Huang, Yang Zhao, Tao Jin 0004, Siqi Zheng, Zhou Zhao 0001. [doi]
- PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless ImagingXin Cai, Zhiyuan You, Hailong Zhang, Jinwei Gu, WenTao Liu, Tianfan Xue. [doi]
- Identification and Estimation of the Bi-Directional MR with Some Invalid InstrumentsFeng Xie, Zhen Yao, Lin Xie, Yan Zeng, Zhi Geng. [doi]
- DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation LearningWeikang Wan, Ziyu Wang, Yufei Wang, Zackory Erickson, David Held. [doi]
- GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority LanguagesAmir Hossein Kargaran, François Yvon, Hinrich Schütze. [doi]
- Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge AugmentationKun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy. [doi]
- Image-aware Evaluation of Generated Medical ReportsGefen Dawidowicz, Elad Hirsch, Ayellet Tal. [doi]
- e-COP : Episodic Constrained Optimization of PoliciesAkhil Agnihotri, Rahul Jain 0002, Deepak Ramachandran, Sahil Singla 0005. [doi]
- 2Net: PDE-Preserved Coarse Correction Network for efficient prediction of spatiotemporal dynamicsQi Wang, Pu Ren, Hao Zhou, Xin-Yang Liu, Zhiwen Deng, Yi Zhang, Zeruizhi Cheng, Hongsheng Liu 0002, Zidong Wang 0010, Jian-Xun Wang 0001, Ji-Rong Wen, Hao Sun, Yang Liu. [doi]
- VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image UnderstandingXiang Li, Jian Ding, Mohamed Elhoseiny. [doi]
- Non-asymptotic Global Convergence Analysis of BFGS with the Armijo-Wolfe Line SearchQiujiang Jin, Ruichen Jiang, Aryan Mokhtari. [doi]
- Unity by Diversity: Improved Representation Learning for Multimodal VAEsThomas M. Sutter, Yang Meng, Andrea Agostini, Daphné Chopard, Norbert Fortin, Julia E. Vogt, Babak Shahbaba, Stephan Mandt. [doi]
- MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse ViewsYuedong Chen, Chuanxia Zheng, Haofei Xu, Bohan Zhuang, Andrea Vedaldi, Tat-Jen Cham, Jianfei Cai 0001. [doi]
- FEEL-SNN: Robust Spiking Neural Networks with Frequency Encoding and Evolutionary Leak FactorMengting Xu, De Ma, Huajin Tang, Qian Zheng, Gang Pan 0001. [doi]
- Distributed-Order Fractional Graph Operating NetworkKai Zhao 0010, Xuhao Li, Qiyu Kang, Feng Ji, Qinxu Ding, Yanan Zhao, Wenfei Liang, Wee-Peng Tay. [doi]
- QWO: Speeding Up Permutation-Based Causal Discovery in LiGAMsMohammad ShahverdiKondori, Ehsan Mokhtarian, Negar Kiyavash. [doi]
- Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View SynthesisRui Peng, Wangze Xu, Luyang Tang, Levio Leo, Jianbo Jiao, Ronggang Wang. [doi]
- SlimGPT: Layer-wise Structured Pruning for Large Language ModelsGui-ling, Ziyang Wang, Yuliang Yan, Qingwen Liu 0002. [doi]
- Counterfactual Fairness by Combining Factual and Counterfactual PredictionsZeyu Zhou, Tianci Liu 0003, Ruqi Bai, Jing Gao 0004, Murat Kocaoglu, David I. Inouye. [doi]
- Contextual Multinomial Logit Bandits with General Value FunctionsMengxiao Zhang, Haipeng Luo. [doi]
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play AcceleratorHanshu Yan, Xingchao Liu, Jiachun Pan, Jun Hao Liew, Qiang Liu 0001, Jiashi Feng. [doi]
- Functional Gradient Flows for Constrained SamplingShiyue Zhang, Longlin Yu, Ziheng Cheng, Cheng Zhang. [doi]
- Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation TradeoffJian Qian, Haichen Hu, David Simchi-Levi. [doi]
- Conformalized Credal Set PredictorsAlireza Javanmardi, David Stutz, Eyke Hüllermeier. [doi]
- OAM-TCD: A globally diverse dataset of high-resolution tree cover mapsJosh Veitch-Michaelis, Andrew Cottam, Daniella Schweizer, Eben N. Broadbent, David Dao, Ce Zhang 0001, Angelica Almeyda Zambrano, Simeon Max. [doi]
- On Differentially Private Subspace Estimation in a Distribution-Free SettingEliad Tsfadia. [doi]
- Decompose, Analyze and Rethink: Solving Intricate Problems with Human-like Reasoning CycleShangzi Xue, Zhenya Huang, Jiayu Liu 0001, Xin Lin 0005, Yuting Ning, Binbin Jin, Xin Li 0064, Qi Liu 0003. [doi]
- Constrained Adaptive Attack: Effective Adversarial Attack Against Deep Neural Networks for Tabular DataThibault Simonetto, Salah Ghamizi, Maxime Cordy. [doi]
- ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series TransformerJiawen Zhang 0001, Shun Zheng, Xumeng Wen, Xiaofang Zhou, Jiang Bian 0002, Jia Li 0009. [doi]
- Attention boosted Individualized RegressionGuang Yang, Yuan Cao, Long Feng. [doi]
- Leveraging Tumor Heterogeneity: Heterogeneous Graph Representation Learning for Cancer Survival Prediction in Whole Slide ImagesJunxian Wu, Xinyi Ke, Xiaoming Jiang, Huanwen Wu, Youyong Kong, Lizhi Shao. [doi]
- Map It Anywhere: Empowering BEV Map Prediction using Large-scale Public DatasetsCherie Ho, Jiaye Zou, Omar Alama, Sai Mitheran Jagadesh Kumar, Cheng-Yu Chiang, Taneesh Gupta, Chen Wang 0033, Nikhil Varma Keetha, Katia P. Sycara, Sebastian A. Scherer. [doi]
- AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale GamesKefan Su, Yusen Huo, Zhilin Zhang, Shuai Dou, Chuan Yu, Jian Xu, Zongqing Lu, Bo Zheng. [doi]
- Unveil Benign Overfitting for Transformer in Vision: Training Dynamics, Convergence, and GeneralizationJiarui Jiang, Wei Huang 0034, Miao Zhang, Taiji Suzuki, Liqiang Nie. [doi]
- TreeVI: Reparameterizable Tree-structured Variational Inference for Instance-level Correlation CapturingJunxi Xiao, Qinliang Su. [doi]
- On provable privacy vulnerabilities of graph representationsRuofan Wu, Guanhua Fang, Mingyang Zhang, Qiying Pan, Tengfei Liu, Weiqiang Wang. [doi]
- Policy Improvement using Language Feedback ModelsVictor Zhong, Dipendra Misra, Xingdi Yuan, Marc-Alexandre Côté. [doi]
- Enhancing Chess Reinforcement Learning with Graph RepresentationTomas Rigaux, Hisashi Kashima. [doi]
- Flexible Context-Driven Sensory Processing in Dynamical Vision ModelsLakshmi Narasimhan Govindarajan, Abhiram Iyer, Valmiki Kothare, Ila Fiete. [doi]
- GenAI Arena: An Open Evaluation Platform for Generative ModelsDongfu Jiang, Max Ku, Tianle Li, Yuansheng Ni, Shizhuo Sun, Rongqi Fan, Wenhu Chen. [doi]
- Biomedical Visual Instruction Tuning with Clinician Preference AlignmentHejie Cui, Lingjun Mao, Xin Liang, Jieyu Zhang, Hui Ren 0001, Quanzheng Li, Xiang Li 0001, Carl Yang 0001. [doi]
- Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action MaskingRoland Stolz, Hanna Krasowski, Jakob Thumm, Michael Eichelbeck, Philipp Gassert, Matthias Althoff. [doi]
- Intrinsic Robustness of Prophet Inequality to Strategic Reward SignalingWei Tang, Haifeng Xu, Ruimin Zhang, Derek Zhu. [doi]
- Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood DiscrepancyShengfang Zhai, Huanran Chen, Yinpeng Dong, Jiajun Li, Qingni Shen, Yansong Gao, Hang Su 0006, Yang Liu 0014. [doi]
- Accelerating ERM for data-driven algorithm design using output-sensitive techniquesMaria-Florina Balcan, Christopher Seiler, Dravyansh Sharma. [doi]
- HAWK: Learning to Understand Open-World Video AnomaliesJiaqi Tang 0005, Hao Lu 0009, Ruizheng Wu, Xiaogang Xu, Ke Ma, Cheng Fang, Bin Guo 0001, Jiangbo Lu, Qifeng Chen, Yingcong Chen. [doi]
- Challenges of Generating Structurally Diverse GraphsFedor Velikonivtsev, Mikhail Mironov, Liudmila Prokhorenkova. [doi]
- Sequential Signal Mixing Aggregation for Message Passing Graph Neural NetworksMitchell Keren Taraday, Almog David, Chaim Baskin. [doi]
- An Efficient Memory Module for Graph Few-Shot Class-Incremental LearningDong Li, Aijia Zhang, Junqi Gao, Biqing Qi. [doi]
- Be Confident in What You Know: Bayesian Parameter Efficient Fine-Tuning of Vision Foundation ModelsDeep Shankar Pandey, Spandan Pyakurel, Qi Yu 0001. [doi]
- Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM AlignmentJiaxiang Li, Siliang Zeng, Hoi-To Wai, Chenliang Li, Alfredo García, Mingyi Hong 0001. [doi]
- Active preference learning for ordering items in- and out-of-sampleHerman Bergström, Emil Carlsson, Devdatt P. Dubhashi, Fredrik D. Johansson. [doi]
- Many-shot JailbreakingCem Anil, Esin Durmus, Nina Panickssery, Mrinank Sharma, Joe Benton, Sandipan Kundu, Joshua Batson, Meg Tong, Jesse Mu, Daniel Ford, Francesco Mosconi, Rajashree Agrawal, Rylan Schaeffer, Naomi Bashkansky, Samuel Svenningsen, Mike Lambert, Ansh Radhakrishnan, Carson Denison, Evan Hubinger, Yuntao Bai, Trenton Bricken, Timothy Maxwell, Nicholas Schiefer, James Sully, Alex Tamkin, Tamera Lanham, Karina Nguyen, Tomek Korbak, Jared Kaplan, Deep Ganguli, Samuel R. Bowman, Ethan Perez, Roger B. Grosse, David Kristjanson Duvenaud. [doi]
- Pipeline Parallelism with Controllable MemoryPenghui Qi, Xinyi Wan, Nyamdavaa Amar, Min Lin. [doi]
- DeepLag: Discovering Deep Lagrangian Dynamics for Intuitive Fluid PredictionQilong Ma, Haixu Wu, Lanxiang Xing, Shangchen Miao, Mingsheng Long. [doi]
- Mirror and Preconditioned Gradient Descent in Wasserstein SpaceClément Bonet, Théo Uscidda, Adam David, Pierre-Cyril Aubin-Frankowski, Anna Korba. [doi]
- CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech ProcessingYen-Ju Lu, Jing Liu, Thomas Thebaud, Laureano Moro-Velázquez, Ariya Rastrow, Najim Dehak, Jesús Villalba 0001. [doi]
- DiffuserLite: Towards Real-time Diffusion PlanningZibin Dong, Jianye Hao, Yifu Yuan, Fei Ni 0001, Yitian Wang, Pengyi Li, Yan Zheng 0002. [doi]
- MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-Object Demand-driven NavigationHongcheng Wang, Peiqi Liu, Wenzhe Cai, Mingdong Wu, Zhengyu Qian, Hao Dong 0003. [doi]
- Activating Self-Attention for Multi-Scene Absolute Pose RegressionMiso Lee, Jihwan Kim, Jae-Pil Heo. [doi]
- Auditing Local Explanations is HardRobi Bhattacharjee, Ulrike von Luxburg. [doi]
- Computerized Adaptive Testing via Collaborative RankingZirui Liu 0010, Yan Zhuang, Qi Liu 0003, Jiatong Li 0002, Yuren Zhang, Zhenya Huang, Jinze Wu, Shijin Wang 0001. [doi]
- Deep Bayesian Active Learning for Preference Modeling in Large Language ModelsLuckeciano Carvalho Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal. [doi]
- MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language ModelChaoya Jiang, Hongrui Jia, Haiyang Xu, Wei Ye, Mengfan Dong, Ming Yan, Ji Zhang 0011, Fei Huang 0004, Shikun Zhang. [doi]
- Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous DrivingHaochen Liu, Li Chen, Yu Qiao, Chen Lv, Hongyang Li. [doi]
- WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management TasksMichael Wornow, Avanika Narayan, Ben Viggiano, Ishan S. Khare, Tathagat Verma, Tibor Thompson, Miguel Angel Fuentes Hernandez, Sudharsan Sundar, Chloe Trujillo, Krrish Chawla, Rongfei Lu, Justin Shen, Divya Nagaraj, Joshua Martinez, Vardhan Agrawal, Althea Hudson, Nigam Shah, Christopher Ré. [doi]
- The Secretary Problem with Predicted Additive GapAlexander Braun, Sherry Sarkar. [doi]
- VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector QuantizationYiWei Zhang, Jin Gao, Fudong Ge, Guan Luo, Bing Li 0001, Zhao-Xiang Zhang 0001, Haibin Ling, Weiming Hu. [doi]
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic SynthesisSwapnil Bhosale, Haosen Yang, Diptesh Kanojia, Jiankang deng, Xiatian Zhu. [doi]
- RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language ModelsMaya Varma, Jean-Benoit Delbrouck, Zhihong Chen, Akshay Chaudhari, Curtis P. Langlotz. [doi]
- Scaling Laws for Reward Model Overoptimization in Direct Alignment AlgorithmsRafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum. [doi]
- Saliency-driven Experience Replay for Continual LearningGiovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Matteo Boschini, Lorenzo Bonicelli, Angelo Porrello, Simone Calderara, Simone Palazzo, Concetto Spampinato. [doi]
- T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness RecognitionChen Yeh, You-Ming Chang, Wei-chen Chiu, Ning Yu. [doi]
- XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic SegmentationZiyi Wang 0007, Yanbo Wang, Xumin Yu, Jie Zhou 0001, Jiwen Lu. [doi]
- Optimized Feature Generation for Tabular Data via LLMs with Decision Tree ReasoningJaehyun Nam, Kyuyoung Kim, Seunghyuk Oh, Jihoon Tack, Jaehyung Kim 0001, Jinwoo Shin. [doi]
- FedGMKD: An Efficient Prototype Federated Learning Framework through Knowledge Distillation and Discrepancy-Aware AggregationJianqiao Zhang, Caifeng Shan, Jungong Han. [doi]
- Weight Diffusion for Future: Learn to Generalize in Non-Stationary EnvironmentsMixue Xie, Shuang Li 0008, Binhui Xie, Chi Harold Liu, Jian Liang 0002, Zixun Sun, Ke Feng, Chengwei Zhu. [doi]
- HORSE: Hierarchical Representation for Large-Scale Neural Subset SelectionBinghui Xie, Yixuan Wang, Yongqiang Chen 0002, Kaiwen Zhou, Yu Li, Wei Meng 0001, James Cheng. [doi]
- The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement LearningMoritz Schneider, Robert Krug 0002, Narunas Vaskevicius, Luigi Palmieri, Joschka Boedecker. [doi]
- EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical DilemmasMikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Ivan Nasonov, Daniil Orekhov, Pekhotin Vladislav, Ivan Makovetskiy, Mikhail Baklashkin, Vasily Lavrentyev, Akim Tsvigun, Denis Turdakov, Tatiana Shavrina, Andrey Savchenko, Ilya Makarov. [doi]
- Exploring the Role of Large Language Models in Prompt Encoding for Diffusion ModelsBingqi Ma, Zhuofan Zong, Guanglu Song, Hongsheng Li, Yu Liu. [doi]
- Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-ExplorationYun-Yen Chuang, Hung-Min Hsu, Kevin Lin, Chen-Sheng Gu, Ling Zhen Li, Ray-I Chang, Hung-yi Lee. [doi]
- Approximating the Top Eigenvector in Random Order StreamsPraneeth Kacham, David P. Woodruff. [doi]
- BIGOS V2 Benchmark for Polish ASR: Curated Datasets and Tools for Reproducible EvaluationMichal Junczyk. [doi]
- Robust Gaussian Processes via Relevance PursuitSebastian Ament, Elizabeth Santorella, David Eriksson, Ben Letham, Maximilian Balandat, Eytan Bakshy. [doi]
- Fast Channel Simulation via Error-Correcting CodesSharang M. Sriramu, Rochelle Barsz, Elizabeth Polito, Aaron B. Wagner. [doi]
- Identifying Functionally Important Features with End-to-End Sparse Dictionary LearningDan Braun, Jordan Taylor, Nicholas Goldowsky-Dill, Lee Sharkey. [doi]
- Learnability of high-dimensional targets by two-parameter models and gradient flowDmitry Yarotsky. [doi]
- UrbanDataLayer: A Unified Data Pipeline for Urban ScienceYiheng Wang, Tianyu Wang, Yuying Zhang, Hongji Zhang, Haoyu Zheng, Guanjie Zheng, Linghe Kong. [doi]
- Causal Discovery from Event Sequences by Local Cause-Effect AttributionJoscha Cüppers, Sascha Xu, Ahmed Musa, Jilles Vreeken. [doi]
- TinyTTA: Efficient Test-time Adaptation via Early-exit Ensembles on Edge DevicesHong Jia, Young Kwon, Alessio Orsino, Ting Dang, Domenico Talia, Cecilia Mascolo. [doi]
- A Non-parametric Direct Learning Approach to Heterogeneous Treatment Effect Estimation under Unmeasured ConfoundingXinhai Zhang, Xingye Qiao. [doi]
- Structured Matrix Basis for Multivariate Time Series Forecasting with Interpretable DynamicsXiaodan Chen, Xiucheng Li, Xinyang Chen 0001, Zhijun Li 0002. [doi]
- Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph ReasoningJiapu Wang, Kai Sun, Linhao Luo, Wei Wei, Yongli Hu, Alan Wee-Chung Liew, Shirui Pan, Baocai Yin. [doi]
- PURE: Prompt Evolution with Graph ODE for Out-of-distribution Fluid Dynamics ModelingHao Wu, Changhu Wang, Fan Xu, Jinbao Xue, Chong Chen 0002, Xian-Sheng Hua, Xiao Luo. [doi]
- FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank AdaptationsZiyao Wang, Zheyu Shen, Yexiao He, Guoheng Sun, Hongyi Wang 0001, Lingjuan Lyu, Ang Li 0005. [doi]
- Aligning Large Language Models with Representation Editing: A Control PerspectiveLingkai Kong, Haorui Wang, Wenhao Mu, Yuanqi Du, Yuchen Zhuang, Yifei Zhou, Yue Song, Rongzhi Zhang, Kai Wang, Chao Zhang 0014. [doi]
- What to Say and When to Say it: Live Fitness Coaching as a Testbed for Situated InteractionSunny Panchal, Apratim Bhattacharyya, Guillaume Berger, Antoine Mercier 0005, Cornelius Böhm, Florian Dietrichkeit, Reza Pourreza 0002, Xuanlin Li, Pulkit Madan, Mingu Lee, Mark Todorovich, Ingo Bax, Roland Memisevic. [doi]
- Enhancing Domain Adaptation through Prompt Gradient AlignmentViet-Hoang Phan, Tung Lam Tran, Quyen Tran, Trung Le. [doi]
- An Efficient Recipe for Long Context Extension via Middle-Focused Positional EncodingTong Wu, Yanpeng Zhao, Zilong Zheng. [doi]
- Federated Fine-tuning of Large Language Models under Heterogeneous Tasks and Client ResourcesJiamu Bai, Daoyuan Chen, Bingchen Qian, Liuyi Yao, Yaliang Li. [doi]
- Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion InferenceXunpeng Huang, Difan Zou, Hanze Dong, Yi Zhang, Yian Ma, Tong Zhang 0001. [doi]
- HEMM: Holistic Evaluation of Multimodal Foundation ModelsPaul Pu Liang, Akshay Goindani, Talha Chafekar, Leena Mathur, Haofei Yu, Ruslan Salakhutdinov, Louis-Philippe Morency. [doi]
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language ModelsZhuoran Jin, Pengfei Cao, Chenhao Wang, Zhitao He, Hongbang Yuan, Jiachun Li, Yubo Chen 0001, Kang Liu 0001, Jun Zhao 0001. [doi]
- GACL: Exemplar-Free Generalized Analytic Continual LearningHuiping Zhuang, Yizhu Chen, Di Fang 0004, Run He, Kai Tong, Hongxin Wei, Ziqian Zeng, Cen Chen. [doi]
- A PID Controller Approach for Adaptive Probability-dependent Gradient Decay in Model CalibrationSiyuan Zhang, Linbo Xie. [doi]
- FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem ProvingXiaohan Lin, Qingxing Cao, Yinya Huang, Haiming Wang, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang. [doi]
- Prediction with Action: Visual Policy Learning via Joint Denoising ProcessYanjiang Guo, Yucheng Hu, Jianke Zhang, Yen-Jen Wang, Xiaoyu Chen, Chaochao Lu, Jianyu Chen. [doi]
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuningYibo Yang, Xiaojie Li, Zhongzhu Zhou, Shuaiwen Song, Jianlong Wu, Liqiang Nie, Bernard Ghanem. [doi]
- HYSYNTH: Context-Free LLM Approximation for Guiding Program SynthesisShraddha Barke, Emmanuel Anaya Gonzalez, Saketh Ram Kasibatla, Taylor Berg-Kirkpatrick, Nadia Polikarpova. [doi]
- UniFL: Improve Latent Diffusion Model via Unified Feedback LearningJiacheng Zhang, Jie Wu 0030, Yuxi Ren, Xin Xia, Huafeng Kuang, Pan Xie, Jiashi Li, XueFeng Xiao, Weilin Huang, Shilei Wen, Lean Fu, Guanbin Li. [doi]
- Attractor Memory for Long-Term Time Series Forecasting: A Chaos PerspectiveJiaxi Hu, Yuehong Hu, Wei Chen 0070, Ming Jin 0005, Shirui Pan, Qingsong Wen, Yuxuan Liang. [doi]
- UniAR: A Unified model for predicting human Attention and Responses on visual contentPeizhao Li, Junfeng He, Gang Li, Rachit Bhargava, Shaolei Shen, Nachiappan Valliappan, Youwei Liang, Hongxiang Gu, Venky Ramachandran, Golnaz Farhadi, Yang Li, Kai Kohlhoff, Vidhya Navalpakkam. [doi]
- FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision MakingYangyang Yu, Zhiyuan Yao, Haohang Li, Zhiyang Deng, Yuechen Jiang, Yupeng Cao, Zhi Chen, Jordan W. Suchow, Zhenyu Cui, Rong Liu, Zhaozhuo Xu, Denghui Zhang, Koduvayur Subbalakshmi, Guojun Xiong, Yueru He, Jimin Huang, Dong Li, Qianqian Xie. [doi]
- VastTrack: Vast Category Visual Object TrackingLiang Peng, Junyuan Gao, Xinran Liu, Weihong Li, Shaohua Dong, Zhipeng Zhang, Heng Fan 0001, Libo Zhang 0001. [doi]
- Learning Versatile Skills with Curriculum MaskingYao Tang, Zhihui Xie 0002, Zichuan Lin, Deheng Ye, Shuai Li. [doi]
- Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint SelectionYinxuan Huang, Chengmin Gao, Bin Li 0015, Xiangyang Xue 0001. [doi]
- Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects ModelsJinlin Lai, Justin Domke, Daniel R. Sheldon. [doi]
- HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language ModelsBernal Jimenez Gutierrez, Yiheng Shu, Yu Gu 0016, Michihiro Yasunaga, Yu Su 0001. [doi]
- PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic ManipulationKaidong Zhang, Pengzhen Ren, Bingqian Lin, Junfan Lin, Shikui Ma, Hang Xu, Xiaodan Liang. [doi]
- Neural Conditional Probability for Uncertainty QuantificationVladimir Kostic, Grégoire Pacreau, Giacomo Turri, Pietro Novelli, Karim Lounici, Massimiliano Pontil. [doi]
- Style Adaptation and Uncertainty Estimation for Multi-Source Blended-Target Domain AdaptationYuwu Lu, Haoyu Huang, Xue Hu. [doi]
- Bridging Geometric States via Geometric Diffusion BridgeShengjie Luo, Yixian Xu, Di He 0001, Shuxin Zheng, Tie-Yan Liu, Liwei Wang 0001. [doi]
- Towards Editing Time SeriesBaoyu Jing, Shuqi Gu, Tianyu Chen, Zhiyu Yang, Dongsheng Li 0002, Jingrui He, Kan Ren. [doi]
- Sharpness-diversity tradeoff: improving flat ensembles with SharpBalanceHaiquan Lu, Xiaotian Liu, Yefan Zhou, Qunli Li, Kurt Keutzer, Michael W. Mahoney, Yujun Yan, Huanrui Yang, Yaoqing Yang. [doi]
- Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language ModelsJavier Gonzalez, Aditya Nori. [doi]
- Tight Rates for Bandit Control Beyond QuadraticsY. Jennifer Sun, Zhou Lu. [doi]
- Scene Graph Generation with Role-Playing Large Language ModelsGuikun Chen, Jin Li, Wenguan Wang. [doi]
- Understanding the Expressive Power and Mechanisms of Transformer for Sequence ModelingMingze Wang, Weinan E. [doi]
- Optimal and Approximate Adaptive Stochastic QuantizationRan Ben-Basat, Yaniv Ben-Itzhak, Michael Mitzenmacher, Shay Vargaftik. [doi]
- Taming Generative Diffusion Prior for Universal Blind Image RestorationSiwei Tu, Weidong Yang, Ben Fei. [doi]
- Stepwise Alignment for Constrained Language Model Policy OptimizationAkifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto. [doi]
- WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the EnvironmentHao Tang 0008, Darren Key, Kevin Ellis. [doi]
- Text to Blind MotionHee-Jae Kim, Kathakoli Sengupta, Masaki Kuribayashi, Hernisa Kacorri, Eshed Ohn-Bar. [doi]
- SeafloorAI: A Large-scale Vision-Language Dataset for Seafloor Geological SurveyKien X. Nguyen 0001, Fengchun Qiao, Arthur Trembanis, Xi Peng 0005. [doi]
- The Minimax Rate of HSIC Estimation for Translation-Invariant KernelsFlorian Kalinke, Zoltán Szabó. [doi]
- Questioning the Survey Responses of Large Language ModelsRicardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner. [doi]
- Ensemble sampling for linear bandits: small ensembles sufficeDavid Janz, Alexander E. Litvak, Csaba Szepesvári. [doi]
- Accuracy is Not All You NeedAbhinav Dutta, Sanjeev Krishnan, Nipun Kwatra, Ramachandran Ramjee. [doi]
- 2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic ReconstructionRuyi Zha, Tao Jun Lin, Yuanhao Cai, Jiwen Cao, Yanhao Zhang 0003, Hongdong Li. [doi]
- Disentangling and mitigating the impact of task similarity for continual learningNaoki Hiratani. [doi]
- Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded ImagesDonghwan Kim, Tae-Kyun Kim 0001. [doi]
- Near-Optimality of Contrastive Divergence AlgorithmsPierre Glaser, Kevin Han Huang, Arthur Gretton. [doi]
- Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & BeyondAlan Jeffares, Alicia Curth, Mihaela van der Schaar. [doi]
- Gradient-Variation Online Learning under Generalized SmoothnessYan-Feng Xie, Peng Zhao 0006, Zhi-Hua Zhou. [doi]
- shapiq: Shapley Interactions for Machine LearningMaximilian Muschalik, Hubert Baniecki, Fabian Fumagalli, Patrick Kolpaczki, Barbara Hammer, Eyke Hüllermeier. [doi]
- From Unstructured Data to In-Context Learning: Exploring What Tasks Can Be Learned and WhenKevin Christian Wibisono, Yixin Wang. [doi]
- RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health InterventionsEaston K. Huch, Jieru Shi, Madeline R. Abbott, Jessica R. Golbus, Alexander Moreno, Walter Dempsey. [doi]
- Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight AdjustmentJiawei Du, Xin Zhang 0092, Juncheng Hu, Wenxin Huang, Joey Tianyi Zhou. [doi]
- Immiscible Diffusion: Accelerating Diffusion Training with Noise AssignmentYiheng Li, Heyang Jiang, Akio Kodaira, Masayoshi Tomizuka, Kurt Keutzer, Chenfeng Xu. [doi]
- In-Context Learning State Vector with Inner and Momentum OptimizationDongfang Li 0002, Zhenyu Liu, Xinshuo Hu, Zetian Sun, Baotian Hu, Min Zhang 0005. [doi]
- Test-Time Dynamic Image FusionBing Cao 0002, Yinan Xia, Yi Ding, Changqing Zhang, Qinghua Hu. [doi]
- SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical AlterationsSri Harsha Dumpala, Aman Jaiswal, Chandramouli Shama Sastry, Evangelos E. Milios, Sageev Oore, Hassan Sajjad 0001. [doi]
- Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear RegressionDeqing Fu, Tian Qi Chen, Robin Jia, Vatsal Sharan. [doi]
- iVideoGPT: Interactive VideoGPTs are Scalable World ModelsJialong Wu 0001, Shaofeng Yin, Ningya Feng, Xu He, Dong Li, Jianye Hao, Mingsheng Long. [doi]
- How do Large Language Models Handle Multilingualism?Yiran Zhao 0006, Wenxuan Zhang, Guizhen Chen, Kenji Kawaguchi, Lidong Bing. [doi]
- Generalizing CNNs to graphs with learnable neighborhood quantizationIsaac Osafo Nkansah, Neil Gallagher, Ruchi Sandilya, Conor Liston, Logan Grosenick. [doi]
- Pretraining with Random Noise for Fast and Robust Learning without Weight TransportJeonghwan Cheon, Sang Wan Lee, Se-Bum Paik. [doi]
- Semi-Supervised Sparse Gaussian Classification: Provable Benefits of Unlabeled DataEyar Azar, Boaz Nadler. [doi]
- Error Analysis of Spherically Constrained Least Squares Reformulation in Solving the Stackelberg Prediction GameXiyuan Li, Weiwei Liu. [doi]
- Robust Offline Active Learning on GraphsYuanchen Wu, Yubai Yuan. [doi]
- Text-space Graph Foundation Models: Comprehensive Benchmarks and New InsightsZhikai Chen, Haitao Mao, Jingzhe Liu, Yu Song, Bingheng Li, Wei Jin 0009, Bahare Fatemi, Anton Tsitsulin, Bryan Perozzi, Hui Liu 0031, Jiliang Tang. [doi]
- Towards Next-Level Post-Training Quantization of Hyper-Scale TransformersJunhan Kim, Chungman Lee, Eulrang Cho, Kyungphil Park, Ho Young Kim, Joonyoung Kim, Yongkweon Jeon. [doi]
- Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion ModelsDominik Hintersdorf, Lukas Struppek, Kristian Kersting, Adam Dziedzic, Franziska Boenisch. [doi]
- Decoding-Time Language Model Alignment with Multiple ObjectivesRuizhe Shi, Yifang Chen 0001, Yushi Hu, Alisa Liu, Hanna Hajishirzi, Noah A. Smith, Simon S. Du. [doi]
- DEFT: Efficient Fine-tuning of Diffusion Models by Learning the Generalised $h$-transformAlexander Denker, Francisco Vargas 0001, Shreyas Padhy, Kieran Didi, Simon V. Mathis, Riccardo Barbano, Vincent Dutordoir, Emile Mathieu, Urszula Julia Komorowska, Pietro Lió. [doi]
- Hierarchical Programmatic Option FrameworkYu-An Lin, Chen-Tao Lee, Chih-Han Yang, Guan-Ting Liu, Shao-Hua Sun. [doi]
- Exact, Tractable Gauss-Newton Optimization in Deep Reversible Architectures Reveal Poor GeneralizationDavide Buffelli, Jamie McGowan, Wangkun Xu, Alexandru Cioba, Da-shan Shiu, Guillaume Hennequin, Alberto Bernacchia. [doi]
- Generative Retrieval Meets Multi-Graded RelevanceYubao Tang, Ruqing Zhang 0001, Jiafeng Guo, Maarten de Rijke, Wei Chen 0034, Xueqi Cheng. [doi]
- ReVideo: Remake a Video with Motion and Content ControlChong Mou, Mingdeng Cao, Xintao Wang, Zhaoyang Zhang 0004, Ying Shan, Jian Zhang 0018. [doi]
- ActFusion: a Unified Diffusion Model for Action Segmentation and AnticipationDayoung Gong, Suha Kwak, Minsu Cho. [doi]
- SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM TrainingJinda Jia, Cong Xie, Hanlin Lu, Daoce Wang, Hao Feng, Chengming Zhang 0006, Baixi Sun, Haibin Lin, Zhi Zhang, Xin Liu, Dingwen Tao. [doi]
- Bandit-Feedback Online Multiclass Classification: Variants and TradeoffsYuval Filmus, Steve Hanneke, Idan Mehalel, Shay Moran. [doi]
- ANT: Adaptive Noise Schedule for Time Series Diffusion ModelsSeunghan Lee, Kibok Lee 0003, Taeyoung Park. [doi]
- Alignment at Pre-training! Towards Native Alignment for Arabic LLMsJuhao Liang, Zhenyang Cai, Jianqing Zhu, Huang Huang, Kewei Zong, Bang An, Mosen Alharthi, Juncai He, Lian Zhang, Haizhou Li 0001, Benyou Wang, Jinchao Xu. [doi]
- Temporally Consistent Atmospheric Turbulence Mitigation with Neural RepresentationsHaoming Cai, Jingxi Chen, Brandon Y. Feng, Weiyun Jiang, Mingyang Xie, Kevin Zhang 0003, Cornelia Fermüller, Yiannis Aloimonos, Ashok Veeraraghavan, Christopher A. Metzler. [doi]
- EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric ViewsYuhang Yang, Wei Zhai, Chengfeng Wang, Chengjun Yu, Yang Cao 0010, Zheng-Jun Zha. [doi]
- Voxel Proposal Network via Multi-Frame Knowledge Distillation for Semantic Scene CompletionLubo Wang, Di Lin 0002, Kairui Yang, Ruonan Liu, Qing Guo 0005, Wuyuan Xie, Miaohui Wang, Lingyu Liang, Yi Wang, Ping Li. [doi]
- AdaPKC: PeakConv with Adaptive Peak Receptive Field for Radar Semantic SegmentationTeng Li, Liwen Zhang, Youcheng Zhang, ZijunHu, Pengcheng Pi, Zongqing Lu, Qingmin Liao, Zhe Ma. [doi]
- Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional DataAlexander Havrilla, Wenjing Liao. [doi]
- Using Surrogates in Covariate-adjusted Response-adaptive Randomization Experiments with Delayed OutcomesLei Shi, Waverly Wei, Jingshen Wang. [doi]
- Conditional Density Estimation with Histogram TreesLincen Yang, Matthijs van Leeuwen. [doi]
- A Unified Debiasing Approach for Vision-Language Models across Modalities and TasksHoin Jung, Taeuk Jang, Xiaoqian Wang 0001. [doi]
- Advection Augmented Convolutional Neural NetworksNiloufar Zakariaei, Siddharth Rout, Eldad Haber, Moshe Eliasof. [doi]
- Learning Transferable Features for Implicit Neural RepresentationsKushal Kardam Vyas, Ahmed Imtiaz Humayun, Aniket Dashpute, Richard G. Baraniuk, Ashok Veeraraghavan, Guha Balakrishnan. [doi]
- Does Egalitarian Fairness Lead to Instability? The Fairness Bounds in Stable Federated Learning Under Altruistic BehaviorsJiashi Gao, Ziwei Wang, Xiangyu Zhao 0001, Xin Yao 0001, Xuetao Wei. [doi]
- Proving Olympiad Algebraic Inequalities without Human DemonstrationsChenrui Wei, Mengzhou Sun, Wei Wang. [doi]
- Probabilistic size-and-shape functional mixed modelsFangyi Wang, Karthik Bharath, Oksana A. Chkrebtii, Sebastian Kurtek. [doi]
- HydraViT: Stacking Heads for a Scalable ViTJanek Haberer, Ali Hojjat, Olaf Landsiedel. [doi]
- bit2bit: 1-bit quanta video reconstruction via self-supervised photon predictionYehe Liu, Alexander Krull, Hector Basevi, Ales Leonardis, Michael W. Jenkins. [doi]
- Exploiting Descriptive Completeness Prior for Cross Modal Hashing with Incomplete LabelsHaoyang Luo, Zheng Zhang 0006, Yadan Luo. [doi]
- Can Graph Learning Improve Planning in LLM-based Agents?Xixi Wu, Yifei Shen, Caihua Shan, Kaitao Song, Siwei Wang, Bohang Zhang, Jiarui Feng, Hong Cheng, Wei Chen, Yun Xiong, Dongsheng Li. [doi]
- QTIP: Quantization with Trellises and Incoherence ProcessingAlbert Tseng, Qingyao Sun, David Hou, Christopher De Sa. [doi]
- Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion ModelMin Zhao 0013, Hongzhou Zhu, Chendong Xiang, Kaiwen Zheng, Chongxuan Li, Jun Zhu 0001. [doi]
- S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in PruningWeihao Lin 0002, Shengji Tang, Chong Yu, Peng Ye, Tao Chen 0003. [doi]
- Synthetic Programming Elicitation for Text-to-Code in Very Low-Resource Programming and Formal LanguagesFederico Mora 0002, Justin Wong, Haley Lepe, Sahil Bhatia, Karim Elmaaroufi, George Varghese, Joseph E. Gonzalez, Elizabeth Polgreen, Sanjit Seshia. [doi]
- Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic SegmentationJintao Tong, Yixiong Zou, Yuhua Li 0003, Ruixuan Li 0001. [doi]
- Solving Inverse Problems via Diffusion Optimal ControlHenry Li, Marcus Pereira. [doi]
- Do Counterfactually Fair Image Classifiers Satisfy Group Fairness? - A Theoretical and Empirical StudySangwon Jung, Sumin Yu, Sanghyuk Chun, Taesup Moon. [doi]
- High-Resolution Image Harmonization with Adaptive-Interval Color TransformationQuanling Meng, Qinglin Liu, Zonglin Li, Xiangyuan Lan, Shengping Zhang, Liqiang Nie. [doi]
- Probabilistic Conformal Distillation for Enhancing Missing Modality RobustnessMengxi Chen, Fei Zhang, Zihua Zhao, Jiangchao Yao, Ya Zhang 0002, Yanfeng Wang 0001. [doi]
- LION: Linear Group RNN for 3D Object Detection in Point CloudsZhe Liu 0033, Jinghua Hou, Xinyu Wang 0024, Xiaoqing Ye, Jingdong Wang 0001, Hengshuang Zhao, Xiang Bai. [doi]
- Unified Generative and Discriminative Training for Multi-modal Large Language ModelsWei Chow, Juncheng Li 0006, Qifan Yu, Kaihang Pan, Hao Fei 0001, Zhiqi Ge, Shuai Yang, Siliang Tang, Hanwang Zhang, Qianru Sun. [doi]
- Data Mixture Inference Attack: BPE Tokenizers Reveal Training Data CompositionsJonathan Hayase, Alisa Liu, Yejin Choi 0001, Sewoong Oh, Noah A. Smith. [doi]
- Accelerating Relative Entropy Coding with Space PartitioningJiajun He 0003, Gergely Flamich, José Miguel Hernández-Lobato. [doi]
- Enhancing Robustness of Last Layer Two-Stage Fair Model CorrectionsNathan Stromberg, Rohan Ayyagari, Sanmi Koyejo, Richard Nock, Lalitha Sankar. [doi]
- The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order InformationDiyuan Wu, Ionut-Vlad Modoranu, Mher Safaryan, Denis Kuznedelev, Dan Alistarh. [doi]
- NeuMA: Neural Material Adaptor for Visual Grounding of Intrinsic DynamicsJunyi Cao, Shanyan Guan, Yanhao Ge, Wei Li, Xiaokang Yang, Chao Ma. [doi]
- Enhancing Diversity in Bayesian Deep Learning via Hyperspherical Energy Minimization of CKADavid Smerkous, Qinxun Bai, Fuxin Li. [doi]
- WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from WikipediaYufang Hou 0001, Alessandra Pascale, Javier Carnerero-Cano, Tigran T. Tchrakian, Radu Marinescu 0002, Elizabeth Daly, Inkit Padhi, Prasanna Sattigeri. [doi]
- DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object DetectionHaochen Li, Rui Zhang, Hantao Yao, Xin Zhang, Yifan Hao, Xinkai Song, Xiaqing Li, Yongwei Zhao, Yunji Chen, Ling Li. [doi]
- From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $\alpha$-NeuSHaoran Zhang, Junkai Deng, Xuhui Chen, Fei Hou, Wencheng Wang, Hong Qin 0001, Chen Qian 0006, Ying He 0001. [doi]
- A New Multi-Source Light Detection Benchmark and Semi-Supervised Focal Light DetectionJae Yong Baek, Yong-Sang Yoo, Seung Hwan Bae. [doi]
- Deep Correlated Prompting for Visual Recognition with Missing ModalitiesLianyu Hu, Tongkai Shi, Wei Feng 0005, Fanhua Shang, Liang Wan. [doi]
- Road Network Representation Learning with the Third Law of GeographyHaicang Zhou, Weiming Huang 0001, Yile Chen 0001, Tiantian He 0001, Gao Cong, Yew-Soon Ong. [doi]
- ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMsIrene Huang, Wei Lin 0019, Muhammad Jehanzeb Mirza, Jacob A. Hansen, Sivan Doveh, Victor Butoi, Roei Herzig, Assaf Arbelle, Hilde Kuehne, Trevor Darrell, Chuang Gan, Aude Oliva, Rogério Feris, Leonid Karlinsky. [doi]
- ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion ModelsWei Pang, Masoumeh Shafieinejad, Lucy Liu, Stephanie Hazlewood, Xi He. [doi]
- LLM Circuit Analyses Are Consistent Across Training and ScaleCurt Tigges, Michael Hanna 0001, Qinan Yu, Stella Biderman. [doi]
- The Mamba in the Llama: Distilling and Accelerating Hybrid ModelsJunxiong Wang, Daniele Paliotta, Avner May, Alexander M. Rush, Tri Dao. [doi]
- The Value of Reward Lookahead in Reinforcement LearningNadav Merlis, Dorian Baudry, Vianney Perchet. [doi]
- Segmenting Watermarked Texts From Language ModelsXingchi Li 0002, Guanxun Li, Xianyang Zhang. [doi]
- Expanding Sparse Tuning for Low Memory UsageShufan Shen, Junshu Sun, Xiangyang Ji, Qingming Huang, Shuhui Wang. [doi]
- Risk-sensitive control as inference with Rényi divergenceKaito Ito, Kenji Kashima. [doi]
- MambaTree: Tree Topology is All You Need in State Space ModelYicheng Xiao, Lin Song 0002, Shaoli Huang, Jiangshan Wang, Siyu Song, Yixiao Ge, Xiu Li 0001, Ying Shan. [doi]
- Reimagining Mutual Information for Enhanced Defense against Data Leakage in Collaborative InferenceLin Duan, Jingwei Sun 0002, Jinyuan Jia, Yiran Chen 0001, Maria Gorlatova. [doi]
- AsyncDiff: Parallelizing Diffusion Models by Asynchronous DenoisingZigeng Chen, Xinyin Ma, Gongfan Fang, Zhenxiong Tan, Xinchao Wang. [doi]
- Orchid: Flexible and Data-Dependent Convolution for Sequence ModelingMahdi Karami, Ali Ghodsi 0001. [doi]
- LOVA3: Learning to Visual Question Answering, Asking and AssessmentHenry Hengyuan Zhao, Pan Zhou 0002, Difei Gao, Zechen Bai, Mike Zheng Shou. [doi]
- DetectRL: Benchmarking LLM-Generated Text Detection in Real-World ScenariosJunchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xinyi Yang 0008, Yulin Yuan, Lidia S. Chao. [doi]
- Benign overfitting in leaky ReLU networks with moderate input dimensionKedar Karhadkar, Erin George, Michael Murray, Guido F. Montúfar, Deanna Needell. [doi]
- QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelFei Xie, Weijia Zhang, Zhongdao Wang, Chao Ma. [doi]
- Certified Adversarial Robustness via Randomized α-Smoothing for Regression ModelsAref Miri Rekavandi, Farhad Farokhi, Olga Ohrimenko, Benjamin I. P. Rubinstein. [doi]
- Distributionally Robust Performative PredictionSongkai Xue, Yuekai Sun. [doi]
- Accelerating Augmentation Invariance PretrainingJinhong Lin, Cheng-En Wu, Yibing Wei, Pedro Morgado 0001. [doi]
- Fast and Memory-Efficient Video Diffusion Using Streamlined InferenceZheng Zhan 0001, Yushu Wu, Yifan Gong 0004, Zichong Meng, Zhenglun Kong, Changdi Yang, Geng Yuan, Pu Zhao 0001, Wei Niu 0002, Yanzhi Wang. [doi]
- KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgePengcheng Jiang, Lang Cao, Cao (Danica) Xiao, Parminder Bhatia, Jimeng Sun 0001, Jiawei Han 0001. [doi]
- Towards Understanding Extrapolation: a Causal LensLingjing Kong, Guangyi Chen 0002, Petar Stojanov, Haoxuan Li, Eric P. Xing, Kun Zhang 0001. [doi]
- AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite ImageryHangyu Zhou, Chia-Hsiang Kao, Cheng Perng Phoo, Utkarsh Mall, Bharath Hariharan, Kavita Bala. [doi]
- Graph Convolutions Enrich the Self-Attention in Transformers!Jeongwhan Choi 0002, Hyowon Wi, Jayoung Kim 0002, Yehjin Shin, Kookjin Lee, Nathaniel Trask, Noseong Park. [doi]
- Efficient Policy Evaluation Across Multiple Different Experimental DatasetsYonghan Jung, Alexis Bellot. [doi]
- DeSparsify: Adversarial Attack Against Token Sparsification MechanismsOryan Yehezkel, Alon Zolfi, Amit Baras, Yuval Elovici, Asaf Shabtai. [doi]
- SocraticLM: Exploring Socratic Personalized Teaching with Large Language ModelsJiayu Liu 0001, Zhenya Huang, Tong Xiao, Jing Sha, Jinze Wu, Qi Liu 0003, Shijin Wang 0001, Enhong Chen. [doi]
- LAVIB: A Large-scale Video Interpolation BenchmarkAlex Stergiou. [doi]
- Can Language Models Learn to Skip Steps?Tengxiao Liu, Qipeng Guo, Xiangkun Hu, Cheng Jiayang, Yue Zhang 0004, Xipeng Qiu, Zheng Zhang 0001. [doi]
- Optimal Algorithms for Learning Partitions with Faulty OraclesAdela Frances DePavia, Olga Medrano Martín del Campo, Erasmo Tani. [doi]
- GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement LearningJaewoo Lee, Sujin Yun, Taeyoung Yun, Jinkyoo Park. [doi]
- einspace: Searching for Neural Architectures from Fundamental OperationsLinus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos J. Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley. [doi]
- Revisiting the Integration of Convolution and Attention for Vision BackboneLei Zhu, Xinjiang Wang, Wayne Zhang 0001, Rynson W. H. Lau. [doi]
- Large Language Model Unlearning via Embedding-Corrupted PromptsChris Yuhao Liu, Yaxuan Wang, Jeffrey Flanigan, Yang Liu. [doi]
- U-DiTs: Downsample Tokens in U-Shaped Diffusion TransformersYuchuan Tian, Zhijun Tu, Hanting Chen, Jie Hu 0021, Chao Xu 0006, Yunhe Wang 0001. [doi]
- Unveiling the Tapestry of Consistency in Large Vision-Language ModelsYuan Zhang 0020, Fei Xiao, Tao Huang 0020, Chun-Kai Fan, Hongyuan Dong, Jiawen Li, Jiacong Wang, Kuan Cheng, Shanghang Zhang, Haoyuan Guo. [doi]
- Hierarchical Object-Aware Dual-Level Contrastive Learning for Domain Generalized Stereo MatchingYikun Miao, Meiqing Wu, Siew Kei Lam, Changsheng Li, Thambipillai Srikanthan. [doi]
- DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware DiffusionWeicai Ye, Chenhao Ji, Zheng Chen 0016, Junyao Gao 0002, Xiaoshui Huang, Song-Hai Zhang, Wanli Ouyang, Tong He 0001, Cairong Zhao, Guofeng Zhang 0001. [doi]
- Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD LearningSergey Samsonov, Eric Moulines, Qi-Man Shao, Zhuo-Song Zhang, Alexey Naumov. [doi]
- The ALCHEmist: Automated Labeling 500x CHEaper than LLM Data AnnotatorsTzu-Heng Huang, Catherine Cao, Vaishnavi Bhargava, Frederic Sala. [doi]
- xMIL: Insightful Explanations for Multiple Instance Learning in HistopathologyJulius Hense, Mina Jamshidi Idaji, Oliver Eberle, Thomas Schnake, Jonas Dippel, Laure Ciernik, Oliver Buchstab, Andreas Mock, Frederick Klauschen, Klaus-Robert Müller. [doi]
- Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning ratesJincheng Mei, Bo Dai 0001, Alekh Agarwal, Sharan Vaswani, Anant Raj, Csaba Szepesvári, Dale Schuurmans. [doi]
- Vision-Language Navigation with Energy-Based PolicyRui Liu, Wenguan Wang, Yi Yang. [doi]
- The Prevalence of Neural Collapse in Neural Multivariate RegressionGeorge Andriopoulos, Zixuan Dong, Li Guo, Zifan Zhao, Keith W. Ross. [doi]
- RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking ClassifierPin-Yen Huang, Szu-Wei Fu, Yu Tsao 0001. [doi]
- Policy AggregationParand A. Alamdari, Soroush Ebadian, Ariel D. Procaccia. [doi]
- Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance ReductionAnton Rodomanov, Xiaowen Jiang, Sebastian U. Stich. [doi]
- Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondZhechao Wang, Peirui Cheng, Minxing Chen, Pengju Tian, Zhirui Wang, Xinming Li, Xue Yang 0005, Xian Sun. [doi]
- BEACON: Benchmark for Comprehensive RNA Tasks and Language ModelsYuchen Ren, Zhiyuan Chen, Lifeng Qiao, Hongtai Jing, Yuchen Cai, Sheng Xu, Peng Ye, Xinzhu Ma, Siqi Sun, Hongliang Yan, Dong Yuan, Wanli Ouyang, Xihui Liu. [doi]
- ResAD: A Simple Framework for Class Generalizable Anomaly DetectionXincheng Yao, Zixin Chen, Chao Gao, Guangtao Zhai, Chongyang Zhang. [doi]
- Iteratively Refined Behavior Regularization for Offline Reinforcement LearningYi Ma 0005, Jianye Hao, Xiaohan Hu, Yan Zheng 0002, Chenjun Xiao. [doi]
- MMSite: A Multi-modal Framework for the Identification of Active Sites in ProteinsSong Ouyang, Huiyu Cai, Yong Luo 0002, Kehua Su, Lefei Zhang, Bo Du 0001. [doi]
- WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm ControlClaire Bizon Monroc, Ana Busic, Donatien Dubuc, Jiamin Zhu. [doi]
- ShowMaker: Creating High-Fidelity 2D Human Video via Fine-Grained Diffusion ModelingQuanwei Yang, Jiazhi Guan, Kaisiyuan Wang, Lingyun Yu 0002, Wenqing Chu, Hang Zhou 0009, Zhiqiang Feng, Haocheng Feng, Errui Ding, Jingdong Wang 0001, Hongtao Xie. [doi]
- BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best PracticesAnka Reuel-Lamparth, Amelia F. Hardy, Chandler Smith, Max Lamparth, Malcolm Hardy, Mykel J. Kochenderfer. [doi]
- Amortized Eigendecomposition for Neural NetworksTianbo Li, Zekun Shi, Jiaxi Zhao, Min Lin. [doi]
- Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view ImagesShengjun Zhang, Xin Fei, Fangfu Liu, Haixu Song, Yueqi Duan. [doi]
- BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth EstimationXiang Zhang, Bingxin Ke, Hayko Riemenschneider, Nando Metzger, Anton Obukhov, Markus Gross 0001, Konrad Schindler, Christopher Schroers. [doi]
- CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action RecognitionYuhang Wen 0001, Mengyuan Liu, Songtao Wu, Beichen Ding. [doi]
- Chat-Scene: Bridging 3D Scene and Large Language Models with Object IdentifiersHaifeng Huang, Yilun Chen, Zehan Wang 0001, Rongjie Huang, Runsen Xu, Tai Wang, Luping Liu, Xize Cheng, Yang Zhao, Jiangmiao Pang, Zhou Zhao 0001. [doi]
- Private Attribute Inference from Images with Vision-Language ModelsBatuhan Tömekçe, Mark Vero, Robin Staab, Martin T. Vechev. [doi]
- Avoiding Undesired Future with Minimal Cost in Non-Stationary EnvironmentsWen-Bo Du 0002, Tian Qin, Tian-Zuo Wang, Zhi-Hua Zhou. [doi]
- A Unifying Post-Processing Framework for Multi-Objective Learn-to-Defer ProblemsMohammad-Amin Charusaie, Samira Samadi. [doi]
- Communication-Efficient Federated Group Distributionally Robust OptimizationZhishuai Guo, Tianbao Yang. [doi]
- On the Comparison between Multi-modal and Single-modal Contrastive LearningWei Huang, Andi Han, Yongqiang Chen, Yuan Cao, Zhiqiang Xu, Taiji Suzuki. [doi]
- Overfitting Behaviour of Gaussian Kernel Ridgeless Regression: Varying Bandwidth or DimensionalityMarko Medvedev, Gal Vardi, Nati Srebro. [doi]
- Boosting Semi-Supervised Scene Text Recognition via Viewing and SummarizingYadong Qu, Yuxin Wang, Bangbang Zhou, Zixiao Wang, Hongtao Xie, Yongdong Zhang 0001. [doi]
- Interpretable Concept Bottlenecks to Align Reinforcement Learning AgentsQuentin Delfosse, Sebastian Sztwiertnia, Mark Rothermel, Wolfgang Stammer, Kristian Kersting. [doi]
- Parallelizing Linear Transformers with the Delta Rule over Sequence LengthSonglin Yang, Bailin Wang, Yu Zhang, Yikang Shen, Yoon Kim. [doi]
- Universal Rates for Active LearningSteve Hanneke, Amin Karbasi, Shay Moran, Grigoris Velegkas. [doi]
- Continual Counting with Gradual Privacy ExpirationJoel Daniel Andersson, Monika Henzinger, Rasmus Pagh, Teresa Anna Steiner, Jalaj Upadhyay. [doi]
- Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object RetrievalYang Xu, Yifan Feng, Jun Zhang, Jun-Hai Yong, Yue Gao. [doi]
- DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZJonas Belouadi, Simone Paolo Ponzetto, Steffen Eger. [doi]
- Federated Graph Learning for Cross-Domain RecommendationZiqi Yang, Zhaopeng Peng, Zihui Wang, Jianzhong Qi 0001, Chaochao Chen 0001, Weike Pan, Chenglu Wen, Cheng Wang 0003, Xiaoliang Fan. [doi]
- Improving Environment Novelty Quantification for Effective Unsupervised Environment DesignJayden Teoh Jing Teoh, Wenjun Li, Pradeep Varakantham. [doi]
- CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal ModelsJunho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro. [doi]
- Towards Combating Frequency Simplicity-biased Learning for Domain GeneralizationXilin He, Jingyu Hu, Qinliang Lin, Cheng Luo, Weicheng Xie 0001, Siyang Song, Muhammad Haris Khan, LinLin Shen. [doi]
- Algebraic Positional EncodingsKonstantinos Kogkalidis, Jean-Philippe Bernardy, Vikas Garg 0001. [doi]
- VideoGUI: A Benchmark for GUI Automation from Instructional VideosKevin Qinghong Lin, Linjie Li, Difei Gao, Qinchen Wu, Mingyi Yan, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou. [doi]
- Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person Re-identificationJiangming Shi, Xiangbo Yin, Yachao Zhang 0001, Zhizhong Zhang 0001, Yuan Xie 0001, Yanyun Qu. [doi]
- Combining Observational Data and Language for Species Range EstimationMax Hamilton, Christian Lange 0004, Elijah Cole, Alexander Shepard, Samuel Heinrich, Oisin Mac Aodha, Grant Van Horn, Subhransu Maji. [doi]
- Erasing Undesirable Concepts in Diffusion Models with Adversarial PreservationAnh Bui, Tung Long Vuong, Khanh Doan, Trung Le, Paul Montague, Tamas Abraham, Dinh Q. Phung. [doi]
- Self-Play Fine-tuning of Diffusion Models for Text-to-image GenerationHuizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu. [doi]
- Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret AlgorithmSattar Vakili, Julia Olkhovskaya. [doi]
- The Power of Extrapolation in Federated LearningHanmin Li, Kirill Acharya, Peter Richtárik. [doi]
- NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free AttentionTianyi Zhang 0011, Jonah Yi, Bowen Yao, Zhaozhuo Xu, Anshumali Shrivastava. [doi]
- Unlock the Intermittent Control Ability of Model Free Reinforcement LearningJiashun Liu, Jianye Hao, Xiaotian Hao, Yi Ma 0005, Yan Zheng 0002, Yujing Hu, Tangjie Lv. [doi]
- DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular VideosWen-Hsuan Chu, Lei Ke, Katerina Fragkiadaki. [doi]
- Data-Efficient Learning with Neural ProgramsAlaia Solko-Breslin, Seewon Choi, Ziyang Li, Neelay Velingker, Rajeev Alur, Mayur Naik, Eric Wong 0001. [doi]
- Dual-Diffusion for Binocular 3D Human Pose EstimationXiaoyue Wan, Zhuo Chen, Bingzhi Duan, Xu Zhao. [doi]
- Large Scale Transfer Learning for Tabular Data via Language ModelingJosh Gardner 0001, Juan C. Perdomo, Ludwig Schmidt. [doi]
- Learning Superconductivity from Ordered and Disordered Material StructuresPin Chen, Luoxuan Peng, Rui Jiao, Qing Mo, Zhen Wang, Wenbing Huang 0001, Yang Liu 0005, Yutong Lu. [doi]
- Multilinear Mixture of Experts: Scalable Expert Specialization through FactorizationJames Oldfield 0001, Markos Georgopoulos, Grigorios Chrysos 0002, Christos Tzelepis, Yannis Panagakis, Mihalis Nicolaou, Jiankang deng, Ioannis Patras. [doi]
- ReplaceAnything3D: Text-Guided Object Replacement in 3D Scenes with Compositional Scene RepresentationsEdward Bartrum, Thu Nguyen-Phuoc, Christopher Xie, Zhengqin Li, Numair Khan, Armen Avetisyan, Douglas Lanman, Lei Xiao. [doi]
- Online Control in Population DynamicsNoah Golowich, Elad Hazan, Zhou Lu, Dhruv Rohatgi, Y. Jennifer Sun. [doi]
- LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion ModelsSeyedmorteza Sadat, Jakob Buhmann, Derek Bradley, Otmar Hilliges, Romann M. Weber. [doi]
- Masked Pre-training Enables Universal Zero-shot DenoiserXiaoxiao Ma 0006, Zhixiang Wei, Yi Jin 0002, Pengyang Ling, Tianle Liu, Ben Wang, Junkang Dai, Huaian Chen. [doi]
- Toward Self-Improvement of LLMs via Imagination, Searching, and CriticizingYe Tian, Baolin Peng, Linfeng Song, Lifeng Jin, Dian Yu 0001, Lei Han, Haitao Mi, Dong Yu 0001. [doi]
- Enhancing Large Vision Language Models with Self-Training on Image ComprehensionYihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, Quanquan Gu, James Y. Zou, Kai-Wei Chang, Wei Wang 0010. [doi]
- DiTFastAttn: Attention Compression for Diffusion Transformer ModelsZhihang Yuan, Hanling Zhang, Lu Pu, Xuefei Ning, Linfeng Zhang, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang 0002. [doi]
- Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting ApproachChaoxi Niu, Guansong Pang, Ling Chen 0006, Bing Liu. [doi]
- Causal Dependence PlotsJoshua R. Loftus, Lucius Bynum, Sakina Hansen. [doi]
- realSEUDO for real-time calcium imaging analysisIuliia Dmitrieva, Sergey Babkin, Adam S. Charles. [doi]
- Multi-LLM Debate: Framework, Principals, and InterventionsAndrew Estornell, Yang Liu 0018. [doi]
- Slight Corruption in Pre-training Data Makes Better Diffusion ModelsHao Chen, Yujin Han, Diganta Misra, Xiang Li, Kai Hu 0010, Difan Zou, Masashi Sugiyama, Jindong Wang, Bhiksha Raj. [doi]
- Online Estimation via Offline Estimation: An Information-Theoretic FrameworkDylan J. Foster, Yanjun Han, Jian Qian, Alexander Rakhlin. [doi]
- Extracting Training Data from Molecular Pre-trained ModelsRenhong Huang, Jiarong Xu, Zhiming Yang, Xiang Si, Xin Jiang, Hanyang Yuan, Chunping Wang, Yang Yang 0009. [doi]
- Lorentz-Equivariant Geometric Algebra Transformers for High-Energy PhysicsJonas Spinner, Victor Bresó, Pim de Haan, Tilman Plehn, Jesse Thaler, Johann Brehmer. [doi]
- GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game MapsMuhammad Umair Nasir, Steven James 0001, Julian Togelius. [doi]
- Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 StepsNikita Starodubcev, Mikhail Khoroshikh, Artem Babenko, Dmitry Baranchuk. [doi]
- Exploratory Retrieval-Augmented Planning For Continual Embodied Instruction FollowingMinjong Yoo, Jinwoo Jang, Wei-Jin Park, Honguk Woo. [doi]
- 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion ModelsHeng Yu, Chaoyang Wang, Peiye Zhuang, Willi Menapace, Aliaksandr Siarohin, Junli Cao, László A. Jeni, Sergey Tulyakov, Hsin-Ying Lee 0001. [doi]
- SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal FusionMing Dai, Lingfeng Yang, Yihao Xu, Zhenhua Feng 0001, Wankou Yang. [doi]
- Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-ContrastChufan Shi, Cheng Yang 0002, Xinyu Zhu, Jiahao Wang, Taiqiang Wu, Siheng Li, Deng Cai 0002, Yujiu Yang, Yu Meng. [doi]
- MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive ImagingZhenghao Pan, Haijin Zeng, Jiezhang Cao, Yongyong Chen, Kai Zhang 0008, Yong Xu. [doi]
- Learning Identifiable Factorized Causal Representations of Cellular ResponsesHaiyi Mao, Romain Lopez, Kai Liu, Jan-Christian Huetter, David Richmond, Panayiotis V. Benos, Lin Qiu. [doi]
- Diffusion Spectral Representation for Reinforcement LearningDmitry Shribak, Chen-Xiao Gao, Yitong Li, Chenjun Xiao, Bo Dai 0001. [doi]
- DMC-VB: A Benchmark for Representation Learning for Control with Visual DistractorsJoseph Ortiz, Antoine Dedieu, Wolfgang Lehrach, J. Swaroop Guntupalli, Carter Wendelken, Ahmad Humayun, Sivaramakrishnan Swaminathan, Guangyao Zhou, Miguel Lázaro-Gredilla, Kevin P. Murphy. [doi]
- FairJob: A Real-World Dataset for Fairness in Online SystemsMariia Vladimirova, Federico Pavone, Eustache Diemert. [doi]
- Understanding and Minimising Outlier Features in Transformer TrainingBobby He, Lorenzo Noci, Daniele Paliotta, Imanol Schlag, Thomas Hofmann. [doi]
- SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond WordsJunyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang 0066, Lu Lu 0015, Yuxuan Wang 0002, Haizhou Li 0001, Zhizheng Wu 0001. [doi]
- Improved Generation of Adversarial Examples Against Safety-aligned LLMsQizhang Li, Yiwen Guo, Wangmeng Zuo, Hao Chen 0003. [doi]
- NeoRL: Efficient Exploration for Nonepisodic RLBhavya Sukhija, Lenart Treven, Florian Dörfler, Stelian Coros, Andreas Krause 0001. [doi]
- Improving Adversarial Robust Fairness via Anti-Bias Soft Label DistillationShiji Zhao, Ranjie Duan, Xizhe Wang, Xingxing Wei. [doi]
- On the Adversarial Robustness of Benjamini HochbergLouis Chen, Roberto Szechtman, Matan Seri. [doi]
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and LanguagesJunho Myung, Nayeon Lee, Yi Zhou 0019, Jiho Jin, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Pérez-Almendros, Abinew Ali Ayele, Víctor Gutiérrez-Basulto, Yazmín Ibáñez García, Hwaran Lee, Shamsuddeen Hassan Muhammad, Ki-Woong Park, Anar Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, José Camacho-Collados, Alice Oh. [doi]
- Reconstruct and Match: Out-of-Distribution Robustness via Topological HomogeneityChaoqi Chen, Luyao Tang, Hui Huang. [doi]
- Probabilistic Graph Rewiring via Virtual NodesChendi Qian, Andrei Manolache, Christopher Morris 0001, Mathias Niepert. [doi]
- What Variables Affect Out-of-Distribution Generalization in Pretrained Models?Md Yousuf Harun, Kyungbok Lee, Gianmarco J. Gallardo, Giri Krishnan, Christopher Kanan. [doi]
- On the Worst Prompt Performance of Large Language ModelsBowen Cao, Deng Cai 0002, Zhisong Zhang, Yuexian Zou, Wai Lam. [doi]
- GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian GenerationChubin Zhang, Hongliang Song, Yi Wei 0003, Chen Yu, Jiwen Lu, Yansong Tang. [doi]
- Mixtures of Experts for Audio-Visual LearningYing Cheng 0005, Yang Li, Junjie He, Rui Feng. [doi]
- Towards a Theoretical Understanding of the 'Reversal Curse' via Training DynamicsHanlin Zhu, Baihe Huang, Shaolun Zhang, Michael I. Jordan, Jiantao Jiao, Yuandong Tian, Stuart J. Russell. [doi]
- MemVLT: Vision-Language Tracking with Adaptive Memory-based PromptsXiaokun Feng, Xuchen Li, Shiyu Hu, Dailing Zhang, Meiqi Wu, Jing Zhang, Xiaotang Chen, Kaiqi Huang. [doi]
- Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal LearningAlex Jinpeng Wang, Linjie Li, Yiqi Lin, Min Li, Lijuan Wang, Mike Zheng Shou. [doi]
- SPO: Sequential Monte Carlo Policy OptimisationMatthew Macfarlane, Edan Toledo, Donal Byrne, Paul Duckworth, Alexandre Laterre. [doi]
- 3D Focusing-and-Matching Network for Multi-Instance Point Cloud RegistrationLiyuan Zhang, Le Hui, Qi Liu 0054, Bo Li, Yuchao Dai. [doi]
- On the Complexity of Identification in Linear Structural Causal ModelsJulian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz. [doi]
- Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust ClassifiersJonas Ngnawé, Sabyasachi Sahoo, Yann Pequignot, Frédéric Precioso, Christian Gagné 0001. [doi]
- Classifier Clustering and Feature Alignment for Federated Learning under Distributed Concept DriftJunbao Chen, Jingfeng Xue, Yong Wang 0010, Zhenyan Liu, Lu Huang. [doi]
- Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language ModelsMinki Kang, Sung Ju Hwang, Gibbeum Lee, Jaewoong Cho. [doi]
- USCILab3D: A Large-scale, Long-term, Semantically Annotated Outdoor DatasetKiran Lekkala, Henghui Bao, Peixu Cai, Wei Lim, Chen Liu, Laurent Itti. [doi]
- Adaptive Depth Networks with Skippable Sub-PathsWoochul Kang, Hyungseop Lee. [doi]
- Data subsampling for Poisson regression with pth-root-linkHan Cheng Lie, Alexander Munteanu. [doi]
- SEA: State-Exchange Attention for High-Fidelity Physics Based TransformersParsa Esmati, Amirhossein Dadashzadeh, Vahid Ardakani, Nicolas Larrosa, Nicolò Grilli. [doi]
- Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent GamesThe Viet Bui, Tien Mai, Thanh Hong Nguyen. [doi]
- Private Algorithms for Stochastic Saddle Points and Variational Inequalities: Beyond Euclidean GeometryRaef Bassily, Cristóbal Guzmán, Michael Menart. [doi]
- SLTrain: a sparse plus low rank approach for parameter and memory efficient pretrainingAndi Han, Jiaxiang Li, Wei Huang, Mingyi Hong 0001, Akiko Takeda, Pratik Kumar Jawanpuria, Bamdev Mishra. [doi]
- Stress-Testing Long-Context Language Models with Lifelong ICL and Task HaystackXiaoyue Xu, Qinyuan Ye, Xiang Ren 0001. [doi]
- Discovering plasticity rules that organize and maintain neural circuitsDavid Bell, Alison Duffy, Adrienne Fairhall. [doi]
- Are nuclear masks all you need for improved out-of-domain generalisation? A closer look at cancer classification in histopathologyDhananjay Tomar, Alexander Binder, Andreas Kleppe. [doi]
- COLD: Causal reasOning in cLosed Daily activitiesAbhinav Joshi, Areeb Ahmad, Ashutosh Modi. [doi]
- Data curation via joint example selection further accelerates multimodal learningTalfan Evans, Nikhil Parthasarathy, Hamza Merzic, Olivier J. Hénaff. [doi]
- A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function ApproximationHeyang Zhao, Jiafan He, Quanquan Gu. [doi]
- Curvature Clues: Decoding Deep Learning Privacy with Input Loss CurvatureDeepak Ravikumar, Efstathia Soufleri, Kaushik Roy 0001. [doi]
- Sequoia: Scalable and Robust Speculative DecodingZhuoming Chen, Avner May, Ruslan Svirschevski, Yuhsun Huang, Max Ryabinin, Zhihao Jia, Beidi Chen. [doi]
- Fine-Tuning is Fine, if CalibratedZheda Mai, Arpita Chowdhury, Ping Zhang 0016, Cheng-Hao Tu 0001, Hong-You Chen, Vardaan Pahuja, Tanya Y. Berger-Wolf, Song Gao 0001, Charles V. Stewart, Yu Su 0001, Wei-Lun Chao. [doi]
- Localizing Memorization in SSL Vision EncodersWenhao Wang, Adam Dziedzic, Michael Backes 0001, Franziska Boenisch. [doi]
- Compositional Generalization Across Distributional Shifts with Sparse Tree OperationsPaul Soulos, Henry Conklin, Mattia Opper, Paul Smolensky, Jianfeng Gao 0001, Roland Fernandez. [doi]
- Boosting the Potential of Large Language Models with an Intelligent Information AssistantYujia Zhou 0002, Zheng Liu 0011, Zhicheng Dou. [doi]
- Efficient Multi-task Reinforcement Learning with Cross-Task Policy GuidanceJinmin He, Kai Li, Yifan Zang 0001, Haobo Fu, Qiang Fu 0016, Junliang Xing, Jian Cheng. [doi]
- Learning Mixtures of Unknown Causal InterventionsAbhinav Kumar, Kirankumar Shiragur, Caroline Uhler. [doi]
- Learning symmetries via weight-sharing with doubly stochastic tensorsPutri A. van der Linden, Alejandro García-Castellanos, Sharvaree P. Vadgama, Thijs P. Kuipers, Erik J. Bekkers. [doi]
- Samba: Severity-aware Recurrent Modeling for Cross-domain Medical Image GradingQi Bi, Jingjun Yi, Hao Zheng 0008, Wei Ji 0011, Haolan Zhan, Yawen Huang, Yuexiang Li, Yefeng Zheng 0001. [doi]
- Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual CortexSpandan Madan, Will Xiao, Mingran Cao, Hanspeter Pfister, Margaret S. Livingstone, Gabriel Kreiman. [doi]
- Rethinking Imbalance in Image Super-Resolution for Efficient InferenceWei Yu 0004, Bowen Yang, Qinglin Liu, Jianing Li 0001, Shengping Zhang, Xiangyang Ji. [doi]
- Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationWenjie Fu 0005, Huandong Wang, Chen Gao 0001, Guanghua Liu, Yong Li 0008, Tao Jiang. [doi]
- Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit DifferenceJiabao Ji, Yujian Liu, Yang Zhang 0001, Gaowen Liu, Ramana Kompella, Sijia Liu 0001, Shiyu Chang. [doi]
- Ferrari: Federated Feature Unlearning via Optimizing Feature SensitivityHanlin Gu, WinKent Ong, Chee Seng Chan, Lixin Fan. [doi]
- Latent Intrinsics Emerge from Training to RelightXiao Zhang, William Gao, Seemandhar Jain, Michael Maire, David A. Forsyth, Anand Bhattad. [doi]
- Provably Transformers Harness Multi-Concept Word Semantics for Efficient In-Context LearningDake Bu, Wei Huang, Andi Han, Atsushi Nitanda, Taiji Suzuki, Qingfu Zhang 0001, Hau-San Wong. [doi]
- Double-Ended Synthesis Planning with Goal-Constrained Bidirectional SearchKevin Yu, Jihye Roh, Ziang Li, Wenhao Gao 0001, Runzhong Wang, Connor W. Coley. [doi]
- GenRL: Multimodal-foundation world models for generalization in embodied agentsPietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Aaron C. Courville, Sai Rajeswar Mudumba. [doi]
- Textual Training for the Hassle-Free Removal of Unwanted Visual Data: Case Studies on OOD and Hateful Image DetectionSaehyung Lee, Jisoo Mok, Sangha Park, Yongho Shin, Dahuin Jung, Sungroh Yoon. [doi]
- DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI AcceleratorsTaesik Gong, Fahim Kawsar, Chulhong Min. [doi]
- Towards Dynamic Message Passing on GraphsJunshu Sun, Chenxue Yang, Xiangyang Ji, Qingming Huang, Shuhui Wang. [doi]
- Divide-and-Conquer Predictive Coding: a structured Bayesian inference algorithmEli Sennesh, Hao Wu, Tommaso Salvatori. [doi]
- Putting Gale & Shapley to Work: Guaranteeing Stability Through LearningHadi Hosseini, Sanjukta Roy 0001, Duohan Zhang. [doi]
- Bridging semantics and pragmatics in information-theoretic emergent communicationEleonora Gualdoni, Mycal Tucker, Roger Levy, Noga Zaslavsky. [doi]
- Towards a theory of how the structure of language is acquired by deep neural networksFrancesco Cagnetta, Matthieu Wyart. [doi]
- Data Distribution ValuationXinyi Xu, Shuaiqi Wang, Chuan-Sheng Foo, Bryan Kian Hsiang Low, Giulia Fanti. [doi]
- Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation TechniquesBenyuan Meng, Qianqian Xu, Zitai Wang, Zhiyong Yang 0001, Xiaochun Cao, Qingming Huang. [doi]
- Learning diverse causally emergent representations from time series dataDavid McSharry, Christos Kaplanis, Fernando Rosas, Pedro A. M. Mediano. [doi]
- Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs QuestionsVinamra Benara, Chandan Singh, John X. Morris, Richard Antonello, Ion Stoica, Alexander Huth, Jianfeng Gao 0001. [doi]
- A Tractable Inference Perspective of Offline RLXuejie Liu, Anji Liu, Guy Van den Broeck, Yitao Liang. [doi]
- Untrained Neural Nets for Snapshot Compressive Imaging: Theory and AlgorithmsMengyu Zhao, Xi Chen, Xin Yuan, Shirin Jalali. [doi]
- Nonparametric Evaluation of Noisy ICA SolutionsSyamantak Kumar, Derek Bean, Peter J. Bickel, Purnamrita Sarkar. [doi]
- A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image AnalysisYue Yang 0006, Mona Gandhi, Yufei Wang, Yifan Wu, Michael S. Yao, Chris Callison-Burch, James C. Gee, Mark Yatskar. [doi]
- Active Learning for Derivative-Based Global Sensitivity Analysis with Gaussian ProcessesSyrine Belakaria, Ben Letham, Jana Doppa, Barbara Engelhardt, Stefano Ermon, Eytan Bakshy. [doi]
- Learning Multimodal Behaviors from Scratch with Diffusion Policy GradientSteven Li, Rickmer Krohn, Tao Chen 0046, Anurag Ajay, Pulkit Agrawal 0001, Georgia Chalvatzaki. [doi]
- PAC-Bayes-Chernoff bounds for unbounded lossesIoar Casado, Luis A. Ortega Andrés, Aritz Pérez, Andrés R. Masegosa. [doi]
- Group Robust Preference Optimization in Reward-free RLHFShyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas, Viraj Mehta, Pier Giuseppe Sessa, Haitham Bou-Ammar, Ilija Bogunovic. [doi]
- Animal-Bench: Benchmarking Multimodal Video Models for Animal-centric Video UnderstandingYinuo Jing, Ruxu Zhang, Kongming Liang, Yongxiang Li, Zhongjiang He, Zhanyu Ma, Jun Guo 0002. [doi]
- Adaptive Variance Reduction for Stochastic Optimization under Weaker AssumptionsWei Jiang, Sifan Yang, Yibo Wang 0005, Lijun Zhang 0005. [doi]
- Measuring Mutual Policy Divergence for Multi-Agent Sequential ExplorationHaowen Dou, Lujuan Dang, Zhirong Luan, Badong Chen. [doi]
- Learning from Noisy Labels via Conditional Distributionally Robust OptimizationHui Guo, Grace Yi, Boyu Wang. [doi]
- Free Lunch in Pathology Foundation Model: Task-specific Model Adaptation with Concept-Guided Feature EnhancementYanyan Huang, Weiqin Zhao, Yihang Chen, Yu Fu, Lequan Yu. [doi]
- SAM-Guided Masked Token Prediction for 3D Scene UnderstandingZhimin Chen, Liang Yang, Yingwei Li, Longlong Jing, Bing Li 0008. [doi]
- Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental LearningGrzegorz Rypesc, Sebastian Cygert, Tomasz Trzcinski, Bartlomiej Twardowski. [doi]
- Time-Varying LoRA: Towards Effective Cross-Domain Fine-Tuning of Diffusion ModelsZhan Zhuang, Yulong Zhang, Xuehao Wang, Jiangang Lu, Ying Wei, Yu Zhang. [doi]
- GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph PruningGuibin Zhang, Haonan Dong, Yuchen Zhang, Zhixun Li, Dingshuo Chen, Kai Wang 0036, Tianlong Chen, Yuxuan Liang, Dawei Cheng, Kun Wang. [doi]
- Contextual Active Model SelectionXuefeng Liu, Fangfang Xia, Rick Stevens, Yuxin Chen 0001. [doi]
- Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier DesignsYao Lai, Jinxin Liu, David Z. Pan, Ping Luo. [doi]
- Text2CAD: Generating Sequential CAD Designs from Beginner-to-Expert Level Text PromptsMohammad Sadil Khan, Sankalp Sinha, Talha Uddin Sheikh, Didier Stricker, Sk Aziz Ali, Muhammad Zeshan Afzal. [doi]
- Mixture of In-Context Experts Enhance LLMs' Long Context AwarenessHongzhan Lin 0002, Ang Lv, Yuhan Chen 0001, Chen Zhu 0003, Yang Song 0021, Hengshu Zhu, Rui Yan 0001. [doi]
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda DiscrepancyCameron Allen, Aaron Kirtland, Ruo Yu Tao, Sam Lobel, Daniel Scott, Nicholas Petrocelli, Omer Gottesman, Ronald Parr, Michael L. Littman, George Konidaris 0001. [doi]
- Unlearnable 3D Point Clouds: Class-wise Transformation Is All You NeedXianlong Wang 0001, Minghui Li, Wei Liu, Hangtao Zhang, Shengshan Hu, Yechao Zhang, Ziqi Zhou 0001, Hai Jin 0001. [doi]
- Online Convex Optimisation: The Optimal Switching Regret for all Segmentations SimultaneouslyStephen Pasteris, Chris Hicks, Vasilios Mavroudis, Mark Herbster. [doi]
- No-Regret Learning for Fair Multi-Agent Social Welfare OptimizationMengxiao Zhang, Ramiro Deo-Campo Vuong, Haipeng Luo. [doi]
- Learning with Fitzpatrick LossesSeta Rakotomandimby, Jean-Philippe Chancelier, Michel De Lara, Mathieu Blondel. [doi]
- Slot State Space ModelsJindong Jiang, Fei Deng 0001, Gautam Singh, Minseung Lee, Sungjin Ahn. [doi]
- A Unified Principle of Pessimism for Offline Reinforcement Learning under Model MismatchYue Wang 0068, Zhongchang Sun, Shaofeng Zou. [doi]
- AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with TransformersJake Grigsby, Justin Sasek, Samyak Parajuli, Daniel Adebi, Amy Zhang, Yuke Zhu. [doi]
- An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matchingHugo Malard, Michel Olvera, Stéphane Lathuilière, Slim Essid. [doi]
- DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian ConsensusYu Chen, Gim Hee Lee. [doi]
- Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solvingTheodore Tsesmelis, Luca Palmieri 0002, Marina Khoroshiltseva, Adeela Islam, Gur Elkin, Ofir Itzhak Shahar, Gianluca Scarpellini, Stefano Fiorini, Yaniv Ohayon, Nadav Alali, Sinem Aslan, Pietro Morerio, Sebastiano Vascon, Elena Gravina, Maria Cristina Napolitano, Giuseppe Scarpati, Gabriel Zuchtriegel, Alexandra Spühler, Michel E. Fuchs, Stuart James, Ohad Ben-Shahar, Marcello Pelillo, Alessio Del Bue. [doi]
- Cryptographic Hardness of Score EstimationMin Jae Song. [doi]
- Random Cycle Coding: Lossless Compression of Cluster Assignments via Bits-Back CodingDaniel Severo 0001, Ashish Khisti, Alireza Makhzani. [doi]
- Efficient Discrepancy Testing for Learning with Distribution ShiftGautam Chandrasekaran, Adam R. Klivans, Vasilis Kontonis, Konstantinos Stavropoulos, Arsen Vasilyan. [doi]
- TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous GraphsJulia Gastinger, Shenyang Huang, Michael Galkin, Erfan Loghmani, Ali Parviz, Farimah Poursafaei, Jacob Danovitch, Emanuele Rossi, Ioannis Koutis, Heiner Stuckenschmidt, Reihaneh Rabbany, Guillaume Rabusseau. [doi]
- Learning-to-Cache: Accelerating Diffusion Transformer via Layer CachingXinyin Ma, Gongfan Fang, Michael Bi Mi, Xinchao Wang. [doi]
- Spectral Editing of Activations for Large Language Model AlignmentYifu Qiu, Zheng Zhao 0005, Yftah Ziser, Anna Korhonen, Edoardo Maria Ponti, Shay B. Cohen. [doi]
- CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated ResponsesJing Yao, Xiaoyuan Yi, Xing Xie. [doi]
- Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at RandomMingming Ha, Taoxuewen, Wenfang Lin, Qiongxu Ma, Wujiang Xu, Linxun Chen. [doi]
- Fixed Confidence Best Arm Identification in the Bayesian SettingKyoungseok Jang, Junpei Komiyama, Kazutoshi Yamazaki. [doi]
- AFBench: A Large-scale Benchmark for Airfoil DesignJian Liu, Jianyu Wu, Hairun Xie, Guoqing Zhang, Jing Wang, Wei Liu 0123, Wanli Ouyang, Junjun Jiang, Xianming Liu, Shixiang Tang, Miao Zhang. [doi]
- Multiclass Transductive Online LearningSteve Hanneke, Vinod Raman, Amirreza Shaeiri, Unique Subedi. [doi]
- The Star Geometry of Critic-Based Regularizer LearningOscar Leong, Eliza O'Reilly, Yong Sheng Soh. [doi]
- Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code GenerationJingchang Chen, Hongxuan Tang, Zheng Chu, Qianglong Chen, Zekun Wang, Ming Liu 0004, Bing Qin 0001. [doi]
- Sketchy Moment Matching: Toward Fast and Provable Data Selection for FinetuningYijun Dong, Viet-Hoang Phan, Xiang Pan, Qi Lei. [doi]
- Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human InteractionsHeng Li, Minghan Li, Zhi-Qi Cheng, Yifei Dong, Yuxuan Zhou 0004, Jun-Yan He, Qi Dai 0001, Teruko Mitamura, Alexander G. Hauptmann. [doi]
- DiGRAF: Diffeomorphic Graph-Adaptive Activation FunctionKrishna Sri Ipsit Mantri, Xinzhi Wang, Carola-Bibiane Schönlieb, Bruno Ribeiro 0001, Beatrice Bevilacqua, Moshe Eliasof. [doi]
- Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured SpacesTobias Schröder, Zijing Ou, Yingzhen Li, Andrew B. Duncan. [doi]
- Knowledge Graph Completion by Intermediate Variables RegularizationChangyi Xiao, Yixin Cao 0002. [doi]
- Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic EmbeddingChenhao Zhou, Zebang Shen, Zhang Chao, Hanbin Zhao, Hui Qian 0001. [doi]
- COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video EditingJiangshan Wang, Yue Ma, Jiayi Guo, Yicheng Xiao, Gao Huang 0001, Xiu Li 0001. [doi]
- Physics-Informed Regularization for Domain-Agnostic Dynamical System ModelingZijie Huang 0002, Wanjia Zhao, Jingdong Gao, Ziniu Hu, Xiao Luo 0001, Yadi Cao, Yuanzhou Chen, Yizhou Sun, Wei Wang 0010. [doi]
- SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-ReflectionLiangxin Liu, Xuebo Liu 0002, Derek F. Wong, Dongfang Li 0002, Ziyi Wang, Baotian Hu, Min Zhang 0005. [doi]
- Adam with model exponential moving average is effective for nonconvex optimizationKwangjun Ahn, Ashok Cutkosky. [doi]
- Enhancing Large Language Models through Adaptive TokenizersMengyu Zheng, Hanting Chen, Tianyu Guo 0001, Chong Zhu, Binfan Zheng, Chang Xu, Yunhe Wang 0001. [doi]
- 4+3 Phases of Compute-Optimal Neural Scaling LawsElliot Paquette, Courtney Paquette, Lechao Xiao, Jeffrey Pennington. [doi]
- From Chaos to Clarity: 3DGS in the DarkZhihao Li, Yufei Wang, Alex C. Kot, Bihan Wen. [doi]
- On the Ability of Developers' Training Data Preservation of LearnwareHao-Yi Lei, Zhi-Hao Tan, Zhi-Hua Zhou. [doi]
- ECMamba: Consolidating Selective State Space Model with Retinex Guidance for Efficient Multiple Exposure CorrectionWei Dong 0010, Han Zhou 0003, Yulun Zhang 0001, Xiaohong Liu 0001, Jun Chen 0005. [doi]
- Aggregating Quantitative Relative Judgments: From Social Choice to Ranking PredictionYixuan Xu, Hanrui Zhang 0001, Yu Cheng 0002, Vincent Conitzer. [doi]
- StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal LossesJianan Li, Quan Tu, Cunli Mao, Zhengtao Yu 0001, Ji-Rong Wen, Rui Yan 0001. [doi]
- Knowledge Circuits in Pretrained TransformersYunzhi Yao, Ningyu Zhang 0001, Zekun Xi, Mengru Wang, Ziwen Xu, Shumin Deng, Huajun Chen. [doi]
- Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and ConditioningPenghui Ruan, Pichao Wang, Divya Saxena, Jiannong Cao 0001, Yuhui Shi. [doi]
- MIDGArD: Modular Interpretable Diffusion over Graphs for Articulated DesignsQuentin Leboutet, Nina Wiedemann, Zhipeng Cai, Michael Paulitsch, Kai Yuan. [doi]
- ECLipsE: Efficient Compositional Lipschitz Constant Estimation for Deep Neural NetworksYuezhu Xu, S. Sivaranjani. [doi]
- Conjugate Bayesian Two-step Change Point Detection for Hawkes ProcessZeyue Zhang, Xiaoling Lu, Feng Zhou. [doi]
- Monte Carlo Tree Search based Space Transfer for Black Box OptimizationShukuan Wang, Ke Xue 0001, Song Lei, Xiaobin Huang, Chao Qian 0001. [doi]
- Selective Attention: Enhancing Transformer through Principled Context ControlXuechen Zhang 0002, Xiangyu Chang, Mingchen Li, Amit K. Roy Chowdhury, Jiasi Chen, Samet Oymak. [doi]
- When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language ModelsYinghui Li, Qingyu Zhou, Yuanzhen Luo, Shirong Ma, Yangning Li, Hai-Tao Zheng 0002, Xuming Hu, Philip S. Yu. [doi]
- Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent ApproachRory Young, Nicolas Pugeault. [doi]
- The Bayesian sampling in a canonical recurrent circuit with a diversity of inhibitory interneuronsEryn Sale, Wenhao Zhang. [doi]
- BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image RetrievalImanol Miranda, Ander Salaberria, Eneko Agirre, Gorka Azkune. [doi]
- Sparse-view Pose Estimation and Reconstruction via Analysis by Generative SynthesisQitao Zhao, Shubham Tulsiani. [doi]
- Wormhole Loss for Partial Shape MatchingAmit Bracha, Thomas Dagès, Ron Kimmel. [doi]
- Coherent 3D Scene Diffusion From a Single RGB ImageManuel Dahnert, Angela Dai, Norman Müller, Matthias Nießner. [doi]
- Learning Human-like Representations to Enable Learning Human ValuesAndrea Wynn, Ilia Sucholutsky, Tom Griffiths 0001. [doi]
- WildPPG: A Real-World PPG Dataset of Long Continuous RecordingsManuel Meier, Berken Utku Demirel, Christian Holz 0001. [doi]
- Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-LineEungyeup Kim, Mingjie Sun, Christina Baek, Aditi Raghunathan, J. Zico Kolter. [doi]
- ReMI: A Dataset for Reasoning with Multiple ImagesMehran Kazemi, Nishanth Dikkala, Ankit Anand, Petar Devic, Ishita Dasgupta 0001, Fangyu Liu, Bahare Fatemi, Pranjal Awasthi, Sreenivas Gollapudi, Dee Guo, Ahmed Qureshi. [doi]
- A Theory of Optimistically Universal Online Learnability for General Concept ClassesSteve Hanneke, Hongao Wang. [doi]
- CogVLM: Visual Expert for Pretrained Language ModelsWeihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Keqin Chen, Bin Xu 0001, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang 0001. [doi]
- OSLO: One-Shot Label-Only Membership Inference AttacksYuefeng Peng, Jaechul Roh, Subhransu Maji, Amir Houmansadr. [doi]
- Learning Partitions from ContextSimon Buchholz. [doi]
- Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness PriorsViet Ho Tam Thuc Do, Parham Eftekhar, Seyed Alireza Hosseini, Gene Cheung, Philip A. Chou. [doi]
- RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language DescriptionsZiyao Zeng, Yangchao Wu, Hyoungseob Park, Daniel Wang 0005, Fengyu Yang, Stefano Soatto, Dong Lao, Byung-Woo Hong, Alex Wong 0001. [doi]
- Bayesian Optimization of Functions over Node Subsets in GraphsHuidong Liang, Xingchen Wan, Xiaowen Dong 0001. [doi]
- Analysing the Generalisation and Reliability of Steering VectorsDaniel Tan, David Chanin, Aengus Lynch, Brooks Paige, Dimitrios Kanoulas, Adrià Garriga-Alonso, Robert Kirk. [doi]
- MomentumSMoE: Integrating Momentum into Sparse Mixture of ExpertsRachel S. Y. Teo, Tan Nguyen. [doi]
- QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine TranslationGonçalo Rui Alves Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José Guilherme Camargo de Souza, André Martins. [doi]
- Efficient Graph Matching for Correlated Stochastic Block ModelsShuwen Chai, Miklós Z. Rácz. [doi]
- Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation ModelsYifan Zhang, Junhui Hou. [doi]
- Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsRui Yang, Ruomeng Ding, Yong Lin, Huan Zhang, Tong Zhang. [doi]
- Efficient Combinatorial Optimization via Heat DiffusionHengyuan Ma, Wenlian Lu, Jianfeng Feng. [doi]
- ReEvo: Large Language Models as Hyper-Heuristics with Reflective EvolutionHaoran Ye, Jiarui Wang 0002, Zhiguang Cao, Federico Berto, Chuanbo Hua, Haeyeon Kim, Jinkyoo Park, Guojie Song. [doi]
- Phased Consistency ModelsFu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao 0007, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Xiaogang Wang, Hongsheng Li. [doi]
- Improving Alignment and Robustness with Circuit BreakersAndy Zou, Long Phan, Justin Wang, Derek Duenas, Maxwell Lin, Maksym Andriushchenko, J. Zico Kolter, Matt Fredrikson, Dan Hendrycks. [doi]
- Reciprocal Reward Influence Encourages Cooperation From Self-Interested AgentsJohn L. Zhou, Weizhe Hong, Jonathan C. Kao. [doi]
- SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake DetectionYi Zhu, Surya Koppisetti, Trang Tran 0008, Gaurav Bharaj. [doi]
- ControlSynth Neural ODEs: Modeling Dynamical Systems with Guaranteed ConvergenceWenjie Mei, Dongzhe Zheng, Shihua Li 0001. [doi]
- Verified Safe Reinforcement Learning for Neural Network Dynamic ModelsJunlin Wu 0001, Huan Zhang, Yevgeniy Vorobeychik. [doi]
- MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding BenchmarkYubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue, Wenhu Chen. [doi]
- From Similarity to Superiority: Channel Clustering for Time Series ForecastingJialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying. [doi]
- PCP-MAE: Learning to Predict Centers for Point Masked AutoencodersXiangdong Zhang, Shaofeng Zhang, Junchi Yan. [doi]
- Handling Learnwares from Heterogeneous Feature Spaces with Explicit Label ExploitationPeng Tan, Hai-Tian Liu, Zhi-Hao Tan, Zhi-Hua Zhou. [doi]
- Generalization Bound and Learning Methods for Data-Driven Projections in Linear ProgrammingShinsaku Sakaue, Taihei Oki. [doi]
- Gradients of Functions of Large MatricesNicholas Krämer, Pablo Moreno-Muñoz, Hrittik Roy, Søren Hauberg. [doi]
- Achieving Constant Regret in Linear Markov Decision ProcessesWeitong Zhang, Zhiyuan Fan, Jiafan He, Quanquan Gu. [doi]
- Equivariant Machine Learning on Graphs with Nonlinear Spectral FiltersYa-Wei Eileen Lin, Ronen Talmon, Ron Levie. [doi]
- BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of ExpertsQizhen (Irene) Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo 0003, David Cairuz, Bharat Venkitesh, Jakob N. Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli. [doi]
- Omnigrasp: Grasping Diverse Objects with Simulated HumanoidsZhengyi Luo 0002, Jinkun Cao, Sammy Christen, Alexander Winkler, Kris Kitani, WeiPeng Xu. [doi]
- Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal AlgorithmsMiao Lu, Han Zhong 0001, Tong Zhang 0001, Jose H. Blanchet. [doi]
- Team-Fictitious Play for Reaching Team-Nash Equilibrium in Multi-team GamesAhmed Said Donmez, Yuksel Arslantas, Muhammed Omer Sayin. [doi]
- NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive SecurityMinghao Shao, Sofija Jancheska, Meet Udeshi, Brendan Dolan-Gavitt, Haoran Xi, Kimberly Milner, Boyuan Chen 0004, Max Yin, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique 0001. [doi]
- Nature-Inspired Local PropagationAlessandro Betti, Marco Gori. [doi]
- Optimizing over Multiple Distributions under Generalized Quasar-Convexity ConditionShihong Ding, Long Yang 0004, Luo Luo, Cong Fang 0001. [doi]
- Understanding the Transferability of Representations via Task-RelatednessAkshay Mehra, Yunbei Zhang, Jihun Hamm. [doi]
- ProxyFusion: Face Feature Aggregation Through Sparse ExpertsBhavin Jawade, Alexander Stone, Deen Dayal Mohan, Xiao Wang, Srirangaraj Setlur, Venu Govindaraju. [doi]
- LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking LossDominik A. Kloepfer, João F. Henriques, Dylan Campbell. [doi]
- Improving Equivariant Model Training via Constraint RelaxationStefanos Pertigkiozoglou, Evangelos Chatzipantazis, Shubhendu Trivedi, Kostas Daniilidis. [doi]
- On conditional diffusion models for PDE simulationsAliaksandra Shysheya, Cristiana Diaconu, Federico Bergamin, Paris Perdikaris, José Miguel Hernández-Lobato, Richard E. Turner, Emile Mathieu. [doi]
- A Structure-Aware Framework for Learning Device Placements on Computation GraphsShukai Duan 0002, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Panagiotis Kyriakis, Nesreen K. Ahmed, Peiyu Zhang 0002, Guixiang Ma, Mihai Capota, Shahin Nazarian, Theodore L. Willke, Paul Bogdan. [doi]
- Feint Behaviors and Strategies: Formalization, Implementation and EvaluationJunyu Liu, Xiangjun Peng. [doi]
- Enhancing Protein Mutation Effect Prediction through a Retrieval-Augmented FrameworkRuihan Guo, Rui Wang, Ruidong Wu, Zhizhou Ren, Jiahan Li, Shitong Luo, Zuofan Wu, Qiang Liu 0001, Jian Peng 0001, Jianzhu Ma. [doi]
- CALANet: Cheap All-Layer Aggregation for Human Activity RecognitionJaegyun Park, Dae-Won Kim 0001, Jaesung Lee 0001. [doi]
- Initializing Services in Interactive ML Systems for Diverse UsersAvinandan Bose, Mihaela Curmei, Daniel L. Jiang, Jamie H. Morgenstern, Sarah Dean, Lillian J. Ratliff, Maryam Fazel. [doi]
- Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMsXin Ma, Yang Liu, Jingjing Liu, Xiaoxu Ma. [doi]
- Frequency Adaptive Normalization For Non-stationary Time Series ForecastingWeiwei Ye, Songgaojun Deng, Qiaosha Zou, Ning Gui. [doi]
- HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian SplattingYuanhao Cai, Zihao Xiao 0001, Yixun Liang, Minghan Qin, Yulun Zhang 0001, Xiaokang Yang, Yaoyao Liu 0001, Alan L. Yuille. [doi]
- Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor ActivenessWeilin Lin, Li Liu 0036, Shaokui Wei, Jianze Li, Hui Xiong. [doi]
- SnapKV: LLM Knows What You are Looking for Before GenerationYuhong Li, Yingbing Huang, Bowen Yang, Bharat Venkitesh, Acyr Locatelli, Hanchen Ye, Tianle Cai, Patrick Lewis, Deming Chen. [doi]
- On the Scalability of Certified Adversarial Robustness with Generated DataThomas Altstidl, David Dobre, Arthur Kosmala, Bjoern M. Eskofier, Gauthier Gidel, Leo Schwinn. [doi]
- Almost-Linear RNNs Yield Highly Interpretable Symbolic Codes in Dynamical Systems ReconstructionManuel Brenner, Christoph Jürgen Hemmer, Zahra Monfared, Daniel Durstewitz. [doi]
- GITA: Graph to Visual and Textual Integration for Vision-Language Graph ReasoningYanbin Wei, Shuai Fu, Weisen Jiang, Zejian Zhang, Zhixiong Zeng, Qi Wu, James T. Kwok, Yu Zhang 0006. [doi]
- Transcoders find interpretable LLM feature circuitsJacob Dunefsky, Philippe Chlenski, Neel Nanda. [doi]
- Differentially Private Stochastic Gradient Descent with Fixed-Size Minibatches: Tighter RDP Guarantees with or without ReplacementJeremiah Birrell, Reza Ebrahimi, Rouzbeh Behnia, Jason Pacheco. [doi]
- Model-Based Transfer Learning for Contextual Reinforcement LearningJung-Hoon Cho, Vindula Jayawardana, Sirui Li, Cathy Wu 0002. [doi]
- A Near-optimal Algorithm for Learning Margin Halfspaces with Massart NoiseIlias Diakonikolas, Nikos Zarifis. [doi]
- Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D InpaintingYian Wang, Xiaowen Qiu, Jiageng Liu, Zhehuan Chen, Jiting Cai, Yufei Wang, Tsun-Hsuan Johnson Wang, Zhou Xian, Chuang Gan. [doi]
- Deep Graph MatingYongcheng Jing, Seok-Hee Hong 0001, Dacheng Tao. [doi]
- Learning predictable and robust neural representations by straightening image sequencesXueyan Niu, Cristina Savin, Eero P. Simoncelli. [doi]
- Focus On What Matters: Separated Models For Visual-Based RL GeneralizationDi Zhang, Bowen Lv, Hai Zhang, Feifan Yang, Junqiao Zhao, Hang Yu, Chang Huang, Hongtu Zhou, Chen Ye 0002, Changjun Jiang. [doi]
- Scale Equivariant Graph MetanetworksIoannis Kalogeropoulos, Giorgos Bouritsas, Yannis Panagakis. [doi]
- The Challenges of the Nonlinear Regime for Physics-Informed Neural NetworksAndrea Bonfanti, Giuseppe Bruno, Cristina Cipriani. [doi]
- WaveAttack: Asymmetric Frequency Obfuscation-based Backdoor Attacks Against Deep Neural NetworksJun Xia 0003, Zhihao Yue, Yingbo Zhou, Zhiwei Ling, Yiyu Shi 0001, Xian Wei, Mingsong Chen 0001. [doi]
- DoFIT: Domain-aware Federated Instruction Tuning with Alleviated Catastrophic ForgettingBinqian Xu, Xiangbo Shu, Haiyang Mei, Zechen Bai, Basura Fernando, Mike Zheng Shou, Jinhui Tang 0001. [doi]
- OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsTianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu 0009. [doi]
- Long-form factuality in large language modelsJerry Wei, Chengrun Yang, Xinying Song, Yifeng Lu, Nathan Hu, Jie Huang 0009, Dustin Tran, Daiyi Peng, Ruibo Liu, Da Huang, Cosmo Du, Quoc V. Le. [doi]
- Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale ModelsShuxia Lin, Miaosen Zhang, Ruiming Chen, Xu Yang, Qiufeng Wang, Xin Geng 0001. [doi]
- Boosting Transferability and Discriminability for Time Series Domain AdaptationMingyang Liu, Xinyang Chen 0001, Yang Shu, Xiucheng Li, Weili Guan, Liqiang Nie. [doi]
- Guided Trajectory Generation with Diffusion Models for Offline Model-based OptimizationTaeyoung Yun, Sujin Yun, Jaewoo Lee, Jinkyoo Park. [doi]
- Micro-Bench: A Microscopy Benchmark for Vision-Language UnderstandingAlejandro Lozano, Jeffrey J. Nirschl, James Burgess, Sanket Rajan Gupte, Yuhui Zhang, Alyssa Unell, Serena Yeung. [doi]
- Loki: Low-rank Keys for Efficient Sparse AttentionPrajwal Singhania, Siddharth Singh, Shwai He, Soheil Feizi, Abhinav Bhatele. [doi]
- Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray GenerationWenfang Yao, Chen Liu, Kejing Yin, William Kwok-Wai Cheung, Jing Qin. [doi]
- The Fairness-Quality Tradeoff in ClusteringRashida Hakim, Ana-Andreea Stoica, Christos H. Papadimitriou, Mihalis Yannakakis. [doi]
- Plant-and-Steal: Truthful Fair Allocations via PredictionsIlan Reuven Cohen, Alon Eden, Talya Eden, Arsen Vasilyan. [doi]
- Understanding Representation of Deep Equilibrium Models from Neural Collapse PerspectiveHaixiang Sun, Ye Shi 0001. [doi]
- Latent Functional Maps: a spectral framework for representation alignmentMarco Fumero, Marco Pegoraro 0002, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà. [doi]
- Moving Off-the-Grid: Scene-Grounded Video RepresentationsSjoerd van Steenkiste, Daniel Zoran, Yi Yang 0007, Yulia Rubanova, Rishabh Kabra, Carl Doersch, Dilara Gokay, Joseph Heyward, Etienne Pot, Klaus Greff, Drew A. Hudson, Thomas Keck, João Carreira 0001, Alexey Dosovitskiy, Mehdi S. M. Sajjadi, Thomas Kipf. [doi]
- Abrupt Learning in Transformers: A Case Study on Matrix CompletionPulkit Gopalani, Ekdeep Singh Lubana, Wei Hu. [doi]
- The Importance of Being Scalable: Improving the Speed and Accuracy of Neural Network Interatomic Potentials Across Chemical DomainsEric Qu, Aditi S. Krishnapriyan. [doi]
- Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous ControlJinzhu Luo, Dingyang Chen, Qi Zhang. [doi]
- Polyhedral Complex Derivation from Piecewise Trilinear NetworksJin-Hwa Kim. [doi]
- A Closer Look at the CLS Token for Cross-Domain Few-Shot LearningYixiong Zou, Shuai Yi, Yuhua Li 0003, Ruixuan Li 0001. [doi]
- EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view UnderstandingThanh-Dat Truong, Utsav Prabhu, Dongyi Wang, Bhiksha Raj, Susan Gauch, Jeyamkondan Subbiah, Khoa Luu. [doi]
- Taming "data-hungry" reinforcement learning? Stability in continuous state-action spacesYaqi Duan, Martin J. Wainwright. [doi]
- Stabilizing Linear Passive-Aggressive Online Learning with Weighted Reservoir SamplingSkyler Wu, Fred Lu, Edward Raff, James Holt. [doi]
- Universal In-Context Approximation By Prompting Fully Recurrent ModelsAleksandar Petrov, Tom A. Lamb, Alasdair Paren, Philip Torr 0001, Adel Bibi. [doi]
- Topological Generalization Bounds for Discrete-Time Stochastic Optimization AlgorithmsRayna Andreeva, Benjamin Dupuis, Rik Sarkar, Tolga Birdal, Umut Simsekli. [doi]
- Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRIGuanxiong Luo, Shoujin Huang, Martin Uecker. [doi]
- Online Posterior Sampling with a Diffusion PriorBranislav Kveton, Boris Oreshkin, Youngsuk Park, Aniket Anand Deshmukh, Rui Song 0006. [doi]
- Richelieu: Self-Evolving LLM-Based Agents for AI DiplomacyZhenyu Guan, Xiangyu Kong, Fangwei Zhong, Yizhou Wang 0001. [doi]
- Learning Macroscopic Dynamics from Partial Microscopic ObservationsMengyi Chen, Qianxiao Li. [doi]
- VFIMamba: Video Frame Interpolation with State Space ModelsGuozhen Zhang, Chunxu Liu, Yutao Cui, Xiaotong Zhao, Kai Ma 0002, Limin Wang 0002. [doi]
- Certified Robustness for Deep Equilibrium Models via Serialized Random SmoothingWeizhi Gao, Zhichao Hou, Han Xu, Xiaorui Liu. [doi]
- OneBit: Towards Extremely Low-bit Large Language ModelsYuzhuang Xu, Xu Han 0007, Zonghan Yang, Shuo Wang, Qingfu Zhu, Zhiyuan Liu, Weidong Liu, Wanxiang Che. [doi]
- InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language ModelsLinyi Li 0001, Shijie Geng, Zhenwen Li, Yibo He, Hao Yu, Ziyue Hua, Guanghan Ning, Siwei Wang, Tao Xie 0001, Hongxia Yang. [doi]
- High-dimensional (Group) Adversarial Training in Linear RegressionYiling Xie, Xiaoming Huo. [doi]
- Inexact Augmented Lagrangian Methods for Conic Optimization: Quadratic Growth and Linear ConvergenceFeng-Yi Liao, Lijun Ding, Yang Zheng 0001. [doi]
- IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware NeuronsDan Shi, Renren Jin, Tianhao Shen, Weilong Dong, Xinwei Wu, Deyi Xiong. [doi]
- The Closeness of In-Context Learning and Weight Shifting for Softmax RegressionShuai Li, Zhao Song 0002, Yu Xia, Tong Yu 0001, Tianyi Zhou 0001. [doi]
- The Expressive Capacity of State Space Models: A Formal Language PerspectiveYash Sarrof, Yana Veitsman, Michael Hahn 0001. [doi]
- Noisy Label Learning with Instance-Dependent Outliers: Identifiability via Crowd WisdomTri Nguyen, Shahana Ibrahim, Xiao Fu 0001. [doi]
- Sample-efficient Bayesian Optimisation Using Known InvariancesTheodore Brown, Alexandru Cioba, Ilija Bogunovic. [doi]
- Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoEXun Zhu, Ying Hu, Fanbin Mo, Miao Li, Ji Wu 0002. [doi]
- Robustly overfitting latents for flexible neural image compressionYura Perugachi-Diaz, Arwin Gansekoele, Sandjai Bhulai. [doi]
- MOTIVE: A Drug-Target Interaction Graph For Inductive Link PredictionJohn Arevalo, Ellen Su, Anne E. Carpenter, Shantanu Singh. [doi]
- Rethinking Transformer for Long Contextual Histopathology Whole Slide Image AnalysisHonglin Li 0001, YunLong Zhang, Pingyi Chen, Zhongyi Shui, Chenglu Zhu, Lin Yang 0002. [doi]
- CVQA: Culturally-diverse Multilingual Visual Question Answering BenchmarkDavid Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Santiago Góngora, Aishik Mandal, Sukannya Purkayastha, Jesús-Germán Ortiz-Barajas, Emilio Villa-Cueva, Jinheon Baek, Soyeong Jeong, Injy Hamed, Zheng Xin Yong, Zheng Wei Lim, Paula Mónica Silva, Jocelyn Dunstan, Mélanie Jouitteau, David Le Meur, Joan Nwatu, Ganzorig Batnasan, Munkh-Erdene Otgonbold, Munkhjargal Gochoo, Guido Ivetta, Luciana Benotti, Laura Alonso Alemany, Hernán Maina, Jiahui Geng, Tiago Timponi Torrent, Frederico Belcavello, Marcelo Viridiano, Jan Christian Blaise Cruz, Dan John Velasco, Oana Ignat, Zara Burzo, Chenxi Whitehouse, Artem Abzaliev, Teresa Clifford, Grainne Caulfield, Teresa Lynn, Christian Salamea Palacios, Vladimir Araujo, Yova Kementchedjhieva, Mihail Mihaylov, Israel Abebe Azime, Henok Biadglign Ademtew, Bontu Fufa Balcha, Naome A. Etori, David Ifeoluwa Adelani, Rada Mihalcea, Atnafu Lambebo Tonja, Maria Camila Buitrago Cabrera, Gisela Vallejo, Holy Lovenia, Ruochen Zhang, Marcos Estecha-Garitagoitia, Mario Rodríguez-Cantelar, Toqeer Ehsan, Rendi Chevi, Muhammad Farid Adilazuarda, Ryandito Diandaru, Samuel Cahyawijaya, Fajri Koto, Tatsuki Kuribayashi, Haiyue Song, Aditya Khandavally, Thanmay Jayakumar, Raj Dabre, Mohamed Fazli Mohamed Imam, Kumaranage Ravindu Yasas Nagasinghe, Alina Dragonetti, Luis Fernando D'Haro, Olivier Niyomugisha, Jay Gala, Pranjal A. Chitale, Fauzan Farooqui, Thamar Solorio, Alham Fikri Aji. [doi]
- LLaMo: Large Language Model-based Molecular Graph AssistantJinyoung Park, Minseong Bae, Dohwan Ko, Hyunwoo J. Kim. [doi]
- StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video GenerationYupeng Zhou, Daquan Zhou, Ming-Ming Cheng, Jiashi Feng, Qibin Hou. [doi]
- Fast Rates for Bandit PAC Multiclass ClassificationLiad Erez, Alon Peled-Cohen, Tomer Koren, Yishay Mansour, Shay Moran. [doi]
- Homology Consistency Constrained Efficient Tuning for Vision-Language ModelsHuatian Zhang 0001, Lei Zhang 0119, Yongdong Zhang 0001, Zhendong Mao. [doi]
- Near-Optimal Distributed Minimax Optimization under the Second-Order SimilarityQihao Zhou, Haishan Ye, Luo Luo. [doi]
- Towards Stable Representations for Protein Interface PredictionZiqi Gao, Zijing Liu, Yu Li, Jia Li. [doi]
- QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint VideosSharath Girish, Tianye Li, Amrita Mazumdar, Abhinav Shrivastava, David Luebke, Shalini De Mello. [doi]
- CosAE: Learnable Fourier Series for Image RestorationSifei Liu, Shalini De Mello, Jan Kautz. [doi]
- MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMsZiyu Liu, Tao Chu, Yuhang Zang, Xilin Wei, Xiaoyi Dong, Pan Zhang 0001, Zijian Liang, Yuanjun Xiong, Yu Qiao 0001, Dahua Lin, Jiaqi Wang 0003. [doi]
- Semantic Routing via Autoregressive ModelingEric Zhao 0003, Pranjal Awasthi, Zhengdao Chen, Sreenivas Gollapudi, Daniel Delling. [doi]
- Taming Heavy-Tailed Losses in Adversarial Bandits and the Best-of-Both-Worlds SettingDuo Cheng, Xingyu Zhou, Bo Ji 0001. [doi]
- Covariate Shift Corrected Conditional Randomization TestBowen Xu, Yiwen Huang, Chuan Hong, Shuangning Li, Molei Liu. [doi]
- Queueing Matching Bandits with Preference FeedbackJung Hun Kim, Min-hwan Oh. [doi]
- Banded Square Root Matrix Factorization for Differentially Private Model TrainingNikita Kalinin, Christoph H. Lampert. [doi]
- Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-ThoughtQiguang Chen, Libo Qin 0001, Jiaqi Wang, Jingxuan Zhou, Wanxiang Che. [doi]
- Gene-Gene Relationship Modeling Based on Genetic Evidence for Single-Cell RNA-Seq Data ImputationDaeho Um, Ji Won Yoon, Seong-Jin Ahn 0002, Yunha Yeo. [doi]
- Simplifying Constraint Inference with Inverse Reinforcement LearningAdriana Hugessen, Harley Wiltzer, Glen Berseth. [doi]
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language ModelsBowen Wang, Jiuyang Chang, Yiming Qian, Guoxin Chen, Junhao Chen, Zhouqiang Jiang, Jiahao Zhang, Yuta Nakashima, Hajime Nagahara. [doi]
- Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision ProcessesAndrew Bennett, Nathan Kallus, Miruna Oprescu, Wen Sun 0002, Kaiwen Wang. [doi]
- Spiking Transformer with Experts MixtureZhaokun Zhou, Yijie Lu, Yanhao Jia, Kaiwei Che, Jun Niu, Liwei Huang, Xinyu Shi, Yuesheng Zhu, Guoqi Li, Zhaofei Yu, Li Yuan 0007. [doi]
- Feature-Level Adversarial Attacks and Ranking Disruption for Visible-Infrared Person Re-identificationXi Yang 0011, Huanling Liu, De Cheng, Nannan Wang 0001, Xinbo Gao 0001. [doi]
- Prediction-Powered Ranking of Large Language ModelsIvi Chatzi, Eleni Straitouri, Suhas Thejaswi, Manuel Rodriguez. [doi]
- WhodunitBench: Evaluating Large Multimodal Agents via Murder Mystery GamesJunlin Xie, Ruifei Zhang, Zhihong Chen, Xiang Wan, Guanbin Li. [doi]
- CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor OptimizationZi Yang, Ziyue Liu, Samridhi Choudhary, Xinfeng Xie, Cao Gao, Siegfried Kunzmann, Zheng Zhang. [doi]
- FUSE: Fast Unified Simulation and Estimation for PDEsLevi E. Lingsch, Dana Grund, Siddhartha Mishra, Georgios Kissas. [doi]
- Using Noise to Infer Aspects of Simplicity Without LearningZachery Boner, Harry Chen, Lesia Semenova, Ronald Parr, Cynthia Rudin. [doi]
- Bandits with Abstention under Expert AdviceStephen Pasteris, Alberto Rumi, Maximilian Thiessen, Shota Saito, Atsushi Miyauchi 0001, Fabio Vitale, Mark Herbster. [doi]
- Do LLMs Build World Representations? Probing Through the Lens of State AbstractionZichao Li, Yanshuai Cao, Jackie CK Cheung. [doi]
- HEALNet: Multimodal Fusion for Heterogeneous Biomedical DataKonstantin Hemker, Nikola Simidjievski, Mateja Jamnik. [doi]
- Embedding Dimension of Contrastive Learning and k-Nearest NeighborsDmitrii Avdiukhin, Vaggos Chatziafratis, Orr Fischer, Grigory Yaroslavtsev. [doi]
- LLMs Can Evolve Continually on Modality for X-Modal ReasoningJiazuo Yu, Haomiao Xiong, Lu Zhang 0053, Haiwen Diao, Yunzhi Zhuge, Lanqing Hong, Dong Wang 0004, Huchuan Lu, You He, Long Chen 0016. [doi]
- AverNet: All-in-one Video Restoration for Time-varying Unknown DegradationsHaiyu Zhao, Lei Tian, Xinyan Xiao, Peng Hu, Yuanbiao Gou, Xi Peng. [doi]
- NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface ReconstructionYifan Wang, Di Huang, Weicai Ye, Guofeng Zhang 0001, Wanli Ouyang, Tong He 0001. [doi]
- Cherry on Top: Parameter Heterogeneity and Quantization in Large Language ModelsWanyun Cui, Qianle Wang. [doi]
- Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language ModelsLai Wei 0005, Zhiquan Tan, Chenghai Li, Jindong Wang, Weiran Huang 0001. [doi]
- Learning Generalized Linear Programming Value FunctionsTu Anh Nguyen, Joey Huchette, Christian Tjandraatmadja. [doi]
- Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-MakingDrago Plecko, Elias Bareinboim. [doi]
- IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question AnsweringRuosen Li, Ruochen Li, Barry Wang, Xinya Du. [doi]
- MEQA: A Benchmark for Multi-hop Event-centric Question Answering with ExplanationsRuosen Li, Zimu Wang, Son Quoc Tran, Lei Xia, Xinya Du. [doi]
- Learning Cortico-Muscular Dependence through Orthonormal Decomposition of Density RatiosShihan Ma, Bo Hu, Tianyu Jia, Alexander Kenneth Clarke, Blanka Zicher, Arnault H. Caillet, Dario Farina, José C. Príncipe. [doi]
- Constrained Binary Decision MakingDaniel Prusa, Vojtech Franc. [doi]
- 2FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured SparsityXinyu Yang, Jixuan Leng, Geyang Guo, Jiawei Zhao, Ryumei Nakada, Linjun Zhang, Huaxiu Yao, Beidi Chen. [doi]
- Categorical Flow Matching on Statistical ManifoldsChaoran Cheng, Jiahan Li, Jian Peng 0001, Ge Liu. [doi]
- Zipper: Addressing Degeneracy in Algorithm-Agnostic InferenceGeng Chen, Yinxu Jia, Guanghui Wang, Changliang Zou. [doi]
- Infinite Limits of Multi-head Transformer DynamicsBlake Bordelon, Hamza Tahir Chaudhry, Cengiz Pehlevan. [doi]
- Training Compute-Optimal Protein Language ModelsXingyi Cheng, Bo Chen 0026, Pan Li, Jing Gong, Jie Tang, Le Song. [doi]
- Bandits with Ranking FeedbackDavide Maran, Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni, Nicola Gatti 0001, Marcello Restelli. [doi]
- NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and BenchmarkingDaniel Dauner, Marcel Hallgarten, Tianyu Li, Xinshuo Weng, Zhiyu Huang, Zetong Yang, Hongyang Li 0001, Igor Gilitschenski, Boris Ivanovic, Marco Pavone 0001, Andreas Geiger 0001, Kashyap Chitta. [doi]
- Equivariant Neural Diffusion for Molecule GenerationFrançois Cornet, Grigory Bartosh, Mikkel Schmidt, Christian Andersson Naesseth. [doi]
- Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix EstimationStefan Stojanovic, Yassir Jedra, Alexandre Proutière. [doi]
- Reliable Learning of Halfspaces under Gaussian MarginalsIlias Diakonikolas, Lisheng Ren, Nikos Zarifis. [doi]
- Rapid Plug-in DefendersKai Wu, Yujian Betterest Li, Jian Lou, Xiaoyu Zhang, Handing Wang, Jing Liu. [doi]
- ALPINE: Unveiling The Planning Capability of Autoregressive Learning in Language ModelsSiwei Wang 0002, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen 0013. [doi]
- Offline Reinforcement Learning with OOD State Correction and OOD Action SuppressionYixiu Mao, Qi Wang, Chen Chen, Yun Qu 0002, Xiangyang Ji. [doi]
- Prompt-Agnostic Adversarial Perturbation for Customized Diffusion ModelsCong Wan, Yuhang He, Xiang Song 0005, Yihong Gong. [doi]
- All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path AggregationXu Zhang, Peiyao Guo, Ming Lu, Zhan Ma. [doi]
- Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language ModelsNitzan Bitton Guetta, Aviv Slobodkin, Aviya Maimon, Eliya Habba, Royi Rassin, Yonatan Bitton, Idan Szpektor, Amir Globerson, Yuval Elovici. [doi]
- 3M: A Long-Range Interaction Modeling Enhancer for Geometric GNNsYusong Wang, Chaoran Cheng, Shaoning Li, Yuxuan Ren, Bin Shao, Ge Liu, Pheng-Ann Heng, Nanning Zheng 0001. [doi]
- Inference of Neural Dynamics Using Switching Recurrent Neural NetworksYongxu Zhang, Shreya Saxena. [doi]
- Globally Convergent Variational InferenceDeclan McNamara, Jackson Loper, Jeffrey Regier. [doi]
- JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language ModelsPatrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong 0001. [doi]
- Paloma: A Benchmark for Evaluating Language Model FitIan Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hanna Hajishirzi, Noah A. Smith, Kyle Richardson 0001, Jesse Dodge. [doi]
- Data-faithful Feature Attribution: Mitigating Unobservable Confounders via Instrumental VariablesQiheng Sun, Haocheng Xia, Jinfei Liu. [doi]
- The Road Less ScheduledAaron Defazio, Xingyu Yang, Ahmed Khaled 0001, Konstantin Mishchenko, Harsh Mehta, Ashok Cutkosky. [doi]
- Contrastive dimension reduction: when and how?Sam Hawke, Yueen Ma, Didong Li. [doi]
- IPO: Interpretable Prompt Optimization for Vision-Language ModelsYingjun Du, Wenfang Sun, Cees Snoek. [doi]
- Partial Structure Discovery is Sufficient for No-regret Learning in Causal BanditsMuhammad Qasim Elahi, Mahsa Ghasemi, Murat Kocaoglu. [doi]
- QGFN: Controllable Greediness with Action ValuesElaine Lau, Stephen Zhewen Lu, Ling Pan, Doina Precup, Emmanuel Bengio. [doi]
- Stochastic Optimization Schemes for Performative Prediction with Nonconvex LossQiang Li 0017, Hoi-To Wai. [doi]
- Personalized Adapter for Large Meteorology Model on Devices: Towards Weather Foundation ModelsShengchao Chen, Guodong Long, Jing Jiang 0002, Chengqi Zhang. [doi]
- Ada-MSHyper: Adaptive Multi-Scale Hypergraph Transformer for Time Series ForecastingZongjiang Shang, Ling Chen 0001, Binqing Wu, Dongliang Cui. [doi]
- Consistency Diffusion Bridge ModelsGuande He, Kaiwen Zheng, Jianfei Chen, Fan Bao, Jun Zhu. [doi]
- InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability TechniquesRohan Gupta, Iván Arcuschin Moreno, Thomas Kwa, Adrià Garriga-Alonso. [doi]
- B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous DataRunze You, Shi Pu. [doi]
- Automatic Outlier Rectification via Optimal TransportJose H. Blanchet, Jiajin Li, Markus Pelger, Greg Zanotti. [doi]
- Generative Modeling of Molecular Dynamics TrajectoriesBowen Jing, Hannes Stärk, Tommi S. Jaakkola, Bonnie Berger. [doi]
- Job-SDF: A Multi-Granularity Dataset for Job Skill Demand Forecasting and BenchmarkingXi Chen, Chuan Qin 0002, Chuyu Fang, Chao Wang, Chen Zhu 0003, Fuzhen Zhuang, Hengshu Zhu, Hui Xiong 0001. [doi]
- Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based AgentsWenkai Yang, Xiaohan Bi, Yankai Lin, Sishuo Chen, Jie Zhou, Xu Sun 0001. [doi]
- One-Step Effective Diffusion Network for Real-World Image Super-ResolutionRongyuan Wu, Lingchen Sun, Zhiyuan Ma 0002, Lei Zhang 0006. [doi]
- Optical Diffusion Models for Image GenerationIlker Oguz, Niyazi Ulas Dinç, Mustafa Yildirim, Junjie Ke, Innfarn Yoo, Qifei Wang, Feng Yang, Christophe Moser, Demetri Psaltis. [doi]
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASRLiang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-Shan Lee, Shao-Hua Sun. [doi]
- Verifiably Robust Conformal PredictionLinus Jeary, Tom Kuipers, Mehran Hosseini, Nicola Paoletti. [doi]
- The FineWeb Datasets: Decanting the Web for the Finest Text Data at ScaleGuilherme Penedo, Hynek Kydlícek, Loubna Ben Allal, Anton Lozhkov, Margaret Mitchell, Colin A. Raffel, Leandro von Werra, Thomas Wolf 0008. [doi]
- What Makes Partial-Label Learning Algorithms Effective?Jiaqi Lv, Yangfan Liu, Shiyu Xia, Ning Xu 0009, Miao Xu, Gang Niu 0001, Min-Ling Zhang, Masashi Sugiyama, Xin Geng 0001. [doi]
- A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationJer Pelhan, Alan Lukezic, Vitjan Zavrtanik, Matej Kristan. [doi]
- Back to the Continuous AttractorÁbel Ságodi, Guillermo Martín-Sánchez, Piotr A. Sokól, Memming Park. [doi]
- A Motion-aware Spatio-temporal Graph for Video Salient Object RankingHao Chen 0034, Yufei Zhu, Yongjian Deng. [doi]
- Interpreting the Weight Space of Customized Diffusion ModelsAmil Dravid, Yossi Gandelsman, Kuan-Chieh Wang, Rameen Abdal, Gordon Wetzstein, Alexei A. Efros, Kfir Aberman. [doi]
- On the Computational Complexity of Private High-dimensional Model SelectionSaptarshi Roy, Zehua Wang, Ambuj Tewari. [doi]
- Continuous Spatiotemporal Events Decoupling through Spike-based Bayesian ComputationYajing Zheng, Jiyuan Zhang, Zhaofei Yu, Tiejun Huang 0001. [doi]
- Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion ModelsYimeng Zhang, Xin Chen, Jinghan Jia, Yihua Zhang, Chongyu Fan, Jiancheng Liu, Mingyi Hong 0001, Ke Ding, Sijia Liu 0001. [doi]
- Time Makes Space: Emergence of Place Fields in Networks Encoding Temporally Continuous Sensory ExperiencesZhaoze Wang, Ronald W. Di Tullio, Spencer Rooke, Vijay Balasubramanian. [doi]
- AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal PropertiesXiayan Ji, Anton Xue, Eric Wong, Oleg Sokolsky, Insup Lee 0001. [doi]
- Reconstructing the Image Stitching Pipeline: Integrating Fusion and Rectangling into a Unified Inpainting ModelZiqi Xie, Weidong Zhao, Xianhui Liu, Jian Zhao, Ning Jia. [doi]
- Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular Property PredictionQiang Liu 0006, Shaozhen Liu, Xin Sun, Shu Wu, Liang Wang 0056. [doi]
- Tight Bounds for Learning RUMs from Small SlatesFlavio Chierichetti, Mirko Giacchini, Ravi Kumar 0001, Alessandro Panconesi, Andrew Tomkins. [doi]
- Adaptive Image Quality Assessment via Teaching Large Multimodal Model to CompareHanwei Zhu, Haoning Wu 0001, Yixuan Li, Zicheng Zhang, Baoliang Chen, Lingyu Zhu 0006, Yuming Fang, Guangtao Zhai, Weisi Lin, Shiqi Wang 0001. [doi]
- FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage TrainingRuihong Yin, Vladimir Yugay, Yue Li 0036, Sezer Karaoglu, Theo Gevers. [doi]
- Right this way: Can VLMs Guide Us to See More to Answer Questions?Li Liu, Diji Yang, Sijia Zhong, Kalyana Suma Sree Tholeti, Lei Ding, Yi Zhang, Leilani Gilpin. [doi]
- Sm: enhanced localization in Multiple Instance Learning for medical imaging classificationFrancisco M. Castro-Macías, Pablo Morales-Alvarez, Yunan Wu, Rafael Molina 0001, Aggelos K. Katsaggelos. [doi]
- IncomeSCM: From tabular data set to time-series simulator and causal estimation benchmarkFredrik D. Johansson. [doi]
- EyeGraph: Modularity-aware Spatio Temporal Graph Clustering for Continuous Event-based Eye TrackingNuwan Sriyantha Bandara, Thivya Kandappu, Argha Sen, Ila Gokarn, Archan Misra. [doi]
- Learning 3D Equivariant Implicit Function with Patch-Level Pose-Invariant RepresentationXin Hu, Xiaole Tang, Ruixuan Yu, Jian Sun. [doi]
- ParallelEdits: Efficient Multi-Aspect Text-Driven Image Editing with Attention GroupingMingzhen Huang, Jialing Cai, Shan Jia, Vishnu Suresh Lokhande, Siwei Lyu. [doi]
- Towards Principled Graph TransformersLuis Müller, Daniel Kusuma, Blai Bonet, Christopher Morris 0001. [doi]
- LFME: A Simple Framework for Learning from Multiple Experts in Domain GeneralizationLiang Chen, Yong Zhang, Yibing Song, Zhiqiang Shen, Lingqiao Liu. [doi]
- Towards Neuron Attributions in Multi-Modal Large Language ModelsJunfeng Fang, Zac Bi, Ruipeng Wang, Houcheng Jiang, Yuan Gao, Kun Wang, An Zhang 0003, Jie Shi, Xiang Wang, Tat-Seng Chua. [doi]
- Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion ModelsMasatoshi Uehara, Yulai Zhao 0002, Ehsan Hajiramezanali, Gabriele Scalia, Gökcen Eraslan, Avantika Lal, Sergey Levine, Tommaso Biancalani. [doi]
- MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language AnnotationsRuiyuan Lyu, Jingli Lin, Tai Wang, Shuai Yang, Xiaohan Mao, Yilun Chen, Runsen Xu, Haifeng Huang, Chenming Zhu, Dahua Lin, Jiangmiao Pang. [doi]
- Linear Causal Representation Learning from Unknown Multi-node InterventionsBurak Varici, Emre Acartürk, Karthikeyan Shanmugam, Ali Tajer. [doi]
- Cascade of phase transitions in the training of energy-based modelsDimitrios Bachtis, Giulio Biroli, Aurélien Decelle, Beatriz Seoane. [doi]
- Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement LearningAdhyyan Narang, Andrew Wagenmaker, Lillian J. Ratliff, Kevin G. Jamieson. [doi]
- FlexSBDD: Structure-Based Drug Design with Flexible Protein ModelingZaixi Zhang, Mengdi Wang, Qi Liu. [doi]
- Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early ExitingFangcheng Liu, Yehui Tang, Zhenhua Liu 0003, Yunsheng Ni, Duyu Tang, Kai Han 0002, Yunhe Wang 0001. [doi]
- Sparse Bayesian Generative Modeling for Compressive SensingBenedikt Böck, Sadaf Syed, Wolfgang Utschick. [doi]
- Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable?Sonia Laguna, Ricards Marcinkevics, Moritz Vandenhirtz, Julia E. Vogt. [doi]
- Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to OptimalJuliusz Ziomek, Masaki Adachi, Michael A. Osborne. [doi]
- Unitary Convolutions for Learning on Graphs and GroupsBobak T. Kiani, Lukas Fesser, Melanie Weber 0001. [doi]
- Aligning Audio-Visual Joint Representations with an Agentic WorkflowShentong Mo, Yibing Song. [doi]
- Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent SpaceMaximilian Stölzle, Cosimo Della Santina. [doi]
- On Softmax Direct Preference Optimization for RecommendationYuxin Chen, Junfei Tan, An Zhang 0003, Zhengyi Yang 0007, Leheng Sheng, Enzhi Zhang, Xiang Wang 0010, Tat-Seng Chua. [doi]
- Scaling Retrieval-Based Language Models with a Trillion-Token DatastoreRulin Shao, Jacqueline He, Akari Asai, Weijia Shi, Tim Dettmers, Sewon Min, Luke Zettlemoyer, Pang Wei Koh. [doi]
- Beware of Road Markings: A New Adversarial Patch Attack to Monocular Depth EstimationHangcheng Liu, Zhenhu Wu, Hao Wang, Xingshuo Han, Shangwei Guo, Tao Xiang, Tianwei Zhang 0004. [doi]
- MAC Advice for facility location mechanism designZohar Barak, Anupam Gupta 0001, Inbal Talgam-Cohen. [doi]
- pFedClub: Controllable Heterogeneous Model Aggregation for Personalized Federated LearningJiaqi Wang 0002, Qi Li 0012, Lingjuan Lyu, Fenglong Ma. [doi]
- Navigating Chemical Space with Latent FlowsGuanghao Wei, Yining Huang, Chenru Duan, Yue Song, Yuanqi Du. [doi]
- Aligning Target-Aware Molecule Diffusion Models with Exact Energy OptimizationSiyi Gu, Minkai Xu, Alexander S. Powers, Weili Nie, Tomas Geffner, Karsten Kreis, Jure Leskovec, Arash Vahdat, Stefano Ermon. [doi]
- When is an Embedding Model More Promising than Another?Maxime Darrin, Philippe Formont, Ismail Ayed, Jackie CK Cheung, Pablo Piantanida. [doi]
- EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment RegimesMason Hargrave, Alex Spaeth, Logan Grosenick. [doi]
- Improving Generalization and Convergence by Enhancing Implicit RegularizationMingze Wang, Jinbo Wang 0003, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu. [doi]
- 4D Gaussian Splatting in the Wild with Uncertainty-Aware RegularizationMijeong Kim 0002, Jongwoo Lim, Bohyung Han. [doi]
- Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language ModelsLuohe Shi, Yao Yao, Zuchao Li, Lefei Zhang, Hai Zhao 0001. [doi]
- Learning Formal Mathematics From Intrinsic MotivationGabriel Poesia, David Broman, Nick Haber, Noah D. Goodman. [doi]
- Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsGiorgio Piatti, Zhijing Jin 0001, Max Kleiman-Weiner, Bernhard Schölkopf, Mrinmaya Sachan, Rada Mihalcea. [doi]
- High-probability complexity bounds for stochastic non-convex minimax optimizationYassine Laguel, Yasa Syed, Necdet Serhat Aybat, Mert Gürbüzbalaban. [doi]
- Flatten Anything: Unsupervised Neural Surface ParameterizationQijian Zhang, Junhui Hou, Wenping Wang, Ying He 0001. [doi]
- Maia-2: A Unified Model for Human-AI Alignment in ChessZhenwei Tang, Difan Jiao, Reid McIlroy-Young, Jon M. Kleinberg, Siddhartha Sen 0001, Ashton Anderson. [doi]
- Biologically Inspired Learning Model for Instructed VisionRoy Abel, Shimon Ullman. [doi]
- Neural network learns low-dimensional polynomials with SGD near the information-theoretic limitJason D. Lee, Kazusato Oko, Taiji Suzuki, Denny Wu. [doi]
- BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented textSiyan Wang, Bradford Levy. [doi]
- Large Spatial Model: End-to-end Unposed Images to Semantic 3DZhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou 0003, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone 0001. [doi]
- Gradient Rewiring for Editable Graph Neural Network TrainingZhimeng Jiang, Zirui Liu 0001, Xiaotian Han, Qizhang Feng, Hongye Jin, Qiaoyu Tan, Kaixiong Zhou, Na Zou 0001, Xia Ben Hu. [doi]
- Scanning Trojaned Models Using Out-of-Distribution SamplesHossein Mirzaei, Ali Ansari 0001, Bahar Dibaei Nia, Mojtaba Nafez, Moein Madadi, Sepehr Rezaee, Zeinab Taghavi 0001, Arad Maleki, Kian Shamsaie, Mahdi Hajialilue, Jafar Habibi, Mohammad Sabokrou, Mohammad Hossein Rohban. [doi]
- Neural Cover Selection for Image SteganographyKarl Chahine, Hyeji Kim. [doi]
- CODA: A Correlation-Oriented Disentanglement and Augmentation Modeling Scheme for Better Resisting Subpopulation ShiftsZiquan Ou, Zijun Zhang. [doi]
- Evaluating the design space of diffusion-based generative modelsYuqing Wang, Ye He 0003, Molei Tao. [doi]
- Fair GLASSO: Estimating Fair Graphical Models with Unbiased Statistical BehaviorMadeline Navarro, Samuel Rey, Andrei Buciulea, Antonio G. Marques, Santiago Segarra. [doi]
- Unsupervised Discovery of Formulas for Mathematical ConstantsMichael Shalyt, Uri Seligmann, Itay Beit Halachmi, Ofir David, Rotem Elimelech, Ido Kaminer. [doi]
- A Flexible, Equivariant Framework for Subgraph GNNs via Graph Products and Graph CoarseningGuy Bar-Shalom, Yam Eitan, Fabrizio Frasca, Haggai Maron. [doi]
- Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences TrainingCheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Animashree Anandkumar. [doi]
- Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising DiffusionYe He 0003, Kevin Rojas, Molei Tao. [doi]
- FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion ModelsRui Hu, Qian He, Gaofeng He, Jiedong Zhuang, Huang Chen, Huafeng Liu, Huamin Wang. [doi]
- Policy Mirror Descent with LookaheadKimon Protopapas, Anas Barakat. [doi]
- Towards Reliable Model Selection for Unsupervised Domain Adaptation: An Empirical Study and A Certified BaselineDapeng Hu, Romy Luo, Jian Liang 0001, Chuan-Sheng Foo. [doi]
- Localized Zeroth-Order Prompt OptimizationWenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low. [doi]
- Low Degree Hardness for Broadcasting on TreesHan Huang, Elchanan Mossel. [doi]
- Integrating Suboptimal Human Knowledge with Hierarchical Reinforcement Learning for Large-Scale Multiagent SystemsDingbang Liu, Shohei Kato, Wen Gu, Fenghui Ren, Jun Yan 0005, Guoxin Su. [doi]
- On the Curses of Future and History in Future-dependent Value Functions for Off-policy EvaluationYuheng Zhang, Nan Jiang. [doi]
- Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement LearningAndreas Schlaginhaufen, Maryam Kamgarpour. [doi]
- An Analytical Study of Utility Functions in Multi-Objective Reinforcement LearningManel Rodriguez-Soto, Juan A. Rodríguez-Aguilar, Maite López-Sánchez. [doi]
- Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual InputsAlexandros Haliassos, Rodrigo Mira, Honglie Chen, Zoe Landgraf, Stavros Petridis, Maja Pantic. [doi]
- Abstracted Shapes as Tokens - A Generalizable and Interpretable Model for Time-series ClassificationYunshi Wen, Tengfei Ma 0001, Lily Weng, Lam M. Nguyen, Anak Agung Julius. [doi]
- Balancing Context Length and Mixing Times for Reinforcement Learning at ScaleMatthew Riemer, Khimya Khetarpal, Janarthanan Rajendran, Sarath Chandar. [doi]
- GS-Blur: A 3D Scene-Based Dataset for Realistic Image DeblurringDongwoo Lee, Joonkyu Park, Kyoung Mu Lee. [doi]
- Adaptive Sampling for Efficient Softmax ApproximationTavor Z. Baharav, Ryan Kang, Colin Sullivan, Mo Tiwari, Eric Luxenberg, David Tse, Mert Pilanci. [doi]
- Grid4D: 4D Decomposed Hash Encoding for High-Fidelity Dynamic Gaussian SplattingJiawei Xu, Zexin Fan, Jian Yang, Jin Xie. [doi]
- AutoMix: Automatically Mixing Language ModelsPranjal Aggarwal, Aman Madaan, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Manaal Faruqui, Mausam. [doi]
- Acoustic Volume Rendering for Neural Impulse Response FieldsZitong Lan, Chenhao Zheng, Zhiwei Zheng, Mingmin Zhao. [doi]
- GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient DescentHongtai Zeng, Chao Yang, Yanzhen Zhou, Cheng Yang, Qinglai Guo. [doi]
- On Neural Networks as Infinite Tree-Structured Probabilistic Graphical ModelsBoyao Li, Alexander Thomson, Houssam Nassif, Matthew Engelhard, David Page. [doi]
- Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated LearningRunhua Xu, Shiqi Gao, Chao Li 0023, James Joshi, Jianxin Li. [doi]
- PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM CompressionVladimir Malinovskii, Denis Mazur, Ivan Ilin, Denis Kuznedelev, Konstantin Burlachenko, Kai Yi, Dan Alistarh, Peter Richtárik. [doi]
- Truthfulness of Calibration MeasuresNika Haghtalab, Mingda Qiao, Kunhe Yang, Eric Zhao 0003. [doi]
- Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World EnvironmentsPaulius Rauba, Nabeel Seedat, Krzysztof Kacprzyk, Mihaela van der Schaar. [doi]
- Implicit Regularization of Decentralized Gradient Descent for Sparse RegressionTongle Wu, Ying Sun. [doi]
- Private Geometric MedianMahdi Haghifam, Thomas Steinke 0002, Jonathan R. Ullman. [doi]
- Paths to Equilibrium in GamesBora Yongacoglu, Gürdal Arslan, Lacra Pavel, Serdar Yüksel. [doi]
- Variance estimation in compound decision theory under boundednessSubhodh Kotekal. [doi]
- LRM-Zero: Training Large Reconstruction Models with Synthesized DataDesai Xie, Sai Bi, Zhixin Shu, Kai Zhang 0045, Zexiang Xu, Yi Zhou 0023, Sören Pirk, Arie E. Kaufman, Xin Sun 0014, Hao Tan 0002. [doi]
- Truncated Variance Reduced Value IterationYujia Jin, Ishani Karmarkar, Aaron Sidford, Jiayi Wang. [doi]
- E-Motion: Future Motion Simulation via Event Sequence DiffusionSong Wu, Zhiyu Zhu, Junhui Hou, Guangming Shi, Jinjian Wu. [doi]
- Measuring Goal-DirectednessMatt MacDermott, James Fox, Francesco Belardinelli, Tom Everitt. [doi]
- The motion planning neural circuit in goal-directed navigation as Lie group operator searchJunfeng Zuo, Ying Nian Wu, Si Wu 0001, Wenhao Zhang 0002. [doi]
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem SolvingChang Gao, Haiyun Jiang, Deng Cai 0002, Shuming Shi 0001, Wai Lam. [doi]
- Needle In A Multimodal HaystackWeiyun Wang, Shuibo Zhang, Yiming Ren, Yuchen Duan, Tiantong Li, Shuo Liu, Mengkang Hu, Zhe Chen 0017, Kaipeng Zhang, Lewei Lu, Xizhou Zhu, Ping Luo 0002, Yu Qiao 0001, Jifeng Dai, Wenqi Shao, Wenhai Wang. [doi]
- Preferential Normalizing FlowsPetrus Mikkola, Luigi Acerbi, Arto Klami. [doi]
- Wasserstein Distributionally Robust Optimization through the Lens of Structural Causal Models and Individual FairnessAhmad-Reza Ehyaei, Golnoosh Farnadi, Samira Samadi. [doi]
- Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding NetworkChengyu Fang, Chunming He, Fengyang Xiao, Yulun Zhang 0001, Longxiang Tang, Yuelin Zhang, Kai Li, Xiu Li 0001. [doi]
- Grounded Answers for Multi-agent Decision-making Problem through Generative World ModelZeyang Liu, Xinrui Yang, Shiguang Sun, Long Qian, Lipeng Wan 0003, Xingyu Chen, Xuguang Lan. [doi]
- Second-order forward-mode optimization of recurrent neural networks for neuroscienceYoujing Yu, Rui Xia, Qingxi Ma, Máté Lengyel, Guillaume Hennequin. [doi]
- Dynamics of Supervised and Reinforcement Learning in the Non-Linear PerceptronChristian Schmid, James M. Murray. [doi]
- Exploring Context Window of Large Language Models via Decomposed Positional VectorsZican Dong, Junyi Li, Xin Men, Xin Zhao 0018, Bingning Wang, Zhen Tian, Weipeng Chen, Ji-Rong Wen. [doi]
- A Comprehensive Analysis on the Learning Curve in Kernel Ridge RegressionTin Sum Cheng, Aurélien Lucchi, Anastasis Kratsios, David Belius. [doi]
- GraphVis: Boosting LLMs with Visual Knowledge Graph IntegrationYihe Deng, Chenchen Ye 0001, Zijie Huang 0002, Mingyu Derek Ma, Yiwen Kou, Wei Wang 0010. [doi]
- TAPTRv2: Attention-based Position Update Improves Tracking Any PointHongyang Li 0003, Hao Zhang, Shilong Liu, Zhaoyang Zeng, Feng Li, Bohan Li, Tianhe Ren, Lei Zhang 0006. [doi]
- LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and RenderingDelin Qu, Qizhi Chen, Pingrui Zhang, Xianqiang Gao, Bin Zhao 0001, Zhigang Wang 0002, Dong Wang 0028, Xuelong Li 0001. [doi]
- Toward a Stable, Fair, and Comprehensive Evaluation of Object Hallucination in Large Vision-Language ModelsHongliang Wei, Xingtao Wang, Xianqi Zhang, Xiaopeng Fan, Debin Zhao. [doi]
- How to Solve Contextual Goal-Oriented Problems with Offline Datasets?Ying Fan, Jingling Li, Adith Swaminathan, Aditya Modi 0002, Ching-An Cheng. [doi]
- Structured Learning of Compositional Sequential InterventionsJialin Yu, Andreas Koukorinis, Nicolò Colombo, Yuchen Zhu, Ricardo Silva 0001. [doi]
- On the Robustness of Spectral Algorithms for Semirandom Stochastic Block ModelsAditya Bhaskara, Agastya Vibhuti Jha, Michael Kapralov, Naren Manoj, Davide Mazzali, Weronika Wrzos-Kaminska. [doi]
- Cell ontology guided transcriptome foundation modelXinyu Yuan, Zhihao Zhan, Zuobai Zhang, Manqi Zhou, Jianan Zhao 0002, Boyu Han, Yue Li, Jian Tang 0005. [doi]
- Open-Vocabulary Object Detection via Language HierarchyJiaxing Huang, Jingyi Zhang, Kai Jiang, Shijian Lu. [doi]
- Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement LearningJiaheng Hu, Zizhao Wang, Peter Stone 0001, Roberto Martín-Martín. [doi]
- Predicting the Performance of Foundation Models via Agreement-on-the-LineRahul Saxena, Taeyoun Kim, Aman Mehra, Christina Baek, J. Zico Kolter, Aditi Raghunathan. [doi]
- CRAYM: Neural Field Optimization via Camera RAY MatchingLiqiang Lin, Wenpeng Wu, Chi-Wing Fu, Hao Zhang 0002, Hui Huang 0004. [doi]
- Protein-Nucleic Acid Complex Modeling with Frame Averaging TransformerTinglin Huang, Zhenqiao Song, Rex Ying, Wengong Jin. [doi]
- Spiking Token Mixer: An event-driven friendly Former structure for spiking neural networksShikuang Deng, Yuhang Wu, Kangrui Du, Shi Gu. [doi]
- DECRL: A Deep Evolutionary Clustering Jointed Temporal Knowledge Graph Representation Learning ApproachQian Chen, Ling Chen. [doi]
- Streaming Detection of Queried Event StartCristóbal Eyzaguirre, Eric Tang, Shyamal Buch, Adrien Gaidon, Jiajun Wu 0001, Juan Carlos Niebles. [doi]
- The Intelligible and Effective Graph Neural Additive NetworkMaya Bechler-Speicher, Amir Globerson, Ran Gilad-Bachrach. [doi]
- Fair Bilevel Neural Network (FairBiNN): On Balancing fairness and accuracy via Stackelberg EquilibriumMehdi Yazdani-Jahromi, Ali Khodabandeh Yalabadi, Amirarsalan Rajabi, Aida Tayebi, Ivan Garibay, Ozlem O. Garibay. [doi]
- Learning Bregman Divergences with Application to RobustnessMohamed-Hicham Leghettas, Markus Püschel. [doi]
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference OptimizationYuanpu Cao, Tianrong Zhang, Bochuan Cao, Ziyi Yin 0003, Lu Lin 0001, Fenglong Ma, Jinghui Chen. [doi]
- AutoTimes: Autoregressive Time Series Forecasters via Large Language ModelsYong Liu, Guo Qin, Xiangdong Huang 0001, Jianmin Wang 0001, Mingsheng Long. [doi]
- Automated Multi-Task Learning for Joint Disease Prediction on Electronic Health RecordsSuhan Cui, Prasenjit Mitra. [doi]
- PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularizationYao Ni, Shan Zhang, Piotr Koniusz. [doi]
- Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance DistillationYuanhao Zhai 0001, Kevin Lin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Chung-Ching Lin, David S. Doermann, Junsong Yuan 0001, Lijuan Wang. [doi]
- QGym: Scalable Simulation and Benchmarking of Queuing Network ControllersHaozhe Chen, Ang Li, Ethan Che, Jing Dong, Tianyi Peng, Hongseok Namkoong. [doi]
- 3D Structure Prediction of Atomic Systems with Flow-based Direct Preference OptimizationRui Jiao, Xiangzhe Kong, Wenbing Huang 0001, Yang Liu 0005. [doi]
- SSDM: Scalable Speech Dysfluency ModelingJiachen Lian, Xuanru Zhou, Zoe Ezzes, Jet Vonk, Brittany Morin, David Baquirin, Zachary Miller, Maria Luisa Gorno-Tempini, Gopala Anumanchipalli. [doi]
- SpelsNet: Surface Primitive Elements Segmentation by B-Rep Graph Structure SupervisionKseniya Cherenkova, Elona Dupont, Anis Kacem 0001, Gleb Gusev, Djamila Aouada. [doi]
- MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture SearchYuMing Zhang, Jun-Wei Hsieh, Xin Li, Ming-Ching Chang, Chun-Chieh Lee, Kuo-Chin Fan. [doi]
- Controlling Counterfactual Harm in Decision Support Systems Based on Prediction SetsEleni Straitouri, Suhas Thejaswi, Manuel Rodriguez. [doi]
- Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag CompetitionEdoardo Debenedetti, Javier Rando, Daniel Paleka, Silaghi Fineas Florin, Dragos Albastroiu, Niv Cohen, Yuval Lemberg, Reshmi Ghosh, Rui Wen 0002, Ahmed Salem 0001, Giovanni Cherubin, Santiago Zanella Béguelin, Robin Schmid, Victor Klemm, Takahiro Miki, Chenhao Li, Stefan Kraft, Mario Fritz, Florian Tramèr, Sahar Abdelnabi, Lea Schönherr. [doi]
- FINALLY: fast and universal speech enhancement with studio-like qualityNicholas Babaev, Kirill Tamogashev, Azat Saginbaev, Ivan Shchekotov, Hanbin Bae, Hosang Sung, Won Jun Lee, Hoon-Young Cho, Pavel Andreev. [doi]
- Aligning LLM Agents by Learning Latent Preference from User EditsGe Gao, Alexey Taymanov, Eduardo Salinas, Paul Mineiro, Dipendra Misra. [doi]
- TabPedia: Towards Comprehensive Visual Table Understanding with Concept SynergyWeichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Binghong Wu, Lei Liao, Shu Wei, Yongjie Ye, Hao Liu 0003, Wengang Zhou, Houqiang Li, Can Huang. [doi]
- NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video EditingTing-Hsuan Chen, Jiewen Chan, Hau-Shiang Shiu, Shih-Han Yen, Changhan Yeh, Yu-Lun Liu 0001. [doi]
- Provable Benefit of Cutout and CutMix for Feature LearningJunsoo Oh, Chulhee Yun. [doi]
- Masked Hard-Attention Transformers Recognize Exactly the Star-Free LanguagesAndy Yang, David Chiang 0001, Dana Angluin. [doi]
- Reinforcement Learning Guided Semi-Supervised LearningMarzi Heidari, Hanping Zhang, Yuhong Guo. [doi]
- Achieving Domain-Independent Certified Robustness via Knowledge ContinuityAlan Sun, Chiyu Ma, Kenneth Ge, Soroush Vosoughi. [doi]
- Computation-Aware Gaussian Processes: Model Selection And Linear-Time InferenceJonathan Wenger, Kaiwen Wu, Philipp Hennig, Jacob R. Gardner, Geoff Pleiss, John P. Cunningham. [doi]
- MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and CorrespondenceFuming You, Minghui Fang 0002, Li Tang, Rongjie Huang, Yongqi Wang, Zhou Zhao. [doi]
- On Divergence Measures for Training GFlowNetsTiago da Silva, Eliezer de Souza da Silva, Diego Mesquita. [doi]
- CoIN: A Benchmark of Continual Instruction Tuning for Multimodel Large Language ModelsCheng Chen, Junchen Zhu, Xu Luo 0003, Hengtao Shen, Jingkuan Song, Lianli Gao. [doi]
- MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian SplattingRuijie Zhu 0002, Yanzhe Liang, Hanzhi Chang, Jiacheng Deng 0002, Jiahao Lu, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang 0001. [doi]
- WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale BenchmarkChunhui Zhang, Li Liu 0036, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang 0001. [doi]
- Scalable DP-SGD: Shuffling vs. Poisson SubsamplingLynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar 0001, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang. [doi]
- Lower Bounds of Uniform Stability in Gradient-Based Bilevel Algorithms for Hyperparameter OptimizationRongzhen Wang, Chenyu Zheng, Guoqiang Wu, Xu Min, Xiaolu Zhang, Jun Zhou, Chongxuan Li. [doi]
- A Unifying Normative Framework of Decision ConfidenceAmelia Johnson, Michael A. Buice, Koosha Khalvati. [doi]
- Linear Transformers are Versatile In-Context LearnersMax Vladymyrov, Johannes von Oswald, Mark Sandler 0002, Rong Ge. [doi]
- Towards Safe Concept Transfer of Multi-Modal Diffusion via Causal Representation EditingPeiran Dong, Bingjie Wang, Song Guo 0001, Junxiao Wang, Jie Zhang 0076, Zicong Hong. [doi]
- Zero-Shot Event-Intensity Asymmetric Stereo via Visual Prompting from Image DomainHanyue Lou, Jinxiu (Sherry) Liang, Minggui Teng, Bin Fan 0002, Yong Xu 0007, Boxin Shi. [doi]
- Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic ConsistencyZenan Li, Yifan Wu, Zhaoyu Li, Xinming Wei, Xian Zhang, Fan Yang, Xiaoxing Ma. [doi]
- MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringYizhen Luo, Zikun Nie, Massimo Hong, Suyuan Zhao, Hao Zhou 0012, Zaiqing Nie. [doi]
- DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam VideosLinhan Wang, Kai Cheng, Shuo Lei, Shengkun Wang, Wei Yin 0006, Chenyang Lei, Xiaoxiao Long, Chang-Tien Lu. [doi]
- Hyper-opinion Evidential Deep Learning for Out-of-Distribution DetectionJingen Qu, Yufei Chen 0002, Xiaodong Yue, Wei Fu, Qiguang Huang. [doi]
- Higher-Rank Irreducible Cartesian Tensors for Equivariant Message PassingViktor Zaverkin, Francesco Alesiani, Takashi Maruyama, Federico Errica, Henrik Christiansen, Makoto Takamoto, Nicolas Weber, Mathias Niepert. [doi]
- Identify Then Recommend: Towards Unsupervised Group RecommendationYue Liu, Shihao Zhu, Tianyuan Yang, Jian Ma, Wenliang Zhong. [doi]
- Recursive PAC-Bayes: A Frequentist Approach to Sequential Prior Updates with No Information LossYi-Shan Wu 0003, Yijie Zhang, Badr-Eddine Chérief-Abdellatif, Yevgeny Seldin. [doi]
- Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of TransformersLorenzo Tiberi, Francesca Mignacco, Kazuki Irie, Haim Sompolinsky. [doi]
- Achieving Linear Convergence with Parameter-Free Algorithms in Decentralized OptimizationIlya A. Kuruzov, Gesualdo Scutari, Alexander V. Gasnikov. [doi]
- Nonlinear dynamics of localization in neural receptive fieldsLeon Lufkin, Andrew M. Saxe, Erin Grant. [doi]
- MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical ReasoningShuyue Stella Li, Vidhisha Balachandran, Shangbin Feng, Jonathan Ilgen, Emma Pierson, Pang Wei W. Koh, Yulia Tsvetkov. [doi]
- Optimal Design for Human Preference ElicitationSubhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Anand Deshmukh, Ge Liu, Yifei Ma, Branislav Kveton. [doi]
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt TypesYutao Mou, Shikun Zhang, Wei Ye. [doi]
- Learning Structured Representations with Hyperbolic EmbeddingsAditya Sinha, Siqi Zeng, Makoto Yamada, Han Zhao 0002. [doi]
- Evaluating Copyright Takedown Methods for Language ModelsBoyi Wei, Weijia Shi, Yangsibo Huang, Noah A. Smith, Chiyuan Zhang, Luke Zettlemoyer, Kai Li, Peter Henderson 0002. [doi]
- UMB: Understanding Model Behavior for Open-World Object DetectionXing Xi, Yangyang Huang, Zhijie Zhong, Ronghua Luo. [doi]
- Pandora's Box: Towards Building Universal Attackers against Real-World Large Vision-Language ModelsDaizong Liu, Mingyu Yang, Xiaoye Qu, Pan Zhou 0001, Xiang Fang, Keke Tang, Yao Wan 0001, Lichao Sun 0001. [doi]
- Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via EquivarianceJoshua McClellan, Naveed Haghani, John Winder, Furong Huang, Pratap Tokekar. [doi]
- YOLOv10: Real-Time End-to-End Object DetectionAo Wang, Hui Chen, Lihao Liu, Kai Chen, Zijia Lin, Jungong Han, Guiguang Ding. [doi]
- EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesSunjun Kweon, Jiyoun Kim, Heeyoung Kwak, Dongchul Cha, Hangyul Yoon, Kwang Kim, Jeewon Yang, Seunghyun Won, Edward Choi. [doi]
- SETLEXSEM CHALLENGE: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language ModelsNicholas A. Dronen, Bardiya Akhbari, Manish Gawali. [doi]
- Image Copy Detection for Diffusion ModelsWenhao Wang, Yifan Sun, Zhentao Tan, Yi Yang. [doi]
- A Simulation Benchmark for Autonomous Racing with Large-Scale Human DataAdrian Remonda, Nicklas Hansen 0001, Ayoub Raji, Nicola Musiu, Marko Bertogna, Eduardo E. Veas, Xiaolong Wang 0004. [doi]
- Accelerating Blockwise Parallel Language Models with Draft RefinementTaehyeon Kim, Ananda Theertha Suresh, Kishore Papineni, Michael D. Riley, Sanjiv Kumar, Adrian Benton. [doi]
- Globally Q-linear Gauss-Newton Method for Overparameterized Non-convex Matrix SensingXixi Jia, Fangchen Feng, Deyu Meng, Defeng Sun. [doi]
- Towards Flexible Visual Relationship SegmentationFangrui Zhu, Jianwei Yang, Huaizu Jiang. [doi]
- Exponential Quantum Communication Advantage in Distributed Inference and LearningDar Gilboa, Hagay Michaeli, Daniel Soudry, Jarrod R. McClean. [doi]
- Microstructures and Accuracy of Graph Recall by Large Language ModelsYanbang Wang, Hejie Cui, Jon M. Kleinberg. [doi]
- Neural Persistence DynamicsSebastian Zeng, Florian Graf, Martin Uray, Stefan Huber 0001, Roland Kwitt. [doi]
- A Theoretical Understanding of Self-Correction through In-context AlignmentYifei Wang 0001, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang 0001. [doi]
- Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation NowcastingChiu Wai Yan, Shi Quan Foo, Van Hoan Trinh, Dit-Yan Yeung, Ka-Hing Wong, Wai-Kin Wong. [doi]
- Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced DataPei-Yau Weng, Minh Hoang, Lam Nguyen, My T. Thai, Lily Weng, Trong Nghia Hoang. [doi]
- Weight for Robustness: A Comprehensive Approach towards Optimal Fault-Tolerant Asynchronous MLTehila Dahan, Kfir Y. Levy. [doi]
- Stability and Generalizability in SDE Diffusion Models with Measure-Preserving DynamicsWeitong Zhang, Chengqi Zang, Liu Li, Sarah Cechnicka, Cheng Ouyang, Bernhard Kainz. [doi]
- Speculative Decoding with CTC-based Draft Model for LLM Inference AccelerationZhuofan Wen, Shangtong Gui, Yang Feng 0004. [doi]
- Mitigating Spurious Correlations via Disagreement ProbabilityHyeonggeun Han, Sehwan Kim, Hyungjun Joo, Sangwoo Hong, Jungwoo Lee 0001. [doi]
- Optimal Parallelization of BoostingArthur da Cunha, Mikael Møller Høgsgaard, Kasper Green Larsen. [doi]
- Simulation-Free Training of Neural ODEs on Paired DataSemin Kim, Jaehoon Yoo, Jinwoo Kim, Yeonwoo Cha, Saehoon Kim, Seunghoon Hong. [doi]
- HOPE: Shape Matching Via Aligning Different K-hop NeighbourhoodsBarakeel Fanseu Kamhoua, Huamin Qu. [doi]
- Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumHadi Pouransari, Chun-Liang Li, Jen-Hao Rick Chang, Pavan Kumar Anasosalu Vasu, Cem Koc, Vaishaal Shankar, Oncel Tuzel. [doi]
- TransVIP: Speech to Speech Translation System with Voice and Isochrony PreservationChenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu 0001, Xiaofei Wang 0009, Midia Yousefi, Yanmin Qian, Jinyu Li 0001, Michael Zeng 0001. [doi]
- The Implicit Bias of Adam on Separable DataChenyang Zhang, Difan Zou, Yuan Cao 0006. [doi]
- Adversarial Representation Engineering: A General Model Editing Framework for Large Language ModelsYihao Zhang, Zeming Wei, Jun Sun 0001, Meng Sun 0002. [doi]
- D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians SoupJoanna Waczynska, Piotr Borycki, Joanna Kaleta, Slawomir Konrad Tadeja, Przemyslaw Spurek. [doi]
- Neural Localizer Fields for Continuous 3D Human Pose and Shape EstimationIstván Sárándi, Gerard Pons-Moll. [doi]
- Neural Concept BinderWolfgang Stammer, Antonia Wüst, David Steinmann, Kristian Kersting. [doi]
- BMRS: Bayesian Model Reduction for Structured PruningDustin Wright 0001, Christian Igel, Raghavendra Selvan. [doi]
- dopanim: A Dataset of Doppelganger Animals with Noisy Annotations from Multiple HumansMarek Herde, Denis Huseljic, Lukas Rauch, Bernhard Sick. [doi]
- Heterogeneity-Guided Client Sampling: Towards Fast and Efficient Non-IID Federated LearningHuancheng Chen, Haris Vikalo. [doi]
- MaskLLM: Learnable Semi-Structured Sparsity for Large Language ModelsGongfan Fang, Hongxu Yin, Saurav Muralidharan, Greg Heinrich, Jeff Pool, Jan Kautz, Pavlo Molchanov 0001, Xinchao Wang. [doi]
- Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam TimestepsBenjamin Ellis, Matthew Thomas Jackson, Andrei Lupu, Alexander David Goldie, Mattie Fellows, Shimon Whiteson, Jakob N. Foerster. [doi]
- AWT: Transferring Vision-Language Models via Augmentation, Weighting, and TransportationYuhan Zhu, Yuyang Ji, Zhiyu Zhao, Gangshan Wu, Limin Wang. [doi]
- Least Squares Regression Can Exhibit Under-Parameterized Double DescentXinyue Li, Rishi Sonthalia. [doi]
- Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformersAlberto Alfarano, François Charton, Amaury Hayat. [doi]
- On the Identifiability of Poisson Branching Structural Causal Model Using Probability Generating FunctionYu Xiang, Jie Qiao, Zefeng Liang, Zihuai Zeng, Ruichu Cai, Zhifeng Hao. [doi]
- Spike-based Neuromorphic Model for Sound Source LocalizationDehao Zhang, Shuai Wang, Ammar Belatreche, Wenjie Wei, Yichen Xiao, Haorui Zheng, Zijian Zhou, Malu Zhang, Yang Yang 0060. [doi]
- Analysis of Corrected Graph ConvolutionsRobert J. Wang, Aseem Baranwal, Kimon Fountoulakis. [doi]
- Disentangled Representation Learning in Non-Markovian Causal SystemsAdam Li, Yushu Pan, Elias Bareinboim. [doi]
- Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)Usha Bhalla, Alex Oesterling, Suraj Srinivas, Flávio P. Calmon, Himabindu Lakkaraju. [doi]
- Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIPSriram Balasubramanian, Samyadeep Basu, Soheil Feizi. [doi]
- On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down GuidanceZhixiong Nan, Yilong Chen 0004, Tianfei Zhou, Tao Xiang 0001. [doi]
- No Free Lunch Theorem and Black-Box Complexity Analysis for Adversarial OptimisationPer Kristian Lehre, Shishen Lin. [doi]
- Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationShutong Ding, Ke Hu, Zhenhao Zhang, Kan Ren, Weinan Zhang 0001, Jingyi Yu, Jingya Wang, Ye Shi 0001. [doi]
- Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive GamesFanqi Kong, Yizhe Huang, Song Chun Zhu, Siyuan Qi, Xue Feng. [doi]
- Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian SurfelsYikai Wang 0001, Xinzhou Wang, Zilong Chen, Zhengyi Wang, Fuchun Sun 0001, Jun Zhu 0001. [doi]
- HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy ProtectionYuxin Wang, Duanyu Feng, Yongfu Dai, Zhengyu Chen, Jimin Huang, Sophia Ananiadou, Qianqian Xie, Hao Wang. [doi]
- Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint DistributionsKhai Nguyen, Nhat Ho. [doi]
- Dynamic Rescaling for Training GNNsNimrah Mustafa, Rebekka Burkholz. [doi]
- Bootstrapping Top-down Information for Self-modulating Slot AttentionDongwon Kim, Seoyeon Kim, Suha Kwak. [doi]
- Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference LabelsJan-Philipp Fränken, Eric Zelikman, Rafael Rafailov, Kanishk Gandhi, Tobias Gerstenberg, Noah D. Goodman. [doi]
- Only Strict Saddles in the Energy Landscape of Predictive Coding Networks?Francesco Innocenti, El Mehdi Achour, Ryan Singh, Christopher L. Buckley. [doi]
- Feedback control guides credit assignment in recurrent neural networksKlara Kaleb, Barbara Feulner, Juan Gallego, Claudia Clopath. [doi]
- Can Learned Optimization Make Reinforcement Learning Less Difficult?Alexander David Goldie, Chris Lu 0001, Matthew Thomas Jackson, Shimon Whiteson, Jakob Foerster. [doi]
- Provable Partially Observable Reinforcement Learning with Privileged InformationYang Cai 0001, Xiangyu Liu, Argyris Oikonomou, Kaiqing Zhang. [doi]
- Efficient Prompt Optimization Through the Lens of Best Arm IdentificationChengshuai Shi, Kun Yang, Zihan Chen, Jundong Li, Jing Yang, Cong Shen. [doi]
- A Fast Convoluted Story: Scaling Probabilistic Inference for Integer ArithmeticsLennert De Smet, Pedro Zuidberg Dos Martires. [doi]
- Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language ModelsFrederik Kunstner, Alan Milligan, Robin Yadav, Mark Schmidt 0001, Alberto Bietti. [doi]
- SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-ResolutionSoufiane Belharbi, Mara KM Whitford, Phuong Hoang, Shakeeb Murtaza, Luke McCaffrey, Eric Granger. [doi]
- Langevin Unlearning: A New Perspective of Noisy Gradient Descent for Machine UnlearningEli Chien, Haoyu Wang 0004, Ziang Chen, Pan Li 0005. [doi]
- OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following AgentsZihao Wang, Shaofei Cai, Zhancun Mu, Haowei Lin, Ceyao Zhang, Xuejie Liu, Qing Li 0003, Anji Liu, Xiaojian (Shawn) Ma, Yitao Liang. [doi]
- Learning the Latent Causal Structure for Modeling Label NoiseYexiong Lin, Yu Yao 0005, Tongliang Liu. [doi]
- Dissecting Query-Key Interaction in Vision TransformersXu Pan, Aaron Philip, Ziqian Xie, Odelia Schwartz. [doi]
- SelfCodeAlign: Self-Alignment for Code GenerationYuxiang Wei 0003, Federico Cassano, Jiawei Liu 0004, Yifeng Ding, Naman Jain, Zachary Mueller, Harm de Vries, Leandro von Werra, Arjun Guha, Lingming Zhang 0001. [doi]
- Simple and Fast Distillation of Diffusion ModelsZhenyu Zhou, Defang Chen 0001, Can Wang 0001, Chun Chen 0001, Siwei Lyu. [doi]
- Rethinking 3D Convolution in $\ell_p$-norm SpaceLi Zhang, Yan Zhong, Jianan Wang, Zhe Min, RujingWang, Liu Liu. [doi]
- NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual UpdatesHexuan Deng, Wenxiang Jiao, Xuebo Liu 0002, Min Zhang 0005, Zhaopeng Tu. [doi]
- AirSketch: Generative Motion to SketchHui Xian Grace Lim, Xuanming Cui, Yogesh S. Rawat, Ser-Nam Lim. [doi]
- Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion AlgorithmsFiras Trabelsi, David Vilar, Mara Finkelstein, Markus Freitag. [doi]
- Test-time Adaptation in Non-stationary Environments via Adaptive Representation AlignmentZhen-yu Zhang, Zhiyu Xie, Huaxiu Yao, Masashi Sugiyama. [doi]
- CryoBench: Diverse and challenging datasets for the heterogeneity problem in cryo-EMMinkyu Jeon, Rishwanth Raghu, Miro Astore, Geoffrey Woollard, Ryan Feathers, Alkin Kaz, Sonya M. Hanson, Pilar Cossio, Ellen D. Zhong. [doi]
- A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense ReasoningNiki Maria Foteinopoulou, Enjie Ghorbel, Djamila Aouada. [doi]
- Local Anti-Concentration Class: Logarithmic Regret for Greedy Linear Contextual BanditSeok-Jin Kim, Min-hwan Oh. [doi]
- D2R2: Diffusion-based Representation with Random Distance Matching for Tabular Few-shot LearningRuoxue Liu, Linjiajie Fang, Wenjia Wang, Bingyi Jing. [doi]
- From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with ReflectionXinlei Wang, Maike Feng, Jing Qiu 0001, Jinjin Gu, Junhua Zhao 0001. [doi]
- NeuroPath: A Neural Pathway Transformer for Joining the Dots of Human ConnectomesZiquan Wei, Tingting Dan, Jiaqi Ding, Guorong Wu 0001. [doi]
- Incentivizing Quality Text Generation via Statistical ContractsEden Saig, Ohad Einav, Inbal Talgam-Cohen. [doi]
- Toward Conditional Distribution Calibration in Survival PredictionShiang Qi, Yakun Yu, Russell Greiner. [doi]
- Compositional PAC-Bayes: Generalization of GNNs with persistence and beyondKirill Brilliantov, Amauri H. Souza, Vikas Garg 0001. [doi]
- IF-Font: Ideographic Description Sequence-Following Font GenerationXinping Chen, Xiao Ke, Wenzhong Guo. [doi]
- Panacea: Pareto Alignment via Preference Adaptation for LLMsYifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Haojun Chen, Qingfu Zhang 0001, Siyuan Qi, Yaodong Yang 0001. [doi]
- Discrete Modeling via Boundary Conditional Diffusion ProcessesYuxuan Gu, Xiaocheng Feng, Lei Huang 0021, Yingsheng Wu, Zekun Zhou, Weihong Zhong, Kun Zhu 0025, Bing Qin 0001. [doi]
- Understanding Model Selection for Learning in Strategic EnvironmentsTinashe Handina, Eric Mazumdar. [doi]
- UDPM: Upsampling Diffusion Probabilistic ModelsShady Abu Hussein, Raja Giryes. [doi]
- Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEsQinpeng Cui, Yixuan Liu, Xinyi Zhang, Qiqi Bao, Qingmin Liao, liwang Amd, Lu Tian, Zicheng Liu, Zhongdao Wang, Emad Barsoum. [doi]
- FlexMol: A Flexible Toolkit for Benchmarking Molecular Relational LearningSizhe Liu, Jun Xia 0001, Lecheng Zhang, Yuchen Liu, Yue Liu 0008, Wenjie Du, Zhangyang Gao, Bozhen Hu, Cheng Tan 0012, Hongxin Xiang, Stan Z. Li. [doi]
- C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control TheoryTianjiao Luo, Tim Pearce, Huayu Chen, Jianfei Chen, Jun Zhu. [doi]
- User-Creator Feature Polarization in Recommender Systems with Dual InfluenceTao Lin, Kun Jin, Andrew Estornell, Xiaoying Zhang, Yiling Chen, Yang Liu. [doi]
- Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and PracticalityTianle Zhang, Langtian Ma, Yuchen Yan, Yuchen Zhang, Yue Yang, Ziyao Guo, Wenqi Shao, Kai Wang 0036, Yang You 0001, Yu Qiao 0001, Ping Luo 0002, Kaipeng Zhang. [doi]
- Precise asymptotics of reweighted least-squares algorithms for linear diagonal networksChiraag Kaushik, Justin Romberg, Vidya Muthukumar. [doi]
- Efficient Multi-task LLM Quantization and Serving for Multiple LoRA AdaptersYifei Xia, Fangcheng Fu, Wentao Zhang, Jiawei Jiang, Bin Cui 0001. [doi]
- Learning and Transferring Sparse Contextual Bigrams with Linear TransformersYunwei Ren, Zixuan Wang, Jason D. Lee. [doi]
- RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented GenerationDongyu Ru, Lin Qiu, Xiangkun Hu, Tianhang Zhang, Peng Shi, Shuaichen Chang, Cheng Jiayang, Cunxiang Wang, Shichao Sun, Huanyu Li 0010, Zizhao Zhang, Binjie Wang, Jiarong Jiang, Tong He 0002, Zhiguo Wang, Pengfei Liu 0003, Yue Zhang 0004, Zheng Zhang 0001. [doi]
- Online Control with Adversarial Disturbance for Continuous-time Linear SystemsJingwei Li, Jing Dong 0008, Can Chang, Baoxiang Wang 0001, Jingzhao Zhang. [doi]
- SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified FlowChaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang 0001. [doi]
- Achievable distributional robustness when the robust risk is only partially identifiedJulia Kostin, Nicola Gnecco, Fanny Yang. [doi]
- A Critical Evaluation of AI Feedback for Aligning Large Language ModelsArchit Sharma, Sedrick Scott Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar. [doi]
- Spatio-Spectral Graph Neural NetworksSimon Geisler, Arthur Kosmala, Daniel Herbst, Stephan Günnemann. [doi]
- Neur2BiLO: Neural Bilevel OptimizationJustin Dumouchelle, Esther Julien, Jannis Kurtz, Elias B. Khalil. [doi]
- Learning to Cooperate with Humans using Generative AgentsYancheng Liang, Daphne Chen, Abhishek Gupta 0004, Simon S. Du, Natasha Jaques. [doi]
- ConceptMix: A Compositional Image Generation Benchmark with Controllable DifficultyXindi Wu, Dingli Yu, Yangsibo Huang, Olga Russakovsky, Sanjeev Arora. [doi]
- Happy: A Debiased Learning Framework for Continual Generalized Category DiscoveryShijie Ma, Fei Zhu, Zhun Zhong, Wenzhuo Liu, Xu-Yao Zhang, Chenglin Liu 0001. [doi]
- Where's Waldo: Diffusion Features For Personalized Segmentation and RetrievalDvir Samuel, Rami Ben-Ari, Matan Levy, Nir Darshan, Gal Chechik. [doi]
- BERTs are Generative In-Context LearnersDavid Samuel. [doi]
- Optimal Top-Two Method for Best Arm Identification and Fluid AnalysisAgniv Bandyopadhyay, Sandeep Juneja 0001, Shubhada Agrawal. [doi]
- Not All Diffusion Model Activations Have Been Evaluated as Discriminative FeaturesBenyuan Meng, Qianqian Xu, Zitai Wang, Xiaochun Cao, Qingming Huang. [doi]
- Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting DiversityRobby Costales, Stefanos Nikolaidis. [doi]
- Rethinking LLM Memorization through the Lens of Adversarial CompressionAvi Schwarzschild, Zhili Feng, Pratyush Maini, Zachary C. Lipton, J. Zico Kolter. [doi]
- A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning NeedsYan Sun, Li Shen 0008, Dacheng Tao. [doi]
- Make Continual Learning Stronger via C-FlatAng Bian, Wei Li, Hangjie Yuan, Chengrong Yu, Mang Wang, Zixiang Zhao, Aojun Lu, Pengliang Ji, Tao Feng 0014. [doi]
- DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMsHaokun Lin, Haobo Xu, Yichen Wu, Jingzhi Cui, Yingtao Zhang, Linzhan Mou, Linqi Song, Zhenan Sun, Ying Wei 0001. [doi]
- A Taxonomy of Challenges to Curating Fair DatasetsDora Zhao, Morgan Klaus Scheuerman, Pooja Chitre, Jerone Theodore Alexander Andrews, Georgia Panagiotidou, Shawn Walker, Kathleen H. Pine, Alice Xiang. [doi]
- Advancing Open-Set Domain Generalization Using Evidential Bi-Level Hardest Domain SchedulerKunyu Peng, Di Wen 0006, Kailun Yang 0001, Ao Luo, Yufan Chen 0001, Jia Fu 0001, M. Saquib Sarfraz, Alina Roitberg, Rainer Stiefelhagen. [doi]
- VMamba: Visual State Space ModelYue Liu, Yunjie Tian, YuZhong Zhao, Hongtian Yu, Lingxi Xie, Yaowei Wang 0001, Qixiang Ye, Jianbin Jiao, Yunfan Liu 0001. [doi]
- Symbolic Regression with a Learned Concept LibraryArya Grayeli, Atharva Sehgal, Omar Costilla-Reyes, Miles Cranmer, Swarat Chaudhuri. [doi]
- CoFie: Learning Compact Neural Surface Representations with Coordinate FieldsHanwen Jiang, Haitao Yang 0005, Georgios Pavlakos, Qixing Huang. [doi]
- Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context LearningWuyang Chen, Jialin Song, Pu Ren, Shashank Subramanian, Dmitriy Morozov, Michael W. Mahoney. [doi]
- Spectral Adapter: Fine-Tuning in Spectral SpaceFangzhao Zhang, Mert Pilanci. [doi]
- Active Set OrderingQuoc Phong Nguyen, Sunil Gupta 0001, Svetha Venkatesh, Bryan Kian Hsiang Low, Patrick Jaillet. [doi]
- FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language ModelsRui Ye, Rui Ge 0008, Xinyu Zhu, Jingyi Chai, Yaxin Du, Yang Liu, Yanfeng Wang 0001, Siheng Chen. [doi]
- Visual Decoding and Reconstruction via EEG Embeddings with Guided DiffusionDongyang Li, Chen Wei 0006, Shiying Li, Jiachen Zou, Quanying Liu. [doi]
- Normalization and effective learning rates in reinforcement learningClare Lyle, Zeyu Zheng, Khimya Khetarpal, James Martens, Hado Philip van Hasselt, Razvan Pascanu, Will Dabney. [doi]
- Nimbus: Secure and Efficient Two-Party Inference for TransformersZhengyi Li, Kang Yang, Jin Tan, Wen-Jie Lu, Haoqi Wu, Xiao Wang, Yu Yu, Derun Zhao, Yancheng Zheng, Minyi Guo, Jingwen Leng. [doi]
- Diffusion Models With Learned Adaptive NoiseSubham S. Sahoo, Aaron Gokaslan, Christopher De Sa, Volodymyr Kuleshov. [doi]
- A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public HealthNikhil Behari, Edwin Zhang, Yunfan Zhao, Aparna Taneja, Dheeraj Nagaraj, Milind Tambe. [doi]
- Structured Unrestricted-Rank Matrices for Parameter Efficient FinetuningArijit Sehanobish, Kumar Avinava Dubey, Krzysztof Marcin Choromanski, Somnath Basu Roy Chowdhury, Deepali Jain, Vikas Sindhwani, Snigdha Chaturvedi. [doi]
- GTA: A Benchmark for General Tool AgentsJize Wang, Zerun Ma, Yining Li, Songyang Zhang, Cailian Chen, Kai Chen 0026, Xinyi Le. [doi]
- Differentially Private Optimization with Sparse GradientsBadih Ghazi, Cristóbal Guzmán, Pritish Kamath, Ravi Kumar 0001, Pasin Manurangsi. [doi]
- Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact GuidanceKai Xiong 0002, Xiao Ding, Ting Liu 0001, Bing Qin 0001, Dongliang Xu, Qing Yang, Hongtao Liu, Yixin Cao 0002. [doi]
- ContactField: Implicit Field Representation for Multi-Person Interaction GeometryHansol Lee, Tackgeun You, Hansoo Park, Woohyeon Shim, SangHyeon Kim, Hwasup Lim. [doi]
- Symmetric Linear Bandits with Hidden SymmetryNam Phuong Tran, The-Anh Ta, Debmalya Mandal, Long Tran-Thanh. [doi]
- Leveraging Catastrophic Forgetting to Develop Safe Diffusion Models against Malicious FinetuningJiadong Pan, Hongcheng Gao, Zongyu Wu, Taihang Hu, Li Su 0003, Qingming Huang, Liang Li 0003. [doi]
- GenArtist: Multimodal LLM as an Agent for Unified Image Generation and EditingZhenyu Wang, Aoxue Li, Zhenguo Li, Xihui Liu. [doi]
- On the Role of Attention Masks and LayerNorm in TransformersXinyi Wu, Amir Ajorlou, Yifei Wang 0001, Stefanie Jegelka, Ali Jadbabaie. [doi]
- Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual KnowledgeFawaz Sammani, Nikos Deligiannis. [doi]
- Credal Deep Ensembles for Uncertainty QuantificationKaizheng Wang, Fabio Cuzzolin, Shireen Kudukkil Manchingal, Keivan Shariatmadar, David Moens, Hans Hallez. [doi]
- Causal Context Adjustment Loss for Learned Image CompressionMinghao Han, Shiyin Jiang, Shengxi Li, Xin Deng 0002, Mai Xu, Ce Zhu, Shuhang Gu. [doi]
- Improving Gloss-free Sign Language Translation by Reducing Representation DensityJinhui Ye, Xing Wang 0007, Wenxiang Jiao, Junwei Liang 0001, Hui Xiong. [doi]
- Query-Efficient Correlation Clustering with Noisy OracleYuko Kuroki, Atsushi Miyauchi 0001, Francesco Bonchi, Wei Chen 0013. [doi]
- Long-Horizon Planning for Multi-Agent Robots in Partially Observable EnvironmentsSiddharth Nayak, Adelmo Morrison Orozco, Marina Ten Have, Jackson Zhang, Vittal Thirumalai, Darren Chen, Aditya Kapoor, Eric Robinson, Karthik Gopalakrishnan 0002, James Harrison, Anuj Mahajan, Brian Ichter, Hamsa Balakrishnan. [doi]
- Instructor-inspired Machine Learning for Robust Molecular Property PredictionFang Wu 0002, Shuting Jin, Siyuan Li 0002, Stan Z. Li. [doi]
- Dealing with Synthetic Data Contamination in Online Continual LearningMaorong Wang, Nicolas Michel, Jiafeng Mao, Toshihiko Yamasaki. [doi]
- Self-Guided Masked AutoencoderJeongwoo Shin, Inseo Lee, Junho Lee, Joonseok Lee. [doi]
- Wings: Learning Multimodal LLMs without Text-only ForgettingYi-Kai Zhang, Shiyin Lu, Yang Li, Yanqing Ma, Qingguo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, De-Chuan Zhan, Han-Jia Ye. [doi]
- Infinite-Dimensional Feature InteractionChenhui Xu, Fuxun Yu, Maoliang Li, Zihao Zheng, Zirui Xu, Jinjun Xiong, Xiang Chen 0010. [doi]
- Interpretable Generalized Additive Models for Datasets with Missing ValuesHayden McTavish, Jon Donnelly, Margo I. Seltzer, Cynthia Rudin. [doi]
- Scalable Optimization in the Modular NormTim Large, Yang Liu, Jacob Huh, Hyojin Bahng, Phillip Isola, Jeremy Bernstein. [doi]
- Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction MethodsJiamian Hu, Yuanyuan Hong, Yihua Chen, He Wang, Moriaki Yasuhara. [doi]
- Frequency-aware Generative Models for Multivariate Time Series ImputationXinyu Yang, Yu Sun 0027, Xiaojie Yuan, Xinyang Chen. [doi]
- An engine not a camera: Measuring performative power of online searchCelestine Mendler-Dünner, Gabriele Carovano, Moritz Hardt. [doi]
- RoPINN: Region Optimized Physics-Informed Neural NetworksHaixu Wu, Huakun Luo, Yuezhou Ma, Jianmin Wang 0001, Mingsheng Long. [doi]
- Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding SpaceLeo Schwinn, David Dobre, Sophie Xhonneux, Gauthier Gidel, Stephan Günnemann. [doi]
- Auditing Privacy Mechanisms via Label Inference AttacksRóbert Busa-Fekete, Travis Dick, Claudio Gentile, Andrés Muñoz Medina, Adam D. Smith 0001, Marika Swanberg. [doi]
- Clustering with Non-adaptive Subset QueriesHadley Black, Euiwoong Lee, Arya Mazumdar, Barna Saha. [doi]
- Adaptive Experimentation When You Can't ExperimentYao Zhao, Kwang-Sung Jun, Tanner Fiez, Lalit Jain. [doi]
- B-cosification: Transforming Deep Neural Networks to be Inherently InterpretableShreyash Arya, Sukrut Rao, Moritz Böhle, Bernt Schiele. [doi]
- Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics DiscoveryYue Yu, Ning Liu, Fei Lu, Tian Gao, Siavash Jafarzadeh, Stewart A. Silling. [doi]
- PhyloGen: Language Model-Enhanced Phylogenetic Inference via Graph Structure GenerationChenrui Duan, Zelin Zang, Siyuan Li 0002, Yongjie Xu, Stan Z. Li. [doi]
- How Does Black-Box Impact the Learning Guarantee of Stochastic Compositional Optimization?Jun Chen, Hong Chen 0004, Bin Gu 0001. [doi]
- Optimizing Automatic Differentiation with Deep Reinforcement LearningJamie Lohoff, Emre Neftci. [doi]
- Nuclear Norm Regularization for Deep LearningChristopher Scarvelis, Justin M. Solomon. [doi]
- Prism: A Framework for Decoupling and Assessing the Capabilities of VLMsYuxuan Qiao, Haodong Duan, XinYu Fang, Junming Yang, Lin Chen, Songyang Zhang, Jiaqi Wang, Dahua Lin, Kai Chen. [doi]
- Towards Accurate and Fair Cognitive Diagnosis via Monotonic Data AugmentationZheng Zhang 0048, Wei Song 0010, Qi Liu 0003, Qingyang Mao, Yiyan Wang, Weibo Gao, Zhenya Huang, Shijin Wang 0001, Enhong Chen. [doi]
- Unified Domain Generalization and Adaptation for Multi-View 3D Object DetectionGyusam Chang, Jiwon Lee, Donghyun Kim, Jinkyu Kim, Dongwook Lee, Daehyun Ji, Sujin Jang, Sangpil Kim. [doi]
- Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene UnderstandingYunze Man, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Liangyan Gui, Yu-Xiong Wang. [doi]
- Grounding Multimodal Large Language Models in ActionsAndrew Szot, Bogdan Mazoure, Harsh Agrawal, R. Devon Hjelm, Zsolt Kira, Alexander Toshev. [doi]
- Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence LossYifei Zhang, Huan-ang Gao, Zhou Jiang, Hao Zhao. [doi]
- Improving Neural ODE Training with Temporal Adaptive Batch NormalizationSu Zheng, Zhengqi Gao, Fan-Keng Sun, Duane S. Boning, Bei Yu 0001, Martin D. F. Wong. [doi]
- Robust Sleep Staging over Incomplete Multimodal Physiological Signals via Contrastive ImaginationQi Shen, Junchang Xin, Bing Tian Dai, Shudi Zhang, Zhiqiong Wang. [doi]
- Learning Discrete Concepts in Latent Hierarchical ModelsLingjing Kong, Guangyi Chen 0002, Biwei Huang, Eric P. Xing, Yuejie Chi, Kun Zhang 0001. [doi]
- On the Power of Small-size Graph Neural Networks for Linear ProgrammingQian Li, Tian Ding, Linxin Yang, Minghui Ouyang, Qingjiang Shi, Ruoyu Sun 0001. [doi]
- Trap-MID: Trapdoor-based Defense against Model Inversion AttacksZhenTing Liu, ShangTse Chen. [doi]
- InversionView: A General-Purpose Method for Reading Information from Neural ActivationsXinting Huang, Madhur Panwar, Navin Goyal, Michael Hahn 0001. [doi]
- PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement LearningChengyang Ying, Zhongkai Hao, Xinning Zhou, Xuezhou Xu, Hang Su 0006, Xingxing Zhang 0001, Jun Zhu 0001. [doi]
- Learning Social Welfare FunctionsKanad Pardeshi, Itai Shapira, Ariel D. Procaccia, Aarti Singh. [doi]
- Proximal Causal Inference With Text DataJacob M. Chen, Rohit Bhattacharya, Katherine A. Keith. [doi]
- Deep Homomorphism NetworksTakanori Maehara, Hoang NT. [doi]
- Multi-modal Transfer Learning between Biological Foundation ModelsJuan Jose Garau Luis, Patrick Bordes, Liam Gonzalez, Masa Roller, Bernardo P. de Almeida, Christopher Blum, Lorenz Hexemer, Stefan Laurent, Maren Lang, Thomas Pierrot, Guillaume Richard. [doi]
- E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image SegmentationBoqian Wu, Qiao Xiao, Shiwei Liu 0003, Lu Yin 0006, Mykola Pechenizkiy, Decebal Constantin Mocanu, Maurice van Keulen, Elena Mocanu. [doi]
- Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningChong Ma, Hanqi Jiang, Wenting Chen, Yiwei Li 0002, Zihao Wu 0001, Xiaowei Yu, Zhengliang Liu, Lei Guo 0002, Dajiang Zhu, Tuo Zhang, Dinggang Shen, Tianming Liu 0001, Xiang Li 0001. [doi]
- SimGen: Simulator-conditioned Driving Scene GenerationYunsong Zhou, Michael Simon, Zhenghao Mark Peng, Sicheng Mo, Hongzi Zhu, Minyi Guo, Bolei Zhou. [doi]
- Understanding the Limits of Vision Language Models Through the Lens of the Binding ProblemDeclan Campbell, Sunayana Rane, Tyler Giallanza, Nicolò De Sabbata, Kia Ghods, Amogh Joshi 0004, Alexander Ku, Steven Frankland, Tom Griffiths 0001, Jonathan D. Cohen 0003, Taylor W. Webb. [doi]
- MotionCraft: Physics-Based Zero-Shot Video GenerationAntonio Montanaro, Luca Savant Aira, Emanuele Aiello, Diego Valsesia, Enrico Magli. [doi]
- RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language ModelsShuhao Chen, Weisen Jiang, Baijiong Lin, James T. Kwok, Yu Zhang 0006. [doi]
- Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-TrainingHaoran He, Chenjia Bai, Ling Pan, Weinan Zhang 0001, Bin Zhao 0001, Xuelong Li 0001. [doi]
- Where does In-context Learning Happen in Large Language Models?Suzanna Sia, David Mueller, Kevin Duh. [doi]
- Diffusion Models are Certifiably Robust ClassifiersHuanran Chen, Yinpeng Dong, Shitong Shao, Zhongkai Hao, Xiao Yang, Hang Su, Jun Zhu. [doi]
- LoQT: Low-Rank Adapters for Quantized PretrainingSebastian Loeschcke, Mads Toftrup, Michael J. Kastoryano, Serge J. Belongie, Vésteinn Snæbjarnarson. [doi]
- DACO: Towards Application-Driven and Comprehensive Data Analysis via Code GenerationXueqing Wu 0001, Rui Zheng, Jingzhen Sha, Te-Lin Wu, Hanyu Zhou, Mohan Tang, Kai-Wei Chang, Nanyun Peng 0001, Haoran Huang. [doi]
- LaKD: Length-agnostic Knowledge Distillation for Trajectory Prediction with Any Length ObservationsYuhang Li, Changsheng Li, Ruilin Lv, Rongqing Li, Ye Yuan 0001, Guoren Wang. [doi]
- Association Pattern-aware Fusion for Biological Entity Relationship PredictionLingxiang Jia, Yuchen Ying, Zunlei Feng, Zipeng Zhong, Shaolun Yao, Jiacong Hu, Mingjiang Duan, Xingen Wang, Jie Song 0011, Mingli Song. [doi]
- Gorilla: Large Language Model Connected with Massive APIsShishir G. Patil, Tianjun Zhang, Xin Wang 0066, Joseph E. Gonzalez. [doi]
- Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites ParadoxXingming Long, Jie Zhang 0071, Shiguang Shan, Xilin Chen 0001. [doi]
- Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient AlgorithmsThanh Nguyen-Tang, Raman Arora. [doi]
- F-OAL: Forward-only Online Analytic Learning with Fast Training and Low Memory Footprint in Class Incremental LearningHuiping Zhuang, Yuchen Liu 0001, Run He, Kai Tong, Ziqian Zeng, Cen Chen, Yi Wang 0068, Lap-Pui Chau. [doi]
- Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization ApproachWeiyu Ma, Qirui Mi, Yongcheng Zeng, Xue Yan, Runji Lin, Yuqiao Wu, Jun Wang, Haifeng Zhang 0002. [doi]
- Building a stable classifier with the inflated argmaxJake A. Soloff, Rina Barber, Rebecca Willett. [doi]
- Unlocking the Potential of Global Human ExpertiseElliot Meyerson, Olivier Francon, Darren Sargent, Babak Hodjat, Risto Miikkulainen. [doi]
- Hollowed Net for On-Device Personalization of Text-to-Image Diffusion ModelsWonguk Cho, Seokeon Choi, Debasmit Das, Matthias Reisser, Taesup Kim, Sungrack Yun, Fatih Porikli. [doi]
- L4GM: Large 4D Gaussian Reconstruction ModelJiawei Ren, Cheng Xie, Ashkan Mirzaei, Hanxue Liang, Xiaohui Zeng, Karsten Kreis, Ziwei Liu, Antonio Torralba 0001, Sanja Fidler, Seung Wook Kim 0001, Huan Ling. [doi]
- BiScope: AI-generated Text Detection by Checking Memorization of Preceding TokensHanxi Guo, Siyuan Cheng 0005, Xiaolong Jin, Zhuo Zhang 0002, Kaiyuan Zhang 0002, Guanhong Tao 0001, Guangyu Shen, Xiangyu Zhang 0001. [doi]
- Transferable Boltzmann GeneratorsLeon Klein, Frank Noé. [doi]
- Promoting Fairness Among Dynamic Agents in Online-Matching Markets under Known Stationary Arrival DistributionsWill Ma, Pan Xu 0001. [doi]
- BOLD: Boolean Logic Deep LearningVan Minh Nguyen, Cristian Ocampo-Blandon, Aymen Askri, Louis Leconte, Ba-Hien Tran. [doi]
- Towards Learning Group-Equivariant Features for Domain Adaptive 3D DetectionSangyun Shin, Yuhang He, Madhu Vankadari, Ta Ying Cheng, Qian Xie 0001, Andrew Markham, Niki Trigoni. [doi]
- MathPile: A Billion-Token-Scale Pretraining Corpus for MathZengzhi Wang, Xuefeng Li, Rui Xia, Pengfei Liu. [doi]
- Learning Cooperative Trajectory Representations for Motion ForecastingHongzhi Ruan, Haibao Yu, Wenxian Yang, Siqi Fan 0002, Zaiqing Nie. [doi]
- Reducing Transformer Key-Value Cache Size with Cross-Layer AttentionWilliam Brandon, Mayank Mishra, Aniruddha Nrusimha, Rameswar Panda, Jonathan Ragan-Kelley. [doi]
- Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecificationThomas Kwa, Drake Thomas, Adrià Garriga-Alonso. [doi]
- Parseval Regularization for Continual Reinforcement LearningWesley Chung, Lynn Cherif, Doina Precup, David Meger. [doi]
- Schrodinger Bridge Flow for Unpaired Data TranslationValentin De Bortoli, Iryna Korshunova, Andriy Mnih, Arnaud Doucet. [doi]
- MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any ResolutionWenzhuo Liu, Fei Zhu, Shijie Ma, Cheng-Lin Liu 0001. [doi]
- Unravelling in Collaborative LearningAymeric Capitaine, Etienne Boursier, Antoine Scheid, Eric Moulines, Michael I. Jordan, El Mahdi El Mhamdi, Alain Durmus. [doi]
- Communication Bounds for the Distributed Experts ProblemZhihao Jia, Qi Pang, Trung Tran, David P. Woodruff, Zhihao Zhang, Wenting Zheng. [doi]
- UQE: A Query Engine for Unstructured DatabasesHanjun Dai, Bethany Wang, Xingchen Wan, Bo Dai 0001, Sherry Yang 0001, Azade Nova, Pengcheng Yin, Phitchaya Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans. [doi]
- DeiSAM: Segment Anything with Deictic PromptingHikaru Shindo, Manuel Brack, Gopika Sudhakaran, Devendra Singh Dhami, Patrick Schramowski, Kristian Kersting. [doi]
- RelBench: A Benchmark for Deep Learning on Relational DatabasesJoshua Robinson 0001, Rishabh Ranjan, Weihua Hu, Kexin Huang, Jiaqi Han, Alejandro Dobles, Matthias Fey, Jan Eric Lenssen, Yiwen Yuan, Zecheng Zhang, Xinwei He, Jure Leskovec. [doi]
- Outlier-Robust Distributionally Robust Optimization via Unbalanced Optimal TransportZifan Wang 0002, Yi Shen 0011, Michael M. Zavlanos, Karl Henrik Johansson. [doi]
- HonestLLM: Toward an Honest and Helpful Large Language ModelChujie Gao, Siyuan Wu 0001, Yue Huang, Dongping Chen, Qihui Zhang, Zhengyan Fu, Yao Wan 0001, Lichao Sun 0001, Xiangliang Zhang 0001. [doi]
- SuperDeepFool: a new fast and accurate minimal adversarial attackAlireza Abdollahpour, Mahed Abroshan, Seyed-Mohsen Moosavi-Dezfooli. [doi]
- Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIPChen Huang 0001, Skyler Seto, Samira Abnar, David Grangier, Navdeep Jaitly, Joshua M. Susskind. [doi]
- Space-Time Continuous PDE Forecasting using Equivariant Neural FieldsDavid M. Knigge, David R. Wessels, Riccardo Valperga, Samuele Papa, Jan-Jakob Sonke, Erik J. Bekkers, Efstratios Gavves. [doi]
- Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference FeedbackHamish Ivison, Yizhong Wang, Jiacheng Liu 0010, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert 0001, Noah A. Smith, Yejin Choi 0001, Hanna Hajishirzi. [doi]
- Mutual Information Estimation via f-Divergence and Data DerangementsNunzio Alexandro Letizia, Nicola Novello, Andrea M. Tonello. [doi]
- Parallel Backpropagation for Shared-Feature VisualizationAlexander Lappe, Anna Bognár, Ghazaleh Ghamkhari Nejad, Albert Mukovskiy, Lucas Martini, Martin A. Giese, Rufin Vogels. [doi]
- Sample-Efficient Private Learning of Mixtures of GaussiansHassan Ashtiani, Mahbod Majid, Shyam Narayanan. [doi]
- PhyRecon: Physically Plausible Neural Scene ReconstructionJunfeng Ni, Yixin Chen 0003, Bohan Jing, Nan Jiang, Bin Wang, Bo Dai 0025, Puhao Li, Yixin Zhu 0001, Song Chun Zhu, Siyuan Huang 0001. [doi]
- Empowering and Assessing the Utility of Large Language Models in Crop ScienceHang Zhang, Jiawei Sun, Renqi Chen, Wei Liu 0123, Zhonghang Yuan, Xinzhe Zheng, Zhefan Wang, Zhiyuan Yang, Hang Yan 0001, Han-Sen Zhong, Xiqing Wang, Wanli Ouyang, Fan Yang, Nanqing Dong. [doi]
- Smoothie: Label Free Language Model RoutingNeel Guha, Mayee F. Chen, Trevor Chow, Ishan S. Khare, Christopher Ré. [doi]
- Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPsLong-Fei Li, Peng Zhao 0006, Zhi-Hua Zhou. [doi]
- IntraMix: Intra-Class Mixup Generation for Accurate Labels and NeighborsShenghe Zheng, Hongzhi Wang 0001, Xianglong Liu 0004. [doi]
- A StrongREJECT for Empty JailbreaksAlexandra Souly, Qingyuan Lu, Dillon Bowen, Tu Trinh, Elvis Hsieh, Sana Pandey, Pieter Abbeel, Justin Svegliato, Scott Emmons, Olivia Watkins, Sam Toyer. [doi]
- Provably Safe Neural Network Controllers via Differential Dynamic LogicSamuel Teuber, Stefan Mitsch, André Platzer. [doi]
- Loss Landscape Characterization of Neural Networks without Over-ParametrizationRustem Islamov, Niccolò Ajroldi, Antonio Orvieto, Aurélien Lucchi. [doi]
- Satformer: Accurate and Robust Traffic Data Estimation for Satellite NetworksLiang Qin, Xiyuan Liu, Wenting Wei, Liang Chengbin, Huaxi Gu. [doi]
- Graph Edit Distance with General Costs Using Neural Set DivergenceEeshaan Jain, Indradyumna Roy, Saswat Meher, Soumen Chakrabarti, Abir De. [doi]
- Director3D: Real-world Camera Trajectory and 3D Scene Generation from TextXinyang Li, Zhangyu Lai, Linning Xu, Yansong Qu, Liujuan Cao, Shengchuan Zhang, Bo Dai 0002, Rongrong Ji. [doi]
- Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge GraphsLiyi Chen, Panrong Tong, Zhongming Jin, Ying Sun, Jieping Ye, Hui Xiong. [doi]
- On the Computational Landscape of Replicable LearningAlkis Kalavasis, Amin Karbasi, Grigoris Velegkas, Felix Zhou 0002. [doi]
- Predicting Label Distribution from Ternary LabelsYunan Lu, Xiuyi Jia. [doi]
- AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source DataZifan Song, Yudong Wang, Wenwei Zhang, Kuikun Liu, Chengqi Lyu, Demin Song, Qipeng Guo, Hang Yan 0001, Dahua Lin, Kai Chen 0026, Cairong Zhao. [doi]
- Strategic Linear Contextual BanditsThomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu. [doi]
- Data Acquisition via Experimental Design for Data MarketsCharles Lu 0001, Baihe Huang, Sai Praneeth Karimireddy, Praneeth Vepakomma, Michael I. Jordan, Ramesh Raskar. [doi]
- 2 with the Optimal RateShohei Taniguchi, Keno Harada, Gouki Minegishi, Yuta Oshima, Seong Cheol Jeong, Go Nagahara, Tomoshi Iiyama, Masahiro Suzuki, Yusuke Iwasawa, Yutaka Matsuo. [doi]
- Reconstruction of Manipulated Garment with Guided Deformation PriorRen Li, Corentin Dumery, Zhantao Deng, Pascal Fua. [doi]
- Diffusion Priors for Variational Likelihood Estimation and Image DenoisingJun Cheng, Shan Tan. [doi]
- Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction TuningZebang Cheng, Zhi-Qi Cheng, Jun-Yan He, Kai Wang 0036, Yuxiang Lin, Zheng Lian, Xiaojiang Peng, Alexander G. Hauptmann. [doi]
- One-to-Normal: Anomaly Personalization for Few-shot Anomaly DetectionYiyue Li, Shaoting Zhang 0001, Kang Li 0004, Qicheng Lao. [doi]
- Unifying Homophily and Heterophily for Spectral Graph Neural Networks via Triple Filter EnsemblesRui Duan, Mingjian Guang, Junli Wang, ChunGang Yan, Hongda Qi, Wenkang Su, Can Tian, Haoran Yang. [doi]
- Relational Concept Bottleneck ModelsPietro Barbiero, Francesco Giannini, Gabriele Ciravegna, Michelangelo Diligenti, Giuseppe Marra. [doi]
- Online Relational Inference for Evolving Multi-agent Interacting SystemsBeomseok Kang, Priyabrata Saha, Sudarshan Sharma, Biswadeep Chakraborty, Saibal Mukhopadhyay. [doi]
- Sparse maximal update parameterization: A holistic approach to sparse training dynamicsNolan Dey, Shane Bergsma, Joel Hestness. [doi]
- Natural Counterfactuals With Necessary BacktrackingGuang-Yuan Hao, Jiji Zhang, Biwei Huang, Hao Wang, Kun Zhang. [doi]
- Geometry of naturalistic object representations in recurrent neural network models of working memoryXiaoxuan Lei, Takuya Ito, Pouya Bashivan. [doi]
- Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive TrainingYunshu Wu, Yingtao Luo, Xianghao Kong, Vagelis Papalexakis, Greg Ver Steeg. [doi]
- EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture ModelsShangquan Sun, Wenqi Ren, Zikun Liu, Hyunhee Park, Rui Wang 0032, Xiaochun Cao. [doi]
- Recurrent neural networks: vanishing and exploding gradients are not the end of the storyNicolas Zucchet, Antonio Orvieto. [doi]
- SemCoder: Training Code Language Models with Comprehensive Semantics ReasoningYangruibo Ding, Jinjun Peng, Marcus J. Min, Gail E. Kaiser, Junfeng Yang, Baishakhi Ray. [doi]
- One-Step Diffusion Distillation through Score Implicit MatchingWeijian Luo, Zemin Huang, Zhengyang Geng, J. Zico Kolter, Guo-Jun Qi. [doi]
- Deep Support VectorsJunhoo Lee, Hyunho Lee, Kyomin Hwang, Nojun Kwak. [doi]
- GraphMorph: Tubular Structure Extraction by Morphing Predicted GraphsZhao Zhang, Ziwei Zhao, Dong Wang, Liwei Wang. [doi]
- Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and EvaluationJuan Formanek, Callum Rhys Tilbury, Louise Beyers, Jonathan P. Shock, Arnu Pretorius. [doi]
- Source Code Foundation Models are Transferable Binary Analysis Knowledge BasesZian Su, Xiangzhe Xu, Ziyang Huang, Kaiyuan Zhang 0002, Xiangyu Zhang 0001. [doi]
- A Cross-Domain Benchmark for Active LearningThorben Werner, Johannes Burchert, Maximilian Stubbemann, Lars Schmidt-Thieme. [doi]
- OxonFair: A Flexible Toolkit for Algorithmic FairnessEoin Delaney, Zihao Fu, Sandra Wachter, Brent D. Mittelstadt 0002, Chris Russell 0001. [doi]
- A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context ReasoningYuanning Cui, Zequn Sun, Wei Hu. [doi]
- Sample Complexity of Algorithm Selection Using Neural Networks and Its Applications to Branch-and-CutHongyu Cheng, Sammy Khalife, Barbara Fiedorowicz, Amitabh Basu. [doi]
- EigenVI: score-based variational inference with orthogonal function expansionsDiana Cai, Chirag Modi 0002, Charles Margossian, Robert M. Gower, David M. Blei, Lawrence K. Saul. [doi]
- Dynamic Neural Regeneration: Enhancing Deep Learning Generalization on Small DatasetsVijaya Raghavan T. Ramkumar, Elahe Arani, Bahram Zonooz. [doi]
- Rethinking the Membrane Dynamics and Optimization Objectives of Spiking Neural NetworksHangchi Shen, Qian Zheng, Huamin Wang, Gang Pan 0001. [doi]
- Intervention and Conditioning in Causal Bayesian NetworksSainyam Galhotra, Joseph Y. Halpern. [doi]
- Delving into the Reversal Curse: How Far Can Large Language Models Generalize?Zhengkai Lin, Zhihang Fu, Kai Liu, Liang Xie 0003, Binbin Lin, Wenxiao Wang 0001, Deng Cai 0001, Yue Wu, Jieping Ye. [doi]
- Initializing Variable-sized Vision Transformers from Learngene with Learnable TransformationShiyu Xia, Yuankun Zu, Xu Yang 0021, Xin Geng 0001. [doi]
- Pruning neural network models for gene regulatory dynamics using data and domain knowledgeIntekhab Hossain, Jonas Fischer, Rebekka Burkholz, John Quackenbush. [doi]
- Motion Forecasting in Continuous DrivingNan Song, Bozhou Zhang, Xiatian Zhu, Li Zhang. [doi]
- A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM EraFangyun Wei, Jinjing Zhao, Kun Yan, Hongyang Zhang, Chang Xu. [doi]
- GAMap: Zero-Shot Object Goal Navigation with Multi-Scale Geometric-Affordance GuidanceShuaihang Yuan, Hao Huang 0003, Yu Hao, Congcong Wen, Anthony Tzes, Yi Fang 0006. [doi]
- Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning AttackTiansheng Huang, Sihao Hu, Ling Liu 0001. [doi]
- Inferring Neural Signed Distance Functions by Overfitting on Single Noisy Point Clouds through Finetuning Data-Driven based PriorsChao Chen, Yu-Shen Liu, Zhizhong Han. [doi]
- BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?David Mayo, Christopher Wang, Asa Harbin, Abdulrahman Alabdulkareem, Albert E. Shaw, Boris Katz, Andrei Barbu. [doi]
- Is Knowledge Power? On the (Im)possibility of Learning from Strategic InteractionsNivasini Ananthakrishnan, Nika Haghtalab, Chara Podimata, Kunhe Yang. [doi]
- DropEdge not Foolproof: Effective Augmentation Method for Signed Graph Neural NetworksZeyu Zhang 0004, Lu Li, Shuyan Wan, Sijie Wang, Zhiyi Wang, Zhiyuan Lu, Dong Hao, Wanli Li. [doi]
- WATT: Weight Average Test Time Adaptation of CLIPDavid Osowiechi, Mehrdad Noori, Gustavo Adolfo Vargas Hakim, Moslem Yazdanpanah, Ali Bahri, Milad Cheraghalikhani, Sahar Dastani, Farzad Beizaee, Ismail Ben Ayed, Christian Desrosiers. [doi]
- Theoretical guarantees in KL for Diffusion Flow MatchingMarta Gentiloni Silveri, Alain Durmus, Giovanni Conforti. [doi]
- Introducing Spectral Attention for Long-Range Dependency in Time Series ForecastingBong Gyun Kang, Dongjun Lee, Hyungi Kim, Dohyun Chung, Sungroh Yoon. [doi]
- Rethinking Inverse Reinforcement Learning: from Data Alignment to Task AlignmentWeichao Zhou, Wenchao Li 0001. [doi]
- Factorized Diffusion Architectures for Unsupervised Image Generation and SegmentationXin Yuan, Michael Maire. [doi]
- LongVideoBench: A Benchmark for Long-context Interleaved Video-Language UnderstandingHaoning Wu, Dongxu Li, Bei Chen, Junnan Li. [doi]
- Learning to Mitigate Externalities: the Coase Theorem with Hindsight RationalityAntoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I. Jordan, Alain Durmus. [doi]
- Thinking Forward: Memory-Efficient Federated Finetuning of Language ModelsKunjal Panchal, Nisarg Parikh, Sunav Choudhary, Lijun Zhang, Yuriy Brun, Hui Guan 0001. [doi]
- Provably Efficient Interactive-Grounded Learning with Personalized RewardMengxiao Zhang, Yuheng Zhang, Haipeng Luo, Paul Mineiro. [doi]
- Twin-Merging: Dynamic Integration of Modular Expertise in Model MergingZhenyi Lu, Chenghao Fan, Wei Wei 0002, Xiaoye Qu, Dangyang Chen, Yu Cheng 0001. [doi]
- DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning BenchmarksMohamed Elrefaie, Florin Morar, Angela Dai, Faez Ahmed. [doi]
- GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF AccelerationSilong Yong, Yaqi Xie, Simon Stepputtis, Katia P. Sycara. [doi]
- HyperLogic: Enhancing Diversity and Accuracy in Rule Learning with HyperNetsYang Yang, Wendi Ren, Shuang Li. [doi]
- Order-Independence Without Fine TuningReid McIlroy-Young, Katrina Brown, Conlan Olson, Linjun Zhang, Cynthia Dwork. [doi]
- Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at ScaleTianyue Ou, Frank F. Xu, Aman Madaan, Jiarui Liu, Robert Lo, Abishek Sridhar, Sudipta Sengupta, Dan Roth, Graham Neubig, Shuyan Zhou. [doi]
- Bayesian Nonparametrics Meets Data-Driven Distributionally Robust OptimizationNicola Bariletto, Nhat Ho. [doi]
- Reparameterization invariance in approximate Bayesian inferenceHrittik Roy, Marco Miani, Carl Henrik Ek, Philipp Hennig, Marvin Pförtner, Lukas Tatzel, Søren Hauberg. [doi]
- Metric Transforms and Low Rank Representations of Kernels for Fast AttentionTimothy Chu, Josh Alman, Gary L. Miller, Shyam Narayanan, Mark Sellke, Zhao Song 0002. [doi]
- Diffusion for World Modeling: Visual Details Matter in AtariEloi Alonso, Adam Jelley, Vincent Micheli, Anssi Kanervisto, Amos J. Storkey, Tim Pearce, François Fleuret. [doi]
- Posture-Informed Muscular Force Learning for Robust Hand Pressure EstimationKyung Jin Seo, Junghoon Seo, Hanseok Jeong, Sangpil Kim, Sang Ho Yoon. [doi]
- CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object DynamicsJiawei Gao 0004, Ziqin Wang, Zeqi Xiao, Jingbo Wang 0003, Tai Wang, Jinkun Cao, Xiaolin Hu 0001, Si Liu 0001, Jifeng Dai, Jiangmiao Pang. [doi]
- Newswire: A Large-Scale Structured Database of a Century of Historical NewsEmily Silcock, Abhishek Arora 0003, Luca D'Amico-Wong, Melissa Dell. [doi]
- Variational Multi-scale Representation for Estimating Uncertainty in 3D Gaussian SplattingRuiqi Li, Yiu-ming Cheung. [doi]
- Gliding over the Pareto Front with Uniform DesignsXiaoyuan Zhang, Genghui Li, Xi Lin 0001, Yichi Zhang, Yifan Chen 0001, Qingfu Zhang 0001. [doi]
- MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and LearningYifan Jiang 0001, Jiarui Zhang 0002, Kexuan Sun 0002, Zhivar Sourati, Kian Ahrabian, Kaixin Ma, Filip Ilievski, Jay Pujara. [doi]
- Diffusion of Thought: Chain-of-Thought Reasoning in Diffusion Language ModelsJiacheng Ye, Shansan Gong, Liheng Chen, Lin Zheng, Jiahui Gao, Han Shi, Chuan Wu, Xin Jiang, Zhenguo Li, Wei Bi, Lingpeng Kong. [doi]
- Matrix Denoising with Doubly Heteroscedastic Noise: Fundamental Limits and Optimal Spectral MethodsYihan Zhang 0001, Marco Mondelli. [doi]
- Neuronal Competition Groups with Supervised STDP for Spike-Based ClassificationGaspard Goupy, Pierre Tirilly, Ioan Marius Bilasco. [doi]
- Flipping-based Policy for Chance-Constrained Markov Decision ProcessesXun Shen, Shuo Jiang, Akifumi Wachi, Kazumune Hashimoto, Sebastien Gros. [doi]
- Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement LearningLiyuan Mao, Haoran Xu 0003, Xianyuan Zhan, Weinan Zhang 0001, Amy Zhang. [doi]
- Probablistic Emulation of a Global Climate Model with Spherical DYffusionSalva Rühling Cachay, Brian Henn, Oliver Watt-Meyer, Christopher S. Bretherton, Rose Yu. [doi]
- InterControl: Zero-shot Human Interaction Generation by Controlling Every JointZhenzhi Wang 0001, Jingbo Wang 0003, Yixuan Li 0002, Dahua Lin, Bo Dai 0002. [doi]
- Absorb & Escape: Overcoming Single Model Limitations in Generating Heterogeneous Genomic SequencesZehui Li, Yuhao Ni, Guoxuan Xia, William A. V. Beardall, Akashaditya Das, Guy-Bart Stan, Yiren Zhao. [doi]
- On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits FusionChenghao Fan, Zhenyi Lu, Wei Wei 0002, Jie Tian, Xiaoye Qu, Dangyang Chen, Yu Cheng 0001. [doi]
- Neural Characteristic Activation Analysis and Geometric Parameterization for ReLU NetworksWenlin Chen, Hong Ge. [doi]
- Perceptual Fairness in Image RestorationGuy Ohayon, Michael Elad, Tomer Michaeli. [doi]
- A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world RoboticsPuze Liu, Jonas Günster, Niklas Funk, Simon Gröger, Dong Chen, Haitham Bou-Ammar, Julius Jankowski, Ante Maric, Sylvain Calinon, Andrej Orsula, Miguel S. Olivares-Méndez, Hongyi Zhou, Rudolf Lioutikov, Gerhard Neumann, Amarildo Likmeta, Amirhossein Zhalehmehrabi, Thomas Bonenfant, Marcello Restelli, Davide Tateo, Ziyuan Liu, Jan R. Peters. [doi]
- Linear Time Approximation Algorithm for Column Subset Selection with Local SearchYuanbin Zou, Ziyun Huang, Jinhui Xu 0001, Jianxin Wang 0001, Qilong Feng. [doi]
- AGILE: A Novel Reinforcement Learning Framework of LLM AgentsPeiyuan Feng, Yichen He, Guanhua Huang, Yuan Lin, Hanchong Zhang, Yuchen Zhang, Hang Li. [doi]
- Self-Retrieval: End-to-End Information Retrieval with One Large Language ModelQiaoyu Tang, Jiawei Chen, Zhuoqun Li, Bowen Yu 0002, Yaojie Lu 0001, Cheng Fu, Haiyang Yu, Hongyu Lin, Fei Huang, Ben He, Xianpei Han, Le Sun 0001, Yongbin Li. [doi]
- LaSCal: Label-Shift Calibration without target labelsTeodora Popordanoska, Gorjan Radevski, Tinne Tuytelaars, Matthew B. Blaschko. [doi]
- Scaling Law for Time Series ForecastingJingzhe Shi, Qinwei Ma, Huan Ma, Lei Li. [doi]
- Enriching Disentanglement: From Logical Definitions to Quantitative MetricsYivan Zhang, Masashi Sugiyama. [doi]
- Data Free Backdoor AttacksBochuan Cao, Jinyuan Jia 0001, Chuxuan Hu, Wenbo Guo 0002, Zhen Xiang, Jinghui Chen, Bo Li 0026, Dawn Song. [doi]
- Embedding-Aligned Language ModelsGuy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Lior Shani, Yi Liang, Craig Boutilier. [doi]
- Visual Fourier Prompt TuningRunjia Zeng, Cheng Han 0001, Qifan Wang, Chunshu Wu, Tong Geng, Lifu Huang, Ying Nian Wu, Dongfang Liu. [doi]
- Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic CorpusTerufumi Morishita, Gaku Morio, Atsuki Yamaguchi, Yasuhiro Sogawa. [doi]
- Why the Metric Backbone Preserves Community StructureMaximilien Dreveton, Charbel Chucri, Matthias Grossglauser, Patrick Thiran. [doi]
- Structural Inference of Dynamical Systems with Conjoined State Space ModelsAoran Wang, Jun Pang 0001. [doi]
- Beyond task diversity: provable representation transfer for sequential multitask linear banditsThang Duong, Zhi Wang 0013, Chicheng Zhang. [doi]
- Free-Rider and Conflict Aware Collaboration Formation for Cross-Silo Federated LearningMengmeng Chen, Xiaohu Wu, Xiaoli Tang, Tiantian He 0001, Yew-Soon Ong, Qiqi Liu, Qicheng Lao, Han Yu 0001. [doi]
- Evaluating Multiview Object Consistency in Humans and Image ModelsTyler Bonnen, Stephanie Fu, Yutong Bai, Thomas P. O'Connell, Yoni Friedman, Nancy Kanwisher, Josh Tenenbaum 0001, Alexei A. Efros. [doi]
- DDR: Exploiting Deep Degradation Response as Flexible Image DescriptorJuncheng Wu, Zhangkai Ni, Hanli Wang, Wenhan Yang, Yuyin Zhou, Shiqi Wang 0001. [doi]
- Exploiting LLM QuantizationKazuki Egashira, Mark Vero, Robin Staab, Jingxuan He, Martin T. Vechev. [doi]
- Global Distortions from Local Rewards: Neural Coding Strategies in Path-Integrating Neural SystemsFrancisco Acosta, Fatih Dinc, William Redman, Manu S. Madhav, David A. Klindt, Nina Miolane. [doi]
- AutoPSV: Automated Process-Supervised VerifierJianqiao Lu, Zhiyang Dou, Hongru Wang 0003, Zeyu Cao, Jianbo Dai, Yunlong Feng, Zhijiang Guo. [doi]
- DiffuLT: Diffusion for Long-tail Recognition Without External KnowledgeJie Shao 0001, Ke Zhu, Hanxiao Zhang, Jianxin Wu 0001. [doi]
- Transformers are Minimax Optimal Nonparametric In-Context LearnersJuno Kim, Tai Nakamaki, Taiji Suzuki. [doi]
- LLM Processes: Numerical Predictive Distributions Conditioned on Natural LanguageJames Requeima, John Bronskill, Dami Choi, Richard E. Turner, David Kristjanson Duvenaud. [doi]
- Graph Learning for Numeric PlanningDillon Z. Chen, Sylvie Thiébaux. [doi]
- Transformation-Invariant Learning and Theoretical Guarantees for OOD GeneralizationOmar Montasser, Han Shao, Emmanuel Abbe. [doi]
- Generalized Fast Exact ConformalizationDiyang Li. [doi]
- Energy-based Hopfield Boosting for Out-of-Distribution DetectionClaus Hofmann, Simon Schmid, Bernhard Lehner, Daniel Klotz, Sepp Hochreiter. [doi]
- On Sparse Canonical Correlation AnalysisYongchun Li, Santanu Dey, Weijun Xie 0001. [doi]
- Binarized Diffusion Model for Image Super-ResolutionZheng Chen 0014, Haotong Qin, Yong Guo, Xiongfei Su, Xin Yuan 0002, Linghe Kong, Yulun Zhang 0001. [doi]
- Fair Wasserstein CoresetsZikai Xiong, Niccolò Dalmasso, Shubham Sharma, Freddy Lécué, Daniele Magazzeni, Vamsi K. Potluru, Tucker Balch, Manuela Veloso. [doi]
- Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of GeneralizationBoshi Wang, Xiang Yue, Yu Su 0001, Huan Sun 0001. [doi]
- Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMsPeter Tong, Ellis Brown, Penghao Wu, Sanghyun Woo, Adithya Iyer, Sai Charitha Akula, Shusheng Yang, Jihan Yang, Manoj Middepogu, Ziteng Wang, Xichen Pan, Rob Fergus, Yann LeCun, Saining Xie. [doi]
- ET-Flow: Equivariant Flow-Matching for Molecular Conformer GenerationMajdi Hassan, Nikhil Shenoy, Jungyoon Lee, Hannes Stärk, Stephan Thaler, Dominique Beaini. [doi]
- Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed AlternativesVincent Hanke, Tom Blanchard, Franziska Boenisch, Iyiola E. Olatunji, Michael Backes 0001, Adam Dziedzic. [doi]
- Contracting with a Learning AgentGuru Guruganesh, Yoav Kolumbus, Jon Schneider, Inbal Talgam-Cohen, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Joshua R. Wang, S. Matthew Weinberg. [doi]
- MeshXL: Neural Coordinate Field for Generative 3D Foundation ModelsSijin Chen, Xin Chen, Anqi Pang, Xianfang Zeng, Wei Cheng, Yijun Fu, Fukun Yin, Billzb Wang, Jingyi Yu, Gang Yu, Bin Fu, Tao Chen 0003. [doi]
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive ReasoningShirley Wu, Shiyu Zhao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis N. Ioannidis, Karthik Subbian, Jure Leskovec, James Y. Zou. [doi]
- V-PETL Bench: A Unified Visual Parameter-Efficient Transfer Learning BenchmarkYi Xin, Siqi Luo, Xuyang Liu, Yuntao Du 0001, Haodi Zhou, Xinyu Cheng, Christina E. Lee, Junlong Du, Haozhe Wang, Mingcai Chen, Ting Liu, Guimin Hu, Zhongwei Wan, Rongchao Zhang, Aoxue Li, Mingyang Yi, Xiaohong Liu 0001. [doi]
- To Learn or Not to Learn, That is the Question - A Feature-Task Dual Learning Model of Perceptual LearningXiao Liu, Muyang Lyu, Cong Yu, Si Wu 0001. [doi]
- Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement LearningRuoqi Zhang, Ziwei Luo, Jens Sjölund, Thomas B. Schön, Per Mattsson. [doi]
- From Dictionary to Tensor: A Scalable Multi-View Subspace Clustering Framework with Triple Information EnhancementZhibin Gu, Songhe Feng. [doi]
- QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor AdaptationZhuo Chen, Rumen Dangovski, Charlotte Loh, Owen Dugan, Di Luo, Marin Soljacic. [doi]
- Foundation Inference Models for Markov Jump ProcessesDavid Berghaus, Kostadin Cvejoski, Patrick Seifner, César Ali Marin Ojeda, Ramsés J. Sánchez. [doi]
- TrAct: Making First-layer Pre-Activations TrainableFelix Petersen, Christian Borgelt, Stefano Ermon. [doi]
- Meta-Controller: Few-Shot Imitation of Unseen Embodiments and Tasks in Continuous ControlSeongwoong Cho, Donggyun Kim, Jinwoo Lee, Seunghoon Hong. [doi]
- AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope PredictionChunan Liu, Lilian Denzler, Yihong Chen, Andrew Martin, Brooks Paige. [doi]
- Identifying Causal Effects Under Functional DependenciesYizuo Chen, Adnan Darwiche. [doi]
- Rejection via Learning Density RatiosAlexander Soen, Hisham Husain, Philip Schulz, Vu Nguyen. [doi]
- Honor Among Bandits: No-Regret Learning for Online Fair DivisionAriel D. Procaccia, Ben Schiffer, Shirley Zhang 0001. [doi]
- An Improved Empirical Fisher Approximation for Natural Gradient DescentXiaodong Wu, Wenyi Yu, Chao Zhang, Philip C. Woodland. [doi]
- ODRL: A Benchmark for Off-Dynamics Reinforcement LearningJiafei Lyu, Kang Xu, Jiacheng Xu 0003, Mengbei Yan, Jingwen Yang, Zongzhang Zhang, Chenjia Bai, Zongqing Lu, Xiu Li 0001. [doi]
- Neural Isometries: Taming Transformations for Equivariant MLThomas W. Mitchel, Michael J. Taylor, Vincent Sitzmann. [doi]
- Codec Avatar Studio: Paired Human Captures for Complete, Driveable, and Generalizable AvatarsJulieta Martinez 0001, Emily Kim, Javier Romero 0002, Timur M. Bagautdinov, Shunsuke Saito, Shoou-I Yu, Stuart Anderson, Michael Zollhöfer, Te-Li Wang, Shaojie Bai, Chenghui Li, Shih-En Wei, Rohan Joshi, Wyatt Borsos, Tomas Simon, Jason M. Saragih, Paul Theodosis, Alexander Greene, Anjani Josyula, Silvio Maeta, Andrew Jewett, Simion Venshtain, Christopher Heilman, Yueh-Tung Chen, Sidi Fu, Mohamed Elshaer, Tingfang Du, Longhua Wu, Shen-Chi Chen, Kai Kang, Michael Wu, Youssef Emad, Steven Longay, Ashley Brewer, Hitesh Shah, James Booth, Taylor Koska, Kayla Haidle, Matthew Andromalos, Joanna Hsu, Thomas Dauer, Peter Selednik, Timothy Godisart, Scott Ardisson, Matthew Cipperly, Ben Humberston, Lon Farr, Bob Hansen, Peihong Guo, Dave Braun, Steven Krenn, He Wen, Lucas Evans, Natalia Fadeeva, Matthew Stewart, Gabriel Schwartz, Divam Gupta, Gyeongsik Moon, Kaiwen Guo, Yuan Dong, Yichen Xu, Takaaki Shiratori, Fabian Prada, Bernardo Pires, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn Mcphail, Melissa Schoeller, Yaser Sheikh. [doi]
- In-and-Out: Algorithmic Diffusion for Sampling Convex BodiesYunbum Kook, Santosh S. Vempala, Matthew Shunshi Zhang. [doi]
- State-free Reinforcement LearningMingyu Chen 0012, Aldo Pacchiano, Xuezhou Zhang. [doi]
- Score-based 3D molecule generation with neural fieldsMatthieu Kirchmeyer, Pedro O. Pinheiro, Saeed Saremi. [doi]
- PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model DynamicsOmead Pooladzandi, Sunay Bhat, Jeffrey Jiang, Alexander Branch, Gregory J. Pottie. [doi]
- A Unified Framework for 3D Scene UnderstandingWei Xu 0017, Chunsheng Shi, Sifan Tu, Xin Zhou 0013, Dingkang Liang, Xiang Bai. [doi]
- User-item fairness tradeoffs in recommendationsSophie Greenwood, Sudalakshmee Chiniah, Nikhil Garg. [doi]
- MagR: Weight Magnitude Reduction for Enhancing Post-Training QuantizationAozhong Zhang, Naigang Wang, Yanxia Deng, Xin Li, Zi Yang, Penghang Yin. [doi]
- HHD-GP: Incorporating Helmholtz-Hodge Decomposition into Gaussian Processes for Learning Dynamical SystemsHao Xu, Jia Pan. [doi]
- SDformer: Similarity-driven Discrete Transformer For Time Series GenerationZhicheng Chen, Shibo Feng, Zhong Zhang, Xi Xiao, Xingyu Gao 0001, Peilin Zhao. [doi]
- Temporal-Difference Learning Using Distributed Error SignalsJonas Guan, Shon Eduard Verch, Claas Voelcker, Ethan C. Jackson, Nicolas Papernot, William A. Cunningham. [doi]
- SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing ImageryJian Song, Hongruixuan Chen, Weihao Xuan, Junshi Xia, Naoto Yokoya. [doi]
- Goal Reduction with Loop-Removal Accelerates RL and Models Human Brain Activity in Goal-Directed LearningHuzi Cheng, Joshua W. Brown. [doi]
- BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsQijun Luo, Hengxu Yu, Xiao Li. [doi]
- Diversity Is Not All You Need: Training A Robust Cooperative Agent Needs Specialist PartnersRujikorn Charakorn, Poramate Manoonpong, Nat Dilokthanakul. [doi]
- AED: Adaptable Error Detection for Few-shot Imitation PolicyJia-Fong Yeh, Kuo-Han Hung, Pang-Chi Lo, Chi-Ming Chung, Tsung-Han Wu, Hung-Ting Su, Yi-Ting Chen, Winston H. Hsu. [doi]
- Slack-Free Spiking Neural Network Formulation for Hypergraph Minimum Vertex CoverTam Nguyen, Anh-Dzung Doan, Zhipeng Cai, Tat-Jun Chin. [doi]
- Sequential Harmful Shift Detection Without LabelsSalim I. Amoukou, Tom Bewley, Saumitra Mishra, Freddy Lécué, Daniele Magazzeni, Manuela Veloso. [doi]
- Optimal Algorithms for Augmented Testing of Discrete DistributionsMaryam Aliakbarpour, Piotr Indyk, Ronitt Rubinfeld, Sandeep Silwal. [doi]
- Stochastic Amortization: A Unified Approach to Accelerate Feature and Data AttributionIan Covert, Chanwoo Kim 0002, Su-In Lee, James Y. Zou, Tatsunori B. Hashimoto. [doi]
- Personalized Instance-based Navigation Toward User-Specific Objects in Realistic EnvironmentsLuca Barsellotti, Roberto Bigazzi, Marcella Cornia, Lorenzo Baraldi 0001, Rita Cucchiara. [doi]
- Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic MaterialsYe Fang, Zeyi Sun 0002, Tong Wu, Jiaqi Wang 0003, Ziwei Liu 0002, Gordon Wetzstein, Dahua Lin. [doi]
- UltraMedical: Building Specialized Generalists in BiomedicineKaiyan Zhang, Sihang Zeng, Ermo Hua, Ning Ding 0002, Zhang-Ren Chen, Zhiyuan Ma 0005, Haoxin Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, Xingtai Lv, Jinfang Hu, Zhiyuan Liu 0001, Bowen Zhou 0002. [doi]
- Revisiting, Benchmarking and Understanding Unsupervised Graph Domain AdaptationMeihan Liu, Zhen Zhang 0023, Jiachen Tang, Jiajun Bu, Bingsheng He, Sheng Zhou 0004. [doi]
- Super Consistency of Neural Network Landscapes and Learning Rate TransferLorenzo Noci, Alexandru Meterez, Thomas Hofmann, Antonio Orvieto. [doi]
- A Phase Transition between Positional and Semantic Learning in a Solvable Model of Dot-Product AttentionHugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborová. [doi]
- Neural Experts: Mixture of Experts for Implicit Neural RepresentationsYizhak Ben-Shabat, Chamin Hewa Koneputugodage, Sameera Ramasinghe, Stephen Gould. [doi]
- Federated Transformer: Multi-Party Vertical Federated Learning on Practical Fuzzily Linked DataZhaomin Wu, Junyi Hou, Yiqun Diao, Bingsheng He. [doi]
- RandNet-Parareal: a time-parallel PDE solver using Random Neural NetworksGuglielmo Gattiglio, Lyudmila Grigoryeva, Massimiliano Tamborrino. [doi]
- Efficient multi-prompt evaluation of LLMsFelipe Maia Polo, Ronald Xu, Lucas Weber, Mírian Silva, Onkar Bhardwaj, Leshem Choshen, Allysson Flavio Melo de Oliveira, Yuekai Sun, Mikhail Yurochkin. [doi]
- Recurrent Complex-Weighted Autoencoders for Unsupervised Object DiscoveryAnand Gopalakrishnan, Aleksandar Stanic, Jürgen Schmidhuber, Michael C. Mozer. [doi]
- CountGD: Multi-Modal Open-World CountingNiki Amini-Naieni, Tengda Han, Andrew Zisserman. [doi]
- PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion TeacherDongjun Kim, Chieh-Hsin Lai, Wei-Hsiang Liao, Yuhta Takida, Naoki Murata, Toshimitsu Uesaka, Yuki Mitsufuji, Stefano Ermon. [doi]
- SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object NavigationHang Yin, Xiuwei Xu, Zhenyu Wu, Jie Zhou 0001, Jiwen Lu. [doi]
- AdanCA: Neural Cellular Automata As Adaptors For More Robust Vision TransformerYitao Xu, Tong Zhang 0023, Sabine Süsstrunk. [doi]
- Not All Tokens Are What You Need for PretrainingZhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu 0029, Yelong Shen, Ruochen Xu, Chen Lin 0001, Yujiu Yang, Jian Jiao 0007, Nan Duan, Weizhu Chen. [doi]
- SSA-Seg: Semantic and Spatial Adaptive Pixel-level Classifier for Semantic SegmentationXiaowen Ma, Zhenliang Ni, Xinghao Chen 0001. [doi]
- Optimistic Verifiable Training by Controlling Hardware NondeterminismMegha Srivastava, Simran Arora, Dan Boneh. [doi]
- Graph Coarsening with Message-Passing GuaranteesAntonin Joly, Nicolas Keriven. [doi]
- Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon CaptioningJifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy T. Rogers, Kevin G. Jamieson, Bob Mankoff, Robert Nowak 0001. [doi]
- Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear BanditsYunlong Hou 0001, Vincent Y. F. Tan, Zixin Zhong. [doi]
- Parameter-Inverted Image Pyramid NetworksXizhou Zhu, Xue Yang 0005, Zhaokai Wang, Hao Li 0069, Wenhan Dou, Junqi Ge, Lewei Lu, Yu Qiao 0001, Jifeng Dai. [doi]
- Label Delay in Online Continual LearningBotos Csaba, Wenxuan Zhang, Matthias Müller 0011, Ser-Nam Lim, Philip Torr 0001, Adel Bibi. [doi]
- MoEUT: Mixture-of-Experts Universal TransformersRóbert Csordás, Kazuki Irie, Jürgen Schmidhuber, Christopher Potts, Christopher D. Manning. [doi]
- Base of RoPE Bounds Context LengthMingyu Xu, Xin Men, Bingning Wang, Qingyu Zhang, Hongyu Lin, Xianpei Han, Weipeng Chen. [doi]
- FairQueue: Rethinking Prompt Learning for Fair Text-to-Image GenerationChristopher T. H. Teo, Milad Abdollahzadeh, Xinda Ma, Ngai-Man Cheung. [doi]
- Rethinking the Capacity of Graph Neural Networks for Branching StrategyZiang Chen, Jialin Liu 0003, Xiaohan Chen, Xinshang Wang, Wotao Yin. [doi]
- When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided SearchXuan Chen, Yuzhou Nie, Wenbo Guo 0002, Xiangyu Zhang 0001. [doi]
- Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous DrivingXiaosong Jia, Zhenjie Yang, Qifeng Li, Zhiyuan Zhang, Junchi Yan. [doi]
- Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and AlgorithmsMiaosen Zhang, Yixuan Wei, Zhen Xing, Yifei Ma, Zuxuan Wu, Ji Li 0006, Zheng Zhang 0022, Qi Dai 0001, Chong Luo, Xin Geng 0001, Baining Guo. [doi]
- DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid RepresentationZhiqi Li, Yiming Chen, Peidong Liu. [doi]
- Online Feature Updates Improve Online (Generalized) Label Shift AdaptationRuihan Wu, Siddhartha Datta, Yi Su, Dheeraj Baby, Yu-Xiang Wang 0003, Kilian Q. Weinberger. [doi]
- Mind the Graph When Balancing Data for Fairness or RobustnessJessica Schrouff, Alexis Bellot, Amal Rannen Triki, Alan Malek, Isabela Albuquerque, Arthur Gretton, Alexander D'Amour, Silvia Chiappa. [doi]
- Knowledge Composition using Task Vectors with Learned Anisotropic ScalingFrederic Z. Zhang, Paul Albert, Cristian Rodriguez Opazo, Anton van den Hengel, Ehsan Abbasnejad. [doi]
- SMART: Towards Pre-trained Missing-Aware Model for Patient Health Status PredictionZhihao Yu, Chu Xu, Yujie Jin, Yasha Wang, Junfeng Zhao 0001. [doi]
- Harmony4D: A Video Dataset for In-The-Wild Close Human InteractionsRawal Khirodkar, Jyun-Ting Song, Jinkun Cao, Zhengyi Luo 0002, Kris Kitani. [doi]
- RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language ModelsHaoyu Chen 0003, Wenbo Li 0002, Jinjin Gu, Jingjing Ren, Sixiang Chen, Tian Ye 0001, Renjing Pei, Kaiwen Zhou, Fenglong Song, Lei Zhu. [doi]
- Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior CollaboratorsHaoming Wang, Zhaoming Tian, Yunpeng Song, Xiangliang Zhang, Zhongmin Cai. [doi]
- To Err Like Human: Affective Bias-Inspired Measures for Visual Emotion Recognition EvaluationChenxi Zhao, Jinglei Shi, Liqiang Nie, Jufeng Yang. [doi]
- On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert AdviceShinji Ito. [doi]
- MaskFactory: Towards High-quality Synthetic Data Generation for Dichotomous Image SegmentationHaotian Qian, Yinda Chen, Shengtao Lou, Fahad Shahbaz Khan, Xiaogang Jin, Deng-Ping Fan. [doi]
- Training for Stable Explanation for FreeChao Chen, Chenghua Guo, Rufeng Chen, Guixiang Ma, Ming Zeng, Xiangwen Liao, Xi Zhang 0008, Sihong Xie. [doi]
- Interpretable Concept-Based Memory ReasoningDavid Debot, Pietro Barbiero, Francesco Giannini, Gabriele Ciravegna, Michelangelo Diligenti, Giuseppe Marra. [doi]
- Customized Multiple Clustering via Multi-Modal Subspace Proxy LearningJiawei Yao, Qi Qian 0001, Juhua Hu. [doi]
- Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language ModelsJiaqi Li, Qianshan Wei, Chuanyi Zhang, Guilin Qi, Miaozeng Du, Yongrui Chen 0002, Sheng Bi, Fan Liu. [doi]
- Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise RegularizationJunlin He, Jinxiao Du, Susu Xu, Wei Ma. [doi]
- Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual NetworksZixuan Zhang, Kaiqi Zhang 0002, Minshuo Chen, Yuma Takeda, Mengdi Wang, Tuo Zhao, Yu-Xiang Wang 0003. [doi]
- Metric Flow Matching for Smooth Interpolations on the Data ManifoldKacper Kapusniak, Peter Potaptchik, Teodora Reu, Leo Zhang, Alexander Tong 0001, Michael M. Bronstein, Avishek Joey Bose, Francesco Di Giovanni. [doi]
- RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMsYue Yu, Wei Ping, Zihan Liu 0001, Boxin Wang, Jiaxuan You, Chao Zhang, Mohammad Shoeybi, Bryan Catanzaro. [doi]
- DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech TranslationWeiting Tan, Jingyu Zhang, Lingfeng Shen, Daniel Khashabi, Philipp Koehn. [doi]
- IODA: Instance-Guided One-shot Domain Adaptation for Super-ResolutionZaizuo Tang, Yu-Bin Yang. [doi]
- PuLID: Pure and Lightning ID Customization via Contrastive AlignmentZinan Guo, Yanze Wu, Zhuowei Chen, Lang Chen, Peng Zhang, Qian He. [doi]
- Suitable is the Best: Task-Oriented Knowledge Fusion in Vulnerability DetectionJingjing Wang, Minhuan Huang, Yuanping Nie, Xiang Li 0078, Qianjin Du, Wei Kong, Huan Deng, Xiaohui Kuang. [doi]
- Faster Repeated Evasion Attacks in Tree EnsemblesLorenzo Cascioli, Laurens Devos, Ondrej Kuzelka, Jesse Davis. [doi]
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel ProgrammingAli TehraniJamsaz, Arijit Bhattacharjee, Le Chen, Nesreen K. Ahmed, Amir Yazdanbakhsh, Ali Jannesari. [doi]
- Treeffuser: probabilistic prediction via conditional diffusions with gradient-boosted treesNicolas Beltran-Velez, Alessandro Antonio Grande, Achille Nazaret, Alp Kucukelbir, David M. Blei. [doi]
- Are Self-Attentions Effective for Time Series Forecasting?Dongbin Kim, Jinseong Park 0001, Jaewook Lee 0001, Hoki Kim. [doi]
- Geometric Trajectory Diffusion ModelsJiaqi Han, Minkai Xu, Aaron Lou, Haotian Ye, Stefano Ermon. [doi]
- Symmetry Discovery Beyond Affine TransformationsBen Shaw 0003, Abram Magner, Kevin R. Moon. [doi]
- Differentially Private Equivalence Testing for Continuous Distributions and ApplicationsOr Sheffet, Daniel Omer. [doi]
- Parameter Competition Balancing for Model MergingGuodong Du 0002, Junlin Lee, Jing Li 0047, Runhua Jiang, Yifei Guo, Shuyang Yu, Hanting Liu, Sim Kuan Goh, Ho-Kin Tang, Daojing He, Min Zhang 0005. [doi]
- Buffer of Thoughts: Thought-Augmented Reasoning with Large Language ModelsLing Yang, Zhaochen Yu, Tianjun Zhang, Shiyi Cao, Minkai Xu, Wentao Zhang, Joseph E. Gonzalez, Bin Cui 0001. [doi]
- Out-of-Distribution Detection with a Single Unconditional Diffusion ModelAlvin Heng, Alexandre H. Thiery, Harold Soh. [doi]
- Physical Consistency Bridges Heterogeneous Data in Molecular Multi-Task LearningYuxuan Ren, Dihan Zheng, Chang Liu 0030, Peiran Jin, Yu Shi, Lin Huang, Jiyan He, Shengjie Luo, Tao Qin 0001, Tie-Yan Liu. [doi]
- LibAMM: Empirical Insights into Approximate Computing for Accelerating Matrix MultiplicationXianzhi Zeng, Wenchao Jiang, Shuhao Zhang 0001. [doi]
- Genetic-guided GFlowNets for Sample Efficient Molecular OptimizationHyeonah Kim, Minsu Kim, Sanghyeok Choi, Jinkyoo Park. [doi]
- A Polar coordinate system represents syntax in large language modelsPablo Diego-Simón, Stéphane d'Ascoli, Emmanuel Chemla, Yair Lakretz, Jean-Remi King. [doi]
- Learning the Optimal Policy for Balancing Short-Term and Long-Term RewardsQinwei Yang, Xueqing Liu, Yan Zeng, Ruocheng Guo, Yang Liu 0018, Peng Wu. [doi]
- SolarCube: An Integrative Benchmark Dataset Harnessing Satellite and In-situ Observations for Large-scale Solar Energy ForecastingRuohan Li, Yiqun Xie, Xiaowei Jia, Dongdong Wang 0001, Yanhua Li, Yingxue Zhang 0002, ZhiHao Wang, Zhili Li. [doi]
- Pedestrian-Centric 3D Pre-collision Pose and Shape Estimation from Dashcam PerspectiveMeijun Wang, Yu Meng, Zhongwei Qiu, Chao Zheng, Yan Xu, Pengxiaorui, Jian Gao. [doi]
- Hybrid Reinforcement Learning Breaks Sample Size Barriers In Linear MDPsKevin Tan, Wei Fan, Yuting Wei 0001. [doi]
- Exploring Molecular Pretraining Model at ScaleXiaohong Ji, Zhen Wang, Zhifeng Gao, Hang Zheng, Linfeng Zhang, Guolin Ke, Weinan E. [doi]
- End-To-End Causal Effect Estimation from Unstructured Natural Language DataNikita Dhawan, Leonardo Cotta, Karen Ullrich, Rahul G. Krishnan, Chris J. Maddison. [doi]
- Explicit Eigenvalue Regularization Improves Sharpness-Aware MinimizationHaocheng Luo, Tuan Truong, Tung Pham 0001, Mehrtash Harandi, Dinh Q. Phung, Trung Le. [doi]
- Learning to Price Homogeneous DataKeran Chen, Joon Suk Huh, Kirthevasan Kandasamy. [doi]
- Multi-turn Reinforcement Learning with Preference Human FeedbackLior Shani, Aviv Rosenberg 0002, Asaf Cassel, Oran Lang, Daniele Calandriello, Avital Zipori, Hila Noga, Orgad Keller, Bilal Piot, Idan Szpektor, Avinatan Hassidim, Yossi Matias, Rémi Munos. [doi]
- Robust and Faster Zeroth-Order Minimax Optimization: Complexity and ApplicationsWeixin An, Yuanyuan Liu 0001, Fanhua Shang, Hongying Liu. [doi]
- One-to-Multiple: A Progressive Style Transfer Unsupervised Domain-Adaptive Framework for Kidney Tumor SegmentationKai Hu 0002, Jinhao Li 0009, Yuan Zhang 0022, Xiongjun Ye, Xieping Gao. [doi]
- Graph Diffusion Transformers for Multi-Conditional Molecular GenerationGang Liu, Jiaxin Xu, Tengfei Luo, Meng Jiang. [doi]
- WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMsSeungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert 0001, Yejin Choi 0001, Nouha Dziri. [doi]
- Estimating Generalization Performance Along the Trajectory of Proximal SGD in Robust RegressionKai Tan, Pierre C. Bellec. [doi]
- Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial AttentionHaomeng Zhang, Chiao-An Yang, Raymond A. Yeh. [doi]
- Towards Next-Generation Logic Synthesis: A Scalable Neural Circuit Generation FrameworkZhihai Wang, Jie Wang, Qingyue Yang, Yinqi Bai, Xing Li, Lei Chen, Jianye Hao, Mingxuan Yuan, Bin Li, Yongdong Zhang 0001, Feng Wu 0001. [doi]
- Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function SimilarityKaja Gruntkowska, Alexander Tyurin, Peter Richtárik. [doi]
- PageRank Bandits for Link PredictionYikun Ban, Jiaru Zou, Zihao Li, Yunzhe Qi, Dongqi Fu, Jian Kang 0008, Hanghang Tong, Jingrui He. [doi]
- Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal IterationsNikil Roashan Selvam, Amil Merchant, Stefano Ermon. [doi]
- CLIPCEIL: Domain Generalization through CLIP via Channel rEfinement and Image-text aLignmentXi Yu, Shinjae Yoo, Yuewei Lin. [doi]
- Attack-Resilient Image Watermarking Using Stable DiffusionLijun Zhang, Xiao Liu 0030, Antoni Viros Martin, Cindy Xiong Bearfield, Yuriy Brun, Hui Guan 0001. [doi]
- In-Context Symmetries: Self-Supervised Learning through Contextual World ModelsSharut Gupta, Chenyu Wang, Yifei Wang 0001, Tommi S. Jaakkola, Stefanie Jegelka. [doi]
- DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning GraphZhehao Zhang, Jiaao Chen, Diyi Yang. [doi]
- DeNetDM: Debiasing by Network Depth ModulationSilpa Vadakkeeveetil Sreelatha, Adarsh Kappiyath, Abhra Chaudhuri, Anjan Dutta 0001. [doi]
- RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-IdentificationLei Tan, Yukang Zhang, Keke Han, Pingyang Dai, Yan Zhang, Yongjian Wu, Rongrong Ji. [doi]
- Learning-Augmented Algorithms with Explicit PredictorsMarek Eliás 0001, Haim Kaplan, Yishay Mansour, Shay Moran. [doi]
- LookHere: Vision Transformers with Directed Attention Generalize and ExtrapolateAnthony Fuller, Daniel G. Kyrollos, Yousef Yassin, James R. Green. [doi]
- FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model FusionZhenheng Tang, Yonggang Zhang 0003, Peijie Dong, Yiu-ming Cheung, Amelie Chi Zhou, Bo Han 0003, Xiaowen Chu 0001. [doi]
- Optimal Scalarizations for Sublinear Hypervolume RegretQiuyi (Richard) Zhang. [doi]
- CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial DefenseMingkun Zhang, Keping Bi, Wei Chen 0034, Quanrun Chen, Jiafeng Guo, Xueqi Cheng. [doi]
- Transformers Represent Belief State Geometry in their Residual StreamAdam S. Shai, Lucas Teixeira, Alexander Gietelink Oldenziel, Sarah Marzen, Paul M. Riechers. [doi]
- Propensity Score Alignment of Unpaired Multimodal DataJohnny Xi, Jana Osea, Zuheng Xu, Jason S. Hartford. [doi]
- Physics-Constrained Comprehensive Optical Neural NetworksYanbing Liu, Jianwei Qin, Yan Liu, Xi Yue, Xun Liu, Guoqing Wang 0001, Tianyu Li, Fangwei Ye, Wei Li. [doi]
- OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object TrackingHaiji Liang, Ruize Han. [doi]
- Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual NoiseZhenning Shi, haoshuai zheng, Chen Xu, Changsheng Dong, Bin Pan, Xueshuo Xie, Along He, Tao Li 0002, Huazhu Fu. [doi]
- Constrained Sampling with Primal-Dual Langevin Monte CarloLuiz F. O. Chamon, Mohammad Reza Karimi Jaghargh, Anna Korba. [doi]
- Einsum Benchmark: Enabling the Development of Next-Generation Tensor Execution EnginesMark Blacher, Christoph Staudt, Julien Klaus, Maurice Wenig, Niklas Merk, Alexander Breuer, Max Engel, Sören Laue, Joachim Giesen. [doi]
- Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust AlgorithmR. Teal Witter, Christopher Musco. [doi]
- Soft ascent-descent as a stable and flexible alternative to floodingMatthew J. Holland, Kosuke Nakatani. [doi]
- Decoupling Semantic Similarity from Spatial Alignment for Neural NetworksTassilo Wald, Constantin Ulrich, Priyank Jaini, Gregor Köhler, David Zimmerer, Stefan Denner, Fabian Isensee, Michael Baumgartner 0001, Klaus H. Maier-Hein. [doi]
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained UnderstandingDong Jing, Xiaolong He, Yutian Luo, Nanyi Fei, Guoxing Yang, Wei Wei, Huiwen Zhao, Zhiwu Lu 0001. [doi]
- Parsimony or Capability? Decomposition Delivers Both in Long-term Time Series ForecastingJinliang Deng, Feiyang Ye, Du Yin, Xuan Song 0001, Ivor W. Tsang, Hui Xiong 0001. [doi]
- Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image GenerationJiajun Wang, Morteza Ghahremani, Yitong Li, Björn Ommer, Christian Wachinger. [doi]
- Domain Adaptation for Large-Vocabulary Object DetectorsKai Jiang, Jiaxing Huang 0001, Weiying Xie, Jie Lei 0001, Yunsong Li, Ling Shao 0001, Shijian Lu. [doi]
- Small coresets via negative dependence: DPPs, linear statistics, and concentrationRémi Bardenet, Subhroshekhar Ghosh, Hugo Simon-Onfroy, Hoang Son Tran. [doi]
- On the Effects of Data Scale on UI Control AgentsWei Li, William E. Bishop, Alice Li, Christopher Rawles, Folawiyo Campbell-Ajala, Divya Tyamagundlu, Oriana Riva. [doi]
- Referencing Where to Focus: Improving Visual Grounding with Referential QueryYabing Wang, Zhuotao Tian, Qingpei Guo, Zheng Qin, Sanping Zhou, Ming Yang, Le Wang 0003. [doi]
- β-DPO: Direct Preference Optimization with Dynamic βJunkang Wu, Yuexiang Xie, Zhengyi Yang 0007, Jiancan Wu, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He 0001. [doi]
- An Efficient High-dimensional Gradient Estimator for Stochastic Differential EquationsShengbo Wang, Jose H. Blanchet, Peter W. Glynn. [doi]
- Kronecker-Factored Approximate Curvature for Physics-Informed Neural NetworksFelix Dangel, Johannes Müller, Marius Zeinhofer. [doi]
- kGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash ResolutionAlex Mathai, Chenxi Huang, Petros Maniatis, Aleksandr Nogikh, Franjo Ivancic, Junfeng Yang, Baishakhi Ray. [doi]
- Meta-Exploiting Frequency Prior for Cross-Domain Few-Shot LearningFei Zhou, Peng Wang, Lei Zhang, Zhenghua Chen, Wei Wei 0008, Chen Ding 0002, Guosheng Lin, Yanning Zhang. [doi]
- Fair Kernel K-Means: from Single Kernel to Multiple KernelPeng Zhou 0006, Rongwen Li, Liang Du 0003. [doi]
- Boosting Text-to-Video Generative Model with MLLMs FeedbackXun Wu, Shaohan Huang, Guolong Wang, Jing Xiong, Furu Wei. [doi]
- UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-World Document AnalysisYulong Hui, Yao Lu, Huanchen Zhang. [doi]
- Evaluating Large Vision-and-Language Models on Children's Mathematical OlympiadsAnoop Cherian, Kuan-Chuan Peng, Suhas Lohit, Joanna Matthiesen, Kevin A. Smith, Josh Tenenbaum 0001. [doi]
- Recurrent neural network dynamical systems for biological visionWayne Soo, Aldo Battista, Puria Radmard, Xiao-Jing Wang. [doi]
- ConStat: Performance-Based Contamination Detection in Large Language ModelsJasper Dekoninck, Mark Niklas Müller, Martin T. Vechev. [doi]
- Elliptical AttentionStefan K. Nielsen, Laziz U. Abdullaev, Rachel S. Y. Teo, Tan Nguyen. [doi]
- Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function ApproximationWooseong Cho, TaeHyun Hwang, Joongkyu Lee, Min-hwan Oh. [doi]
- CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language ModelsSaurav Jha, Dong Gong, Lina Yao 0001. [doi]
- RGFN: Synthesizable Molecular Generation Using GFlowNetsMichal Koziarski, Andrei Rekesh, Dmytro Shevchuk, Almer van der Sloot, Piotr Gainski, Yoshua Bengio, Cheng-Hao Liu, Mike Tyers, Robert A. Batey. [doi]
- Treatment of Statistical Estimation Problems in Randomized Smoothing for Adversarial RobustnessVáclav Vorácek. [doi]
- Interpreting Learned Feedback Patterns in Large Language ModelsLuke Marks, Amir Abdullah, Clement Neo, Rauno Arike, David Krueger 0001, Philip Torr 0001, Fazl Barez. [doi]
- Faster Local Solvers for Graph Diffusion EquationsJiahe Bai, Baojian Zhou, Deqing Yang, Yanghua Xiao. [doi]
- Amortized Bayesian Experimental Design for Decision-MakingDaolang Huang, Yujia Guo, Luigi Acerbi, Samuel Kaski. [doi]
- Navigating Extremes: Dynamic Sparsity in Large Output SpacesNasibullah Nasibullah, Erik Schultheis, Mike Lasby, Yani Ioannou, Rohit Babbar. [doi]
- Direct Unlearning Optimization for Robust and Safe Text-to-Image ModelsYong-Hyun Park, Sangdoo Yun, Jin-Hwa Kim, Junho Kim, Geonhui Jang, Yonghyun Jeong, Junghyo Jo, Gayoung Lee. [doi]
- SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-ResolutionQi Tang, Yao Zhao, Meiqin Liu, Chao Yao. [doi]
- Towards Calibrated Robust Fine-Tuning of Vision-Language ModelsChangdae Oh, Hyesu Lim, Mijoo Kim, Dongyoon Han, Sangdoo Yun, Jaegul Choo, Alexander Hauptmann 0001, Zhi-Qi Cheng, Kyungwoo Song. [doi]
- Fairness-Aware Meta-Learning via Nash BargainingYi Zeng 0005, Xuelin Yang, Li Chen, Cristian Canton-Ferrer, Ming Jin 0002, Michael I. Jordan, Ruoxi Jia 0001. [doi]
- Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-BanditsJulien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel. [doi]
- Community Detection Guarantees using Embeddings Learned by Node2VecAndrew Davison, S. Carlyle Morgan, Owen G. Ward. [doi]
- OptEx: Expediting First-Order Optimization with Approximately Parallelized IterationsYao Shu, Jiongfeng Fang, Ying He, Fei Yu. [doi]
- Bridge the Points: Graph-based Few-shot Segment Anything SemanticallyAnqi Zhang, Guangyu Gao, Jianbo Jiao, Chi Liu, Yunchao Wei. [doi]
- Learning via Surrogate PAC-BayesAntoine Picard-Weibel, Roman Moscoviz, Benjamin Guedj. [doi]
- FM-Delta: Lossless Compression for Storing Massive Fine-tuned Foundation ModelsWanyi Ning, Jingyu Wang, Qi Qi 0001, Mengde Zhu, Haifeng Sun 0001, Daixuan Cheng, Jianxin Liao, Ce Zhang. [doi]
- Artemis: Towards Referential Understanding in Complex VideosJihao Qiu, Yuan Zhang, Xi Tang, Lingxi Xie, TianRen Ma, Pengyu Yan, David S. Doermann, Qixiang Ye, Yunjie Tian. [doi]
- Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement LearningAlessandro Montenegro, Marco Mussi, Matteo Papini, Alberto Maria Metelli. [doi]
- Evaluation of Text-to-Video Generation Models: A Dynamics PerspectiveMingxiang Liao, Hannan Lu, Qixiang Ye, Wangmeng Zuo, Fang Wan 0001, Tianyu Wang, YuZhong Zhao, Jingdong Wang 0001, Xinyu Zhang 0017. [doi]
- IWBVT: Instance Weighting-based Bias-Variance Trade-off for CrowdsourcingWenjun Zhang 0012, Liangxiao Jiang, Chaoqun Li 0001. [doi]
- Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node InjectionsZihan Luo 0001, Hong Huang 0001, Yongkang Zhou, Jiping Zhang, Nuo Chen, Hai Jin 0001. [doi]
- Can LLMs Implicitly Learn Numeric Parameter Constraints in Data Science APIs?Yinlin Deng, Chunqiu Steven Xia, Zhezhen Cao, Meiziniu Li, Lingming Zhang 0001. [doi]
- Conformal Inverse OptimizationBo Lin, Erick Delage, Timothy C. Y. Chan. [doi]
- Belief-State Query Policies for User-Aligned POMDPsDaniel Bramblett, Siddharth Srivastava 0001. [doi]
- Multi-Agent Domain Calibration with a Handful of Offline DataTao Jiang, Lei Yuan 0001, Lihe Li, Cong Guan, Zongzhang Zhang, Yang Yu 0001. [doi]
- AlphaMath Almost Zero: Process Supervision without ProcessGuoxin Chen, Minpeng Liao, Chengxi Li 0014, Kai Fan 0002. [doi]
- Warm-starting Push-RelabelSami Davies, Sergei Vassilvitskii, Yuyan Wang. [doi]
- SfPUEL: Shape from Polarization under Unknown Environment LightYouwei Lyu, Heng Guo 0003, Kailong Zhang, Si Li 0001, Boxin Shi. [doi]
- NoiseGPT: Label Noise Detection and Rectification through Probability CurvatureHaoyu Wang, Zhuo Huang, Zhiwei Lin, Tongliang Liu. [doi]
- Are Large Language Models Good Statisticians?Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo, Nan Tang 0001. [doi]
- AdjointDEIS: Efficient Gradients for Diffusion ModelsZander W. Blasingame, Chen Liu 0001. [doi]
- Limits of Transformer Language Models on Learning to Compose AlgorithmsJonathan Thomm, Giacomo Camposampiero, Aleksandar Terzic, Michael Hersche, Bernhard Schölkopf, Abbas Rahimi. [doi]
- Prospective Representation Learning for Non-Exemplar Class-Incremental LearningWuxuan Shi, Mang Ye. [doi]
- Revisiting Score Propagation in Graph Out-of-Distribution DetectionLongfei Ma, Yiyou Sun, Kaize Ding, Zemin Liu, Fei Wu 0001. [doi]
- L-TTA: Lightweight Test-Time Adaptation Using a Versatile Stem LayerJin Shin, Hyun Kim. [doi]
- Self-Supervised Adversarial Training via Diverse Augmented Queries and Self-Supervised Double PerturbationRuize Zhang, Sheng Tang, Juan Cao 0001. [doi]
- Credal Learning TheoryMichele Caprio, Maryam Sultana, Eleni Elia, Fabio Cuzzolin. [doi]
- Molecule Design by Latent Prompt TransformerDeqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu, Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng 0001, Ying Nian Wu. [doi]
- Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution GeneralizationDang Nguyen, Paymon Haddad, Eric Gan, Baharan Mirzasoleiman. [doi]
- Do Finetti: On Causal Effects for Exchangeable DataSiyuan Guo, Chi Zhang, Karthika Mohan, Ferenc Huszar, Bernhard Schölkopf. [doi]
- RedCode: Risky Code Execution and Generation Benchmark for Code AgentsChengquan Guo, Xun Liu, Chulin Xie, Andy Zhou, Yi Zeng, Zinan Lin 0001, Dawn Song, Bo Li. [doi]
- SpatialRGPT: Grounded Spatial Reasoning in Vision-Language ModelsAn-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo, Ruihan Yang, Jan Kautz, Xiaolong Wang 0004, Sifei Liu. [doi]
- AV-Cloud: Spatial Audio Rendering Through Audio-Visual Cloud SplattingMingfei Chen, Eli Shlizerman. [doi]
- Theoretical Investigations and Practical Enhancements on Tail Task Risk Minimization in Meta LearningYiqin Lv, Qi Wang, Dong Liang, Zheng Xie. [doi]
- Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPsDavide Maran, Alberto Maria Metelli, Matteo Papini, Marcello Restelli. [doi]
- Multi-model Ensemble Conformal Prediction in Dynamic EnvironmentsErfan Hajihashemi, Yanning Shen. [doi]
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design PatentsHomaira Huda Shomee, Zhu Wang, Sathya N. Ravi, Sourav Medya. [doi]
- Bridging OOD Detection and Generalization: A Graph-Theoretic ViewHan Wang, Sharon Li 0001. [doi]
- Context-Aware Testing: A New Paradigm for Model Testing with Large Language ModelsPaulius Rauba, Nabeel Seedat, Max Ruiz Luyten, Mihaela van der Schaar. [doi]
- DapperFL: Domain Adaptive Federated Learning with Model Fusion Pruning for Edge DevicesYongzhe Jia, Xuyun Zhang, Hongsheng Hu, Kim-Kwang Raymond Choo, Lianyong Qi, Xiaolong Xu 0001, Amin Beheshti, Wanchun Dou. [doi]
- Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?Zhanke Zhou, Rong Tao, Jianing Zhu, Yiwen Luo, Zengmao Wang, Bo Han 0003. [doi]
- Boundary Decomposition for Nadir Objective Vector EstimationRuihao Zheng, Zhenkun Wang 0001. [doi]
- Bayesian-guided Label Mapping for Visual ReprogrammingChengyi Cai, Zesheng Ye, Lei Feng 0006, Jianzhong Qi 0001, Feng Liu. [doi]
- Harnessing Multiple Correlated Networks for Exact Community RecoveryMiklós Z. Rácz, Jifan Zhang. [doi]
- ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video GenerationShenghai Yuan, Jinfa Huang, Yongqi Xu, Yaoyang Liu, Shaofeng Zhang, Yujun Shi, Ruijie Zhu 0003, Xinhua Cheng, Jiebo Luo 0001, Li Yuan 0007. [doi]
- Stabilize the Latent Space for Image Autoregressive Modeling: A Unified PerspectiveYongxin Zhu 0003, Bocheng Li, Hang Zhang, Xin Li 0056, Linli Xu, Lidong Bing. [doi]
- Unsupervised Object Detection with Theoretical GuaranteesMarian Longa, João F. Henriques. [doi]
- Visual Prompt Tuning in Null Space for Continual LearningYue Lu, Shizhou Zhang, De Cheng, Yinghui Xing, Nannan Wang 0001, Peng Wang 0015, Yanning Zhang. [doi]
- Stepping on the Edge: Curvature Aware Learning Rate TunersVincent Roulet, Atish Agarwala, Jean-Bastien grill, Grzegorz Swirszcz, Mathieu Blondel, Fabian Pedregosa. [doi]
- Persistent Homology for High-dimensional Data Based on Spectral MethodsSebastian Damrich, Philipp Berens, Dmitry Kobak. [doi]
- Nonstationary Sparse Spectral Permanental ProcessZicheng Sun, Yixuan Zhang 0006, Zenan Ling, Xuhui Fan 0001, Feng Zhou 0011. [doi]
- DDK: Distilling Domain Knowledge for Efficient Large Language ModelsJiaheng Liu, Chenchen Zhang, Jinyang Guo, Yuanxing Zhang, Haoran Que, Ken Deng, Zhiqi Bai, Jie Liu, Ge Zhang, Jiakai Wang, Yanan Wu, Congnan Liu, Jiamang Wang, Lin Qu, Wenbo Su, Bo Zheng. [doi]
- Can Transformers Smell Like Humans?Farzaneh Taleb, Miguel Vasco, Antônio H. Ribeiro, Mårten Björkman, Danica Kragic. [doi]
- Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted DiffusionYongyuan Liang, Tingqiang Xu, Kaizhe Hu, Guangqi Jiang, Furong Huang, Huazhe Xu. [doi]
- A Walsh Hadamard Derived Linear Vector Symbolic ArchitectureMohammad Mahmudul Alam, Alexander Oberle, Edward Raff, Stella Biderman, Tim Oates 0001, James Holt. [doi]
- Continuous Heatmap Regression for Pose Estimation via Implicit Neural RepresentationShengxiang Hu 0001, HuaiJiang Sun, Dong Wei 0007, Xiaoning Sun, Jin Wang. [doi]
- DeTrack: In-model Latent Denoising Learning for Visual Object TrackingXinyu Zhou, Jinglun Li, Lingyi Hong, Kaixun Jiang, Pinxue Guo, Weifeng Ge, Wenqiang Zhang. [doi]
- Directional Smoothness and Gradient Methods: Convergence and AdaptivityAaron Mishkin, Ahmed Khaled 0001, Yuanhao Wang 0001, Aaron Defazio, Robert M. Gower. [doi]
- AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model AgentsYao Fu, Dong Ki Kim, Jaekyeom Kim, Sungryull Sohn, Lajanugen Logeswaran, Kyunghoon Bae, Honglak Lee. [doi]
- Operator World Models for Reinforcement LearningPietro Novelli, Marco Pratticò, Massimiliano Pontil, Carlo Ciliberto. [doi]
- Slot-VLM: Object-Event Slots for Video-Language ModelingJiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu 0001. [doi]
- PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in ActionYijia Shao, Tianshi Li 0006, Weiyan Shi, Yanchen Liu, Diyi Yang. [doi]
- fMRI predictors based on language models of increasing complexity recover brain left lateralizationLaurent Bonnasse-Gahot, Christophe Pallier. [doi]
- PowerGraph: A power grid benchmark dataset for graph neural networksAnna Varbella, Kenza Amara, Blazhe Gjorgiev, Mennatallah El-Assady, Giovanni Sansavini. [doi]
- SARAD: Spatial Association-Aware Anomaly Detection and Diagnosis for Multivariate Time SeriesZhihao Dai, Ligang He, Shuanghua Yang, Matthew Leeke. [doi]
- Universality in Transfer Learning for Linear ModelsReza Ghane, Danil Akhtiamov, Babak Hassibi. [doi]
- Semantics and Spatiality of Emergent CommunicationRotem Ben Zion, Boaz Carmeli, Orr Paradise, Yonatan Belinkov. [doi]
- Universal Sample CodingSzymon Kobus, Tze-Yang Tung, Deniz Gündüz. [doi]
- Model Based Inference of Synaptic Plasticity RulesYash Mehta, Danil Tyulmankov, Adithya Rajagopalan, Glenn Turner, James FitzGerald, Jan Funke. [doi]
- Continuous Product Graph Neural NetworksAref Einizade, Fragkiskos D. Malliaros, Jhony H. Giraldo. [doi]
- Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock LevelAli Hassani 0001, Wen-mei Hwu, Humphrey Shi. [doi]
- Decision-Making Behavior Evaluation Framework for LLMs under Uncertain ContextJingru Jia, Zehua Yuan, Junhao Pan, Paul McNamara, Deming Chen. [doi]
- Exploration by Learning Diverse Skills through Successor State RepresentationsPaul-Antoine Le Tolguenec, Yann Besse, Florent Teichteil-Königsbuch, Dennis Wilson, Emmanuel Rachelson. [doi]
- Exploring Adversarial Robustness of Deep State Space ModelsBiqing Qi, Yiang Luo, Junqi Gao, Pengfei Li, Kai Tian, Zhiyuan Ma 0005, Bowen Zhou 0002. [doi]
- LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive HashingXiaonan Nie, Qibin Liu, Fangcheng Fu, Shenhan Zhu, Xupeng Miao, Xiaoyang Li, Yang Zhang, Shouda Liu, Bin Cui 0001. [doi]
- TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge GraphsZhuofeng Li, Zixing Gou, Xiangnan Zhang, Zhongyuan Liu, Sirui Li, Yuntong Hu, Chen Ling 0003, Zheng Zhang 0047, Liang Zhao. [doi]
- EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and ActivityDominik Hollidt, Paul Streli, Jiaxi Jiang, Yasaman Haghighi, Changlin Qian, Xintong Liu, Christian Holz 0001. [doi]
- MAmmoTH2: Scaling Instructions from the WebXiang Yue, Tianyu Zheng, Ge Zhang, Wenhu Chen. [doi]
- AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language ModelsHaiquan Lu, Yefan Zhou, Shiwei Liu 0003, Zhangyang Wang, Michael W. Mahoney, Yaoqing Yang. [doi]
- Private Edge Density Estimation for Random Graphs: Optimal, Efficient and RobustHongjie Chen 0004, Jingqiu Ding, Yiding Hua, David Steurer. [doi]
- Shape analysis for time seriesThibaut Germain, Samuel Gruffaz, Charles Truong, Alain Durmus, Laurent Oudre. [doi]
- Knowledge-Empowered Dynamic Graph Network for Irregularly Sampled Medical Time SeriesYicheng Luo, Zhen Liu 0023, Linghao Wang, Binquan Wu, Junhao Zheng, Qianli Ma 0001. [doi]
- Learning Successor Features the Simple WayRaymond Chua, Arna Ghosh, Christos Kaplanis, Blake A. Richards, Doina Precup. [doi]
- Structured Multi-Track Accompaniment Arrangement via Style Prior ModellingJingwei Zhao, Gus Xia, Ziyu Wang 0008, Ye Wang. [doi]
- Neural Collapse Inspired Feature Alignment for Out-of-Distribution GeneralizationZhikang Chen, Min Zhang, Sen Cui, Haoxuan Li, Gang Niu 0001, Mingming Gong, Changshui Zhang, Kun Zhang 0001. [doi]
- DF40: Toward Next-Generation Deepfake DetectionZhiyuan Yan, Taiping Yao, Shen Chen, Yandan Zhao, Xinghe Fu, Junwei Zhu, Donghao Luo, Chengjie Wang, Shouhong Ding, Yunsheng Wu, Li Yuan. [doi]
- Lean Workbook: A large-scale Lean problem set formalized from natural language math problemsHuaiyuan Ying, Zijian Wu, Yihan Geng, Jiayu Wang, Dahua Lin, Kai Chen 0026. [doi]
- SampDetox: Black-box Backdoor Defense via Perturbation-based Sample DetoxificationYanxin Yang, Chentao Jia, Dengke Yan, Ming Hu, Tianlin Li, Xiaofei Xie, Xian Wei, Mingsong Chen 0001. [doi]
- GenRec: Unifying Video Generation and Recognition with Diffusion ModelsZejia Weng, Xitong Yang, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang. [doi]
- Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence ChallengeWeihua Du, Qiushi Lyu, Jiaming Shan, Zhenting Qi, Hongxin Zhang, Sunli Chen, Andi Peng, Tianmin Shu, Kwonjoon Lee, Behzad Dariush, Chuang Gan. [doi]
- SustainDC: Benchmarking for Sustainable Data Center ControlAvisek Naug, Antonio Guillen, Ricardo Luna Gutierrez, Vineet Gundecha, Cullen E. Bash, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Desik Rengarajan, Soumyendu Sarkar. [doi]
- Boosting the Transferability of Adversarial Attack on Vision Transformer with Adaptive Token TuningDi Ming, Peng Ren, Yunlong Wang, Xin Feng. [doi]
- Optimal Algorithms for Online Convex Optimization with Adversarial ConstraintsAbhishek Sinha, Rahul Vaze. [doi]
- Federated Black-Box Adaptation for Semantic SegmentationJay N. Paranjape, Shameema Sikder, S. Swaroop Vedula, Vishal M. Patel. [doi]
- Instruction Tuning Large Language Models to Understand Electronic Health RecordsZhenbang Wu, Anant Dadu, Mike A. Nalls, Faraz Faghri, Jimeng Sun 0001. [doi]
- Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the WildXinyu Zhao, Guoheng Sun, Ruisi Cai, Yukun Zhou, Pingzhi Li, Peihao Wang, Bowen Tan, Yexiao He, Li Chen, Yi Liang, Beidi Chen, Binhang Yuan, Hongyi Wang 0001, Ang Li 0005, Zhangyang Wang, Tianlong Chen. [doi]
- BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion ModelsFangyikang Wang, Hubery Yin, Yuejiang Dong, Huminhao Zhu, Zhang Chao, Hanbin Zhao, Hui Qian 0001, Chen Li. [doi]
- MatFormer: Nested Transformer for Elastic InferenceDevvrit, Sneha Kudugunta, Aditya Kusupati, Tim Dettmers, Kaifeng Chen, Inderjit S. Dhillon, Yulia Tsvetkov, Hanna Hajishirzi, Sham M. Kakade, Ali Farhadi, Prateek Jain 0002. [doi]
- Shuffling Gradient-Based Methods for Nonconvex-Concave Minimax OptimizationQuoc Tran-Dinh, Trang H. Tran, Lam M. Nguyen. [doi]
- Rethinking Reconstruction-based Graph-Level Anomaly Detection: Limitations and a Simple RemedySunwoo Kim, Soo Yong Lee, Fanchen Bu, Shinhwan Kang, KyungHo Kim, Jaemin Yoo, Kijung Shin. [doi]
- On the Inductive Bias of Stacking Towards Improving ReasoningNikunj Saunshi, Stefani Karp, Shankar Krishnan, Sobhan Miryoosefi, Sashank Jakkam Reddi, Sanjiv Kumar. [doi]
- Cluster-Learngene: Inheriting Adaptive Clusters for Vision TransformersQiufeng Wang 0002, Xu Yang 0021, Fu Feng, Jing Wang 0113, Xin Geng 0001. [doi]
- EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG SignalsGuangyu Wang, Wenchao Liu 0004, Yuhong He, Cong Xu, Lin Ma 0003, Haifeng Li 0001. [doi]
- Enhancing Graph Transformers with Hierarchical Distance Structural EncodingYuankai Luo, Hongkang Li, Lei Shi 0002, Xiao-Ming Wu 0003. [doi]
- Towards Unsupervised Model Selection for Domain Adaptive Object DetectionHengfu Yu, Jinhong Deng, Wen Li 0001, Lixin Duan. [doi]
- Expert-level protocol translation for self-driving labsYu-Zhe Shi, Fanxu Meng, Haofei Hou, Zhangqian Bi, Qiao Xu, Lecheng Ruan, Qining Wang. [doi]
- SongCreator: Lyrics-based Universal Song GenerationShun Lei, Yixuan Zhou 0002, Boshi Tang, Max W. Y. Lam, Feng Liu, Hangyu Liu, Jingcheng Wu, Shiyin Kang, Zhiyong Wu 0001, Helen Meng. [doi]
- TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based ModelsAndrei Margeloiu, Xiangjian Jiang, Nikola Simidjievski, Mateja Jamnik. [doi]
- Logical characterizations of recurrent graph neural networks with reals and floatsVeeti Ahvonen, Damian Heiman, Antti Kuusisto, Carsten Lutz. [doi]
- MILP-StuDio: MILP Instance Generation via Block Structure DecompositionHaoyang Liu, Jie Wang 0005, Wanbo Zhang, Zijie Geng, Yufei Kuang, Xijun Li, Bin Li 0025, Yongdong Zhang 0001, Feng Wu 0005. [doi]
- DU-Shapley: A Shapley Value Proxy for Efficient Dataset ValuationFelipe Garrido-Lucero, Benjamin Heymann, Maxime Vono, Patrick Loiseau, Vianney Perchet. [doi]
- Neural Residual Diffusion Models for Deep Scalable Vision GenerationZhiyuan Ma 0005, Liangliang Zhao, Biqing Qi, Bowen Zhou 0002. [doi]
- MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue ResolutionWei Tao 0003, Yucheng Zhou, Yanlin Wang 0001, Wenqiang Zhang, Hongyu Zhang 0002, Yu Cheng. [doi]
- Fully Distributed, Flexible Compositional Visual Representations via Soft Tensor ProductsBethia Sun, Maurice Pagnucco, Yang Song 0001. [doi]
- Stochastic Concept Bottleneck ModelsMoritz Vandenhirtz, Sonia Laguna, Ricards Marcinkevics, Julia E. Vogt. [doi]
- On the Stability and Generalization of Meta-LearningYunjuan Wang, Raman Arora. [doi]
- Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility GenerationJiawei Wang, Renhe Jiang, Chuang Yang, Zengqing Wu, Makoto Onizuka, Ryosuke Shibasaki, Noboru Koshizuka, Chuan Xiao 0001. [doi]
- Improving Generalization in Federated Learning with Model-Data Mutual Information Regularization: A Posterior Inference ApproachHao Zhang, Chenglin Li, Nuowen Kan, Ziyang Zheng, Wenrui Dai, Junni Zou, Hongkai Xiong. [doi]
- Frustratingly Easy Test-Time Adaptation of Vision-Language ModelsMatteo Farina, Gianni Franchi, Giovanni Iacca, Massimiliano Mancini, Elisa Ricci 0001. [doi]
- JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis ModelsKun Zhou 0002, Beichen Zhang, Jiapeng Wang, Zhipeng Chen 0001, Xin Zhao 0018, Jing Sha, Zhichao Sheng, Shijin Wang 0001, Ji-Rong Wen. [doi]
- In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD InitializationRuiqi Zhang, Jingfeng Wu, Peter L. Bartlett. [doi]
- A Careful Examination of Large Language Model Performance on Grade School ArithmeticHugh Zhang, Jeff Da, Dean Lee, Vaughn Robinson, Catherine Wu, William Song, Tiffany Zhao, Pranav Raja, Charlotte Zhuang, Dylan Slack, Qin Lyu, Sean Hendryx, Russell Kaplan, Michele Lunati, Summer Yue. [doi]
- HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based ApproachMaxim Nikolaev, Mikhail Kuznetsov, Dmitry P. Vetrov, Aibek Alanov. [doi]
- pcaGAN: Improving Posterior-Sampling cGANs via Principal Component RegularizationMatthew Bendel, Rizwan Ahmad, Philip Schniter. [doi]
- Score-based generative models are provably robust: an uncertainty quantification perspectiveNikiforos Mimikos-Stamatopoulos, Benjamin J. Zhang, Markos A. Katsoulakis. [doi]
- Tactile DreamFusion: Exploiting Tactile Sensing for 3D GenerationRuihan Gao, Kangle Deng, Gengshan Yang, Wenzhen Yuan 0001, Jun-Yan Zhu. [doi]
- Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models FunctionChenyi Zhuang, Ying Hu, Pan Gao. [doi]
- DeltaDEQ: Exploiting Heterogeneous Convergence for Accelerating Deep Equilibrium IterationsZuowen Wang, Longbiao Cheng, Pehuen Moure, Niklas Hahn, Shih-Chii Liu. [doi]
- TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot LearningHui Chen, Yanbin Liu, Yongqiang Ma, Nanning Zheng 0001, Xin Yu 0002. [doi]
- Aligning Model Properties via Conformal Risk ControlWilliam Overman, Jacqueline Jil Vallon, Mohsen Bayati. [doi]
- FairMedFM: Fairness Benchmarking for Medical Imaging Foundation ModelsRuinan Jin, Zikang Xu, Yuan Zhong 0003, Qingsong Yao, Qi Dou 0001, S. Kevin Zhou, Xiaoxiao Li. [doi]
- OPEL: Optimal Transport Guided ProcedurE LearningSayeed Shafayet Chowdhury, Soumyadeep Chandra, Kaushik Roy 0001. [doi]
- Closed-Loop Visuomotor Control with Generative Expectation for Robotic ManipulationQingwen Bu, Jia Zeng, Li Chen 0008, Yanchao Yang, Guyue Zhou, Junchi Yan, Ping Luo, Heming Cui, Yi Ma, Hongyang Li. [doi]
- Improved Distribution Matching Distillation for Fast Image SynthesisTianwei Yin, Michaël Gharbi, Taesung Park, Richard Zhang 0001, Eli Shechtman, Frédo Durand, Bill Freeman. [doi]
- Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level SetWenyuan Zhang, Yu-Shen Liu, Zhizhong Han. [doi]
- Model-based Diffusion for Trajectory OptimizationChaoyi Pan, Zeji Yi, Guanya Shi, Guannan Qu. [doi]
- EnOF-SNN: Training Accurate Spiking Neural Networks via Enhancing the Output FeatureYufei Guo, Weihang Peng 0001, Xiaode Liu, Yuanpei Chen, Yuhan Zhang, Xin Tong, Zhou Jie, Zhe Ma 0001. [doi]
- 0-minimizationYinuo Jiang, Xiuchuan Tang, Cheng Cheng, Ye Yuan. [doi]
- Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic SimilaritiesAlexander Nikitin, Jannik Kossen, Yarin Gal, Pekka Marttinen. [doi]
- CLIPAway: Harmonizing focused embeddings for removing objects via diffusion modelsYigit Ekin, Ahmet Burak Yildirim, Erdem Eren Caglar, Aykut Erdem, Erkut Erdem, Aysegul Dundar. [doi]
- DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads FusionYilong Chen, Linhao Zhang, Junyuan Shang, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun. [doi]
- Efficiently Learning Significant Fourier Feature Pairs for Statistical Independence TestingYixin Ren, Yewei Xia, Hao Zhang, Jihong Guan, Shuigeng Zhou. [doi]
- Statistical-Computational Trade-offs for Density EstimationAnders Aamand, Alexandr Andoni, Justin Y. Chen, Piotr Indyk, Shyam Narayanan, Sandeep Silwal, Haike Xu. [doi]
- Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution DetectionGeng Yu, Jianing Zhu, Jiangchao Yao, Bo Han 0003. [doi]
- Nonconvex Federated Learning on Compact Smooth Submanifolds With Heterogeneous DataJiaojiao Zhang, Jiang Hu, Anthony Man-Cho So, Mikael Johansson 0001. [doi]
- Learning Low-Rank Feature for Thorax Disease ClassificationYancheng Wang, Rajeev Goel, Utkarsh Nath, Alvin C. Silva, Teresa Wu, Yingzhen Yang. [doi]
- Resource-Aware Federated Self-Supervised Learning with Global Class RepresentationsMingyi Li, Xiao Zhang 0015, Qi Wang, Tengfei Liu, Ruofan Wu, Weiqiang Wang, Fuzhen Zhuang, Hui Xiong 0001, Dongxiao Yu. [doi]
- ReF-LDM: A Latent Diffusion Model for Reference-based Face Image RestorationChi-Wei Hsiao, Yu-Lun Liu, Cheng-Kun Yang, Sheng-Po Kuo, Kevin Jou, Chia-Ping Chen. [doi]
- ScaleKD: Strong Vision Transformers Could Be Excellent TeachersJiawei Fan, Chao Li, Xiaolong Liu, Anbang Yao. [doi]
- Normal-GS: 3D Gaussian Splatting with Normal-Involved RenderingMeng Wei, Qianyi Wu, Jianmin Zheng, Hamid Rezatofighi, Jianfei Cai 0001. [doi]
- Unified Insights: Harnessing Multi-modal Data for Phenotype Imputation via View DecouplingQiannan Zhang, Weishen Pan, Zilong Bai, Chang Su, Fei Wang. [doi]
- Scaling the Codebook Size of VQ-GAN to 100, 000 with a Utilization Rate of 99%Lei Zhu 0012, Fangyun Wei, Yanye Lu, Dong Chen. [doi]
- DreamCatcher: A Wearer-aware Multi-modal Sleep Event Dataset Based on Earables in Non-restrictive EnvironmentsZeyu Wang, Xiyuxing Zhang, Ruotong Yu, Yuntao Wang 0001, Kenneth Christofferson, Jingru Zhang, Alex Mariakakis, Yuanchun Shi. [doi]
- Energy-based Epistemic Uncertainty for Graph Neural NetworksDominik Fuchsgruber, Tom Wollschläger, Stephan Günnemann. [doi]
- Parameter-free Clipped Gradient Descent Meets PolyakYuki Takezawa, Han Bao 0002, Ryoma Sato, Kenta Niwa, Makoto Yamada. [doi]
- BackTime: Backdoor Attacks on Multivariate Time Series ForecastingXiao Lin, Zhining Liu 0002, Dongqi Fu, Ruizhong Qiu, Hanghang Tong. [doi]
- Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMsZhao Xu, Fan Liu 0008, Hao Liu. [doi]
- Information Re-Organization Improves Reasoning in Large Language ModelsXiaoxia Cheng, Zeqi Tan, Wei Xue, Weiming Lu 0001. [doi]
- Stopping Bayesian Optimization with Probabilistic Regret BoundsJames Wilson. [doi]
- Effective Exploration Based on the Structural Information PrinciplesXianghua Zeng, Hao Peng 0001, Angsheng Li. [doi]
- Identifying Spatio-Temporal Drivers of Extreme EventsMohamad Hakam Shams Eddin, Jürgen Gall. [doi]
- Mutli-Armed Bandits with Network InterferenceAbhineet Agarwal, Anish Agarwal, Lorenzo Masoero, Justin Whitehouse. [doi]
- DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of PlasticityBaekrok Shin, Junsoo Oh, Hanseul Cho 0002, Chulhee Yun. [doi]
- Inversion-based Latent Bayesian OptimizationJaewon Chu, Jinyoung Park, Seunghun Lee, Hyunwoo J. Kim. [doi]
- DiscoveryWorld: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery AgentsPeter A. Jansen, Marc-Alexandre Côté, Tushar Khot, Erin Bransom, Bhavana Dalvi Mishra, Bodhisattwa Prasad Majumder, Oyvind Tafjord, Peter Clark. [doi]
- PCoTTA: Continual Test-Time Adaptation for Multi-Task Point Cloud UnderstandingJincen Jiang, Qianyu Zhou 0001, Yuhang Li, Xinkui Zhao, Meili Wang, Lizhuang Ma, Jian Chang, Jian-Jun Zhang, Xuequan Lu. [doi]
- MotionTTT: 2D Test-Time-Training Motion Estimation for 3D Motion Corrected MRITobit Klug, Kun Wang, Stefan Ruschke, Reinhard Heckel. [doi]
- DiffuBox: Refining 3D Object Detection with Point DiffusionXiangyu Chen, Zhenzhen Liu, Katie Luo, Siddhartha Datta, Adhitya Polavaram, Yan Wang 0051, Yurong You, Boyi Li, Marco Pavone 0001, Wei-Lun Chao, Mark E. Campbell, Bharath Hariharan, Kilian Q. Weinberger. [doi]
- The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant StepsizeDongyan Lucy Huo, Yixuan Zhang, Yudong Chen 0001, Qiaomin Xie. [doi]
- COSMIC: Compress Satellite Image Efficiently via Diffusion CompensationZiyuan Zhang, Han Qiu 0001, Maosen Zhang, Jun Liu 0063, Bin Chen 0011, Tianwei Zhang 0004, Hewu Li. [doi]
- Unleashing Multispectral Video's Potential in Semantic Segmentation: A Semi-supervised Viewpoint and New UAV-View BenchmarkWei Ji, Jingjing Li, Wenbo Li 0001, Yilin Shen, Li Cheng 0001, Hongxia Jin. [doi]
- Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained TransformersLirui Wang, Xinlei Chen, Jialiang Zhao, Kaiming He. [doi]
- Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD GeneralizationChengtao Jian, Kai Yang 0001, Yang Jiao. [doi]
- Online Budgeted Matching with General BidsJianyi Yang 0001, Pengfei Li 0008, Adam Wierman, Shaolei Ren. [doi]
- DeBaRA: Denoising-Based 3D Room Arrangement GenerationLéopold Maillard, Nicolas Sereyjol-Garros, Tom Durand, Maks Ovsjanikov. [doi]
- The Multimodal Universe: Enabling Large-Scale Machine Learning with 100 TB of Astronomical Scientific DataEirini Angeloudi, Jeroen Audenaert, Micah Bowles, Benjamin M. Boyd, David Chemaly, Brian Cherinka, Ioana Ciuca, Miles D. Cranmer, Aaron Do, Matthew Grayling, Erin E. Hayes, Tom Hehir, Shirley Ho, Marc Huertas-Company, Kartheik Iyer, Maja Jablonska, François Lanusse, Henry Leung, Kaisey Mandel, Rafael Martínez-Galarza, Peter Melchior, Lucas Meyer, Liam Holden Parker, Helen Qu, Jeff Shen, Michael T. Smith, Connor Stone, Mike Walmsley, John F. Wu. [doi]
- Can large language models explore in-context?Akshay Krishnamurthy, Keegan Harris, Dylan J. Foster, Cyril Zhang, Aleksandrs Slivkins. [doi]
- Learning Cut Generating Functions for Integer ProgrammingHongyu Cheng, Amitabh Basu. [doi]
- Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient LearningBei Li, Tong Zheng, Rui Wang 0028, Jiahao Liu, Qingyan Guo, Junliang Guo, Xu Tan 0003, Tong Xiao, Jingbo Zhu, Jingang Wang, Xunliang Cai. [doi]
- The Reliability of OKRidge Method in Solving Sparse Ridge Regression ProblemsXiyuan Li, Youjun Wang, Weiwei Liu 0001. [doi]
- Scalable Constrained Policy Optimization for Safe Multi-agent Reinforcement LearningLijun Zhang, Lin Li, Wei Wei, Huizhong Song, Yaodong Yang, Jiye Liang. [doi]
- Speculative Monte-Carlo Tree SearchScott Cheng, Mahmut T. Kandemir, Ding-Yong Hong. [doi]
- Exclusively Penalized Q-learning for Offline Reinforcement LearningJunghyuk Yeom, Yonghyeon Jo, Jeongmo Kim, Sanghyeon Lee, Seungyul Han. [doi]
- PowerPM: Foundation Model for Power SystemsShihao Tu, Yupeng Zhang, Jing Zhang, Zhendong Fu, Yin Zhang, Yang Yang. [doi]
- Evidential Mixture Machines: Deciphering Multi-Label Correlations for Active Learning SensitivityDayou Yu, Minghao Li, Weishi Shi, Qi Yu 0001. [doi]
- NeuralPlane: An Efficiently Parallelizable Platform for Fixed-wing Aircraft Control with Reinforcement LearningChuanyi Xue, Qihan Liu, Xiaoteng Ma, Xinyao Qin, Gui Ning, Yang Qi, Jinsheng Ren, Bin Liang, Jun Yang. [doi]
- A Practitioner's Guide to Real-World Continual Multimodal PretrainingVishaal Udandarao, Karsten Roth, Sebastian Dziadzio, Ameya Prabhu, Mehdi Cherti, Oriol Vinyals, Olivier J. Hénaff, Samuel Albanie, Zeynep Akata, Matthias Bethge. [doi]
- Detecting and Measuring Confounding Using Causal Mechanism ShiftsAbbavaram Gowtham Reddy, Vineeth N. Balasubramanian. [doi]
- Hamiltonian Score Matching and Generative FlowsPeter Holderrieth, Yilun Xu, Tommi S. Jaakkola. [doi]
- VideoTetris: Towards Compositional Text-to-Video GenerationYe Tian, Ling Yang 0006, Haotian Yang, Yuan Gao, Yufan Deng, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui 0001. [doi]
- Accelerating Matroid Optimization through Fast Imprecise OraclesFranziska Eberle, Felix Hommelsheim, Alexander Lindermayr, Zhenwei Liu, Nicole Megow, Jens Schlöter. [doi]
- Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?Lingao Xiao, Yang He 0002. [doi]
- MiSO: Optimizing brain stimulation to create neural activity statesYuki Minai, Joana Soldado-Magraner, Matthew A. Smith 0001, Byron M. Yu. [doi]
- An Equivalence Between Static and Dynamic Regret MinimizationAndrew Jacobsen, Francesco Orabona. [doi]
- Regularized Q-LearningHan-Dong Lim, Donghwan Lee. [doi]
- Activation Map Compression through Tensor Decomposition for Deep LearningLe-Trung Nguyen, Aël Quélennec, Enzo Tartaglione, Samuel Tardieu, Van Tam Nguyen. [doi]
- Who's Gaming the System? A Causally-Motivated Approach for Detecting Strategic AdaptationTrenton Chang, Lindsay A. Warrenburg, Sae-Hwan Park, Ravi B. Parikh, Maggie Makar, Jenna Wiens. [doi]
- DiffPhyCon: A Generative Approach to Control Complex Physical SystemsLong Wei, Peiyan Hu, Ruiqi Feng, Haodong Feng, Yixuan Du, Tao Zhang 0033, Rui Wang 0017, Yue Wang 0017, Zhi-Ming Ma, Tailin Wu. [doi]
- Exploiting Representation Curvature for Boundary Detection in Time SeriesYooju Shin, Jaehyun Park, Susik Yoon, Hwanjun Song, Byung Suk Lee 0001, Jae-Gil Lee 0001. [doi]
- Aligner: Efficient Alignment by Learning to CorrectJiaming Ji, Boyuan Chen 0008, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Tianyi Qiu, Juntao Dai, Yaodong Yang 0001. [doi]
- UnSeg: One Universal Unlearnable Example Generator is Enough against All Image SegmentationYe Sun, Hao Zhang 0047, Tiehua Zhang, Xingjun Ma, Yu-Gang Jiang 0001. [doi]
- ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidenceKevin Wu, Eric Wu, James Y. Zou. [doi]
- Multivariate Stochastic Dominance via Optimal Transport and Applications to Models BenchmarkingGabriel Rioux, Apoorva Nitsure, Mattia Rigotti, Kristjan H. Greenewald, Youssef Mroueh. [doi]
- Boundary Matters: A Bi-Level Active Finetuning MethodHan Lu, Yichen Xie 0002, Xiaokang Yang, Junchi Yan. [doi]
- Benchmarking Structural Inference Methods for Interacting Dynamical Systems with Synthetic DataAoran Wang, Tsz Pan Tong, Andrzej Mizera, Jun Pang 0001. [doi]
- Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionBoyuan Chen 0003, Diego Marti Monso, Yilun Du, Max Simchowitz, Russ Tedrake, Vincent Sitzmann. [doi]
- Spectral Graph Pruning Against Over-Squashing and Over-SmoothingAdarsh Jamadandi, Celia Rubio-Madrigal, Rebekka Burkholz. [doi]
- Multiview Scene GraphJuexiao Zhang, Gao Zhu, Sihang Li 0001, Xinhao Liu 0003, Haorui Song, Xinran Tang, Chen Feng 0002. [doi]
- Data Attribution for Text-to-Image Models by Unlearning Synthesized ImagesSheng-yu Wang, Aaron Hertzmann, Alexei A. Efros, Jun-Yan Zhu, Richard Zhang 0001. [doi]
- Dual-Perspective Activation: Efficient Channel Denoising via Joint Forward-Backward Criterion for Artificial Neural NetworksTian Qiu, Chenchao Gao, Zunlei Feng, Jie Lei 0002, Bingde Hu, Xingen Wang, Yi Gao, Mingli Song. [doi]
- NoisyGL: A Comprehensive Benchmark for Graph Neural Networks under Label NoiseZhonghao Wang 0002, Danyu Sun, Sheng Zhou 0004, Haobo Wang, Jiapei Fan, Longtao Huang, Jiajun Bu. [doi]
- Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic ModularityPhilip Amortila, Dylan J. Foster, Nan Jiang 0008, Akshay Krishnamurthy, Zakaria Mhammedi. [doi]
- PTQ4DiT: Post-training Quantization for Diffusion TransformersJunyi Wu, Haoxuan Wang, Yuzhang Shang, Mubarak Shah, Yan Yan 0002. [doi]
- How does Gradient Descent Learn Features - A Local Analysis for Regularized Two-Layer Neural NetworksMo Zhou, Rong Ge 0001. [doi]
- Are Multiple Instance Learning Algorithms Learnable for Instances?Jaeseok Jang, Hyuk-Yoon Kwon. [doi]
- SeeA*: Efficient Exploration-Enhanced A* Search by Selective SamplingDengwei Zhao, Shikui Tu, Lei Xu 0001. [doi]
- Are High-Degree Representations Really Unnecessary in Equivariant Graph Neural Networks?Jiacheng Cen, Anyi Li, Ning Lin, Yuxiang Ren, Zihe Wang, Wenbing Huang 0001. [doi]
- Geometry-aware training of factorized layers in tensor Tucker formatEmanuele Zangrando, Steffen Schotthöfer, Gianluca Ceruti, Jonas Kusch, Francesco Tudisco. [doi]
- Towards Understanding How Transformers Learn In-context Through a Representation Learning LensRuifeng Ren, Yong Liu. [doi]
- Differentially Private Graph Diffusion with Applications in Personalized PageRanksRongzhe Wei, Eli Chien, Pan Li 0005. [doi]
- GC-Bench: An Open and Unified Benchmark for Graph CondensationQingyun Sun, Ziying Chen, Beining Yang, Cheng Ji 0001, Xingcheng Fu, Sheng Zhou 0004, Hao Peng 0001, Jianxin Li 0002, Philip S. Yu. [doi]
- Integrating GNN and Neural ODEs for Estimating Non-Reciprocal Two-Body Interactions in Mixed-Species Collective MotionMasahito Uwamichi, Simon K. Schnyder, Tetsuya J. Kobayashi, Satoshi Sawai. [doi]
- No-Regret Bandit Exploration based on Soft Tree Ensemble ModelShogo Iwazaki, Shinya Suzumura. [doi]
- MGF: Mixed Gaussian Flow for Diverse Trajectory PredictionJiahe Chen, Jinkun Cao, Dahua Lin, Kris Kitani, Jiangmiao Pang. [doi]
- Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-TrainingWenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu 0001. [doi]
- First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-OffsBen Norman, Jeff Clune. [doi]
- Spatio-Temporal Interactive Learning for Efficient Image Reconstruction of Spiking CamerasBin Fan 0002, Jiaoyang Yin, Yuchao Dai, Chao Xu 0006, Tiejun Huang 0001, Boxin Shi. [doi]
- Aligning Embeddings and Geometric Random Graphs: Informational Results and Computational Approaches for the Procrustes-Wasserstein ProblemMathieu Even, Luca Ganassali, Jakob Maier, Laurent Massoulié. [doi]
- Scaling Laws with Vocabulary: Larger Models Deserve Larger VocabulariesChaofan Tao, Qian Liu, Longxu Dou, Niklas Muennighoff, Zhongwei Wan, Ping Luo, Min Lin, Ngai Wong. [doi]
- MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image GenerationJialin Luo, Yuanzhi Wang, Ziqi Gu, Yide Qiu, Shuaizhen Yao, Fuyun Wang, Chunyan Xu, Wenhua Zhang, Dan Wang, Zhen Cui 0001. [doi]
- A two-scale Complexity Measure for Deep Learning ModelsMassimiliano Datres, Gian Paolo Leonardi, Alessio Figalli, David Sutter. [doi]
- Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling BanditsTian Huang, Shengbo Wang, Ke Li 0001. [doi]
- EASI: Evolutionary Adversarial Simulator Identification for Sim-to-Real TransferHaoyu Dong, Huiqiao Fu, Wentao Xu, Zhehao Zhou, Chunlin Chen. [doi]
- Learning Spatially-Aware Language and Audio EmbeddingsBhavika Devnani, Skyler Seto, Zakaria Aldeneh, Alessandro Toso, Elena Menyaylenko, Barry-John Theobald, Jonathan Sheaffer, Miguel Sarabia. [doi]
- No Free Delivery Service: Epistemic limits of passive data collection in complex social systemsMaximilian Nickel. [doi]
- The Benefits of Balance: From Information Projections to Variance ReductionLang Liu, Ronak Mehta, Soumik Pal, Zaïd Harchaoui. [doi]
- Identifiable Shared Component Analysis of Unpaired Multimodal MixturesSubash Timilsina, Sagar Shrestha, Xiao Fu 0001. [doi]
- Provably Faster Algorithms for Bilevel Optimization via Without-Replacement SamplingJunyi Li, Heng Huang. [doi]
- Non-Asymptotic Uncertainty Quantification in High-Dimensional LearningFrederik Hoppe, Claudio Mayrink Verdun, Hannah Laus, Felix Krahmer, Holger Rauhut. [doi]
- SpreadsheetBench: Towards Challenging Real World Spreadsheet ManipulationZeyao Ma, Bohan Zhang, Jing Zhang, Jifan Yu, Xiaokang Zhang, Xiaohan Zhang, Sijia Luo, Xi Wang, Jie Tang 0001. [doi]
- CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural NetworksMiria Feng, Zachary Frangella, Mert Pilanci. [doi]
- Designing Cell-Type-Specific Promoter Sequences Using Conservative Model-Based OptimizationAniketh Janardhan Reddy, Xinyang Geng, Michael Herschl, Sathvik Kolli, Aviral Kumar, Patrick Hsu, Sergey Levine, Nilah Ioannidis. [doi]
- Transformers on Markov data: Constant depth sufficesNived Rajaraman, Marco Bondaschi, Ashok Vardhan Makkuva, Kannan Ramchandran, Michael Gastpar. [doi]
- 3: Exploring Embodied Emotion Through A Large-Scale Egocentric Video DatasetWang Lin, Yueying Feng, WenKang Han, Tao Jin 0004, Zhou Zhao 0001, Fei Wu 0001, Chang Yao, Jingyuan Chen. [doi]
- Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?Maohao Shen, Jongha Jon Ryu, Soumya Ghosh, Yuheng Bu, Prasanna Sattigeri, Subhro Das, Gregory W. Wornell. [doi]
- Discovery of the Hidden World with Large Language ModelsChenxi Liu, Yongqiang Chen, Tongliang Liu, Mingming Gong, James Cheng, Bo Han, Kun Zhang. [doi]
- Constrained Diffusion Models via Dual TrainingShervin Khalafi, Dongsheng Ding, Alejandro Ribeiro. [doi]
- Indoor Air Quality Dataset with Activities of Daily Living in Low to Middle-income CommunitiesPrasenjit Karmakar, Swadhin Pradhan, Sandip Chakraborty 0001. [doi]
- Why Transformers Need Adam: A Hessian PerspectiveYushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun 0001, Zhi-Quan Luo. [doi]
- Diffusion PID: Interpreting Diffusion via Partial Information DecompositionShaurya Dewan, Rushikesh Zawar, Prakanshul Saxena, Yingshan Chang, Andrew Luo, Yonatan Bisk. [doi]
- Fit for our purpose, not yours: Benchmark for a low-resource, Indigenous languageSuzanne Duncan, Gianna Leoni, Lee Steven, Keoni Mahelona, Peter-Lucas Jones. [doi]
- Generalizable Implicit Motion Modeling for Video Frame InterpolationZujin Guo, Wei Li 0190, Chen Change Loy. [doi]
- Subsurface Scattering for Gaussian SplattingJan-Niklas Dihlmann, Arjun Majumdar, Andreas Engelhardt, Raphael Braun, Hendrik P. A. Lensch. [doi]
- AudioMarkBench: Benchmarking Robustness of Audio WatermarkingHongbin Liu 0005, Moyang Guo, Zhengyuan Jiang, Lun Wang, Neil Gong 0001. [doi]
- Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement LearningHonghao Wei, Xiyue Peng, Arnob Ghosh, Xin Liu. [doi]
- Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion ModelsYule Wang, Chengrui Li, Weihan Li, Anqi Wu. [doi]
- Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement LearningSimon Zhai, Hao Bai, Zipeng Lin, Jiayi Pan, Peter Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma 0001, Sergey Levine. [doi]
- RClicks: Realistic Click Simulation for Benchmarking Interactive SegmentationAnton Antonov, Andrey Moskalenko, Denis Shepelev, Alexander Krapukhin, Konstantin Soshin, Anton Konushin, Vlad Shakhuro. [doi]
- G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality ModelsPengyue Jia, Yiding Liu, Xiaopeng Li, Xiangyu Zhao 0001, Yuhao Wang 0006, Yantong Du, Xiao Han 0004, Xuetao Wei, Shuaiqiang Wang, Dawei Yin. [doi]
- Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized TasksBálint Mucsányi, Michael Kirchhof, Seong Joon Oh. [doi]
- M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and Multispectral DataMatthew J. Allen 0001, Francisco Dorr, Joseph Alejandro Gallego Mejia, Laura Martínez-Ferrer, Anna Jungbluth, Freddie Kalaitzis, Raúl Ramos-Pollán. [doi]
- Towards Comprehensive Detection of Chinese Harmful MemesJunyu Lu, Bo Xu, Xiaokun Zhang, Hongbo Wang, Haohao Zhu, Dongyu Zhang, Liang Yang, Hongfei Lin. [doi]
- Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual ProgrammingVictor-Alexandru Padurean, Adish Singla. [doi]
- Hardness of Learning Neural Networks under the Manifold HypothesisBobak T. Kiani, Jason Wang, Melanie Weber 0001. [doi]
- Lower Bounds and Optimal Algorithms for Non-Smooth Convex Decentralized Optimization over Time-Varying NetworksDmitry Kovalev, Ekaterina Borodich, Alexander V. Gasnikov, Dmitrii Feoktistov. [doi]
- Learning from Uncertain Data: From Possible Worlds to Possible ModelsJiongli Zhu, Su Feng, Boris Glavic, Babak Salimi. [doi]
- Contrastive-Equivariant Self-Supervised Learning Improves Alignment with Primate Visual Area ITThomas E. Yerxa, Jenelle Feather, Eero P. Simoncelli, SueYeon Chung. [doi]
- GaussianCut: Interactive segmentation via graph cut for 3D Gaussian SplattingUmangi Jain, Ashkan Mirzaei, Igor Gilitschenski. [doi]
- Spectral-Risk Safe Reinforcement Learning with Convergence GuaranteesDohyeong Kim, Taehyun Cho, Seungyub Han, Hojun Chung, Kyungjae Lee 0001, Songhwai Oh. [doi]
- Score Distillation via Reparametrized DDIMArtem Lukoianov, Haitz Sáez de Ocáriz Borde, Kristjan H. Greenewald, Vitor Guizilini, Timur M. Bagautdinov, Vincent Sitzmann, Justin M. Solomon. [doi]
- Generalized Linear Bandits with Limited AdaptivityAyush Sawarni, Nirjhar Das, Siddharth Barman, Gaurav Sinha 0001. [doi]
- AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge BasesZhaorun Chen, Zhen Xiang, Chaowei Xiao, Dawn Song, Bo Li. [doi]
- Black-Box ForgettingYusuke Kuwana, Yuta Goto, Takashi Shibata 0001, Go Irie. [doi]
- Rainbow Teaming: Open-Ended Generation of Diverse Adversarial PromptsMikayel Samvelyan, Sharath Chandra Raparthy, Andrei Lupu, Eric Hambro, Aram H. Markosyan, Manish Bhatt, Yuning Mao, Minqi Jiang, Jack Parker-Holder, Jakob Foerster, Tim Rocktäschel, Roberta Raileanu. [doi]
- Are Graph Neural Networks Optimal Approximation Algorithms?Morris Yau, Nikolaos Karalias, Eric Lu, Jessica Xu, Stefanie Jegelka. [doi]
- The Importance of Online Data: Understanding Preference Fine-tuning via CoverageYuda Song 0001, Gokul Swamy, Aarti Singh, J. Andrew Bagnell, Wen Sun 0002. [doi]
- ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise OptimizationLuca Eyring, Shyamgopal Karthik, Karsten Roth, Alexey Dosovitskiy, Zeynep Akata. [doi]
- Multi-Object Hallucination in Vision Language ModelsXuweiyi Chen, Ziqiao Ma, Xuejun Zhang 0003, Sihan Xu, Shengyi Qian 0001, Jianing Yang, David Fouhey, Joyce Chai. [doi]
- Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin WalkYuzhou Gu, Nikki Lijing Kuang, Yian Ma, Zhao Song 0002, Lichen Zhang 0003. [doi]
- OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AIZhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang 0003, Yuqing Yang 0004, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang 0001, Dahua Lin, Yu Qiao 0001, Pengfei Liu 0003. [doi]
- A Closer Look at AUROC and AUPRC under Class ImbalanceMatthew B. A. McDermott, Haoran Zhang 0003, Lasse Hyldig Hansen, Giovanni Angelotti, Jack Gallifant. [doi]
- CigTime: Corrective Instruction Generation Through Inverse Motion EditingQihang Fang, Chengcheng Tang, Bugra Tekin, Yanchao Yang. [doi]
- Why Do We Need Weight Decay in Modern Deep Learning?Francesco D'Angelo, Maksym Andriushchenko, Aditya Vardhan Varre, Nicolas Flammarion. [doi]
- Flexible task abstractions emerge in linear networks with fast and bounded unitsKai Sandbrink, Jan P. Bauer, Alexandra M. Proca, Andrew M. Saxe, Christopher Summerfield, Ali Hummos. [doi]
- TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-AlignmentWei Li, Hehe Fan, Yongkang Wong, Mohan S. Kankanhalli, Yi Yang 0001. [doi]
- CoSy: Evaluating Textual Explanations of NeuronsLaura Kopf, Philine Lou Bommer, Anna Hedström, Sebastian Lapuschkin, Marina M.-C. Höhne, Kirill Bykov. [doi]
- DisC-GS: Discontinuity-aware Gaussian SplattingHaoxuan Qu, Zhuoling Li, Hossein Rahmani 0001, Yujun Cai, Jun Liu. [doi]
- Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation ComplexityGuhao Feng, Han Zhong 0001. [doi]
- Mutual Information Estimation via Normalizing FlowsIvan Butakov, Aleksander Tolmachev, Sofia Malanchuk, Anna Neopryatnaya, Alexey A. Frolov. [doi]
- Exploring and Exploiting the Asymmetric Valley of Deep Neural NetworksXin-Chun Li, Jin-Lin Tang, Bo Zhang, Lan Li, De-Chuan Zhan. [doi]
- BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch PredictionZikang Zhou, Haibo Hu, Xinhong Chen 0003, Jianping Wang 0001, Nan Guan, Kui Wu 0001, Yung-hui Li, Yu-Kai Huang, Chun Jason Xue. [doi]
- DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World ModelYuqi Wang, Ke Cheng, Jiawei He 0002, Qitai Wang, Hengchen Dai, YunTao Chen, Fei Xia, Zhao-Xiang Zhang. [doi]
- Credit Attribution and Stable CompressionRoi Livni, Shay Moran, Kobbi Nissim, Chirag Pabbaraju. [doi]
- PrivCirNet: Efficient Private Inference via Block Circulant TransformationTianshi Xu, Lemeng Wu, Runsheng Wang, Meng Li 0004. [doi]
- Consistency Purification: Effective and Efficient Diffusion Purification towards Certified RobustnessYiquan Li, Zhongzhu Chen, Kun Jin, Jiongxiao Wang, Jiachen Lei, Bo Li 0026, Chaowei Xiao. [doi]
- Randomized Sparse Matrix Compression for Large-Scale Constrained Optimization in Cancer RadiotherapyShima Adeli, Mojtaba Tefagh, Gourav Jhanwar, Masoud Zarepisheh. [doi]
- Monomial Matrix Group Equivariant Neural Functional NetworksHoang Tran, Thieu Vo, Tho Huu, An Nguyen The, Tan Nguyen. [doi]
- Few-Shot Diffusion Models Escape the Curse of DimensionalityRuofeng Yang, Bo Jiang, Cheng Chen, Ruinan Jin, Baoxiang Wang 0001, Shuai Li 0010. [doi]
- Relating Hopfield Networks to Episodic ControlHugo Chateau-Laurent, Frédéric Alexandre. [doi]
- State Space Models on Temporal Graphs: A First-Principles StudyJintang Li, Ruofan Wu, Xinzhou Jin, Boqun Ma, Liang Chen 0001, Zibin Zheng. [doi]
- Image Understanding Makes for A Good Tokenizer for Image GenerationLuting Wang 0001, Yang Zhao 0003, Zijian Zhang, Jiashi Feng, Si Liu 0001, Bingyi Kang. [doi]
- Fast yet Safe: Early-Exiting with Risk ControlMetod Jazbec, Alexander Timans, Tin Hadzi Veljkovic, Kaspar Sakmann, Dan Zhang, Christian Andersson Naesseth, Eric T. Nalisnick. [doi]
- Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationWangbo Zhao, Jiasheng Tang, Yizeng Han, Yibing Song, Kai Wang, Gao Huang 0001, Fan Wang, Yang You. [doi]
- Divide-and-Conquer Posterior Sampling for Denoising Diffusion priorsYazid Janati, Badr Moufad, Alain Durmus, Eric Moulines, Jimmy Olsson. [doi]
- On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and GamesAwni Altabaa, Zhuoran Yang. [doi]
- SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringJohn Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press. [doi]
- Subwords as Skills: Tokenization for Sparse-Reward Reinforcement LearningDavid Yunis, Justin Jung, Falcon Z. Dai, Matthew R. Walter. [doi]
- Dual Prototype Evolving for Test-Time Generalization of Vision-Language ModelsCe Zhang 0009, Simon Stepputtis, Katia P. Sycara, Yaqi Xie. [doi]
- Stochastic Optimal Control and Estimation with Multiplicative and Internal NoiseFrancesco Damiani, Akiyuki Anzai, Jan Drugowitsch, Gregory C. DeAngelis, Rubén Moreno-Bote. [doi]
- Accelerating Non-Maximum Suppression: A Graph Theory PerspectiveKing-Siong Si, Lu Sun, Weizhan Zhang, Tieliang Gong, Jiahao Wang, Jiang Liu, Hao Sun. [doi]
- DeformableTST: Transformer for Time Series Forecasting without Over-reliance on PatchingDonghao Luo, Xue Wang. [doi]
- pfl-research: simulation framework for accelerating research in Private Federated LearningFilip Granqvist, Congzheng Song, Áine Cahill, Rogier C. van Dalen, Martin Pelikan, Yi-Sheng Chan, Xiaojun Feng, Natarajan Krishnaswami, Vojta Jina, Mona Chitnis. [doi]
- Self-supervised Transformation Learning for Equivariant RepresentationsJaemyung Yu, Jaehyun Choi, Dong-Jae Lee, Hyeong Gwon Hong, Junmo Kim 0002. [doi]
- The iNaturalist Sounds DatasetMustafa Chasmai, Alexander Shepard, Subhransu Maji, Grant Van Horn. [doi]
- What Matters in Graph Class Incremental Learning? An Information Preservation PerspectiveJialu Li, Yu Wang 0106, Pengfei Zhu 0001, Wanyu Lin, Qinghua Hu. [doi]
- DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement LearningAnthony Liang, Guy Tennenholtz, Chih-Wei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier. [doi]
- Gradient Guidance for Diffusion Models: An Optimization PerspectiveYingqing Guo, Hui Yuan 0002, Yukang Yang, Minshuo Chen, Mengdi Wang. [doi]
- Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question AnsweringJie Ma 0001, Min Hu, Pinghui Wang, Wangchun Sun, Lingyun Song, Hongbin Pei, Jun Liu 0002, Youtian Du. [doi]
- The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and MoreOuail Kitouni, Niklas Nolte, Adina Williams, Michael Rabbat, Diane Bouchacourt, Mark Ibrahim. [doi]
- Tracing Hyperparameter Dependencies for Model Parsing via Learnable Graph Pooling NetworkXiao Guo, Vishal Asnani, Sijia Liu 0001, Xiaoming Liu. [doi]
- A Simple yet Scalable Granger Causal Structural Learning Approach for Topological Event SequencesMingjia Li 0002, Shuo Liu 0017, Hong Qian, Aimin Zhou. [doi]
- To Believe or Not to Believe Your LLM: Iterative Prompting for Estimating Epistemic UncertaintyYasin Abbasi-Yadkori, Ilja Kuzborskij, András György 0001, Csaba Szepesvári. [doi]
- Poisson Variational AutoencoderHadi Vafaii, Dekel Galor, Jacob L. Yates. [doi]
- HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-TuningChunlin Tian, Zhan Shi, Zhijiang Guo, Li Li 0064, Cheng-Zhong Xu 0001. [doi]
- Soft-Label Integration for Robust Toxicity ClassificationZelei Cheng, Xian Wu 0007, Jiahao Yu 0001, Shuo Han, Xin-Qiang Cai, Xinyu Xing 0001. [doi]
- Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image ReconstructionXingyu Xu 0001, Yuejie Chi. [doi]
- High Rank Path Development: an approach to learning the filtration of stochastic processesJiajie Tao, Hao Ni, Chong Liu. [doi]
- FT-AED: Benchmark Dataset for Early Freeway Traffic Anomalous Event DetectionAustin Coursey, Junyi Ji, Marcos Quiñones-Grueiro, William Barbour, Yuhang Zhang 0009, Tyler Derr, Gautam Biswas, Daniel B. Work. [doi]
- Achievable Fairness on Your Data With Utility GuaranteesMuhammad Faaiz Taufiq, Jean-Francois Ton, Yang Liu 0018. [doi]
- MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View ImagesEunji Hong, Minh Hieu Nguyen, Mikaela Angelina Uy, Minhyuk Sung. [doi]
- Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selectionYuli Wang, Peng Jian, Yuwei Dai, Craig K. Jones, Haris I. Sair, Jinglai Shen, Nicolas Loizou, Jing Wu, Wen-Chi Hsu, Maliha Imami, Zhicheng Jiao, Paul Zhang, Harrison Bai. [doi]
- An Analysis of Elo Rating Systems via Markov ChainsSam Olesker-Taylor, Luca Zanetti. [doi]
- Prospective Learning: Learning for a Dynamic FutureAshwin De Silva, Rahul Ramesh, Rubing Yang, Siyu Yu, Joshua T. Vogelstein, Pratik Chaudhari. [doi]
- Measuring Multimodal Mathematical Reasoning with MATH-Vision DatasetKe Wang, Junting Pan, Weikang Shi, Zimu Lu, Houxing Ren, Aojun Zhou, Mingjie Zhan, Hongsheng Li. [doi]
- Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label NoiseYeonguk Yu, Minhwan Ko, Sungho Shin, Kangmin Kim, Kyoobin Lee. [doi]
- The Price of Implicit Bias in Adversarially Robust GeneralizationNikolaos Tsilivis 0002, Natalie Frank, Nati Srebro, Julia Kempe. [doi]
- What Is Missing For Graph Homophily? Disentangling Graph Homophily For Graph Neural NetworksYilun Zheng, Sitao Luan, Lihui Chen 0001. [doi]
- Learnability Matters: Active Learning for Video CaptioningYiqian Zhang, Buyu Liu, Jun Bao, Qiang Huang, Min Zhang, Jun Yu. [doi]
- Q-VLM: Post-training Quantization for Large Vision-Language ModelsChangYuan Wang, Ziwei Wang 0001, Xiuwei Xu, Yansong Tang, Jie Zhou 0001, Jiwen Lu. [doi]
- SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning AgentsEthan Rathbun, Christopher Amato, Alina Oprea. [doi]
- Hierarchical Federated Learning with Multi-Timescale Gradient CorrectionWenzhi Fang, Dong-Jun Han, Evan Chen, Shiqiang Wang 0001, Christopher G. Brinton. [doi]
- Quantifying and Optimizing Global Faithfulness in Persona-driven Role-playingLetian Peng, Jingbo Shang. [doi]
- SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form UnderstandingJiefeng Ma, Yan Wang, Chenyu Liu, Jun Du, Yu Hu, Zhenrong Zhang, Pengfei Hu, Qing Wang, Jianshu Zhang. [doi]
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their DefensesXiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang 0001, Min Lin. [doi]
- Learn To be Efficient: Build Structured Sparsity in Large Language ModelsHaizhong Zheng, Xiaoyan Bai, Xueshen Liu, Zhuoqing Morley Mao, Beidi Chen, Fan Lai, Atul Prakash 0001. [doi]
- KnowGPT: Knowledge Graph based Prompting for Large Language ModelsQinggang Zhang, Junnan Dong, Hao Chen 0062, Daochen Zha, Zailiang Yu, Xiao Huang 0001. [doi]
- Cascade Speculative Drafting for Even Faster LLM InferenceZiyi Chen 0003, Xiaocong Yang, Jiacheng Lin, Chenkai Sun, Kevin Chen-Chuan Chang, Jie Huang 0009. [doi]
- Stochastic Extragradient with Flip-Flop Shuffling & Anchoring: Provable ImprovementsJiseok Chae, Chulhee Yun, Donghwan Kim. [doi]
- Cross-Device Collaborative Test-Time AdaptationGuohao Chen, Shuaicheng Niu, Deyu Chen, Shuhai Zhang, Changsheng Li, Yuanqing Li 0001, Mingkui Tan. [doi]
- Mercury: A Code Efficiency Benchmark for Code Large Language ModelsMingzhe Du, Anh Tuan Luu, bin Ji, Qian Liu, See-Kiong Ng. [doi]
- Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time AdaptationMingjia Li 0003, Shuang Li 0008, Tongrui Su, Longhui Yuan, Jian Liang 0002, Wei Li 0111. [doi]
- Mixture of Adversarial LoRAs: Boosting Robust Generalization in Meta-TuningXu Yang, Chen Liu, Ying Wei. [doi]
- Lambda: Learning Matchable Prior For Entity Alignment with Unlabeled Dangling CasesHang Yin, Liyao Xiang, Dong Ding, Yuheng He, Yihan Wu, Pengzhi Chu, Xinbing Wang, Chenghu Zhou. [doi]
- How Do Large Language Models Acquire Factual Knowledge During Pretraining?Hoyeon Chang, Jinho Park, Seonghyeon Ye, Sohee Yang, Youngkyung Seo, Du-Seong Chang, Minjoon Seo. [doi]
- OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and CaptioningAnwesa Choudhuri, Girish Chowdhary 0001, Alexander G. Schwing. [doi]
- Active Perception for Grasp Detection via Neural Graspness FieldHaoxiang Ma, Modi Shi, Boyang Gao, Di Huang 0001. [doi]
- One-Layer Transformer Provably Learns One-Nearest Neighbor In ContextZihao Li, Yuan Cao, Cheng Gao, Yihan He, Han Liu, Jason M. Klusowski, Jianqing Fan, Mengdi Wang. [doi]
- Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise AttentionPeng Li, Yuan Liu 0025, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wei Xue, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo. [doi]
- Parametric model reduction of mean-field and stochastic systems via higher-order action matchingJules Berman, Tobias Blickhan, Benjamin Peherstorfer. [doi]
- LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token EmbeddingsDuo Wang, Yuan Zuo, Fengzhi Li, Junjie Wu 0002. [doi]
- On the Noise Robustness of In-Context Learning for Text GenerationHongfu Gao, Feipeng Zhang, Wenyu Jiang, Jun Shu, Feng Zheng, Hongxin Wei. [doi]
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language ModelsJianyi Zhang, Da-Cheng Juan, Cyrus Rashtchian, Chun-Sung Ferng, Heinrich Jiang, Yiran Chen 0001. [doi]
- Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion modelsKyungmin Lee, Sangkyung Kwak, Kihyuk Sohn, Jinwoo Shin. [doi]
- Adaptive Visual Scene Understanding: Incremental Scene Graph GenerationNaitik Khandelwal, Xiao Liu, Mengmi Zhang. [doi]
- Addressing Spatial-Temporal Heterogeneity: General Mixed Time Series Analysis via Latent Continuity Recovery and AlignmentJiawei Chen 0007, Chunhui Zhao 0001. [doi]
- DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive LearningXun Guo, Yongxin He, Shan Zhang, Ting Zhang, Wanquan Feng, Haibin Huang, Chongyang Ma. [doi]
- Beyond Aesthetics: Cultural Competence in Text-to-Image ModelsNithish Kannen, Arif Ahmad, Marco Andreetto, Vinodkumar Prabhakaran, Utsav Prabhu, Adji Bousso Dieng, Pushpak Bhattacharyya, Shachi Dave. [doi]
- DDN: Dual-domain Dynamic Normalization for Non-stationary Time Series ForecastingTao Dai 0001, Beiliang Wu, Peiyuan Liu, Naiqi Li, Xue Yuerong, Shu-Tao Xia, Zexuan Zhu. [doi]
- Strategic Littlestone Dimension: Improved Bounds on Online Strategic ClassificationSaba Ahmadi, Kunhe Yang, Hanrui Zhang 0001. [doi]
- TAIA: Large Language Models are Out-of-Distribution Data LearnersShuyang Jiang, Yusheng Liao, Ya Zhang, Yanfeng Wang, Yu Wang 0027. [doi]
- Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View SynthesisXin Jin 0005, Pengyi Jiao, Zheng-Peng Duan, Xingchao Yang, Chongyi Li, Chun-Le Guo, Bo Ren. [doi]
- ESPACE: Dimensionality Reduction of Activations for Model CompressionCharbel Sakr, Brucek Khailany. [doi]
- Generative Adversarial Model-Based Optimization via Source Critic RegularizationMichael S. Yao, Yimeng Zeng, Hamsa Bastani, Jacob R. Gardner, James C. Gee, Osbert Bastani. [doi]
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree SearchDan Zhang, Sining Zhoubian, Ziniu Hu, Yisong Yue, Yuxiao Dong, Jie Tang 0001. [doi]
- Dynamic 3D Gaussian Fields for Urban AreasTobias Fischer 0004, Jonas Kulhanek, Samuel Rota Bulò, Lorenzo Porzi, Marc Pollefeys, Peter Kontschieder. [doi]
- GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term MatchingHaibin He, Maoyuan Ye, Jing Zhang 0037, Juhua Liu, Bo Du 0001, Dacheng Tao. [doi]
- General bounds on the quality of Bayesian coresetsTrevor Campbell. [doi]
- Episodic Future Thinking Mechanism for Multi-agent Reinforcement LearningDongSu Lee, Minhae Kwon. [doi]
- Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement LearningJeonghye Kim, Suyoung Lee, Woojun Kim, Youngchul Sung. [doi]
- FairWire: Fair Graph GenerationOyku Deniz Kose, Yanning Shen. [doi]
- Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation ForecastingBrandon Victor, Mathilde Letard, Peter Naylor, Karim Douch, Nicolas Longépé, Zhen He, Patrick Ebel 0002. [doi]
- Scaling laws for learning with real and surrogate dataAyush Jain, Andrea Montanari, Eren Sasoglu. [doi]
- Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasShan Chen, Jack Gallifant, Mingye Gao, Pedro Moreira, Nikolaj Munch, Ajay Muthukkumar, Arvind Rajan, Jaya Kolluri, Amelia Fiske, Janna Hastings, Hugo J. W. L. Aerts, Brian Anthony, Leo Anthony Celi, William G. La Cava, Danielle S. Bitterman. [doi]
- The Representation Landscape of Few-Shot Learning and Fine-Tuning in Large Language ModelsDiego Doimo, Alessandro Serra, Alessio Ansuini, Alberto Cazzaniga. [doi]
- Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with HumansJen-Tse Huang, Man Ho Lam, Eric John Li, Shujie Ren, Wenxuan Wang 0001, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu. [doi]
- Hamiltonian Monte Carlo on ReLU Neural Networks is InefficientVu C. Dinh, Lam S. Ho, Cuong V. Nguyen. [doi]
- Talking Heads: Understanding Inter-Layer Communication in Transformer Language ModelsJack Merullo, Carsten Eickhoff, Ellie Pavlick. [doi]
- Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow MatchingYongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang 0008, Zehan Wang 0001, Fuming You, Ruiqi Li, Zhou Zhao. [doi]
- Achieving Optimal Clustering in Gaussian Mixture Models with Anisotropic Covariance StructuresXin Chen, Anderson Ye Zhang. [doi]
- ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsLin Chen 0016, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang 0001, Yuhang Zang, Zehui Chen, Haodong Duan, Lin Bin, Zhenyu Tang 0004, Li Yuan 0007, Yu Qiao 0001, Dahua Lin, Feng Zhao 0004, Jiaqi Wang 0003. [doi]
- Diffusion Policies Creating a Trust Region for Offline Reinforcement LearningTianyu Chen, Zhendong Wang, Mingyuan Zhou. [doi]
- TinyLUT: Tiny Look-Up Table for Efficient Image Restoration at the EdgeHuanan Li, Juntao Guan, Lai Rui, Sijun Ma, Lin Gu 0003, Noperson. [doi]
- GuardT2I: Defending Text-to-Image Models from Adversarial PromptsYijun Yang, Ruiyuan Gao 0001, Xiao Yang, Jianyuan Zhong, Qiang Xu 0001. [doi]
- What makes unlearning hard and what to do about itKairan Zhao, Meghdad Kurmanji, George-Octavian Barbulescu, Eleni Triantafillou, Peter Triantafillou. [doi]
- Efficient Temporal Action Segmentation via Boundary-aware Query VotingPeiyao Wang, Yuewei Lin, Erik Blasch, Jie Wei, Haibin Ling. [doi]
- NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud InterpolationChaokang Jiang, Dalong Du, Jiuming Liu, Siting Zhu, Zhenqiang Liu, Zhuang Ma, Zhujin Liang, Jie Zhou 0001. [doi]
- Preventing Dimensional Collapse in Self-Supervised Learning via Orthogonality RegularizationJunlin He, Jinxiao Du, Wei Ma. [doi]
- Calibrating Reasoning in Language Models with Internal ConsistencyZhihui Xie 0002, Jizhou Guo, Tong Yu 0001, Shuai Li 0010. [doi]
- Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented ImitationYihong Guo, Yixuan Wang, Yuanyuan Shi, Pan Xu, Anqi Liu. [doi]
- Towards Universal Mesh Movement NetworksMingrui Zhang, Chunyang Wang, Stephan C. Kramer, Joseph G. Wallwork, Siyi Li, Jiancheng Liu, Xiang Chen, Matthew D. Piggott. [doi]
- DARNet: Dual Attention Refinement Network with Spatiotemporal Construction for Auditory Attention DetectionSheng Yan, Cunhang Fan, Hongyu Zhang, Xiaoke Yang, Jianhua Tao 0001, Zhao Lv. [doi]
- Neural decoding from stereotactic EEG: accounting for electrode variability across subjectsGeorgios Mentzelopoulos, Evangelos Chatzipantazis, Ashwin G. Ramayya, Michelle J. Hedlund, Vivek P. Buch, Kostas Daniilidis, Konrad P. Kording, Flavia Vitale. [doi]
- An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting ConstraintsJung Hun Kim, Milan Vojnovic, Se-Young Yun. [doi]
- Benchmark Data Repositories for Better BenchmarkingRachel Longjohn, Markelle Kelly, Sameer Singh 0001, Padhraic Smyth. [doi]
- Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error BoundsPengyu Chen, Xu Shi, Rujun Jiang, Jiulin Wang. [doi]
- LT-Defense: Searching-free Backdoor Defense via Exploiting the Long-tailed EffectYixiao Xu, Binxing Fang, Mohan Li, Keke Tang, Zhihong Tian. [doi]
- Mitigating Backdoor Attack by Injecting Proactive Defensive BackdoorShaokui Wei, Hongyuan Zha, Baoyuan Wu. [doi]
- CryoSPIN: Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose InferenceShayan Shekarforoush, David B. Lindell, Marcus A. Brubaker, David J. Fleet. [doi]
- Motif-oriented influence maximization for viral marketing in large-scale social networksMingyang Zhou 0001, Weiji Cao, Hao Liao, Rui Mao 0001. [doi]
- On the Limitations of Fractal Dimension as a Measure of GeneralizationCharlie Tan, Inés García-Redondo, Qiquan Wang, Michael M. Bronstein, Anthea Monod. [doi]
- Incremental Learning of Retrievable Skills For Efficient Continual Task AdaptationDaehee Lee, Minjong Yoo, Woo Kyung Kim, Wonje Choi 0003, Honguk Woo. [doi]
- Meta-Learning Universal Priors Using Non-Injective Change of VariablesYilang Zhang, Alireza Sadeghi, Georgios B. Giannakis. [doi]
- Diffusing Differentiable RepresentationsYash Savani, Marc Finzi, J. Zico Kolter. [doi]
- Towards Multi-dimensional Explanation Alignment for Medical ClassificationLijie Hu, Songning Lai, Wenshuo Chen, Hongru Xiao, Hongbin Lin, Lu Yu, Jingfeng Zhang, Di Wang 0015. [doi]
- SuperVLAD: Compact and Robust Image Descriptors for Visual Place RecognitionFeng Lu, Xinyao Zhang, Canming Ye, Shuting Dong, Lijun Zhang, Xiangyuan Lan, Chun Yuan. [doi]
- Are We on the Right Way for Evaluating Large Vision-Language Models?Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang 0001, Yuhang Zang, Zehui Chen, Haodong Duan, Jiaqi Wang 0003, Yu Qiao 0001, Dahua Lin, Feng Zhao 0004. [doi]
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesTiming Yang, Yuanliang Ju, Li Yi. [doi]
- Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGDAniket Das, Dheeraj Nagaraj, Soumyabrata Pal, Arun Sai Suggala, Prateek Varshney. [doi]
- FasterDiT: Towards Faster Diffusion Transformers Training without Architecture ModificationJingfeng Yao, Cheng Wang, Wenyu Liu 0001, Xinggang Wang. [doi]
- Latent Diffusion for Neural Spiking DataJaivardhan Kapoor, Auguste Schulz, Julius Vetter, Felix Pei, Richard Gao, Jakob H. Macke. [doi]
- Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention ReasonerXing Cui, Peipei Li, Zekun Li 0008, Xuannan Liu, Yueying Zou, Zhaofeng He. [doi]
- Learning Place Cell Representations and Context-Dependent RemappingMarkus Pettersen, Frederik Rogge, Mikkel E. Lepperød. [doi]
- Unveiling the Potential of Robustness in Selecting Conditional Average Treatment Effect EstimatorsYiyan Huang, Cheuk Hang Leung, Siyi Wang, Yijun Li 0005, Qi Wu 0009. [doi]
- Random Function DescentFelix Benning, Leif Döring. [doi]
- 4-bit Shampoo for Memory-Efficient Network TrainingSike Wang, Pan Zhou, Jia Li, Hua Huang. [doi]
- Improved off-policy training of diffusion samplersMarcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca Scimeca, Jarrid Rector-Brooks, Alexandre Adam, Yoshua Bengio, Nikolay Malkin. [doi]
- Practical Shuffle CodingJulius Kunze, Daniel Severo 0001, Jan-Willem van de Meent, James Townsend. [doi]
- UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-ClassesTed de Vries Lentsch, Holger Caesar, Dariu Gavrila. [doi]
- In-Context Learning with Transformers: Softmax Attention Adapts to Function LipschitznessLiam Collins, Advait Parulekar, Aryan Mokhtari, Sujay Sanghavi, Sanjay Shakkottai. [doi]
- DEL: Discrete Element Learner for Learning 3D Particle Dynamics with Neural RenderingJiaxu Wang, Jingkai Sun, Ziyi Zhang, Junhao He, Qiang Zhang 0019, Mingyuan Sun, Renjing Xu. [doi]
- D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language ModelsYikun Jiang, Huanyu Wang, Lei Xie, Hanbin Zhao, Zhang Chao, Hui Qian 0001, John C. S. Lui. [doi]
- NeuroBOLT: Resting-state EEG-to-fMRI Synthesis with Multi-dimensional Feature MappingYamin Li, Ange Lou, Ziyuan Xu, Shengchao Zhang, Shiyu Wang, Dario J. Englot, Soheil Kolouri, Daniel Moyer, Roza G. Bayrak, Catie Chang. [doi]
- SLowcalSGD : Slow Query Points Improve Local-SGD for Stochastic Convex OptimizationTehila Dahan, Kfir Y. Levy. [doi]
- Transferring disentangled representations: bridging the gap between synthetic and real imagesJacopo Dapueto, Nicoletta Noceti, Francesca Odone. [doi]
- Neural collapse vs. low-rank bias: Is deep neural collapse really optimal?Peter Súkeník, Christoph H. Lampert, Marco Mondelli. [doi]
- Computational Aspects of Bayesian Persuasion under Approximate Best ResponseKunhe Yang, Hanrui Zhang 0001. [doi]
- Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural NetworksArjun Subramonian, Jian Kang 0008, Yizhou Sun. [doi]
- Causal vs. Anticausal merging of predictorsSergio Hernan Garrido Mejia, Patrick Blöbaum, Bernhard Schölkopf, Dominik Janzing. [doi]
- Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion ModelsHanwen Liang, Yuyang Yin, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao 0001, Yunchao Wei. [doi]
- Cross-video Identity Correlating for Person Re-identification Pre-trainingJialong Zuo, Ying Nie, Hanyu Zhou, Huaxin Zhang, Haoyu Wang 0003, Tianyu Guo 0001, Nong Sang, Changxin Gao. [doi]
- OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple EstimatorsAllen Nie, Yash Chandak, Christina J. Yuan, Anirudhan Badrinath, Yannis Flet-Berliac, Emma Brunskill. [doi]
- Mining and Transferring Feature-Geometry Coherence for Unsupervised Point Cloud RegistrationKezheng Xiong, Haoen Xiang, Qingshan Xu, Chenglu Wen, Siqi Shen, Jonathan Jun LI, Cheng Wang 0003. [doi]
- Class Distribution Shifts in Zero-Shot Learning: Learning Robust RepresentationsYuli Slavutsky, Yuval Benjamini. [doi]
- MotionBooth: Motion-Aware Customized Text-to-Video GenerationJianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou 0001, Yining Li, Yunhai Tong, Kai Chen 0026. [doi]
- NN4SysBench: Characterizing Neural Network Verification for Computer SystemsShuyi Lin, Haoyu He, Tianhao Wei, Kaidi Xu, Huan Zhang, Gagandeep Singh 0001, Changliu Liu, Cheng Tan. [doi]
- SEL-BALD: Deep Bayesian Active Learning with Selective LabelsRuijiang Gao, Mingzhang Yin, Maytal Saar-Tsechansky. [doi]
- Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution ShiftsZhitong Gao, Bingnan Li, Mathieu Salzmann, Xuming He 0001. [doi]
- Transition Constrained Bayesian Optimization via Markov Decision ProcessesJose Pablo Folch, Calvin Tsay, Robert M. Lee, Behrang Shafei, Weronika Ormaniec, Andreas Krause 0001, Mark van der Wilk, Ruth Misener, Mojmir Mutny. [doi]
- Trading Place for Space: Increasing Location Resolution Reduces Contextual Capacity in Hippocampal CodesSpencer Rooke, Zhaoze Wang, Ronald W. Di Tullio, Vijay Balasubramanian. [doi]
- MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected LayersNing Ding, Yehui Tang, Haochen Qin, Zhenli Zhou, Chao Xu 0006, Lin Li, Kai Han 0002, Heng Liao, Yunhe Wang 0001. [doi]
- DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust ClassifiersChandramouli Shama Sastry, Sri Harsha Dumpala, Sageev Oore. [doi]
- Zero-Shot Scene Reconstruction from Single Images with Deep Prior AssemblyJunsheng Zhou, Yu-Shen Liu, Zhizhong Han. [doi]
- RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement LearningYujie Zhao, Jose Aguilar Escamilla, Weyl Lu, Huazheng Wang. [doi]
- Toward a Well-Calibrated Discrimination via Survival Outcome-Aware Contrastive LearningDongJoon Lee, Hyeryn Park, ChangHee Lee. [doi]
- CulturePark: Boosting Cross-cultural Understanding in Large Language ModelsCheng Li, Damien Teney, Linyi Yang, Qingsong Wen, Xing Xie 0001, Jindong Wang 0001. [doi]
- FedAvP: Augment Local Data via Shared Policy in Federated LearningMinui Hong, Junhyeog Yun, Insu Jeon, Gunhee Kim. [doi]
- TAPVid-3D: A Benchmark for Tracking Any Point in 3DSkanda Koppula, Ignacio Rocco, Yi Yang 0007, Joseph Heyward, João Carreira 0001, Andrew Zisserman, Gabriel Brostow, Carl Doersch. [doi]
- Just Add $100 More: Augmenting Pseudo-LiDAR Point Cloud for Resolving Class-imbalance ProblemMincheol Chang, Siyeong Lee, Jinkyu Kim, Namil Kim. [doi]
- Differentiable Structure Learning with Partial OrdersTaiyu Ban, Lyuzhou Chen, Xiangyu Wang, Xin Wang, Derui Lyu, Huanhuan Chen. [doi]
- Adversarial Environment Design via Regret-Guided Diffusion ModelsHojun Chung, Junseo Lee, Minsoo Kim, Dohyeong Kim, Songhwai Oh. [doi]
- Discrete Dictionary-based Decomposition Layer for Structured Representation LearningTaewon Park, Hyun-Chul Kim, Minho Lee 0001. [doi]
- G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-trainingChe Liu, Cheng Ouyang, Sibo Cheng, Anand Shah, Wenjia Bai, Rossella Arcucci. [doi]
- Maximizing utility in multi-agent environments by anticipating the behavior of other learnersAngelos Assos, Yuval Dagan, Constantinos Daskalakis. [doi]
- Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and InsightsSy-Tuyen Ho, Tuan Van Vo, Somayeh Ebrahimkhani, Ngai-Man Cheung. [doi]
- Lever LM: Configuring In-Context Sequence to Lever Large Vision Language ModelsXu Yang, Yingzhe Peng, Haoxuan Ma, Shuo Xu, Chi Zhang, Yucheng Han, Hanwang Zhang. [doi]
- Exploring the trade-off between deep-learning and explainable models for brain-machine interfacesLuis Cubillos, Guy Revach, Matthew Mender, Joseph T. Costello, Hisham Temmar, Aren Hite, Diksha Anoop Kumar Zutshi, Dylan Wallace, Xiaoyong Ni, Madison Kelberman, Matt S. Willsey, Ruud van Sloun, Nir Shlezinger, Parag G. Patil, Anne Draelos, Cynthia A. Chestek. [doi]
- Distributional Successor Features Enable Zero-Shot Policy OptimizationChuning Zhu, Xinqi Wang, Tyler Han, Simon S. Du, Abhishek Gupta 0004. [doi]
- Disentangled Style Domain for Implicit z-Watermark Towards Copyright ProtectionJunqiang Huang, Zhaojun Guo, Ge Luo 0003, Zhenxing Qian, Sheng Li 0006, Xinpeng Zhang 0001. [doi]
- The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection BenchmarkQinghua Liu, John Paparrizos. [doi]
- ReLIZO: Sample Reusable Linear Interpolation-based Zeroth-order OptimizationXiaoxing Wang, Xiaohan Qin, Xiaokang Yang, Junchi Yan. [doi]
- PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity RecognitionJinghui Lu, Yanjie Wang, Ziwei Yang, Xuejing Liu, Brian Mac Namee, Can Huang. [doi]
- Adaptive Labeling for Efficient Out-of-distribution Model EvaluationDaksh Mittal, Yuanzhe Ma, Shalmali Joshi, Hongseok Namkoong. [doi]
- 2 under the water. A global multi-temporal satellite dataset for rapid flood mappingNikolaos-Ioannis Bountos, Maria Sdraka, Angelos Zavras, Andreas Karavias, Ilektra Karasante, Themistocles Herekakis, Angeliki Thanasou, Dimitrios Michail 0001, Ioannis Papoutsis. [doi]
- Probing the Decision Boundaries of In-context Learning in Large Language ModelsSiyan Zhao, Tung Nguyen, Aditya Grover. [doi]
- Safe and Sparse Newton Method for Entropic-Regularized Optimal TransportZihao Tang, Yixuan Qiu. [doi]
- MiraData: A Large-Scale Video Dataset with Long Durations and Structured CaptionsXuan Ju, Yiming Gao 0007, Zhaoyang Zhang 0004, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu 0001, Ying Shan. [doi]
- Graph Neural Networks Need Cluster-Normalize-Activate ModulesArseny Skryagin, Felix Divo, Mohammad Amin Ali, Devendra Singh Dhami, Kristian Kersting. [doi]
- A Foundation Model for Zero-shot Logical Query ReasoningMichael Galkin, Jincheng Zhou, Bruno Ribeiro 0001, Jian Tang 0005, Zhaocheng Zhu. [doi]
- A probability contrastive learning framework for 3D molecular representation learningJiayu Qin 0001, Jian Chen 0043, Rohan Sharma, Jingchen Sun, Changyou Chen. [doi]
- SILENCE: Protecting privacy in offloaded speech understanding on resource-constrained devicesDongqi Cai, Shangguang Wang, Zeling Zhang, Felix Xiaozhu Lin, Mengwei Xu. [doi]
- Parameter Disparities Dissection for Backdoor Defense in Heterogeneous Federated LearningWenke Huang, Mang Ye, Zekun Shi, Guancheng Wan, He Li, Bo Du 0001. [doi]
- Probabilistic Weather Forecasting with Hierarchical Graph Neural NetworksJoel Oskarsson, Tomas Landelius, Marc Peter Deisenroth, Fredrik Lindsten. [doi]
- OmniTokenizer: A Joint Image-Video Tokenizer for Visual GenerationJunke Wang, Yi Jiang, Zehuan Yuan, Bingyue Peng, Zuxuan Wu, Yu-Gang Jiang. [doi]
- Full-Atom Peptide Design with Geometric Latent DiffusionXiangzhe Kong, Yinjun Jia, Wenbing Huang 0001, Yang Liu 0005. [doi]
- Proportional Fairness in Non-Centroid ClusteringIoannis Caragiannis, Evi Micha, Nisarg Shah 0001. [doi]
- DiffusionPDE: Generative PDE-Solving under Partial ObservationJiahe Huang, Guandao Yang, Zichen Wang, Jeong-Joon Park. [doi]
- Differentially Private Set RepresentationsSarvar Patel, Giuseppe Persiano, Joon Young Seo, Kevin Yeo. [doi]
- Embedding Trajectory for Out-of-Distribution Detection in Mathematical ReasoningYiming Wang, Pei Zhang 0011, Baosong Yang, Derek F. Wong, Zhuosheng Zhang 0001, Rui Wang 0015. [doi]
- DisenGCD: A Meta Multigraph-assisted Disentangled Graph Learning Framework for Cognitive DiagnosisShangshang Yang, Mingyang Chen, Ziwen Wang 0006, Xiaoshan Yu 0002, Panpan Zhang, Haiping Ma, Xingyi Zhang 0001. [doi]
- SPEAR: Exact Gradient Inversion of Batches in Federated LearningDimitar I. Dimitrov, Maximilian Baader, Mark Niklas Müller, Martin T. Vechev. [doi]
- On the Parameter Identifiability of Partially Observed Linear Causal ModelsXinshuai Dong, Ignavier Ng, Biwei Huang, Yuewen Sun, Songyao Jin, Roberto Legaspi, Peter Spirtes, Kun Zhang 0001. [doi]
- LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language ModelsHaitao Li 0006, You Chen, Qingyao Ai, Yueyue Wu, Ruizhe Zhang 0005, Yiqun Liu 0001. [doi]
- Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning MambaHaoye Dong, Aviral Chharia, Wenbo Gou, Francisco Vicente Carrasco 0001, Fernando De la Torre. [doi]
- Noether's Razor: Learning Conserved QuantitiesTycho F. A. van der Ouderaa, Mark van der Wilk, Pim de Haan. [doi]
- Exogenous Matching: Learning Good Proposals for Tractable Counterfactual EstimationYikang Chen, Dehui Du, Lili Tian. [doi]
- Mitigating Covariate Shift in Behavioral Cloning via Robust Stationary Distribution CorrectionSeokin Seo, Byung-Jun Lee 0001, Jongmin Lee 0004, HyeongJoo Hwang, Hongseok Yang, Kee-Eung Kim. [doi]
- SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core FusionHan Lu, Xu-Yang Chen, Han-Jia Ye, De-Chuan Zhan. [doi]
- On Sampling Strategies for Spectral Model ShardingDenis Korzhenkov, Christos Louizos. [doi]
- Rough Transformers: Lightweight and Continuous Time Series Modelling through Signature PatchingFernando Moreno-Pino, Alvaro Arroyo, Harrison Waldon, Xiaowen Dong 0001, Álvaro Cartea. [doi]
- Optimal ablation for interpretabilityMaximilian Li, Lucas Janson. [doi]
- Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression PerspectiveQishuai Wen, Chun-Guang Li. [doi]
- SPIQA: A Dataset for Multimodal Question Answering on Scientific PapersShraman Pramanick, Rama Chellappa, Subhashini Venugopalan. [doi]
- This Too Shall Pass: Removing Stale Observations in Dynamic Bayesian OptimizationAnthony Bardou, Patrick Thiran, Giovanni Ranieri. [doi]
- Coarse-to-Fine Concept Bottleneck ModelsKonstantinos P. Panousis, Dino Ienco, Diego Marcos. [doi]
- Is Value Learning Really the Main Bottleneck in Offline RL?Seohong Park, Kevin Frans, Sergey Levine, Aviral Kumar. [doi]
- Quantitative Convergences of Lie Group Momentum OptimizersLingkai Kong, Molei Tao. [doi]
- Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesisJianning Deng, Kartic Subr, Hakan Bilen. [doi]
- Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and MethodBikang Pan, Wei Huang, Ye Shi 0001. [doi]
- Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model InferenceSenmao Li, Taihang Hu, Joost van de Weijer 0001, Fahad Shahbaz Khan, Tao Liu, Linxuan Li, Shiqi Yang, Yaxing Wang, Ming-Ming Cheng, Jian Yang 0003. [doi]
- Towards Exact Gradient-based Training on Analog In-memory ComputingZhaoxian Wu, Tayfun Gokmen, Malte J. Rasch, Tianyi Chen. [doi]
- Language-Driven Interactive Traffic Trajectory GenerationJunkai Xia, Chenxin Xu, Qingyao Xu, Yanfeng Wang 0001, Siheng Chen. [doi]
- Algorithmic Capabilities of Random TransformersZiqian Zhong, Jacob Andreas. [doi]
- End-to-end Learnable Clustering for Intent Learning in RecommendationYue Liu, Shihao Zhu, Jun Xia, Yingwei Ma, Jian Ma, Xinwang Liu, Shengju Yu, Kejun Zhang, Wenliang Zhong. [doi]
- Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast OptimizationYuhang Cai, Jingfeng Wu, Song Mei, Michael Lindsey, Peter L. Bartlett. [doi]
- Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized RetrievalHaolun Wu, Ofer Meshi, Masrour Zoghi, Fernando Diaz 0001, Xue (Steve) Liu, Craig Boutilier, Maryam Karimzadehgan. [doi]
- Sequence-Augmented SE(3)-Flow Matching For Conditional Protein GenerationGuillaume Huguet, James Vuckovic, Kilian Fatras, Eric Thibodeau-Laufer, Pablo Lemos, Riashat Islam, Cheng-Hao Liu, Jarrid Rector-Brooks, Tara Akhound-Sadegh, Michael M. Bronstein, Alexander Tong 0001, Avishek Joey Bose. [doi]
- Transformers as Game Players: Provable In-context Game-playing Capabilities of Pre-trained ModelsChengshuai Shi, Kun Yang, Jing Yang, Cong Shen. [doi]
- A New Neural Kernel Regime: The Inductive Bias of Multi-Task LearningJulia B. Nakhleh, Joseph Shenouda, Robert D. Nowak. [doi]
- FFAM: Feature Factorization Activation Map for Explanation of 3D DetectorsShuai Liu, Boyang Li 0009, Zhiyu Fang, Mingyue Cui, Kai Huang 0001. [doi]
- An exactly solvable model for emergence and scaling laws in the multitask sparse parity problemYoonsoo Nam, Nayara Fonseca, Seok Hyeong Lee, Chris Mingard, Ard A. Louis. [doi]
- OneActor: Consistent Subject Generation via Cluster-Conditioned GuidanceJiahao Wang, Caixia Yan, Haonan Lin, Weizhan Zhang, Mengmeng Wang, Tieliang Gong, Guang Dai, Hao Sun 0015. [doi]
- Search for Efficient Large Language ModelsXuan Shen, Pu Zhao 0001, Yifan Gong 0004, Zhenglun Kong, Zheng Zhan 0001, Yushu Wu, Ming Lin, Chao Wu, Xue Lin, Yanzhi Wang. [doi]
- What matters when building vision-language models?Hugo Laurençon, Léo Tronchon, Matthieu Cord, Victor Sanh. [doi]
- LLM-Check: Investigating Detection of Hallucinations in Large Language ModelsGaurang Sriramanan, Siddhant Bharti, Vinu Sankar Sadasivan, Shoumik Saha, Priyatham Kattakinda, Soheil Feizi. [doi]
- Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation ControlYuxin Xiao, Chaoqun Wan, Yonggang Zhang 0003, Wenxiao Wang 0001, Binbin Lin, Xiaofei He 0001, Xu Shen 0001, Jieping Ye. [doi]
- MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge TransferMinghao Zhu, Zhengpu Wang, Mengxian Hu, Ronghao Dang, Xiao Lin, Xun Zhou, Chengju Liu, Qijun Chen. [doi]
- When Is Inductive Inference Possible?Zhou Lu. [doi]
- Block Sparse Bayesian Learning: A Diversified SchemeYanhao Zhang, Zhihan Zhu, Yong Xia. [doi]
- Images that Sound: Composing Images and Sounds on a Single CanvasZiyang Chen, Daniel Geng, Andrew Owens. [doi]
- VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained ActionsGuangyan Chen, Meiling Wang 0002, Te Cui, Yao Mu 0001, Haoyang Lu, Tianxing Zhou, Zicai Peng, Mengxiao Hu, Haizhou Li 0004, Li Yuan, Yi Yang 0009, Yufeng Yue. [doi]
- Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot ModelsKaican Li, Weiyan Xie, Yongxiang Huang, Didan Deng, Lanqing Hong, Zhenguo Li, Ricardo Silva 0001, Nevin L. Zhang. [doi]
- Almost Free: Self-concordance in Natural Exponential Families and an Application to BanditsShuai Liu, Alex Ayoub, Flore Sentenac, Xiaoqi Tan, Csaba Szepesvári. [doi]
- Unsupervised Homography Estimation on Multimodal Image Pair via Alternating OptimizationSanghyeob Song, Jaihyun Lew, Hyemi Jang, Sungroh Yoon. [doi]
- Inevitable Trade-off between Watermark Strength and Speculative Sampling Efficiency for Language ModelsZhengmian Hu, Heng Huang. [doi]
- UQ-Guided Hyperparameter Optimization for Iterative LearnersJiesong Liu, Feng Zhang 0007, Jiawei Guan, Xipeng Shen. [doi]
- PointMamba: A Simple State Space Model for Point Cloud AnalysisDingkang Liang, Xin Zhou 0013, Wei Xu 0017, Xingkui Zhu, Zhikang Zou, Xiaoqing Ye, Xiao Tan 0001, Xiang Bai. [doi]
- Sample and Computationally Efficient Robust Learning of Gaussian Single-Index ModelsPuqian Wang, Nikos Zarifis, Ilias Diakonikolas, Jelena Diakonikolas. [doi]
- Gated Inference Network: Inference and Learning State-Space ModelsHamidreza Hashempoorikderi, Wan Choi. [doi]
- Towards Croppable Implicit Neural RepresentationsMaor Ashkenazi, Eran Treister. [doi]
- On the Complexity of Teaching a Family of Linear Behavior Cloning LearnersShubham Kumar Bharti, Stephen Wright, Adish Singla, Xiaojin (Jerry) Zhu. [doi]
- Enhancing LLM's Cognition via StructurizationKai Liu, Zhihang Fu, Chao Chen, Wei Zhang, Rongxin Jiang 0001, Fan Zhou 0007, Yaowu Chen, Yue Wu, Jieping Ye. [doi]