Abstract is missing.
- Optimizing Watermarks for Large Language ModelsBram Wouters. [doi]
- Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticTianying Ji, Yu Luo, Fuchun Sun 0001, Xianyuan Zhan, Jianwei Zhang 0001, Huazhe Xu. [doi]
- Enhancing Implicit Shape Generators Using Topological RegularizationsLiyan Chen, Yan Zheng, Yang Li, Lohit Anirudh Jagarapu, Haoxiang Li, Hao Kang, Gang Hua 0001, Qixing Huang. [doi]
- ODIM: Outlier Detection via Likelihood of Under-Fitted Generative ModelsDongha Kim, Jaesung Hwang, Jongjin Lee, Kunwoong Kim, Yongdai Kim. [doi]
- SCoRe: Submodular Combinatorial Representation LearningAnay Majee, Suraj Kothawade, KrishnaTeja Killamsetty, Rishabh K. Iyer. [doi]
- An Embodied Generalist Agent in 3D WorldJiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li 0003, Song Chun Zhu, Baoxiong Jia, Siyuan Huang 0001. [doi]
- Navigating Scaling Laws: Compute Optimality in Adaptive Model TrainingSotiris Anagnostidis, Gregor Bachmann, Imanol Schlag, Thomas Hofmann. [doi]
- Smooth Tchebycheff Scalarization for Multi-Objective OptimizationXi Lin 0001, Xiaoyuan Zhang, Zhiyuan Yang 0003, Fei Liu 0044, Zhenkun Wang, Qingfu Zhang 0001. [doi]
- Position: Amazing Things Come From Having Many Good ModelsCynthia Rudin, Chudi Zhong, Lesia Semenova, Margo I. Seltzer, Ronald Parr, Jiachang Liu 0001, Srikar Katta, Jon Donnelly, Harry Chen, Zachery Boner. [doi]
- InfoNet: Neural Estimation of Mutual Information without Test-Time OptimizationZhengyang Hu, Song Kang, Qunsong Zeng, Kaibin Huang, Yanchao Yang. [doi]
- Distinguishing the Knowable from the Unknowable with Language ModelsGustaf Ahdritz, Tian Qin, Nikhil Vyas 0001, Boaz Barak, Benjamin L. Edelman. [doi]
- How to Escape Sharp Minima with Random PerturbationsKwangjun Ahn, Ali Jadbabaie, Suvrit Sra. [doi]
- SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking MechanismsXingrun Xing, Zheng Zhang, Ziyi Ni, Shitao Xiao, Yiming Ju, Siqi Fan 0001, Yequan Wang, Jiajun Zhang, Guoqi Li. [doi]
- Measures of diversity and space-filling designs for categorical dataCédric Malherbe, Emilio Domínguez-Sánchez, Merwan Barlier, Igor Colin, Haitham Bou-Ammar, Tom Diethe. [doi]
- Controlled Decoding from Language ModelsSidharth Mudgal, Jong Lee, Harish Ganapathy, Yaguang Li, Tao Wang, Yanping Huang, Zhifeng Chen, Heng Tze Cheng, Michael Collins, Trevor Strohman, Jilin Chen, Alex Beutel, Ahmad Beirami. [doi]
- Revisiting the Role of Language Priors in Vision-Language ModelsZhiqiu Lin, Xinyue Chen, Deepak Pathak, Pengchuan Zhang, Deva Ramanan. [doi]
- Lightweight Image Super-Resolution via Flexible Meta PruningYulun Zhang, Kai Zhang 0008, Luc Van Gool, Martin Danelljan, Fisher Yu 0001. [doi]
- Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and ConvergenceYancheng Huang, Kai Yang, Zelin Zhu, Leian Chen. [doi]
- Graph Neural Networks Use Graphs When They Shouldn'tMaya Bechler-Speicher, Ido Amos, Ran Gilad-Bachrach, Amir Globerson. [doi]
- CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language TransformersDachuan Shi, Chaofan Tao, Anyi Rao, Zhendong Yang, Chun Yuan, Jiaqi Wang 0003. [doi]
- Mechanistic Design and Scaling of Hybrid ArchitecturesMichael Poli, Armin W. Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang 0001, Stefano Massaroli. [doi]
- UP2ME: Univariate Pre-training to Multivariate Fine-tuning as a General-purpose Framework for Multivariate Time Series AnalysisYunhao Zhang, Minghao Liu, Shengyang Zhou, Junchi Yan. [doi]
- UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation LearningShikun Feng, Yuyan Ni, Minghao Li, Yanwen Huang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan. [doi]
- Energy-Efficient Gaussian Processes Using Low-Precision ArithmeticNicolas Alder, Ralf Herbrich. [doi]
- MD tree: a model-diagnostic tree grown on loss landscapeYefan Zhou, Jianlong Chen, Qinxue Cao, Konstantin Schürholt, Yaoqing Yang. [doi]
- Double Momentum Method for Lower-Level Constrained Bilevel OptimizationWanli Shi, Yi Chang, Bin Gu 0001. [doi]
- Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor AttacksWenhan Yang, Jingdong Gao, Baharan Mirzasoleiman. [doi]
- Getting the most out of your tokenizer for pre-training and domain adaptationGautier Dagan, Gabriel Synnaeve, Baptiste Rozière. [doi]
- ODIN: Disentangled Reward Mitigates Hacking in RLHFLichang Chen, Chen Zhu 0001, Jiuhai Chen, Davit Soselia, Tianyi Zhou 0001, Tom Goldstein, Heng Huang, Mohammad Shoeybi, Bryan Catanzaro. [doi]
- Matrix Information Theory for Self-Supervised LearningYifan Zhang, Zhiquan Tan, Jingqin Yang, Weiran Huang 0001, Yang Yuan. [doi]
- Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder StationingAmutheezan Sivagnanam, Ava Pettet, Hunter Lee, Ayan Mukhopadhyay, Abhishek Dubey, Aron Laszka. [doi]
- Differentially Private Domain Adaptation with Theoretical GuaranteesRaef Bassily, Corinna Cortes, Anqi Mao, Mehryar Mohri. [doi]
- Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator LearningJunfeng Chen, Kailiang Wu. [doi]
- On the Emergence of Cross-Task Linearity in Pretraining-Finetuning ParadigmZhanpeng Zhou, Zijun Chen, Yilan Chen 0002, Bo Zhang 0069, Junchi Yan. [doi]
- Optimal Coresets for Low-Dimensional Geometric MedianPeyman Afshani, Chris Schwiegelshohn. [doi]
- Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware MinimizationZiqing Fan, Shengchao Hu, Jiangchao Yao, Gang Niu 0001, Ya Zhang 0002, Masashi Sugiyama, Yanfeng Wang. [doi]
- HAMLET: Graph Transformer Neural Operator for Partial Differential EquationsAndrey Bryutkin, Jiahao Huang, Zhongying Deng, Guang Yang 0006, Carola-Bibiane Schönlieb, Angelica I. Avilés-Rivero. [doi]
- Model-based Reinforcement Learning for Confounded POMDPsMao Hong, Zhengling Qi, Yanxun Xu. [doi]
- RLVF: Learning from Verbal Feedback without OvergeneralizationMoritz Stephan, Alexander Khazatsky, Eric Mitchell, Annie S. Chen, Sheryl Hsu, Archit Sharma, Chelsea Finn. [doi]
- Generative Marginalization ModelsSulin Liu, Peter J. Ramadge, Ryan P. Adams. [doi]
- Adaptive Conformal Inference by BettingAleksandr Podkopaev, Dong Xu, Kuang-Chih Lee. [doi]
- Forget Sharpness: Perturbed Forgetting of Model Biases Within SAM DynamicsAnkit Vani, Frederick Tung, Gabriel L. Oliveira, Hossein Sharifi Noghabi. [doi]
- Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language ModelsBilgehan Sel, Ahmad Al-Tawaha, Vanshaj Khattar, Ruoxi Jia 0001, Ming Jin 0002. [doi]
- ACM-MILP: Adaptive Constraint Modification via Grouping and Selection for Hardness-Preserving MILP Instance GenerationZiao Guo, Yang Li, Chang Liu, Wenli Ouyang, Junchi Yan. [doi]
- Learning Constraints from Offline Demonstrations via Superior Distribution Correction EstimationGuorui Quan, Zhiqiang Xu, Guiliang Liu. [doi]
- Shifted Interpolation for Differential PrivacyJinho Bok, Weijie J. Su, Jason M. Altschuler. [doi]
- Neuro-Symbolic Temporal Point ProcessesYang Yang, Chao Yang, Boyang Li, Yinghao Fu, Shuang Li 0002. [doi]
- Codebook Features: Sparse and Discrete Interpretability for Neural NetworksAlex Tamkin, Mohammad Taufeeque, Noah D. Goodman. [doi]
- ATraDiff: Accelerating Online Reinforcement Learning with Imaginary TrajectoriesQianlan Yang, Yu-Xiong Wang. [doi]
- Position: What Can Large Language Models Tell Us about Time Series AnalysisMing Jin 0005, Yifan Zhang, Wei Chen, Kexin Zhang, Yuxuan Liang, Bin Yang 0002, Jindong Wang, Shirui Pan, Qingsong Wen. [doi]
- Rethinking Momentum Knowledge Distillation in Online Continual LearningNicolas Michel, Maorong Wang, Ling Xiao 0001, Toshihiko Yamasaki. [doi]
- Bayesian Program Learning by Decompiling Amortized KnowledgeAlessandro B. Palmarini, Christopher G. Lucas, N. Siddharth 0001. [doi]
- Efficient and Effective Time-Series Forecasting with Spiking Neural NetworksChangze Lv, Yansen Wang, Dongqi Han, Xiaoqing Zheng, Xuanjing Huang 0001, Dongsheng Li 0002. [doi]
- Position: Intent-aligned AI Systems Must Optimize for Agency PreservationCatalin Mitelut, Benjamin J. Smith, Peter Vamplew 0001. [doi]
- Tandem Transformers for Inference Efficient LLMsAishwarya P. S., Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain 0002, Praneeth Netrapalli. [doi]
- Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online LearningJoon Suk Huh, Kirthevasan Kandasamy. [doi]
- Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision MakingParand A. Alamdari, Toryn Q. Klassen, Elliot Creager, Sheila A. McIlraith. [doi]
- Sample-Efficient Multiagent Reinforcement Learning with Reset ReplayYaodong Yang 0002, Guangyong Chen, Jianye Hao, Pheng-Ann Heng. [doi]
- DynSyn: Dynamical Synergistic Representation for Efficient Learning and Control in Overactuated Embodied SystemsKaibo He, Chenhui Zuo, Chengtian Ma, Yanan Sui. [doi]
- SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language ModelsXiaoxuan Wang, Ziniu Hu, Pan Lu, Yanqiao Zhu 0001, Jieyu Zhang, Satyen Subramaniam, Arjun R. Loomba, Shichang Zhang, Yizhou Sun, Wei Wang 0010. [doi]
- Non-convex Stochastic Composite Optimization with Polyak MomentumYuan Gao, Anton Rodomanov, Sebastian U. Stich. [doi]
- LLark: A Multimodal Instruction-Following Language Model for MusicJoshua Patrick Gardner, Simon Durand, Daniel Stoller, Rachel M. Bittner. [doi]
- When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal AbstractionsZhening Li, Gabriel Poesia, Armando Solar-Lezama. [doi]
- Position: A Call for Embodied AIGiuseppe Paolo, Jonas Gonzalez-Billandon, Balázs Kégl. [doi]
- Class-Imbalanced Graph Learning without Class RebalancingZhining Liu 0002, Ruizhong Qiu, Zhichen Zeng, Hyunsik Yoo, David Zhou, Zhe Xu 0007, Yada Zhu, Kommy Weldemariam, Jingrui He, Hanghang Tong. [doi]
- Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing BackpropagationYuchen Yang, Yingdong Shi, Cheems Wang, Xiantong Zhen, Yuxuan Shi, Jun Xu 0019. [doi]
- Membership Inference Attacks on Diffusion Models via Quantile RegressionShuai Tang, Steven Wu 0001, Sergül Aydöre, Michael Kearns, Aaron Roth 0001. [doi]
- QUEST: Query-Aware Sparsity for Efficient Long-Context LLM InferenceJiaming Tang, Yilong Zhao, Kan Zhu, Guangxuan Xiao, Baris Kasikci, Song Han. [doi]
- Neural Networks Learn Statistics of Increasing ComplexityNora Belrose, Quintin Pope, Lucia Quirke, Alex Mallen, Xiaoli Z. Fern. [doi]
- Characterizing Overfitting in Kernel Ridgeless Regression Through the EigenspectrumTin Sum Cheng, Aurélien Lucchi, Anastasis Kratsios, David Belius. [doi]
- How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?Hongkang Li, Meng Wang 0003, Songtao Lu, Xiaodong Cui, Pin-Yu Chen. [doi]
- Language Models with Conformal Factuality GuaranteesChristopher Mohri, Tatsunori Hashimoto. [doi]
- Recovering the Pre-Fine-Tuning Weights of Generative ModelsEliahu Horwitz, Jonathan Kahana, Yedid Hoshen. [doi]
- Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate PredictionsSanjay Kariyappa, Freddy Lécué, Saumitra Mishra, Christopher Pond, Daniele Magazzeni, Manuela Veloso. [doi]
- Enhancing Value Function Estimation through First-Order State-Action Dynamics in Offline Reinforcement LearningYun-Hsuan Lien, Ping-Chun Hsieh, Tzu-Mao Li, Yu-Shuen Wang. [doi]
- Extreme Compression of Large Language Models via Additive QuantizationVage Egiazarian, Andrei Panferov, Denis Kuznedelev, Elias Frantar, Artem Babenko, Dan Alistarh. [doi]
- Efficient Value Iteration for s-rectangular Robust Markov Decision ProcessesNavdeep Kumar, Kaixin Wang, Kfir Yehuda Levy, Shie Mannor. [doi]
- MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language BenchmarkDongping Chen, Ruoxi Chen, Shilin Zhang, Yaochen Wang, Yinuo Liu, Huichi Zhou, Qihui Zhang, Yao Wan 0001, Pan Zhou 0001, Lichao Sun 0001. [doi]
- Towards Neural Architecture Search through Hierarchical Generative ModelingLichuan Xiang, Lukasz Dudziak, Mohamed S. Abdelfattah, Abhinav Mehrotra, Nicholas Donald Lane, Hongkai Wen 0001. [doi]
- Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Features ModelHien Dang 0003, Tho Tran Huu, Tan Minh Nguyen, Nhat Ho. [doi]
- Probabilistic Subgoal Representations for Hierarchical Reinforcement LearningVivienne Huiling Wang, Tinghuai Wang, Wenyan Yang, Joni-Kristian Kämäräinen, Joni Pajarinen. [doi]
- MagicLens: Self-Supervised Image Retrieval with Open-Ended InstructionsKai Zhang 0033, Yi Luan, Hexiang Hu, Kenton Lee, Siyuan Qiao, Wenhu Chen, Yu Su 0001, Ming-Wei Chang. [doi]
- Physics-Informed Neural Network Policy Iteration: Algorithms, Convergence, and VerificationYiming Meng, Ruikun Zhou, Amartya Mukherjee, Maxwell Fitzsimmons, Christopher Song, Jun Liu 0015. [doi]
- Creative Text-to-Audio Generation via Synthesizer ProgrammingManuel Cherep, Nikhil Singh 0003, Jessica Shand. [doi]
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & AdaptationCan Yaras, Peng Wang 0098, Laura Balzano, Qing Qu 0001. [doi]
- Learning Exceptional Subgroups by End-to-End Maximizing KL-DivergenceSascha Xu, Nils Philipp Walter, Janis Kalofolias, Jilles Vreeken. [doi]
- Fine-grained Local Sensitivity Analysis of Standard Dot-Product Self-AttentionAaron J. Havens, Alexandre Araujo, Huan Zhang, Bin Hu 0002. [doi]
- Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy LearningKakei Yamamoto, Kazusato Oko, Zhuoran Yang, Taiji Suzuki. [doi]
- Leveraging VLM-Based Pipelines to Annotate 3D ObjectsRishabh Kabra, Loic Matthey, Alexander Lerchner, Niloy J. Mitra. [doi]
- Memorization Through the Lens of Curvature of Loss Function Around SamplesIsha Garg, Deepak Ravikumar, Kaushik Roy 0001. [doi]
- QuRating: Selecting High-Quality Data for Training Language ModelsAlexander Wettig, Aatmik Gupta, Saumya Malik, Danqi Chen 0001. [doi]
- Revisiting the Power of Prompt for Visual TuningYuzhu Wang, Lechao Cheng, Chaowei Fang, Dingwen Zhang, Manni Duan, Meng Wang. [doi]
- Position: Social Environment Design Should be Further Developed for AI-based Policy-MakingEdwin Zhang, Sadie Zhao, Tonghan Wang 0003, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen 0001. [doi]
- Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance ScaleCandi Zheng, Yuan Lan. [doi]
- Robust Yet Efficient Conformal Prediction SetsSoroush H. Zargarbashi, Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski. [doi]
- Self-Rewarding Language ModelsWeizhe Yuan, Richard Yuanzhe Pang, KyungHyun Cho, Xian Li, Sainbayar Sukhbaatar, Jing Xu, Jason Weston. [doi]
- Beyond Point Prediction: Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point ProcessZichong Li, Qunzhi Xu, Zhenghao Xu, Yajun Mei, Tuo Zhao, Hongyuan Zha. [doi]
- Switching the Loss Reduces the Cost in Batch Reinforcement LearningAlex Ayoub, Kaiwen Wang, Vincent Liu, Samuel Robertson, James McInerney, Dawen Liang, Nathan Kallus, Csaba Szepesvári. [doi]
- How Smooth Is Attention?Valérie Castin, Pierre Ablin, Gabriel Peyré. [doi]
- No Free Prune: Information-Theoretic Barriers to Pruning at InitializationTanishq Kumar, Kevin Luo, Mark Sellke. [doi]
- Hierarchical State Space Models for Continuous Sequence-to-Sequence ModelingRaunaq M. Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta 0001, Tess Lee Hellebrekers, Lerrel Pinto. [doi]
- Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity LearningJiaqi Wang 0002, Chenxu Zhao, Lingjuan Lyu, Quanzeng You, Mengdi Huai, Fenglong Ma. [doi]
- Neural Diffusion ModelsGrigory Bartosh, Dmitry P. Vetrov, Christian A. Naesseth. [doi]
- Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you NeedShangda Yang, Vitaly Zankin, Maximilian Balandat, Stefan Scherer, Kevin T. Carlberg, Neil Walton, Kody J. H. Law. [doi]
- Causal Inference from Competing TreatmentsAna-Andreea Stoica, Vivian Y. Nastl, Moritz Hardt. [doi]
- Predictive Dynamic FusionBing Cao, Yinan Xia, Yi Ding, Changqing Zhang, Qinghua Hu. [doi]
- Feasibility Consistent Representation Learning for Safe Reinforcement LearningZhepeng Cen, Yihang Yao, Zuxin Liu, Ding Zhao. [doi]
- Sliding Down the Stairs: How Correlated Latent Variables Accelerate Learning with Neural NetworksLorenzo Bardone, Sebastian Goldt. [doi]
- Prompt Sketching for Large Language ModelsLuca Beurer-Kellner, Mark Niklas Müller, Marc Fischer 0002, Martin T. Vechev. [doi]
- Manifold Integrated Gradients: Riemannian Geometry for Feature AttributionEslam Zaher, Maciej Trzaskowski, Quan Nguyen, Fred Roosta. [doi]
- Mollification Effects of Policy Gradient MethodsTao Wang, Sylvia L. Herbert, Sicun Gao. [doi]
- Differentiability and Optimization of Multiparameter Persistent HomologyLuis Scoccola, Siddharth Setlur, David Loiseaux, Mathieu Carrière, Steve Oudot. [doi]
- Disentangled 3D Scene Generation with Layout LearningDave Epstein, Ben Poole, Ben Mildenhall, Alexei A. Efros, Aleksander Holynski. [doi]
- Adaptive Stabilization Based on Machine Learning for Column GenerationYunzhuang Shen, Yuan Sun 0003, Xiaodong Li 0001, Zhiguang Cao, Andrew C. Eberhard, Guangquan Zhang 0001. [doi]
- Applying language models to algebraic topology: generating simplicial cycles using multi-labeling in Wu's formulaKirill Brilliantov, Fedor Pavutnitskiy, Dmitry Pasechnyuk, German Magai. [doi]
- Context-Guided Diffusion for Out-of-Distribution Molecular and Protein DesignLeo Klarner, Tim G. J. Rudner, Garrett M. Morris, Charlotte M. Deane, Yee Whye Teh. [doi]
- Towards Interpretable Deep Local Learning with Successive Gradient ReconciliationYibo Yang, Xiaojie Li, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Adel Bibi, Philip Torr 0001, Bernard Ghanem. [doi]
- Position: The Reasonable Person Standard for AISunayana Rane. [doi]
- Symbolic Music Generation with Non-Differentiable Rule Guided DiffusionYujia Huang, Adishree Ghatare, Yuanzhe Liu, Ziniu Hu, Qinsheng Zhang, Chandramouli Shama Sastry, Siddharth Gururani, Sageev Oore, Yisong Yue. [doi]
- NDOT: Neuronal Dynamics-based Online Training for Spiking Neural NetworksHaiyan Jiang, Giulia De Masi, Huan Xiong, Bin Gu 0001. [doi]
- Online Isolation ForestFilippo Leveni, Guilherme Weigert Cassales, Bernhard Pfahringer, Albert Bifet, Giacomo Boracchi. [doi]
- Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar. [doi]
- Nonlinear Filtering with Brenier Optimal Transport MapsMohammad Al-Jarrah, Niyizhen Jin, Bamdad Hosseini, Amirhossein Taghvaei. [doi]
- Stop Regressing: Training Value Functions via Classification for Scalable Deep RLJesse Farebrother, Jordi Orbay, Quan Vuong, Adrien Ali Taïga, Yevgen Chebotar, Ted Xiao, Alex Irpan, Sergey Levine, Pablo Samuel Castro, Aleksandra Faust, Aviral Kumar, Rishabh Agarwal. [doi]
- Position: Building Guardrails for Large Language Models Requires Systematic DesignYi Dong 0002, Ronghui Mu, Gaojie Jin, Yi Qi, Jinwei Hu, Xingyu Zhao 0001, Jie Meng, Wenjie Ruan, Xiaowei Huang 0001. [doi]
- Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and FeedbackSongyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang 0001, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan 0001, Qi Zhang 0001, Dahua Lin. [doi]
- Revealing Vision-Language Integration in the Brain with Multimodal NetworksVighnesh Subramaniam, Colin Conwell, Christopher Wang, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu. [doi]
- Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language ModelFei Liu 0044, Xialiang Tong, Mingxuan Yuan, Xi Lin 0001, Fu Luo, Zhenkun Wang, Zhichao Lu, Qingfu Zhang 0001. [doi]
- Causal Effect Identification in LiNGAM Models with Latent ConfoundersDaniele Tramontano, Yaroslav Kivva, Saber Salehkaleybar, Mathias Drton, Negar Kiyavash. [doi]
- All-in-one simulation-based inferenceManuel Glöckler, Michael Deistler, Christian Dietrich Weilbach, Frank Wood, Jakob H. Macke. [doi]
- Online Cascade Learning for Efficient Inference over StreamsLunyiu Nie, Zhimin Ding, Erdong Hu, Christopher M. Jermaine, Swarat Chaudhuri. [doi]
- Truly No-Regret Learning in Constrained MDPsAdrian Müller, Pragnya Alatur, Volkan Cevher, Giorgia Ramponi, Niao He. [doi]
- On Which Nodes Does GCN Fail? Enhancing GCN From the Node PerspectiveJincheng Huang, Jialie Shen 0001, Xiaoshuang Shi, Xiaofeng Zhu 0001. [doi]
- Promptbreeder: Self-Referential Self-Improvement via Prompt EvolutionChrisantha Fernando, Dylan Banarse, Henryk Michalewski, Simon Osindero, Tim Rocktäschel. [doi]
- Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding HeadsTianle Cai, Yuhong Li, Zhengyang Geng, Hongwu Peng, Jason D. Lee, Deming Chen, Tri Dao. [doi]
- Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation ModelsHengyi Wang, Shiwei Tan, Hao Wang. [doi]
- Efficient Exploration for LLMsVikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy. [doi]
- Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank ModificationsBoyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson 0002. [doi]
- Exact Soft Analytical Side-Channel Attacks using Tractable CircuitsThomas Wedenig, Rishub Nagpal, Gaëtan Cassiers, Stefan Mangard, Robert Peharz. [doi]
- LASER: Linear Compression in Wireless Distributed OptimizationAshok Vardhan Makkuva, Marco Bondaschi, Thijs Vogels, Martin Jaggi, Hyeji Kim, Michael Gastpar. [doi]
- Explaining Probabilistic Models with Distributional ValuesLuca Franceschi 0001, Michele Donini, Cédric Archambeau, Matthias W. Seeger. [doi]
- Online Algorithms with Uncertainty-Quantified PredictionsBo Sun 0004, Jerry Huang, Nicolas Christianson, Mohammad Hajiesmaili, Adam Wierman, Raouf Boutaba. [doi]
- Think Before You Act: Decision Transformers with Working MemoryJikun Kang, Romain Laroche, Xingdi Yuan, Adam Trischler, Xue Liu 0001, Jie Fu. [doi]
- AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and TrainingZiyu Wan, Xidong Feng, Muning Wen, Stephen Marcus McAleer, Ying Wen 0001, Weinan Zhang 0001, Jun Wang 0012. [doi]
- Position: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI ResearchRiley Simmons-Edler, Ryan Paul Badman, Shayne Longpre, Kanaka Rajan. [doi]
- Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy PhysicsSiqi Miao 0001, Zhiyuan Lu, Mia Liu, Javier M. Duarte, Pan Li 0005. [doi]
- An Infinite-Width Analysis on the Jacobian-Regularised Training of a Neural NetworkTaeyoung Kim, Hongseok Yang. [doi]
- GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative DecodingCunxiao Du, Jing Jiang 0001, Yuanchen Xu, Jiawei Wu, Sicheng Yu, Yongqi Li 0001, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu, Yang You. [doi]
- How Transformers Learn Causal Structure with Gradient DescentEshaan Nichani, Alex Damian, Jason D. Lee. [doi]
- Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound AssumptionsKaihong Zhang, Heqi Yin, Feng Liang, Jingbo Liu. [doi]
- Hybrid2 Neural ODE Causal Modeling and an Application to Glycemic ResponseBob Junyi Zou, Matthew E. Levine, Dessi P. Zaharieva, Ramesh Johari, Emily B. Fox. [doi]
- Quality-Diversity with Limited ResourcesRen-Jian Wang, Ke Xue 0001, Cong Guan, Chao Qian 0001. [doi]
- Accelerating Federated Learning with Quick Distributed Mean EstimationRan Ben-Basat, Shay Vargaftik, Amit Portnoy, Gil Einziger, Yaniv Ben-Itzhak, Michael Mitzenmacher. [doi]
- Purifying Quantization-conditioned Backdoors via Layer-wise Activation Correction with Distribution ApproximationBoheng Li, Yishuo Cai, Jisong Cai, Yiming Li 0004, Han Qiu 0001, Run Wang, Tianwei Zhang 0004. [doi]
- Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them)Drew Prinster, Samuel Don Stanton, Anqi Liu, Suchi Saria. [doi]
- Compositional Few-Shot Class-Incremental LearningYixiong Zou, Shanghang Zhang, Haichen Zhou, Yuhua Li 0003, Ruixuan Li 0001. [doi]
- Differentiable Model Scaling using Differentiable TopkKai Liu, Ruohui Wang, Jianfei Gao 0003, Kai Chen. [doi]
- EquiAV: Leveraging Equivariance for Audio-Visual Contrastive LearningJongsuk Kim, Hyeongkeun Lee, Kyeongha Rho, Junmo Kim, Joon Son Chung. [doi]
- When and How Does In-Distribution Label Help Out-of-Distribution Detection?Xuefeng Du, Yiyou Sun, Yixuan Li 0001. [doi]
- A Theoretical Analysis of Backdoor Poisoning Attacks in Convolutional Neural NetworksBoqi Li, Weiwei Liu. [doi]
- Non-confusing Generation of Customized Concepts in Diffusion ModelsWang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin 0004, Zhou Zhao, Fei Wu 0001, Shuicheng Yan, Hanwang Zhang. [doi]
- Sparsest Models Elude Pruning: An Exposé of Pruning's Current CapabilitiesStephen Zhang, Vardan Papyan. [doi]
- CompeteAI: Understanding the Competition Dynamics of Large Language Model-based AgentsQinlin Zhao, Jindong Wang 0001, Yixuan Zhang, Yiqiao Jin, Kaijie Zhu, Hao Chen 0102, Xing Xie 0001. [doi]
- Optimal Eye Surgeon: Finding image priors through sparse generators at initializationAvrajit Ghosh, Xitong Zhang, Kenneth K. Sun, Qing Qu 0001, Saiprasad Ravishankar, Rongrong Wang. [doi]
- Learning Reward for Robot Skills Using Large Language Models via Self-AlignmentYuwei Zeng, Yao Mu, Lin Shao 0002. [doi]
- DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based ReasoningSiyuan Guo, Cheng Deng, Ying Wen 0001, Hechang Chen, Yi Chang 0001, Jun Wang 0012. [doi]
- HyperFields: Towards Zero-Shot Generation of NeRFs from TextSudarshan Babu, Richard Liu, Avery Zhou, Michael Maire, Greg Shakhnarovich, Rana Hanocka. [doi]
- On PI Controllers for Updating Lagrange Multipliers in Constrained OptimizationMotahareh Sohrabi, Juan Ramirez, Tianyue H. Zhang, Simon Lacoste-Julien, Jose Gallego-Posada. [doi]
- Mitigating Privacy Risk in Membership Inference by Convex-Concave LossZhenlong Liu, Lei Feng 0006, Huiping Zhuang, Xiaofeng Cao, Hongxin Wei. [doi]
- Optimal Kernel Choice for Score Function-based Causal DiscoveryWenjie Wang, Biwei Huang, Feng Liu 0003, Xinge You, Tongliang Liu, Kun Zhang 0001, Mingming Gong. [doi]
- Disentangled Continual Graph Neural Architecture Search with Invariant Modular SupernetZeyang Zhang, Xin Wang 0019, Yijian Qin, Hong Chen, Ziwei Zhang, Xu Chu, Wenwu Zhu 0001. [doi]
- Amend to Alignment: Decoupled Prompt Tuning for Mitigating Spurious Correlation in Vision-Language ModelsJie Zhang 0076, Xiaosong Ma, Song Guo 0001, Peng Li 0017, Wenchao Xu 0001, Xueyang Tang, Zicong Hong. [doi]
- TVE: Learning Meta-attribution for Transferable Vision ExplainerGuanchu Wang, Yu-Neng Chuang, Fan Yang 0023, Mengnan Du, Chia-Yuan Chang, Shaochen Zhong, Zirui Liu, Zhaozhuo Xu, Kaixiong Zhou, Xuanting Cai, Xia Hu 0001. [doi]
- Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement LearningDonghu Kim, HoJoon Lee, Kyungmin Lee, Dongyoon Hwang, Jaegul Choo. [doi]
- Structured Chemistry Reasoning with Large Language ModelsSiru Ouyang, Zhuosheng Zhang 0001, Bing Yan, Xuan Liu, Yejin Choi 0001, Jiawei Han 0001, Lianhui Qin. [doi]
- RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow MatchingDivya Nori, Wengong Jin. [doi]
- Algorithm and Hardness for Dynamic Attention Maintenance in Large Language ModelsJan van den Brand, Zhao Song 0002, Tianyi Zhou 0001. [doi]
- Improved Dimensionality Dependence for Zeroth-Order Optimisation over Cross-PolytopesWeijia Shao. [doi]
- Minimizing f-Divergences by Interpolating Velocity FieldsSong Liu, Jiahao Yu, Jack Simons, Mingxuan Yi, Mark Beaumont. [doi]
- ILILT: Implicit Learning of Inverse Lithography TechnologiesHaoyu Yang, Haoxing Ren. [doi]
- Compositional Text-to-Image Generation with Dense Blob RepresentationsWeili Nie, Sifei Liu, Morteza Mardani, Chao Liu 0064, Benjamin Eckart, Arash Vahdat. [doi]
- Position: Leverage Foundational Models for Black-Box OptimizationXingyou Song, Yingtao Tian, Robert Tjarko Lange, Chansoo Lee, Yujin Tang, Yutian Chen 0001. [doi]
- Make-A-Shape: a Ten-Million-scale 3D Shape ModelKa-Hei Hui, Aditya Sanghi, Arianna Rampini, Kamal Rahimi Malekshan, Zhengzhe Liu, Hooman Shayani, Chi-Wing Fu. [doi]
- When Will Gradient Regularization Be Harmful?Yang Zhao 0016, Hao Zhang 0005, Xiuyuan Hu. [doi]
- LangCell: Language-Cell Pre-training for Cell Identity UnderstandingSuyuan Zhao, Jiahuan Zhang, Yushuai Wu, Yizhen Luo, Zaiqing Nie. [doi]
- Conformal Prediction for Deep Classifier via Label RankingJianguo Huang, Huajun Xi, Linjun Zhang, Huaxiu Yao, Yue Qiu, Hongxin Wei. [doi]
- Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network ParametrizationMudit Gaur, Amrit S. Bedi, Di Wang 0015, Vaneet Aggarwal. [doi]
- Domain Generalisation via Imprecise LearningAnurag Singh, Siu Lun Chau, Shahine Bouabid, Krikamol Muandet. [doi]
- DeCoOp: Robust Prompt Tuning with Out-of-Distribution DetectionZhi Zhou 0007, Ming Yang, Jiang-Xin Shi, Lan-Zhe Guo, Yu-Feng Li. [doi]
- Implicit meta-learning may lead language models to trust more reliable sourcesDmitrii Krasheninnikov, Egor Krasheninnikov, Bruno Kacper Mlodozeniec, Tegan Maharaj, David Krueger 0001. [doi]
- Test-Time Regret Minimization in Meta Reinforcement LearningMirco Mutti, Aviv Tamar. [doi]
- IM-Unpack: Training and Inference with Arbitrarily Low Precision IntegersZhanpeng Zeng, Karthikeyan Sankaralingam, Vikas Singh. [doi]
- Fundamental Benefit of Alternating Updates in Minimax OptimizationJaewook Lee, Hanseul Cho 0002, Chulhee Yun. [doi]
- MILP-FBGen: LP/MILP Instance Generation with Feasibility/BoundednessYahong Zhang, Chenchen Fan, Donghui Chen, Congrui Li, Wenli Ouyang, Mingda Zhu, Junchi Yan. [doi]
- Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with TransformersXiaoqiang Lin, Zhaoxuan Wu, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low. [doi]
- Submodular framework for structured-sparse optimal transportPiyushi Manupriya, Pratik Jawanpuria, Karthik S. Gurumoorthy, Saketha Nath Jagarlapudi, Bamdev Mishra. [doi]
- An Information-Theoretic Analysis of In-Context LearningHong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy. [doi]
- Harmony in Diversity: Merging Neural Networks with Canonical Correlation AnalysisStefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky, Guy Wolf. [doi]
- Hieros: Hierarchical Imagination on Structured State Space Sequence World ModelsPaul Mattes, Rainer Schlosser, Ralf Herbrich. [doi]
- Graph Structure Extrapolation for Out-of-Distribution GeneralizationXiner Li, Shurui Gui, Youzhi Luo, Shuiwang Ji. [doi]
- Graph-based Time Series Clustering for End-to-End Hierarchical ForecastingAndrea Cini, Danilo P. Mandic, Cesare Alippi. [doi]
- Fine-grained Classes and How to Find ThemMatej Grcic, Artyom Gadetsky, Maria Brbic. [doi]
- Proactive Detection of Voice Cloning with Localized WatermarkingRobin San Roman, Pierre Fernandez, Hady ElSahar, Alexandre Défossez, Teddy Furon, Tuan Tran. [doi]
- Diffusion Models Encode the Intrinsic Dimension of Data ManifoldsJan Stanczuk, Georgios Batzolis, Teo Deveney, Carola-Bibiane Schönlieb. [doi]
- On the Generalization of Equivariant Graph Neural NetworksRafal Karczewski, Amauri H. Souza, Vikas Garg 0001. [doi]
- Removing Spurious Concepts from Neural Network Representations via Joint Subspace EstimationFloris Holstege, Bram Wouters, Noud P. A. van Giersbergen, Cees Diks. [doi]
- Scale-Free Image Keypoints Using Differentiable Persistent HomologyGiovanni Barbarani, Francesco Vaccarino, Gabriele Trivigno, Marco Guerra, Gabriele Moreno Berton, Carlo Masone. [doi]
- Variance-reduced Zeroth-Order Methods for Fine-Tuning Language ModelsTanmay Gautam, Youngsuk Park, Hao Zhou, Parameswaran Raman, Wooseok Ha. [doi]
- StrWAEs to Invariant RepresentationsHyunjong Lee, Yedarm Seong, Sungdong Lee, Joong-Ho Won. [doi]
- Stability Evaluation through Distributional Perturbation AnalysisJosé H. Blanchet, Peng Cui 0001, Jiajin Li, Jiashuo Liu. [doi]
- Accelerated Speculative Sampling Based on Tree Monte CarloZhengmian Hu, Heng Huang. [doi]
- Learning and Forgetting Unsafe Examples in Large Language ModelsJiachen Zhao, Zhun Deng, David Madras, James Zou 0001, Mengye Ren. [doi]
- Improving Generalization in Offline Reinforcement Learning via Adversarial Data SplittingDa Wang, Lin Li, Wei Wei 0018, Qixian Yu, Jianye Hao, Jiye Liang. [doi]
- Position: Measure Dataset Diversity, Don't Just Claim ItDora Zhao, Jerone T. A. Andrews, Orestis Papakyriakopoulos, Alice Xiang. [doi]
- CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded ModellingJunchao Gong, Lei Bai 0001, Peng Ye, Wanghan Xu, Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang. [doi]
- Is DPO Superior to PPO for LLM Alignment? A Comprehensive StudyShusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu 0005, Yi Wu 0013. [doi]
- Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric RegularizationJinlu Zhang 0002, Yiyi Zhou, Qiancheng Zheng, Xiaoxiong Du, Gen Luo, Jun Peng, Xiaoshuai Sun, Rongrong Ji. [doi]
- OMPO: A Unified Framework for RL under Policy and Dynamics ShiftsYu Luo, Tianying Ji, Fuchun Sun 0001, Jianwei Zhang 0001, Huazhe Xu, Xianyuan Zhan. [doi]
- Statistical Inference Under Constrained Selection BiasSantiago Cortes-Gomez, Mateo Dulce Rubio, Carlos Miguel Patiño, Bryan Wilder. [doi]
- DsDm: Model-Aware Dataset Selection with DatamodelsLogan Engstrom, Axel Feldmann, Aleksander Madry. [doi]
- TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic TasksZhiruo Wang, Graham Neubig, Daniel Fried. [doi]
- Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based ModelsSongtao Liu, Hanjun Dai, Yue Zhao, Peng Liu. [doi]
- How Language Model Hallucinations Can SnowballMuru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith. [doi]
- State-Free Inference of State-Space Models: The *Transfer Function* ApproachRom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin M. Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Michael Poli, Atsushi Yamashita. [doi]
- Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many MessagesHilal Asi, Vitaly Feldman, Jelani Nelson, Huy L. Nguyen, Kunal Talwar, Samson Zhou. [doi]
- Robust and Conjugate Gaussian Process RegressionMatías Altamirano, François-Xavier Briol, Jeremias Knoblauch. [doi]
- Learning Surrogates for Offline Black-Box Optimization via Gradient MatchingMinh Hoang, Azza Fadhel, Aryan Deshwal, Jana Doppa, Trong Nghia Hoang. [doi]
- LIDAO: Towards Limited Interventions for Debiasing (Large) Language ModelsTianci Liu 0003, Haoyu Wang 0004, Shiyang Wang, Yu Cheng, Jing Gao 0004. [doi]
- More Benefits of Being Distributional: Second-Order Bounds for Reinforcement LearningKaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun 0002. [doi]
- Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic PromptsZhi-Yi Chin, Chieh-Ming Jiang, Ching-Chun Huang, Pin-Yu Chen, Wei-chen Chiu. [doi]
- Learning to Explore in POMDPs with Informational RewardsAnnie Xie, Logan M. Bhamidipaty, Evan Zheran Liu, Joey Hong, Sergey Levine, Chelsea Finn. [doi]
- Position: AI/ML Influencers Have a Place in the Academic ProcessIain Weissburg, Mehir Arora, Xinyi Wang, Liangming Pan, William Yang Wang. [doi]
- Effects of Exponential Gaussian Distribution on (Double Sampling) Randomized SmoothingYouwei Shu, Xi Xiao, Derui Wang, Yuxin Cao, Siji Chen, Jason Xue, Linyi Li 0001, Bo Li 0026. [doi]
- TimeX++: Learning Time-Series Explanations with Information BottleneckZichuan Liu, Tianchun Wang, Jimeng Shi, Xu Zheng 0003, Zhuomin Chen, Lei Song, Wenqian Dong, Jayantha Obeysekera, Farhad Shirani 0001, Dongsheng Luo. [doi]
- Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free LunchLe Yu, Bowen Yu 0002, Haiyang Yu, Fei Huang 0004, Yongbin Li. [doi]
- GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian SplattingXiaoyu Zhou, Xingjian Ran, Yajiao Xiong, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang 0001. [doi]
- DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningJianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan. [doi]
- Not all distributional shifts are equal: Fine-grained robust conformal inferenceJiahao Ai, Zhimei Ren. [doi]
- Listwise Reward Estimation for Offline Preference-based Reinforcement LearningHeewoong Choi, Sangwon Jung, Hongjoon Ahn, Taesup Moon. [doi]
- eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction DataBo Peng 0009, Xinyi Ling, Ziru Chen, Huan Sun 0001, Xia Ning. [doi]
- Towards Efficient Spiking Transformer: a Token Sparsification Framework for Training and Inference AccelerationZhengyang Zhuge, Peisong Wang, Xingting Yao, Jian Cheng 0001. [doi]
- Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex LossesChangyu Gao, Andrew Lowy, Xingyu Zhou, Stephen J. Wright 0001. [doi]
- Is Epistemic Uncertainty Faithfully Represented by Evidential Deep Learning Methods?Mira Jürgens, Nis Meinert, Viktor Bengs, Eyke Hüllermeier, Willem Waegeman. [doi]
- Embodied CoT Distillation From LLM To Off-the-shelf AgentsWonje Choi 0003, Woo Kyung Kim, Minjong Yoo, Honguk Woo. [doi]
- Vector Quantization Pretraining for EEG Time Series with Random Projection and Phase AlignmentHaokun Gui, Xiucheng Li, Xinyang Chen. [doi]
- Dr. Strategy: Model-Based Generalist Agents with Strategic DreamingHany Hamed, Subin Kim, Dongyeong Kim, Jaesik Yoon, Sungjin Ahn. [doi]
- Sliced Wasserstein with Random-Path Projecting DirectionsKhai Nguyen, Shujian Zhang, Tam Le, Nhat Ho. [doi]
- Autaptic Synaptic Circuit Enhances Spatio-temporal Predictive Learning of Spiking Neural NetworksLihao Wang, Zhaofei Yu. [doi]
- Discrete Diffusion Modeling by Estimating the Ratios of the Data DistributionAaron Lou, Chenlin Meng, Stefano Ermon. [doi]
- Self-Supervised Coarsening of Unstructured Grid with Automatic DifferentiationSergei Shumilin, Alexander Ryabov, Nikolay B. Yavich, Evgeny Burnaev, Vladimir Vanovskiy. [doi]
- Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RLJiawei Huang, Niao He, Andreas Krause 0001. [doi]
- Rich-Observation Reinforcement Learning with Continuous Latent DynamicsYuda Song 0001, Lili Wu, Dylan J. Foster, Akshay Krishnamurthy. [doi]
- Finding NEM-U: Explaining unsupervised representation learning through neural network generated explanation masksBjørn Leth Møller, Christian Igel, Kristoffer Knutsen Wickstrøm, Jon Sporring, Robert Jenssen, Bulat Ibragimov. [doi]
- Watermarks in the Sand: Impossibility of Strong Watermarking for Language ModelsHanlin Zhang, Benjamin L. Edelman, Danilo Francati, Daniele Venturi 0001, Giuseppe Ateniese, Boaz Barak. [doi]
- A Single-Loop Robust Policy Gradient Method for Robust Markov Decision ProcessesZhenwei Lin, Chenyu Xue, Qi Deng, Yinyu Ye 0001. [doi]
- FADAS: Towards Federated Adaptive Asynchronous OptimizationYujia Wang, Shiqiang Wang, Songtao Lu, Jinghui Chen. [doi]
- Discovering Multiple Solutions from a Single Task in Offline Reinforcement LearningTakayuki Osa, Tatsuya Harada. [doi]
- Two-Stage Shadow Inclusion Estimation: An IV Approach for Causal Inference under Latent Confounding and Collider BiasBaohong Li, Anpeng Wu, Ruoxuan Xiong, Kun Kuang. [doi]
- OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsAli AhmadiTeshnizi, Wenzhi Gao, Madeleine Udell. [doi]
- UPOCR: Towards Unified Pixel-Level OCR InterfaceDezhi Peng, Zhenhua Yang, Jiaxin Zhang 0003, Chongyu Liu, Yongxin Shi, Kai Ding 0009, Fengjun Guo, Lianwen Jin. [doi]
- Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingDenis Blessing, Xiaogang Jia, Johannes Esslinger, Francisco Vargas 0001, Gerhard Neumann. [doi]
- Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image RestorationXiaole Tang, Xin Hu, Xiang Gu 0005, Jian Sun 0009. [doi]
- Practical Performance Guarantees for Pipelined DNN InferenceAaron Archer, Matthew Fahrbach, Kuikui Liu, Prakash Prabhu. [doi]
- Multi-layer Rehearsal Feature Augmentation for Class-Incremental LearningBowen Zheng, Da-Wei Zhou 0001, Han-Jia Ye, De-Chuan Zhan. [doi]
- Image Fusion via Vision-Language ModelZixiang Zhao, Lilun Deng, Haowen Bai, Yukun Cui, Zhipeng Zhang, Yulun Zhang, Haotong Qin, Dongdong Chen, Jiangshe Zhang 0001, Peng Wang, Luc Van Gool. [doi]
- A Dense Reward View on Aligning Text-to-Image Diffusion with PreferenceShentao Yang, TianQi Chen, Mingyuan Zhou. [doi]
- Algorithmic Stability Unleashed: Generalization Bounds with Unbounded LossesShaojie Li, Bowei Zhu, Yong Liu 0018. [doi]
- Weakly Convex Regularisers for Inverse Problems: Convergence of Critical Points and Primal-Dual OptimisationZakhar Shumaylov, Jeremy Budd, Subhadip Mukherjee, Carola-Bibiane Schönlieb. [doi]
- Interpretability Illusions in the Generalization of Simplified ModelsDan Friedman, Andrew Kyle Lampinen, Lucas Dixon, Danqi Chen 0001, Asma Ghandeharioun. [doi]
- SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer BlocksJiwon Song, Kyungseok Oh, Taesu Kim, HyungJun Kim, Yulhwa Kim, Jae-Joon Kim. [doi]
- Improving Transformers with Dynamically Composable Multi-Head AttentionDa Xiao, Qingye Meng, Shengping Li, Xingyuan Yuan. [doi]
- Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal SlicesNathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli. [doi]
- Semantically-correlated memories in a dense associative modelThomas F. Burns. [doi]
- Interpreting and Improving Large Language Models in Arithmetic CalculationWei Zhang, Chaoqun Wan, Yonggang Zhang, Yiu-ming Cheung, Xinmei Tian 0001, Xu Shen, Jieping Ye. [doi]
- Mean-field Analysis on Two-layer Neural Networks from a Kernel PerspectiveShokichi Takakura, Taiji Suzuki. [doi]
- NExT-GPT: Any-to-Any Multimodal LLMShengqiong Wu, Hao Fei 0001, Leigang Qu, Wei Ji 0008, Tat-Seng Chua. [doi]
- Online Matching with Stochastic Rewards: Provable Better Bound via Adversarial Reinforcement LearningQiankun Zhang, Aocheng Shen, Boyu Zhang, Hanrui Jiang, Bingqian Du. [doi]
- Q-Probe: A Lightweight Approach to Reward Maximization for Language ModelsKenneth Li 0002, Samy Jelassi, Hugh Zhang, Sham M. Kakade, Martin Wattenberg, David Brandfonbrener. [doi]
- On the Weight Dynamics of Deep Normalized NetworksChristian H. X. Ali Mehmeti-Göpel, Michael Wand 0001. [doi]
- Diffusion Rejection SamplingByeonghu Na, Yeongmin Kim, Minsang Park, Donghyeok Shin, Wanmo Kang, Il-Chul Moon. [doi]
- Graph Automorphism Group Equivariant Neural NetworksEdward Pearce-Crump, William J. Knottenbelt. [doi]
- FedBAT: Communication-Efficient Federated Learning via Learnable BinarizationShiwei Li, Wenchao Xu, Haozhao Wang, Xing Tang 0007, Yining Qi, Shijie Xu, Weihong Luo, Yuhua Li 0003, Xiuqiang He, Ruixuan Li 0001. [doi]
- Isometric Representation Learning for Disentangled Latent Space of Diffusion ModelsJaehoon Hahm, Junho Lee, Sunghyun Kim, Joonseok Lee. [doi]
- Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation ModelMikail Khona, Maya Okawa, Jan Hula, Rahul Ramesh, Kento Nishi, Robert P. Dick, Ekdeep Singh Lubana, Hidenori Tanaka. [doi]
- Interpreting and Improving Diffusion Models from an Optimization PerspectiveFrank Permenter, Chenyang Yuan. [doi]
- Delving into Differentially Private TransformerYoulong Ding, Xueyang Wu 0001, Yining Meng, Yonggang Luo, Hao Wang 0014, Weike Pan. [doi]
- Sobolev Space Regularised Pre Density ModelsMark Kozdoba, Binyamin Perets, Shie Mannor. [doi]
- Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial StatesNoam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen. [doi]
- An Efficient Self-Learning Framework For Interactive Spoken Dialog SystemsHitesh Tulsiani, David M. Chan, Shalini Ghosh, Garima Lalwani, Prabhat Pandey, Ankish Bansal, Sri Garimella, Ariya Rastrow, Björn Hoffmeister. [doi]
- Mixtures of Experts Unlock Parameter Scaling for Deep RLJohan Samir Obando-Ceron, Ghada Sokar, Timon Willi, Clare Lyle, Jesse Farebrother, Jakob Nicolaus Foerster, Gintare Karolina Dziugaite, Doina Precup, Pablo Samuel Castro. [doi]
- Sampling in Unit Time with Kernel Fisher-Rao FlowAimee Maurais, Youssef M. Marzouk. [doi]
- A General Theory for Softmax Gating Multinomial Logistic Mixture of ExpertsHuy Nguyen, Pedram Akbarian, TrungTin Nguyen, Nhat Ho. [doi]
- Estimating Barycenters of Distributions with Neural Optimal TransportAlexander Kolesov, Petr Mokrov, Igor Udovichenko, Milena Gazdieva, Gudmund Pammer, Evgeny Burnaev, Alexander Korotin. [doi]
- Practical Hamiltonian Monte Carlo on Riemannian Manifolds via Relativity TheoryKai Xu, Hong Ge. [doi]
- Performance Bounds for Active Binary Testing with Information MaximizationAditya Chattopadhyay, Benjamin David Haeffele, René Vidal, Donald Geman. [doi]
- Handling Heterogeneous Curvatures in Bandit LQR ControlYu-Hu Yan, Jing Wang, Peng Zhao 0006. [doi]
- MLI Formula: A Nearly Scale-Invariant Solution with Noise PerturbationBowen Tao, Xin-Chun Li, De-Chuan Zhan. [doi]
- CLLMs: Consistency Large Language ModelsSiqi Kou, Lanxiang Hu, Zhezhi He, Zhijie Deng, Hao Zhang. [doi]
- EvTexture: Event-driven Texture Enhancement for Video Super-ResolutionDachun Kai, Jiayao Lu, Yueyi Zhang, Xiaoyan Sun 0001. [doi]
- Diffusion Model-Augmented Behavioral CloningShang-Fu Chen, Hsiang-Chun Wang, Ming-Hao Hsu, Chun-Mao Lai, Shao-Hua Sun. [doi]
- Efficient Stochastic Approximation of Minimax Excess Risk OptimizationLijun Zhang 0005, Haomin Bai, Wei-Wei Tu, Ping Yang, Yao Hu. [doi]
- KnowFormer: Revisiting Transformers for Knowledge Graph ReasoningJunnan Liu, Qianren Mao, Weifeng Jiang, Jianxin Li 0002. [doi]
- Knowledge-aware Reinforced Language Models for Protein Directed EvolutionYuhao Wang, Qiang Zhang, Ming Qin, Xiang Zhuang, Xiaotong Li, Zhichen Gong, Zeyuan Wang, Yu Zhao 0009, Jianhua Yao 0001, Keyan Ding, Huajun Chen. [doi]
- Fair Federated Learning via the Proportional Veto CoreBhaskar Ray Chaudhury, Aniket Murhekar, Zhuowen Yuan, Bo Li 0026, Ruta Mehta, Ariel D. Procaccia. [doi]
- Efficient Algorithms for Empirical Group Distributionally Robust Optimization and BeyondDingzhi Yu, Yunuo Cai, Wei Jiang, Lijun Zhang. [doi]
- Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERTJon Saad-Falcon, Daniel Y. Fu, Simran Arora, Neel Guha, Christopher Ré. [doi]
- Information Complexity of Stochastic Convex Optimization: Applications to Generalization, Memorization, and TracingIdan Attias, Gintare Karolina Dziugaite, Mahdi Haghifam, Roi Livni, Daniel M. Roy 0001. [doi]
- Prompt-tuning Latent Diffusion Models for Inverse ProblemsHyungjin Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio. [doi]
- Differentiable Mapper for Topological Optimization of Data RepresentationZiyad Oulhaj, Mathieu Carrière, Bertrand Michel. [doi]
- Fault Tolerant ML: Efficient Meta-Aggregation and Synchronous TrainingTehila Dahan, Kfir Yehuda Levy. [doi]
- SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise AttentionRomain Ilbert, Ambroise Odonnat, Vasilii Feofanov, Aladin Virmaux, Giuseppe Paolo, Themis Palpanas, Ievgen Redko. [doi]
- Variational Inference with Coverage Guarantees in Simulation-Based InferenceYash P. Patel, Declan McNamara, Jackson Loper, Jeffrey Regier, Ambuj Tewari. [doi]
- Provable Contrastive Continual LearningYichen Wen, Zhiquan Tan, Kaipeng Zheng, Chuanlong Xie, Weiran Huang 0001. [doi]
- Extending Test-Time Augmentation with Metamorphic Relations for Combinatorial ProblemsSiwei Wei, Xudong Zhang, Zhiyang Zhou, Yan Cai 0001. [doi]
- Beyond the Federation: Topology-aware Federated Learning for Generalization to Unseen ClientsMengmeng Ma 0002, Tang Li 0005, Xi Peng 0005. [doi]
- Learning-Efficient Yet Generalizable Collaborative Filtering for Item RecommendationYuanhao Pu, Xiaolong Chen, Xu Huang 0008, Jin Chen 0008, Defu Lian, Enhong Chen. [doi]
- PARCv2: Physics-aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics ModelingPhong C. H. Nguyen, Xinlun Cheng, Shahab Azarfar, Pradeep K. Seshadri, Yen Thi Nguyen, MunHo Kim, Sanghun Choi, H. S. Udaykumar, Stephen Baek. [doi]
- Mean-field Underdamped Langevin Dynamics and its Spacetime DiscretizationQiang Fu, Ashia Camage Wilson. [doi]
- Graph-Triggered Rising BanditsGianmarco Genalti, Marco Mussi, Nicola Gatti 0001, Marcello Restelli, Matteo Castiglioni, Alberto Maria Metelli. [doi]
- A Theory of Fault-Tolerant LearningChanglong Wu, Yifan Wang, Ananth Grama. [doi]
- An Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic OptimizationEmre Sahinoglu, Shahin Shahrampour. [doi]
- Two-timescale Derivative Free Optimization for Performative Prediction with Markovian DataHaitong Liu, Qiang Li, Hoi-To Wai. [doi]
- Accurate LoRA-Finetuning Quantization of LLMs via Information RetentionHaotong Qin, Xudong Ma, Xingyu Zheng, Xiaoyang Li, Yang Zhang 0088, Shouda Liu, Jie Luo 0004, Xianglong Liu 0001, Michele Magno. [doi]
- Convergence of Some Convex Message Passing Algorithms to a Fixed PointVáclav Vorácek, Tomás Werner. [doi]
- Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable SimulationsFeng Gao, Liangzhi Shi, Shenao Zhang, Zhaoran Wang 0001, Yi Wu. [doi]
- Towards Theoretical Understanding of Learning Large-scale Dependent Data via Random FeaturesChao Wang, Xin Bing, Xin He, Caixing Wang. [doi]
- Stochastic Quantum Sampling for Non-Logconcave Distributions and Estimating Partition FunctionsGuneykan Ozgul, Xiantao Li, Mehrdad Mahdavi, Chunhao Wang. [doi]
- No Dimensional Sampling Coresets for ClassificationMeysam Alishahi, Jeff M. Phillips. [doi]
- Standardized Interpretable Fairness Measures for Continuous Risk ScoresAnn-Kristin Becker, Oana Dumitrasc, Klaus Broelemann. [doi]
- Rotational Equilibrium: How Weight Decay Balances Learning Across Neural NetworksAtli Kosson, Bettina Messmer, Martin Jaggi. [doi]
- Exploration and Anti-Exploration with Distributional Random Network DistillationKai Yang, Jian Tao, Jiafei Lyu, Xiu Li 0001. [doi]
- Position: What makes an image realistic?Lucas Theis. [doi]
- RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language ModelsQi Lv, Hao Li, Xiang Deng, Rui Shao, Michael Y. Wang, Liqiang Nie. [doi]
- Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in DisguiseKwangjun Ahn, Zhiyu Zhang, Yunbum Kook, Yan Dai 0002. [doi]
- Meta Evidential Transformer for Few-Shot Open-Set RecognitionHitesh Sapkota, Krishna Prasad Neupane, Qi Yu 0001. [doi]
- MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-ExpertsGuanjie Chen, Xinyu Zhao, Tianlong Chen, Yu Cheng. [doi]
- Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping CompositionsYongqiang Cai. [doi]
- Adaptive Hierarchical Certification for Segmentation using Randomized SmoothingAlaa Anani, Tobias Lorenz 0002, Bernt Schiele, Mario Fritz. [doi]
- On the Consistency of Kernel Methods with Dependent ObservationsPierre-François Massiani, Sebastian Trimpe, Friedrich Solowjow. [doi]
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-TuningHao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion. [doi]
- Towards Scalable and Versatile Weight Space LearningKonstantin Schürholt, Michael W. Mahoney, Damian Borth. [doi]
- The Balanced-Pairwise-Affinities Feature TransformDaniel Shalam, Simon Korman. [doi]
- Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution LearnabilitySepanta Zeighami, Cyrus Shahabi. [doi]
- Sub-token ViT Embedding via Stochastic Resonance TransformersDong Lao, Yangchao Wu, Tian-Yu Liu, Alex Wong 0001, Stefano Soatto. [doi]
- Generalized Smooth Variational Inequalities: Methods with Adaptive StepsizesDaniil Vankov, Angelia Nedich, Lalitha Sankar. [doi]
- ESM All-Atom: Multi-Scale Protein Language Model for Unified Molecular ModelingKangjie Zheng, Siyu Long, Tianyu Lu, Junwei Yang, Xinyu Dai, Ming Zhang 0004, Zaiqing Nie, Wei-Ying Ma, Hao Zhou 0012. [doi]
- Reinformer: Max-Return Sequence Modeling for Offline RLZifeng Zhuang, Dengyun Peng, Jinxin Liu, Ziqi Zhang, Donglin Wang. [doi]
- Vague Prototype-Oriented Diffusion Model for Multi-Class Anomaly DetectionYuxin Li, Yaoxuan Feng, Bo Chen 0001, Wenchao Chen, Yubiao Wang, Xinyue Hu, Baolin Sun, Chunhui Qu, Mingyuan Zhou. [doi]
- Model Assessment and Selection under Temporal Distribution ShiftElise Han, Chengpiao Huang, Kaizheng Wang. [doi]
- Learning High-Order Relationships of Brain RegionsWeikang Qiu, Huangrui Chu, Selena Wang, Haolan Zuo, Xiaoxiao Li, Yize Zhao, Rex Ying. [doi]
- Liouville Flow Importance SamplerYifeng Tian, Nishant Panda, Yen-Ting Lin. [doi]
- On the Hardness of Probabilistic Neurosymbolic LearningJaron Maene, Vincent Derkinderen, Luc De Raedt. [doi]
- Adaptive Accompaniment with ReaLchordsYusong Wu, Tim Cooijmans, Kyle Kastner, Adam Roberts, Ian Simon, Alexander Scarlatos, Chris Donahue, Cassie Tarakajian, Shayegan Omidshafiei, Aaron C. Courville, Pablo Samuel Castro, Natasha Jaques, Cheng-Zhi Anna Huang. [doi]
- Scalable Pre-training of Large Autoregressive Image ModelsAlaaeldin El-Nouby, Michal Klein, Shuangfei Zhai, Miguel Ángel Bautista 0001, Vaishaal Shankar, Alexander T. Toshev, Joshua M. Susskind, Armand Joulin. [doi]
- Offline Inverse RL: New Solution Concepts and Provably Efficient AlgorithmsFilippo Lazzati, Mirco Mutti, Alberto Maria Metelli. [doi]
- Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-DecodingGuangyi Liu, Yu Wang, Zeyu Feng, Qiyu Wu 0001, Liping Tang, Yuan Gao, Zhen Li 0026, Shuguang Cui, Julian J. McAuley, Zichao Yang, Eric P. Xing, Zhiting Hu. [doi]
- Block Acceleration Without Momentum: On Optimal Stepsizes of Block Gradient Descent for Least-SquaresLiangzu Peng, Wotao Yin. [doi]
- Toward Adaptive Reasoning in Large Language Models with Thought RollbackSijia Chen, Baochun Li. [doi]
- A Federated Stochastic Multi-level Compositional Minimax Algorithm for Deep AUC MaximizationXinwen Zhang, Ali Payani, Myungjin Lee, Richard Souvenir, Hongchang Gao. [doi]
- Long Range Propagation on Continuous-Time Dynamic GraphsAlessio Gravina, Giulio Lovisotto, Claudio Gallicchio, Davide Bacciu, Claas Grohnfeldt. [doi]
- High-Performance Temporal Reversible Spiking Neural Networks with O(L) Training Memory and O(1) Inference CostJiakui Hu, Man Yao, Xuerui Qiu, Yuhong Chou, Yuxuan Cai, Ning Qiao, Yonghong Tian 0001, Bo Xu 0002, Guoqi Li. [doi]
- Individualized Privacy Accounting via Subsampling with Applications in Combinatorial OptimizationBadih Ghazi, Pritish Kamath, Ravi Kumar 0001, Pasin Manurangsi, Adam Sealfon. [doi]
- Balancing Similarity and Complementarity for Federated LearningKunda Yan, Sen Cui, Abudukelimu Wuerkaixi, Jingfeng Zhang, Bo Han 0003, Gang Niu 0001, Masashi Sugiyama, Changshui Zhang. [doi]
- Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order GradientHao Di, Haishan Ye, Yueling Zhang, Xiangyu Chang, Guang Dai, Ivor W. Tsang. [doi]
- Ameliorate Spurious Correlations in Dataset CondensationJustin Cui, Ruochen Wang, Yuanhao Xiong, Cho-Jui Hsieh. [doi]
- Understanding MLP-Mixer as a wide and sparse MLPTomohiro Hayase, Ryo Karakida. [doi]
- Rethinking the Flat Minima Searching in Federated LearningTaehwan Lee, Sung Whan Yoon. [doi]
- Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary DynamicsXinyu Zhang, Wenjie Qiu 0005, Yi-Chen Li 0001, Lei Yuan 0001, Chengxing Jia, Zongzhang Zhang, Yang Yu 0001. [doi]
- Enhancing Vision Transformer: Amplifying Non-Linearity in Feedforward Network ModuleYixing Xu, Chao Li, Dong Li, Xiao Sheng, Fan Jiang, Lu Tian, Ashish Sirasao, Emad Barsoum. [doi]
- HexGen: Generative Inference of Large Language Model over Heterogeneous EnvironmentYouhe Jiang, Ran Yan, Xiaozhe Yao, Yang Zhou, Beidi Chen, Binhang Yuan. [doi]
- Taylor Videos for Action RecognitionLei Wang 0001, Xiuyuan Yuan, Tom Gedeon, Liang Zheng 0001. [doi]
- Pseudo-Calibration: Improving Predictive Uncertainty Estimation in Unsupervised Domain AdaptationDapeng Hu, Jian Liang, Xinchao Wang, Chuan-Sheng Foo. [doi]
- Spider: A Unified Framework for Context-dependent Concept SegmentationXiaoqi Zhao, Youwei Pang, Wei Ji 0011, Baicheng Sheng, Jiaming Zuo, Lihe Zhang, Huchuan Lu. [doi]
- Eluder-based Regret for Stochastic Contextual MDPsOrin Levy, Asaf B. Cassel, Alon Cohen, Yishay Mansour. [doi]
- Prompt-guided Precise Audio Editing with Diffusion ModelsManjie Xu, Chenxing Li, Duzhen Zhang, Dan Su 0002, Wei Liang, Dong Yu 0001. [doi]
- Overcoming Saturation in Density Ratio Estimation by Iterated RegularizationLukas Gruber, Markus Holzleitner, Johannes Lehner, Sepp Hochreiter, Werner Zellinger. [doi]
- Perturb-and-Project: Differentially Private Similarities and MarginalsVincent Cohen-Addad, Tommaso d'Orsi, Alessandro Epasto, Vahab Mirrokni, Peilin Zhong. [doi]
- Exploring the Benefit of Activation Sparsity in Pre-trainingZhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai Lin, Zhiyuan Zeng, Xu Han 0007, Zhiyuan Liu 0001, Ruobing Xie, Maosong Sun 0001, Jie Zhou 0016. [doi]
- Controlling Behavioral Diversity in Multi-Agent Reinforcement LearningMatteo Bettini, Ryan Kortvelesy, Amanda Prorok. [doi]
- FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement LearningYuwei Fu, Haichao Zhang, Di Wu 0044, Wei Xu, Benoit Boulet. [doi]
- Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed EnvironmentsAntoine Dedieu, Wolfgang Lehrach, Guangyao Zhou, Dileep George, Miguel Lázaro-Gredilla. [doi]
- A Provable Decision Rule for Out-of-Distribution DetectionXinsong Ma, Xin Zou, Weiwei Liu. [doi]
- Exploring Training on Heterogeneous Data with Mixture of Low-rank AdaptersYuhang Zhou, Zihua Zhao, Siyuan Du, Haolin Li, Jiangchao Yao, Ya Zhang 0002, Yanfeng Wang. [doi]
- Non-clairvoyant Scheduling with Partial PredictionsZiyad Benomar, Vianney Perchet. [doi]
- Position: Do Not Explain Vision Models Without ContextPaulina Tomaszewska, Przemyslaw Biecek. [doi]
- HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid PredictionLanxiang Xing, Haixu Wu, Yuezhou Ma, Jianmin Wang 0001, Mingsheng Long. [doi]
- Graph Neural PDE Solvers with Conservation and Similarity-EquivarianceMasanobu Horie, Naoto Mitsume. [doi]
- Subsampling is not Magic: Why Large Batch Sizes Work for Differentially Private Stochastic OptimisationOssi Räisä, Joonas Jälkö, Antti Honkela. [doi]
- On a Neural Implementation of Brenier's Polar FactorizationNina Vesseron, Marco Cuturi. [doi]
- Random features models: a way to study the success of naive imputationAlexis Ayme, Claire Boyer, Aymeric Dieuleveut, Erwan Scornet. [doi]
- Repeat After Me: Transformers are Better than State Space Models at CopyingSamy Jelassi, David Brandfonbrener, Sham M. Kakade, Eran Malach. [doi]
- From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous SystemsJianliang He, Siyu Chen, Fengzhuo Zhang, Zhuoran Yang. [doi]
- Position: Embracing Negative Results in Machine LearningFlorian Karl, Lukas Malte Kemeter, Gabriel Dax, Paulina Sierak. [doi]
- LongRoPE: Extending LLM Context Window Beyond 2 Million TokensYiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang 0024, Mao Yang. [doi]
- Decoupling Learning and Decision-Making: Breaking the O(T) Barrier in Online Resource Allocation with First-Order MethodsWenzhi Gao, Chunlin Sun, Chenyu Xue, Yinyu Ye 0001. [doi]
- Rethinking Data Shapley for Data Selection Tasks: Misleads and MeritsJiachen T. Wang, Tianji Yang, James Zou 0001, Yongchan Kwon, Ruoxi Jia 0001. [doi]
- Efficient Non-stationary Online Learning by Wavelets with Applications to Online Distribution Shift AdaptationYuyang Qian, Peng Zhao 0006, Yu-Jie Zhang, Masashi Sugiyama, Zhi-Hua Zhou. [doi]
- DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep LearningS. Ashwin Hebbar, Sravan Kumar Ankireddy, Hyeji Kim, Sewoong Oh, Pramod Viswanath. [doi]
- One-Shot Strategic Classification Under Unknown CostsElan Rosenfeld, Nir Rosenfeld. [doi]
- Nonsmooth Implicit Differentiation: Deterministic and Stochastic Convergence RatesRiccardo Grazzi, Massimiliano Pontil, Saverio Salzo. [doi]
- MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data UtilizationYu Zhang 0133, Qi Zhang 0020, Zixuan Gong, Yiwei Shi, Yepeng Liu, Duoqian Miao 0001, Yang Liu, Ke Liu, Kun Yi, Wei Fan 0010, Liang Hu 0004, Changwei Wang. [doi]
- Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code PredicatesAshish Hooda, Mihai Christodorescu, Miltiadis Allamanis, Aaron Wilson, Kassem Fawaz, Somesh Jha. [doi]
- Asymptotically Optimal and Computationally Efficient Average Treatment Effect Estimation in A/B testingVikas Deep, Achal Bassamboo, Sandeep K. Juneja. [doi]
- Conformalized Survival Distributions: A Generic Post-Process to Increase CalibrationShiang Qi, Yakun Yu, Russell Greiner. [doi]
- Privacy-Preserving Data Release Leveraging Optimal Transport and Particle Gradient DescentKonstantin Donhauser, Javier Abad Martinez, Neha Hulkund, Fanny Yang. [doi]
- Learning Decision Policies with Instrumental Variables through Double Machine LearningDaqian Shao, Ashkan Soleymani, Francesco Quinzan, Marta Kwiatkowska. [doi]
- Generalist Equivariant Transformer Towards 3D Molecular Interaction LearningXiangzhe Kong, Wenbing Huang 0001, Yang Liu 0005. [doi]
- DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data AugmentationZelin Zang, Hao Luo 0004, Kai Wang 0036, Panpan Zhang, Fan Wang 0019, Stan Z. Li, Yang You 0001. [doi]
- Compositional Curvature Bounds for Deep Neural NetworksTaha Entesari, Sina Sharifi, Mahyar Fazlyab. [doi]
- 3D Geometric Shape Assembly via Efficient Point Cloud MatchingNahyuk Lee, Juhong Min, Junha Lee, Seungwook Kim, Kanghee Lee, Jaesik Park, Minsu Cho. [doi]
- Learning Causal Domain-Invariant Temporal Dynamics for Few-Shot Action RecognitionYuke Li, Guangyi Chen 0002, Ben Abramowitz, Stefano Anzellotti, Donglai Wei 0001. [doi]
- Differentially Private Synthetic Data via Foundation Model APIs 2: TextChulin Xie, Zinan Lin 0001, Arturs Backurs, Sivakanth Gopi, Da Yu, Huseyin A. Inan, Harsha Nori, Haotian Jiang, Huishuai Zhang, Yin Tat Lee, Bo Li 0026, Sergey Yekhanin. [doi]
- Equilibrium of Data Markets with ExternalitySafwan Hossain, Yiling Chen 0001. [doi]
- Towards Resource-friendly, Extensible and Stable Incomplete Multi-view ClusteringShengju Yu, Zhibin Dong, Siwei Wang 0001, Xinhang Wan, Yue Liu 0008, Weixuan Liang, Pei Zhang 0008, Wenxuan Tu, Xinwang Liu 0002. [doi]
- Amortized Equation Discovery in Hybrid Dynamical SystemsYongtuo Liu, Sara Magliacane, Miltiadis Kofinas, Stratis Gavves. [doi]
- MS3D: A RG Flow-Based Regularization for GAN Training with Limited DataJian Wang, Xin Lan, Yuxin Tian, Jiancheng Lv 0001. [doi]
- Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule SubstratesZhenqiao Song, Yunlong Zhao, Wenxian Shi, Wengong Jin, Yang Yang, Lei Li 0005. [doi]
- CaPS: Collaborative and Private Synthetic Data Generation from Distributed SourcesSikha Pentyala, Mayana Pereira, Martine De Cock. [doi]
- To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DROZi-Hao Qiu, Siqi Guo, Mao Xu, Tuo Zhao, Lijun Zhang 0005, Tianbao Yang. [doi]
- Learning the Uncertainty Sets of Linear Control Systems via Set Membership: A Non-asymptotic AnalysisYingying Li, Jing Yu 0010, Lauren Conger, Taylan Kargin, Adam Wierman. [doi]
- Learning from Streaming Data when Users ChooseJinyan Su, Sarah Dean. [doi]
- Mitigating Label Noise on Graphs via Topological Sample SelectionYuhao Wu, Jiangchao Yao, Xiaobo Xia, Jun Yu, Ruxin Wang 0002, Bo Han, Tongliang Liu. [doi]
- Prompting a Pretrained Transformer Can Be a Universal ApproximatorAleksandar Petrov, Philip Torr 0001, Adel Bibi. [doi]
- Online Adaptive Anomaly Thresholding with Confidence SequencesSophia Huiwen Sun, Abishek Sankararaman, Balakrishnan Narayanaswamy. [doi]
- Predicting and Interpreting Energy Barriers of Metallic Glasses with Graph Neural NetworksHaoyu Li, Shichang Zhang, Longwen Tang, Mathieu Bauchy, Yizhou Sun. [doi]
- Provable Representation with Efficient Planning for Partially Observable Reinforcement LearningHongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai 0001. [doi]
- Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space DualityTri Dao, Albert Gu. [doi]
- Towards Understanding Inductive Bias in Transformers: A View From InfinityItay Lavie, Guy Gur-Ari, Zohar Ringel. [doi]
- Vanilla Bayesian Optimization Performs Great in High DimensionsCarl Hvarfner, Erik Orm Hellsten, Luigi Nardi. [doi]
- Coactive Learning for Large Language Models using Implicit User FeedbackAaron David Tucker, Kianté Brantley, Adam Cahall, Thorsten Joachims. [doi]
- Privacy Attacks in Decentralized LearningAbdellah El Mrini, Edwige Cyffers, Aurélien Bellet. [doi]
- Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to K-Level Stochastic OptimizationsXiaokang Pan, Xingyu Li, Jin Liu, Tao Sun, Kai Sun, Lixing Chen, Zhe Qu. [doi]
- Position: Enforced Amnesia as a Way to Mitigate the Potential Risk of Silent Suffering in the Conscious AIYegor Tkachenko. [doi]
- Batch Singular Value Polarization and Weighted Semantic Augmentation for Universal Domain AdaptationWangzi Qi, Wei Wang, Chao Huang 0008, Jie Wen 0001, Cong Wang. [doi]
- MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-go ApproximationAlexandre Hayderi, Amin Saberi, Ellen Vitercik, Anders Wikum. [doi]
- Understanding Finetuning for Factual Knowledge ExtractionGaurav Rohit Ghosal, Tatsunori Hashimoto, Aditi Raghunathan. [doi]
- Continuous Treatment Effects with Surrogate OutcomesZhenghao Zeng, David Arbour, Avi Feller, Raghavendra Addanki, Ryan A. Rossi, Ritwik Sinha, Edward H. Kennedy. [doi]
- Federated Neuro-Symbolic LearningPengwei Xing, Songtao Lu, Han Yu 0001. [doi]
- AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual DistractorsYucen Wang, Shenghua Wan, Le Gan, Shuai Feng, De-Chuan Zhan. [doi]
- Highway Value Iteration NetworksYuhui Wang, Weida Li, Francesco Faccio, Qingyuan Wu, Jürgen Schmidhuber. [doi]
- Image Restoration Through Generalized Ornstein-Uhlenbeck BridgeConghan Yue, Zhengwei Peng, Junlong Ma, Shiyan Du, Pengxu Wei, Dongyu Zhang. [doi]
- Perfect Alignment May be Poisonous to Graph Contrastive LearningJingyu Liu, Huayi Tang, Yong Liu 0018. [doi]
- Improved Bounds for Pure Private Agnostic Learning: Item-Level and User-Level PrivacyBo Li 0001, Wei Wang 0030, Peng Ye. [doi]
- Conditional Language Learning with ContextXiao Zhang, Miao Li, Ji Wu. [doi]
- Multi-Patch Prediction: Adapting Language Models for Time Series Representation LearningYuxuan Bian, Xuan Ju, Jiangtong Li, Zhijian Xu, Dawei Cheng, Qiang Xu 0001. [doi]
- Few-shot Adaptation to Distribution Shifts By Mixing Source and Target EmbeddingsYihao Xue, Ali Payani, Yu Yang 0007, Baharan Mirzasoleiman. [doi]
- FrameQuant: Flexible Low-Bit Quantization for TransformersHarshavardhan Adepu, Zhanpeng Zeng, Li Zhang, Vikas Singh. [doi]
- IBD-PSC: Input-level Backdoor Detection via Parameter-oriented Scaling ConsistencyLinshan Hou, Ruili Feng, Zhongyun Hua, Wei Luo 0001, Leo Yu Zhang, Yiming Li. [doi]
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust RefusalMantas Mazeika, Long Phan, Xuwang Yin, Andy Zou, Zifan Wang 0001, Norman Mu, Elham Sakhaee, Nathaniel Li, Steven Basart, Bo Li 0026, David A. Forsyth, Dan Hendrycks. [doi]
- Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free InferenceLuca Masserano, Alexander Shen, Michele Doro, Tommaso Dorigo, Rafael Izbicki, Ann B. Lee. [doi]
- CaM: Cache Merging for Memory-efficient LLMs InferenceYuxin Zhang 0002, Yuxuan Du, Gen Luo, Yunshan Zhong, Zhenyu Zhang 0015, Shiwei Liu 0003, Rongrong Ji. [doi]
- Inferring Dynamic Networks from Marginals with Iterative Proportional FittingSerina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander. [doi]
- Benchmarking Deletion Metrics with the Principled ExplanationsYipei Wang, Xiaoqian Wang. [doi]
- Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical ReexaminationZhiyao Luo, Yangchen Pan, Peter J. Watkinson, Tingting Zhu 0001. [doi]
- See More Details: Efficient Image Super-Resolution by Experts MiningEduard Zamfir, Zongwei Wu, Nancy Mehta, Yulun Zhang, Radu Timofte. [doi]
- Learning 1-Bit Tiny Object Detector with Discriminative Feature RefinementSheng Xu 0007, Mingze Wang, Yanjing Li, Mingbao Lin, Baochang Zhang 0001, David S. Doermann, Xiao Sun. [doi]
- Understanding Stochastic Natural Gradient Variational InferenceKaiwen Wu, Jacob R. Gardner. [doi]
- Probabilistic Constrained Reinforcement Learning with Formal InterpretabilityYanran Wang, Qiuchen Qian, David Boyle 0001. [doi]
- MALIBO: Meta-learning for Likelihood-free Bayesian OptimizationJiarong Pan, Stefan Falkner, Felix Berkenkamp, Joaquin Vanschoren. [doi]
- Robust Stable Spiking Neural NetworksJianhao Ding, Zhiyu Pan, Yujia Liu, Zhaofei Yu, Tiejun Huang 0001. [doi]
- Equivariance via Minimal Frame Averaging for More Symmetries and EfficiencyYuchao Lin, Jacob Helwig, Shurui Gui, Shuiwang Ji. [doi]
- LLaGA: Large Language and Graph AssistantRunjin Chen, Tong Zhao 0003, Ajay Kumar Jaiswal, Neil Shah, Zhangyang Wang. [doi]
- Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-Loop and Hessian-Free Solution StrategyRisheng Liu, Zhu Liu, Wei Yao 0014, Shangzhi Zeng, Jin Zhang 0002. [doi]
- Improving Neural Additive Models with Bayesian PrinciplesKouroche Bouchiat, Alexander Immer, Hugo Yèche, Gunnar Rätsch, Vincent Fortuin. [doi]
- Et Tu Certifications: Robustness Certificates Yield Better Adversarial ExamplesAndrew C. Cullen, Shijie Liu, Paul Montague, Sarah Monazam Erfani, Benjamin I. P. Rubinstein. [doi]
- Hybrid Neural Representations for Spherical DataHyomin Kim, Yunhui Jang, Jaeho Lee 0001, Sungsoo Ahn. [doi]
- Feature Importance Disparities for Data Bias InvestigationsPeter W. Chang, Leor Fishman, Seth Neel. [doi]
- Matroid Semi-Bandits in Sublinear TimeRuo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu. [doi]
- Enhancing Storage and Computational Efficiency in Federated Multimodal Learning for Large-Scale ModelsZixin Zhang 0004, Fan Qi, Changsheng Xu. [doi]
- Consistent Adversarially Robust Linear Classification: Non-Parametric SettingElvis Dohmatob. [doi]
- Towards Unified Multi-granularity Text Detection with Interactive AttentionXingyu Wan, Chengquan Zhang, Pengyuan Lyu, Sen Fan, Zihan Ni, Kun Yao, Errui Ding, Jingdong Wang 0001. [doi]
- Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-MindMo Yu, Qiujing Wang, Shunchi Zhang, Yisi Sang, Kangsheng Pu, Zekai Wei, Han Wang, Liyan Xu, Jing Li, Yue Yu, Jie Zhou 0016. [doi]
- Evaluating Quantized Large Language ModelsShiyao Li, Xuefei Ning, Luning Wang, Tengxuan Liu, Xiangsheng Shi, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang 0002. [doi]
- Graph Generation with Diffusion MixtureJaehyeong Jo, Dongki Kim, Sung Ju Hwang. [doi]
- Faster Sampling via Stochastic Gradient Proximal SamplerXunpeng Huang, Difan Zou, Hanze Dong, Yian Ma, Tong Zhang 0001. [doi]
- Learning Modality Knowledge Alignment for Cross-Modality TransferWenxuan Ma 0001, Shuang Li 0008, Lincan Cai, Jingxuan Kang. [doi]
- When Representations Align: Universality in Representation Learning DynamicsLoek van Rossem, Andrew M. Saxe. [doi]
- DNCs Require More Planning StepsYara Shamshoum, Nitzan Hodos, Yuval Sieradzki, Assaf Schuster. [doi]
- Agent Instructs Large Language Models to be General Zero-Shot ReasonersNicholas Crispino, Kyle Montgomery, Fankun Zeng, Dawn Song, Chenguang Wang 0001. [doi]
- Connect Later: Improving Fine-tuning for Robustness with Targeted AugmentationsHelen Qu, Sang Michael Xie. [doi]
- A decoder-only foundation model for time-series forecastingAbhimanyu Das, Weihao Kong, Rajat Sen, Yichen Zhou. [doi]
- SelfVC: Voice Conversion With Iterative Refinement using Self TransformationsPaarth Neekhara, Shehzeen Samarah Hussain, Rafael Valle, Boris Ginsburg, Rishabh Ranjan, Shlomo Dubnov, Farinaz Koushanfar, Julian J. McAuley. [doi]
- On a Combinatorial Problem Arising in Machine TeachingJoakim Sunde, Brigt Arve Toppe Håvardstun, Jan Kratochvíl, Jan Arne Telle. [doi]
- Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked PretrainingQi Zhang, Tianqi Du, Haotian Huang, Yifei Wang, Yisen Wang. [doi]
- Bounded and Uniform Energy-based Out-of-distribution Detection for GraphsShenzhi Yang, Bin Liang, An Liu, Lin Gui, Xingkai Yao, Xiaofang Zhang. [doi]
- Byzantine Resilient and Fast Federated Few-Shot LearningAnkit Pratap Singh, Namrata Vaswani. [doi]
- DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-TrainingZhongkai Hao, Chang Su, Songming Liu, Julius Berner, Chengyang Ying, Hang Su 0006, Anima Anandkumar, Jian Song, Jun Zhu 0001. [doi]
- Provable Benefits of Local Steps in Heterogeneous Federated Learning for Neural Networks: A Feature Learning PerspectiveYajie Bao, Michael Crawshaw, Mingrui Liu. [doi]
- One for All: A Universal Generator for Concept Unlearnability via Multi-Modal AlignmentChaochao Chen, Jiaming Zhang, Yuyuan Li, Zhongxuan Han. [doi]
- Adversarial Attacks on Combinatorial Multi-Armed BanditsRishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao. [doi]
- Neural operators meet conjugate gradients: The FCG-NO method for efficient PDE solvingAlexander Rudikov, Vladimir Fanaskov, Ekaterina A. Muravleva, Yuri M. Laevsky, Ivan V. Oseledets. [doi]
- Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language ModelsJinhao Li 0004, Haopeng Li, Sarah Monazam Erfani, Lei Feng, James Bailey 0001, Feng Liu. [doi]
- Covert Malicious Finetuning: Challenges in Safeguarding LLM AdaptationDanny Halawi, Alexander Wei 0001, Eric Wallace, Tony Tong Wang, Nika Haghtalab, Jacob Steinhardt. [doi]
- Learning with Partial-Label and Unlabeled Data: A Uniform Treatment for Supervision Redundancy and InsufficiencyYangfan Liu, Jiaqi Lv, Xin Geng 0001, Ning Xu 0009. [doi]
- Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraintWei Xiong 0015, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong 0001, Heng Ji, Nan Jiang 0008, Tong Zhang 0001. [doi]
- IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality MetricsEkaterina Shumitskaya, Anastasia Antsiferova, Dmitriy S. Vatolin. [doi]
- Quantum Positional Encodings for Graph Neural NetworksSlimane Thabet, Mehdi Djellabi, Igor Olegovich Sokolov, Sachin Kasture, Louis-Paul Henry, Loïc Henriet. [doi]
- Explorations of Self-Repair in Language ModelsCody Rushing, Neel Nanda. [doi]
- Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial TrainingJiacheng Zhang, Feng Liu 0003, Dawei Zhou 0004, Jingfeng Zhang, Tongliang Liu. [doi]
- Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment ProblemZhentao Tan, Yadong Mu. [doi]
- How Flawed Is ECE? An Analysis via Logit SmoothingMuthu Chidambaram, Holden Lee, Colin McSwiggen, Semon Rezchikov. [doi]
- A New Linear Scaling Rule for Private Adaptive Hyperparameter OptimizationAshwinee Panda, Xinyu Tang, Saeed Mahloujifar, Vikash Sehwag, Prateek Mittal. [doi]
- ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane ReflectionsMassimo Bini, Karsten Roth, Zeynep Akata, Anna Khoreva. [doi]
- Reinforcement Learning within Tree Search for Fast Macro PlacementZijie Geng, Jie Wang 0005, Ziyan Liu, Siyuan Xu, Zhentao Tang, Mingxuan Yuan, Jianye Hao, Yongdong Zhang 0001, Feng Wu 0001. [doi]
- Unsupervised Representation Learning of Brain Activity via Bridging Voxel Activity and Functional ConnectivityAli Behrouz, Parsa Delavari, Farnoosh Hashemi. [doi]
- Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule GenerationZhilin Huang, Ling Yang 0006, Xiangxin Zhou, Chujun Qin, Yijie Yu, Xiawu Zheng, Zikun Zhou, Wentao Zhang, Yu Wang, Wenming Yang. [doi]
- Conformal Predictions under Markovian DataFrédéric Zheng, Alexandre Proutière. [doi]
- Flora: Low-Rank Adapters Are Secretly Gradient CompressorsYongchang Hao, Yanshuai Cao, Lili Mou. [doi]
- AI Alignment with Changing and Influenceable Reward FunctionsMicah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell 0001, Anca D. Dragan. [doi]
- Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGDYijun Wan, Melih Barsbey, Abdellatif Zaidi, Umut Simsekli. [doi]
- Enhancing Adversarial Robustness in SNNs with Sparse GradientsYujia Liu, Tong Bu, Jianhao Ding, Zecheng Hao, Tiejun Huang 0001, Zhaofei Yu. [doi]
- How Private are DP-SGD Implementations?Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar 0001, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang. [doi]
- KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics DemonstrationsLongxin Kou, Fei Ni, Yan Zheng 0002, Jinyi Liu 0002, Yifu Yuan, Zibin Dong, Jianye Hao. [doi]
- Scalable AI Safety via Doubly-Efficient DebateJonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras. [doi]
- Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman ProblemsYifan Xia, Xianliang Yang, Zichuan Liu, Zhihao Liu, Lei Song, Jiang Bian 0002. [doi]
- Collective Certified Robustness against Graph Injection AttacksYuni Lai, Bailin Pan, Kaihuang Chen, Yancheng Yuan, Kai Zhou 0001. [doi]
- Using Uncertainty Quantification to Characterize and Improve Out-of-Domain Learning for PDEsS. Chandra Mouli, Danielle C. Maddix, Shima Alizadeh, Gaurav Gupta, Andrew Stuart, Michael W. Mahoney, Bernie Wang 0001. [doi]
- Monotone Individual FairnessYahav Bechavod. [doi]
- Understanding Server-Assisted Federated Learning in the Presence of Incomplete Client ParticipationHaibo Yang 0001, Peiwen Qiu, Prashant Khanduri, Minghong Fang, Jia Liu 0002. [doi]
- Efficient Mixture Learning in Black-Box Variational InferenceAlexandra Hotti, Oskar Kviman, Ricky Molén, Víctor Elvira, Jens Lagergren. [doi]
- OSN: Infinite Representations of Dynamic 3D Scenes from Monocular VideosZiyang Song, Jinxi Li, Bo Yang 0027. [doi]
- An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural NetworksZhifa Ke, Zaiwen Wen, Junyu Zhang. [doi]
- Sampling is as easy as keeping the consistency: convergence guarantee for Consistency ModelsJunlong Lyu, Zhitang Chen, Shoubo Feng. [doi]
- Offline Transition Modeling via Contrastive Energy LearningRuifeng Chen 0003, Chengxing Jia, Zefang Huang, Tian-Shuo Liu, Xu-Hui Liu, Yang Yu 0001. [doi]
- BAGEL: Bootstrapping Agents by Guiding Exploration with LanguageShikhar Murty, Christopher D. Manning, Peter Shaw, Mandar Joshi, Kenton Lee. [doi]
- Improved Communication-Privacy Trade-offs in L2 Mean Estimation under Streaming Differential PrivacyWei-Ning Chen, Berivan Isik, Peter Kairouz, Albert No, Sewoong Oh, Zheng Xu 0002. [doi]
- Viewing Transformers Through the Lens of Long Convolutions LayersItamar Zimerman, Lior Wolf. [doi]
- Iterated Denoising Energy Matching for Sampling from Boltzmann DensitiesTara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong 0001. [doi]
- DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing ProblemsZhi Zheng, Shunyu Yao, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Ke Tang. [doi]
- A Geometric Decomposition of Finite Games: Convergence vs. Recurrence under Exponential WeightsDavide Legacci, Panayotis Mertikopoulos, Bary S. R. Pradelski. [doi]
- Trustworthy Actionable PerturbationsJesse Friedbaum, Sudarshan Adiga, Ravi Tandon. [doi]
- StableSSM: Alleviating the Curse of Memory in State-space Models through Stable ReparameterizationShida Wang, Qianxiao Li. [doi]
- MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use CasesZechun Liu, Changsheng Zhao 0002, Forrest N. Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra. [doi]
- An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt LearningChen Jin, Ryutaro Tanno, Amrutha Saseendran, Tom Diethe, Philip Teare. [doi]
- Mol-AE: Auto-Encoder Based Molecular Representation Learning With 3D Cloze Test ObjectiveJunwei Yang, Kangjie Zheng, Siyu Long, Zaiqing Nie, Ming Zhang 0004, Xinyu Dai, Wei-Ying Ma, Hao Zhou 0012. [doi]
- The Fundamental Limits of Least-Privilege LearningTheresa Stadler, Bogdan Kulynych, Michael Gastpar, Nicolas Papernot, Carmela Troncoso. [doi]
- Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution DetectionChentao Cao, Zhun Zhong, Zhanke Zhou, Yang Liu 0018, Tongliang Liu, Bo Han 0003. [doi]
- Beyond the ROC Curve: Classification Trees Using Cost-Optimal Curves, with Application to Imbalanced DatasetsMagzhan Gabidolla, Arman Zharmagambetov, Miguel Á. Carreira-Perpiñán. [doi]
- Differentially Private Decentralized Learning with Random WalksEdwige Cyffers, Aurélien Bellet, Jalaj Upadhyay. [doi]
- C-RAG: Certified Generation Risks for Retrieval-Augmented Language ModelsMintong Kang, Nezihe Merve Gürel, Ning Yu, Dawn Song, Bo Li 0026. [doi]
- Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateYuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang. [doi]
- Pruner-Zero: Evolving Symbolic Pruning Metric From Scratch for Large Language ModelsPeijie Dong, Lujun Li, Zhenheng Tang, Xiang Liu, Xinglin Pan, Qiang Wang 0022, Xiaowen Chu 0001. [doi]
- Position: Understanding LLMs Requires More Than Statistical GeneralizationPatrik Reizinger, Szilvia Ujváry, Anna Mészáros, Anna Kerekes, Wieland Brendel, Ferenc Huszár. [doi]
- Multi-View Clustering by Inter-cluster Connectivity Guided RewardHao Dai, Yang Liu, Peng Su, Hecheng Cai, Shudong Huang, Jiancheng Lv 0001. [doi]
- Flexible Residual Binarization for Image Super-ResolutionYulun Zhang, Haotong Qin, Zixiang Zhao, Xianglong Liu 0001, Martin Danelljan, Fisher Yu 0001. [doi]
- Differentially private exact recovery for stochastic block modelsDung Nguyen 0002, Anil Kumar S. Vullikanti. [doi]
- Differentiable Annealed Importance Sampling Minimizes The Jensen-Shannon Divergence Between Initial and Target DistributionJohannes Zenn, Robert Bamler. [doi]
- More Flexible PAC-Bayesian Meta-Learning by Learning Learning AlgorithmsHossein Zakerinia, Amin Behjati, Christoph H. Lampert. [doi]
- Convex and Bilevel Optimization for Neural-Symbolic Inference and LearningCharles Andrew Dickens, Changyu Gao, Connor Pryor, Stephen J. Wright 0001, Lise Getoor. [doi]
- Spectral Phase Transition and Optimal PCA in Block-Structured Spiked ModelsPierre Mergny, Justin Ko, Florent Krzakala. [doi]
- On Convergence of Incremental Gradient for Non-convex Smooth FunctionsAnastasia Koloskova, Nikita Doikov, Sebastian U. Stich, Martin Jaggi. [doi]
- Category-Aware Active Domain AdaptationWenxiao Xiao, Jiuxiang Gu, Hongfu Liu 0001. [doi]
- Behavior Generation with Latent ActionsSeungjae Lee 0001, Yibin Wang 0008, Haritheja Etukuru, H. Jin Kim, Nur Muhammad (Mahi) Shafiullah, Lerrel Pinto. [doi]
- On Learning Deep O(n)-Equivariant HyperspheresPavlo Melnyk, Michael Felsberg, Mårten Wadenbäck, Andreas Robinson, Cuong Le. [doi]
- Hypergraph-enhanced Dual Semi-supervised Graph ClassificationWei Ju, Zhengyang Mao, Siyu Yi, Yifang Qin, Yiyang Gu, Zhiping Xiao 0001, Yifan Wang 0014, Xiao Luo 0001, Ming Zhang 0004. [doi]
- OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep LearningBin Lu 0005, Ze Zhao, Luyu Han, Xiaoying Gan, Yuntao Zhou, Lei Zhou, Luoyi Fu, Xinbing Wang, Chenghu Zhou, Jing Zhang. [doi]
- Denoising Autoregressive Representation LearningYazhe Li, Jörg Bornschein, Ting Chen. [doi]
- Statistical Properties of Robust SatisficingZhiyi Li, Yunbei Xu, Ruohan Zhan. [doi]
- Can a Few Decide for Many? The Metric Distortion of SortitionIoannis Caragiannis, Evi Micha, Jannik Peters 0001. [doi]
- An amortized approach to non-linear mixed-effects modeling based on neural posterior estimationJonas Arruda, Yannik Schälte, Clemens Peiter, Olga Teplytska, Ulrich Jaehde, Jan Hasenauer. [doi]
- Monotone, Bi-Lipschitz, and Polyak-Łojasiewicz NetworksRuigang Wang, Krishnamurthy Dj Dvijotham, Ian R. Manchester. [doi]
- Flextron: Many-in-One Flexible Large Language ModelRuisi Cai, Saurav Muralidharan, Greg Heinrich, Hongxu Yin, Zhangyang Wang, Jan Kautz, Pavlo Molchanov 0001. [doi]
- Mind the Boundary: Coreset Selection via Reconstructing the Decision BoundaryShuo Yang, Zhe Cao, Sheng Guo, Ruiheng Zhang, Ping Luo, Shengping Zhang, Liqiang Nie. [doi]
- Mapping the Multiverse of Latent RepresentationsJeremy Wayland, Corinna Coupette, Bastian Rieck. [doi]
- Learning the Target Network in Function SpaceKavosh Asadi, Yao Liu 0009, Shoham Sabach, Ming Yin, Rasool Fakoor. [doi]
- Provable Privacy with Non-Private Pre-ProcessingYaxi Hu, Amartya Sanyal, Bernhard Schölkopf. [doi]
- Universal Gradient Methods for Stochastic Convex OptimizationAnton Rodomanov, Ali Kavis, Yongtao Wu, Kimon Antonakopoulos, Volkan Cevher. [doi]
- PointMC: Multi-instance Point Cloud Registration based on Maximal CliquesYue Wu 0004, Xidao Hu, Yongzhe Yuan, Xiaolong Fan, Maoguo Gong, Hao Li 0009, Mingyang Zhang 0002, Qiguang Miao, Wenping Ma 0001. [doi]
- Robust Universal Adversarial PerturbationsChangming Xu, Gagandeep Singh 0001. [doi]
- Outlier-Efficient Hopfield Layers for Large Transformer-Based ModelsJerry Yao-Chieh Hu, Pei-Hsuan Chang, Haozheng Luo, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu. [doi]
- Scaling Rectified Flow Transformers for High-Resolution Image SynthesisPatrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Robin Rombach. [doi]
- Position: Foundation Agents as the Paradigm Shift for Decision MakingXiaoqian Liu, Xingzhou Lou, Jianbin Jiao, Junge Zhang. [doi]
- Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale DatasetShijie Lian, Ziyi Zhang, Hua Li 0012, Wenjie Li, Laurence Tianruo Yang, Sam Kwong, Runmin Cong. [doi]
- Deletion-Anticipative Data Selection with a Limited BudgetRachael Hwee Ling Sim, Jue Fan, Xiao Tian, Patrick Jaillet, Bryan Kian Hsiang Low. [doi]
- Premise Order Matters in Reasoning with Large Language ModelsXinyun Chen, Ryan A. Chi, Xuezhi Wang 0002, Denny Zhou. [doi]
- Learning Graph Representation via Graph Entropy MaximizationZiheng Sun, Xudong Wang, Chris Ding, Jicong Fan 0001. [doi]
- Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of ExpertsOnur Celik, Aleksandar Taranovic, Gerhard Neumann. [doi]
- X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar GenerationYiwei Ma, Zhekai Lin, Jiayi Ji, Yijun Fan, Xiaoshuai Sun, Rongrong Ji. [doi]
- On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean FunctionsDenys Pushkin, Raphaël Berthier, Emmanuel Abbe. [doi]
- Winner-takes-all learners are geometry-aware conditional density estimatorsVictor Letzelter, David Perera, Cédric Rommel, Mathieu Fontaine 0002, Slim Essid, Gaël Richard, Patrick Pérez. [doi]
- Smoothness Adaptive Hypothesis Transfer LearningHaotian Lin 0002, Matthew Reimherr. [doi]
- Efficient Precision and Recall Metrics for Assessing Generative Models using Hubness-aware SamplingYuanbang Liang, Jing Wu 0004, Yu-Kun Lai, Yipeng Qin. [doi]
- Adaptively Perturbed Mirror Descent for Learning in GamesKenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Atsushi Iwasaki. [doi]
- Automated Loss function Search for Class-imbalanced Node ClassificationXinyu Guo, Kai Wu, Xiaoyu Zhang, Jing Liu. [doi]
- Discovering Mixtures of Structural Causal Models from Time Series DataSumanth Varambally, Yian Ma, Rose Yu. [doi]
- PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial ConsistencyYeonsung Jung, Heecheol Yun, Joonhyung Park, Jin-Hwa Kim, Eunho Yang. [doi]
- BLO-SAM: Bi-level Optimization Based Finetuning of the Segment Anything Model for Overfitting-Preventing Semantic SegmentationLi Zhang, Youwei Liang, Ruiyi Zhang, Amirhosein Javadi, Pengtao Xie. [doi]
- SpikeZIP-TF: Conversion is All You Need for Transformer-based SNNKang You, Zekai Xu, Chen Nie, Zhijie Deng, Qinghai Guo, Xiang Wang, Zhezhi He. [doi]
- Weisfeiler Leman for Euclidean Equivariant Machine LearningSnir Hordan, Tal Amir, Nadav Dym. [doi]
- Diffusion Language Models Are Versatile Protein LearnersXinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, Quanquan Gu. [doi]
- UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual CheckersDuo Peng, Qiuhong Ke, Jun Liu 0036. [doi]
- REST: Efficient and Accelerated EEG Seizure Analysis through Residual State UpdatesArshia Afzal, Grigorios Chrysos 0002, Volkan Cevher, Mahsa Shoaran. [doi]
- Early Time Classification with Accumulated Accuracy Gap ControlLiran Ringel, Regev Cohen, Daniel Freedman, Michael Elad, Yaniv Romano. [doi]
- Generative Active Learning for Long-tailed Instance SegmentationMuzhi Zhu, Chengxiang Fan, Hao Chen 0041, Yang Liu, Weian Mao, Xiaogang Xu, Chunhua Shen. [doi]
- Parameter-Efficient Fine-Tuning with Discrete Fourier TransformZiqi Gao, Qichao Wang, Aochuan Chen, Zijing Liu, Bingzhe Wu, Liang Chen, Jia Li 0009. [doi]
- Light and Optimal Schrödinger Bridge MatchingNikita Gushchin, Sergei Kholkin, Evgeny Burnaev, Alexander Korotin. [doi]
- Causal Representation Learning Made Identifiable by Grouping of Observational VariablesHiroshi Morioka, Aapo Hyvärinen. [doi]
- Profile Reconstruction from Private SketchesHao Wu, Rasmus Pagh. [doi]
- Chatbot Arena: An Open Platform for Evaluating LLMs by Human PreferenceWei-Lin Chiang, Lianmin Zheng, Ying Sheng 0007, Anastasios Nikolas Angelopoulos, Tianle Li, Dacheng Li, Banghua Zhu, Hao Zhang 0108, Michael I. Jordan, Joseph E. Gonzalez, Ion Stoica. [doi]
- From Coarse to Fine: Enable Comprehensive Graph Self-supervised Learning with Multi-granular Semantic EnsembleQianlong Wen, Mingxuan Ju, Zhongyu Ouyang, Chuxu Zhang, Yanfang Ye 0001. [doi]
- Position: Opportunities Exist for Machine Learning in Magnetic Fusion EnergyLucas Spangher, Allen M. Wang, Andrew Maris, Myles Stapelberg, Viraj Mehta, Alex Saperstein, Stephen Lane-Walsh, Akshata Kishore Moharir, Alessandro Pau, Cristina Rea. [doi]
- Self-Correcting Self-Consuming Loops for Generative Model TrainingNate Gillman, Michael Freeman, Daksh Aggarwal, Chia-Hong Hsu, Calvin Luo, Yonglong Tian, Chen Sun 0002. [doi]
- In-context Learning on Function Classes Unveiled for TransformersZhijie Wang, Bo Jiang, Shuai Li 0010. [doi]
- Learning Shadow Variable Representation for Treatment Effect Estimation under Collider BiasBaohong Li, Haoxuan Li, Ruoxuan Xiong, Anpeng Wu, Fei Wu 0001, Kun Kuang. [doi]
- Position: Will we run out of data? Limits of LLM scaling based on human-generated dataPablo Villalobos, Anson Ho, Jaime Sevilla, Tamay Besiroglu, Lennart Heim, Marius Hobbhahn. [doi]
- COLD-Attack: Jailbreaking LLMs with Stealthiness and ControllabilityXingang Guo, Fangxu Yu, Huan Zhang, Lianhui Qin, Bin Hu 0002. [doi]
- Slicing Mutual Information Generalization Bounds for Neural NetworksKimia Nadjahi, Kristjan H. Greenewald, Rickard Brüel Gabrielsson, Justin Solomon 0001. [doi]
- DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric EstimationQinshuo Liu, Zixin Wang, Xi'an Li, Xinyao Ji, Lei Zhang, Lin Liu, Zhonghua Liu. [doi]
- Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic AnchorsChun-Yin Huang, Kartik Srinivas, Xin Zhang, Xiaoxiao Li. [doi]
- Improved Modelling of Federated Datasets using Mixtures-of-Dirichlet-MultinomialsJonathan Scott, Áine Cahill. [doi]
- Quality-Weighted Vendi Scores And Their Application To Diverse Experimental DesignQuan Nguyen, Adji Bousso Dieng. [doi]
- Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific ModelsRaviteja Vemulapalli, Hadi Pouransari, Fartash Faghri, Sachin Mehta, Mehrdad Farajtabar, Mohammad Rastegari, Oncel Tuzel. [doi]
- Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel OptimizationFeihu Huang. [doi]
- Convergence of Online Learning Algorithm for a Mixture of Multiple Linear RegressionsYujing Liu, Zhixin Liu, Lei Guo 0001. [doi]
- Enabling Few-Shot Learning with PID Control: A Layer Adaptive OptimizerLe Yu, Xinde Li, Pengfei Zhang, Zhentong Zhang, Fir Dunkin. [doi]
- Code as Reward: Empowering Reinforcement Learning with VLMsDavid Venuto, Mohammad Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand. [doi]
- Data-efficient Large Vision Models through Sequential AutoregressionZhiwei Hao, Jianyuan Guo, Chengcheng Wang, Yehui Tang, Han Wu, Han Hu 0001, Kai Han 0002, Chang Xu 0002. [doi]
- Offline Actor-Critic Reinforcement Learning Scales to Large ModelsJost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang 0001, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin A. Riedmiller. [doi]
- Parameterized Physics-informed Neural Networks for Parameterized PDEsWoojin Cho, Minju Jo, Haksoo Lim, Kookjin Lee, Dongeun Lee 0001, Sanghyun Hong 0001, Noseong Park. [doi]
- A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip DesignZhihai Wang, Lei Chen 0031, Jie Wang 0005, Yinqi Bai, Xing Li, Xijun Li, Mingxuan Yuan, Jianye Hao, Yongdong Zhang 0001, Feng Wu 0001. [doi]
- Improving Instruction Following in Language Models through Proxy-Based Uncertainty EstimationJoonho Lee, Jae Oh Woo, Juree Seok, Parisa Hassanzadeh, Wooseok Jang, JuYoun Son, Sima Didari, Baruch Gutow, Heng Hao, Hankyu Moon, Wenjun Hu, Yeong-Dae Kwon, Taehee Lee, Seungjai Min. [doi]
- MultiMax: Sparse and Multi-Modal Attention LearningYuxuan Zhou 0004, Mario Fritz, Margret Keuper. [doi]
- Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu. [doi]
- Transitional Uncertainty with Layered Intermediate PredictionsRyan Benkert, Mohit Prabhushankar, Ghassan Alregib. [doi]
- From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint TuningWei Chen, Zhen Huang, Liang Xie 0003, Binbin Lin, Houqiang Li, Le Lu 0001, Xinmei Tian 0001, Deng Cai 0001, Yonggang Zhang, Wenxiao Wang 0001, Xu Shen, Jieping Ye. [doi]
- Risk-Sensitive Reward-Free Reinforcement Learning with CVaRXinyi Ni, Guanlin Liu, Lifeng Lai. [doi]
- How to Trace Latent Generative Model Generated Images without Artificial Watermark?Zhenting Wang, Vikash Sehwag, Chen Chen 0043, Lingjuan Lyu, Dimitris N. Metaxas, ShiQing Ma. [doi]
- Beyond Individual Input for Deep Anomaly Detection on Tabular DataHugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Liên Doan. [doi]
- Mean-field Chaos Diffusion ModelsSungwoo Park, Dongjun Kim, Ahmed Alaa. [doi]
- LESS: Selecting Influential Data for Targeted Instruction TuningMengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen 0001. [doi]
- Improving Computational Complexity in Statistical Models with Local Curvature InformationPedram Akbarian, Tongzheng Ren, Jiacheng Zhuo, Sujay Sanghavi, Nhat Ho. [doi]
- Learning Coverage Paths in Unknown Environments with Deep Reinforcement LearningArvi Jonnarth, Jie Zhao 0014, Michael Felsberg. [doi]
- ReconBoost: Boosting Can Achieve Modality ReconcilementCong Hua, Qianqian Xu, Shilong Bao, Zhiyong Yang 0001, Qingming Huang. [doi]
- DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete LatentsYilun Xu, Gabriele Corso, Tommi S. Jaakkola, Arash Vahdat, Karsten Kreis. [doi]
- How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?Ryan Liu, Theodore R. Sumers, Ishita Dasgupta 0001, Thomas L. Griffiths 0001. [doi]
- Data-free Neural Representation Compression with Riemannian Neural DynamicsZhengqi Pei, Anran Zhang, Shuhui Wang, Xiangyang Ji, Qingming Huang. [doi]
- CARTE: Pretraining and Transfer for Tabular LearningMyung Jun Kim, Léo Grinsztajn, Gaël Varoquaux. [doi]
- Guiding LLMs The Right Way: Fast, Non-Invasive Constrained GenerationLuca Beurer-Kellner, Marc Fischer 0002, Martin T. Vechev. [doi]
- Intersecting-Boundary-Sensitive Fingerprinting for Tampering Detection of DNN ModelsBai Xiaofan, Chaoxiang He, Xiaojing Ma, Bin Benjamin Zhu, Hai Jin 0001. [doi]
- Discounted Adaptive Online Learning: Towards Better RegularizationZhiyu Zhang 0003, David Bombara, Heng Yang. [doi]
- Language Models as Science TutorsAlexis Chevalier, Jiayi Geng, Alexander Wettig, Howard Chen 0003, Sebastian Mizera, Toni Annala, Max Jameson Aragon, Arturo Rodríguez Fanlo, Simon Frieder, Simon Machado, Akshara Prabhakar, Ellie Thieu, Jiachen T. Wang, Zirui Wang, Xindi Wu, Mengzhou Xia, Wenhan Xia, Jiatong Yu, JunJie Zhu, Zhiyong Jason Ren, Sanjeev Arora, Danqi Chen 0001. [doi]
- Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step GenerationMingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang. [doi]
- Accelerating Transformer Pre-training with 2: 4 SparsityYuezhou Hu, Kang Zhao, Weiyu Huang, Jianfei Chen, Jun Zhu. [doi]
- Single-Model Attribution of Generative Models Through Final-Layer InversionMike Laszkiewicz, Jonas Ricker, Johannes Lederer, Asja Fischer. [doi]
- On the Unexpected Effectiveness of Reinforcement Learning for Sequential RecommendationAlvaro Labarca, Denis Parra, Rodrigo Toro Icarte. [doi]
- PairNet: Training with Observed Pairs to Estimate Individual Treatment EffectLokesh Nagalapatti, Pranava Singhal, Avishek Ghosh, Sunita Sarawagi. [doi]
- Tell, Don't Show: Language Guidance Eases Transfer Across Domains in Images and VideosTarun Kalluri, Bodhisattwa Prasad Majumder, Manmohan Chandraker. [doi]
- Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context LearningZhuo Huang, Chang Liu 0077, Yinpeng Dong, Hang Su 0006, Shibao Zheng, Tongliang Liu. [doi]
- PAC-Bayesian Generalization Bounds for Knowledge Graph Representation LearningJaejun Lee, Minsung Hwang, Joyce Jiyoung Whang. [doi]
- Mechanistic Neural Networks for Scientific Machine LearningAdeel Pervez, Francesco Locatello, Stratis Gavves. [doi]
- Orthogonal Bootstrap: Efficient Simulation of Input UncertaintyKaizhao Liu, José H. Blanchet, Lexing Ying, Yiping Lu 0001. [doi]
- Why Larger Language Models Do In-context Learning Differently?Zhenmei Shi, Junyi Wei, Zhuoyan Xu, Yingyu Liang. [doi]
- Zeroth-Order Methods for Constrained Nonconvex Nonsmooth Stochastic OptimizationZhuanghua Liu, Cheng Chen, Luo Luo, Bryan Kian Hsiang Low. [doi]
- Causal Representation Learning from Multiple Distributions: A General SettingKun Zhang 0001, Shaoan Xie, Ignavier Ng, Yujia Zheng. [doi]
- Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human AlignmentChen Zhang, Qiang He, Yuan Zhou, Elvis S. Liu, Hong Wang, Jian Zhao 0010, Yang Wang. [doi]
- Feature Reuse and Scaling: Understanding Transfer Learning with Protein Language ModelsFrancesca-Zhoufan Li, Ava P. Amini, Yisong Yue, Kevin K. Yang, Alex Xijie Lu. [doi]
- Position: A Safe Harbor for AI Evaluation and Red TeamingShayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng Xin Yong, Suhas Kotha, Yi Zeng 0005, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia 0001, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson 0002. [doi]
- Watermark Stealing in Large Language ModelsNikola Jovanovic 0001, Robin Staab, Martin T. Vechev. [doi]
- SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationJialong Guo, Xinghao Chen 0001, Yehui Tang, Yunhe Wang 0001. [doi]
- Diffusive Gibbs SamplingWenlin Chen, Mingtian Zhang, Brooks Paige, José Miguel Hernández-Lobato, David Barber. [doi]
- Temporal Spiking Neural Networks with Synaptic Delay for Graph ReasoningMingqing Xiao 0002, Yixin Zhu, Di He 0001, Zhouchen Lin. [doi]
- Representing Molecules as Random Walks Over Interpretable GrammarsMichael Sun, Minghao Guo, Weize Yuan, Veronika Thost, Crystal Elaine Owens, Aristotle Franklin Grosz, Sharvaa Selvan, Katelyn Zhou, Hassan Mohiuddin, Benjamin J. Pedretti, Zachary P. Smith, Jie Chen 0007, Wojciech Matusik. [doi]
- Estimating Canopy Height at ScaleJan Pauls, Max Zimmer, Una M. Kelly, Martin Schwartz, Sassan Saatchi, Philippe Ciais, Sebastian Pokutta, Martin Brandt, Fabian Gieseke. [doi]
- Bayesian Optimization of Function Networks with Partial EvaluationsPoompol Buathong, Jiayue Wan, Raul Astudillo, Samuel Daulton, Maximilian Balandat, Peter I. Frazier. [doi]
- Theoretical Guarantees for Variational Inference with Fixed-Variance Mixture of GaussiansTom Huix, Anna Korba, Alain Oliviero Durmus, Eric Moulines. [doi]
- Keep the Momentum: Conservation Laws beyond Euclidean Gradient FlowsSibylle Marcotte, Rémi Gribonval, Gabriel Peyré. [doi]
- Total Variation Distance Meets Probabilistic InferenceArnab Bhattacharyya 0001, Sutanu Gayen, Kuldeep S. Meel, Dimitrios Myrisiotis, A. Pavan 0001, N. V. Vinodchandran. [doi]
- Uncertainty-Aware Reward-Free Exploration with General Function ApproximationJunkai Zhang, Weitong Zhang, Dongruo Zhou, Quanquan Gu. [doi]
- Safe Exploration in Dose Finding Clinical Trials with Heterogeneous ParticipantsIsabel Chien, Wessel P. Bruinsma, Javier González Hernández, Richard E. Turner. [doi]
- Principled Gradient-Based MCMC for Conditional Sampling of TextLi Du, Afra Amini, Lucas Torroba Hennigen, Xinyan Velocity Yu, Holden Lee, Jason Eisner, Ryan Cotterell. [doi]
- One Prompt is not Enough: Automated Construction of a Mixture-of-Expert PromptsRuochen Wang, Sohyun An, Minhao Cheng, Tianyi Zhou, Sung Ju Hwang, Cho-Jui Hsieh. [doi]
- Careful with that Scalpel: Improving Gradient Surgery with an EMAYu-Guan Hsieh, James Thornton, Eugène Ndiaye, Michal Klein, Marco Cuturi, Pierre Ablin. [doi]
- Self-Supervised Interpretable End-to-End Learning via Latent Functional ModularityHyunki Seong, David Hyunchul Shim. [doi]
- Time Series Diffusion in the Frequency DomainJonathan Crabbé, Nicolas Huynh, Jan Stanczuk, Mihaela van der Schaar. [doi]
- Rolling Diffusion ModelsDavid Ruhe, Jonathan Heek, Tim Salimans, Emiel Hoogeboom. [doi]
- Feedback Loops With Language Models Drive In-Context Reward HackingAlexander Pan, Erik Jones, Meena Jagadeesan, Jacob Steinhardt. [doi]
- From Neurons to Neutrons: A Case Study in InterpretabilityOuail Kitouni, Niklas Nolte, Víctor Samuel Pérez-Díaz, Sokratis Trifinopoulos, Mike Williams. [doi]
- A Dynamical Model of Neural Scaling LawsBlake Bordelon, Alexander B. Atanasov, Cengiz Pehlevan. [doi]
- Self-Composing Policies for Scalable Continual Reinforcement LearningMikel Malagon, Josu Ceberio, José Antonio Lozano 0001. [doi]
- Learning High-Frequency Functions Made Easy with Sinusoidal Positional EncodingChuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, N. Siddharth, Shuo Chen, Mahesh K. Marina. [doi]
- A New Robust Partial p-Wasserstein-Based Metric for Comparing DistributionsSharath Raghvendra, Pouyan Shirzadian, Kaiyi Zhang 0004. [doi]
- Completing Visual Objects via Bridging Generation and SegmentationXiang Li 0106, Yinpeng Chen, Chung-Ching Lin, Hao Chen 0102, Kai Hu 0010, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu 0001. [doi]
- Hidden Traveling Waves bind Working Memory Variables in Recurrent Neural NetworksArjun Karuvally, Terrence J. Sejnowski, Hava T. Siegelmann. [doi]
- Iterative Regularized Policy Optimization with Imperfect DemonstrationsXudong Gong, Dawei Feng, Kele Xu, Yuanzhao Zhai, Chengkang Yao, Weijia Wang, Bo Ding, Huaimin Wang. [doi]
- On the Diminishing Returns of Width for Continual LearningEtash Kumar Guha, Vihan Lakshman. [doi]
- Understanding Inter-Concept Relationships in Concept-Based ModelsNaveen Raman, Mateo Espinosa Zarlenga, Mateja Jamnik. [doi]
- Fast-Slow Test-Time Adaptation for Online Vision-and-Language NavigationJunyu Gao 0002, Xuan Yao, Changsheng Xu. [doi]
- AttNS: Attention-Inspired Numerical Solving For Limited Data ScenariosZhongzhan Huang, Mingfu Liang, ShanShan Zhong, Liang Lin. [doi]
- Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long SequencesZicheng Liu 0006, Siyuan Li, Li Wang, Zedong Wang, Yunfan Liu 0002, Stan Z. Li. [doi]
- Finite Volume Features, Global Geometry Representations, and Residual Training for Deep Learning-based CFD SimulationLoh Sher En Jessica, Naheed Anjum Arafat, Wei Xian Lim, Wai Lee Chan, Adams Wai-Kin Kong. [doi]
- Risk-Sensitive Policy Optimization via Predictive CVaR Policy GradientJu-Hyun Kim, Seungki Min. [doi]
- A Universal Class of Sharpness-Aware Minimization AlgorithmsBehrooz Tahmasebi, Ashkan Soleymani, Dara Bahri, Stefanie Jegelka, Patrick Jaillet. [doi]
- Accelerated Policy Gradient for s-rectangular Robust MDPs with Large State SpacesZiyi Chen, Heng Huang. [doi]
- Position: The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine LearningMicah Goldblum, Marc Anton Finzi, Keefer Rowan, Andrew Gordon Wilson. [doi]
- One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual LearningDoyoung Kim, Susik Yoon, Dongmin Park, YoungJun Lee, Hwanjun Song, Jihwan Bang, Jae-Gil Lee. [doi]
- The Effect of Weight Precision on the Neuron Count in Deep ReLU NetworksSonghua He, Periklis A. Papakonstantinou. [doi]
- AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for TransformersReduan Achtibat, Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Aakriti Jain, Thomas Wiegand, Sebastian Lapuschkin, Wojciech Samek. [doi]
- Improving Diffusion Models for Inverse Problems Using Optimal Posterior CovarianceXinyu Peng, Ziyang Zheng, Wenrui Dai, Nuoqian Xiao, Chenglin Li, Junni Zou, Hongkai Xiong. [doi]
- Diffusion Posterior Sampling is Computationally IntractableShivam Gupta 0002, Ajil Jalal, Aditya Parulekar, Eric Price 0001, Zhiyang Xun. [doi]
- Restoring balance: principled under/oversampling of data for optimal classificationEmanuele Loffredo, Mauro Pastore, Simona Cocco, Rémi Monasson. [doi]
- Adaptive Group Personalization for Federated Mutual Transfer LearningHaoqing Xu, Dian Shen, Meng Wang, Beilun Wang. [doi]
- Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More PracticalWei Wang, Takashi Ishida 0001, Yu-Jie Zhang, Gang Niu 0001, Masashi Sugiyama. [doi]
- Physics of Language Models: Part 3.1, Knowledge Storage and ExtractionZeyuan Allen Zhu, Yuanzhi Li. [doi]
- KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV CacheZirui Liu, Jiayi Yuan, Hongye Jin, Shaochen Zhong, Zhaozhuo Xu, Vladimir Braverman, Beidi Chen, Xia Hu 0001. [doi]
- Swallowing the Bitter Pill: Simplified Scalable Conformer GenerationYuyang Wang, Ahmed A. A. Elhag, Navdeep Jaitly, Joshua M. Susskind, Miguel Ángel Bautista 0001. [doi]
- Multi-group Learning for Hierarchical GroupsSamuel Deng, Daniel Hsu 0001. [doi]
- Generalization Analysis of Deep Non-linear Matrix CompletionAntoine Ledent, Rodrigo Alves. [doi]
- A Diffusion Model Framework for Unsupervised Neural Combinatorial OptimizationSebastian Sanokowski, Sepp Hochreiter, Sebastian Lehner. [doi]
- SuDA: Support-based Domain Adaptation for Sim2Real Hinge Joint Tracking with Flexible SensorsJiawei Fang, Haishan Song, Chengxu Zuo, Xiaoxia Gao, Xiaowei Chen, Shihui Guo, Yipeng Qin. [doi]
- Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM AgentsZhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang 0001. [doi]
- Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language ModelsAsma Ghandeharioun, Avi Caciularu, Adam Pearce, Lucas Dixon, Mor Geva. [doi]
- A Global Geometric Analysis of Maximal Coding Rate ReductionPeng Wang 0098, Huikang Liu, Druv Pai, Yaodong Yu, Zhihui Zhu, Qing Qu 0001, Yi Ma 0001. [doi]
- LQER: Low-Rank Quantization Error Reconstruction for LLMsCheng Zhang, Jianyi Cheng, George Anthony Constantinides, Yiren Zhao. [doi]
- Understanding and Diagnosing Deep Reinforcement LearningEzgi Korkmaz. [doi]
- Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data UtilizationYihan Du, Anna Winnicki, Gal Dalal, Shie Mannor, R. Srikant 0001. [doi]
- Ensemble Pruning for Out-of-distribution GeneralizationFengchun Qiao, Xi Peng 0005. [doi]
- Physics and Lie symmetry informed Gaussian processesDavid Dalton, Dirk Husmeier, Hao Gao 0002. [doi]
- Data-Efficient Molecular Generation with Hierarchical Textual InversionSeojin Kim, Jaehyun Nam, Sihyun Yu, Younghoon Shin, Jinwoo Shin. [doi]
- Probability Distribution of Hypervolume Improvement in Bi-objective Bayesian OptimizationHao Wang 0025, Kaifeng Yang, Michael Affenzeller. [doi]
- Lessons from Generalization Error Analysis of Federated Learning: You May Communicate Less Often!Milad Sefidgaran, Romain Chor, Abdellatif Zaidi, Yijun Wan. [doi]
- Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse InputsMingyu Kim 0002, Jun-Seong Kim, Se-Young Yun, Jin-Hwa Kim. [doi]
- Simultaneous identification of models and parameters of scientific simulatorsCornelius Schröder, Jakob H. Macke. [doi]
- Faster Maximum Inner Product Search in High DimensionsMo Tiwari, Ryan Kang, Jaeyong Lee, Donghyun Lee, Christopher Piech, Sebastian Thrun, Ilan Shomorony, Martin Jinye Zhang. [doi]
- QBMK: Quantum-based Matching Kernels for Un-attributed GraphsLu Bai 0001, Lixin Cui, Ming Li 0065, Yue Wang 0014, Edwin R. Hancock. [doi]
- Translating Subgraphs to Nodes Makes Simple GNNs Strong and Efficient for Subgraph Representation LearningDongkwan Kim 0001, Alice Oh. [doi]
- Align Your Steps: Optimizing Sampling Schedules in Diffusion ModelsAmirmojtaba Sabour, Sanja Fidler, Karsten Kreis. [doi]
- Fewer Truncations Improve Language ModelingHantian Ding, Zijian Wang 0002, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto. [doi]
- Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimesNabeel Seedat, Nicolas Huynh, Boris van Breugel, Mihaela van der Schaar. [doi]
- Improving Sharpness-Aware Minimization by LookaheadRunsheng Yu, Youzhi Zhang 0001, James T. Kwok. [doi]
- ReLU Network with Width d+O(1) Can Achieve Optimal Approximation RateChenghao Liu, Minghua Chen. [doi]
- Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition LearningWei Li, Hehe Fan, Yongkang Wong, Yi Yang 0001, Mohan S. Kankanhalli. [doi]
- By Tying Embeddings You Are Assuming the Distributional HypothesisFrancesco Bertolotti, Walter Cazzola. [doi]
- Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human FeedbackVincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert 0001, Milan Mossé, Eric Pacuit, Stuart Russell 0001, Hailey Schoelkopf, Emanuel Tewolde, William S. Zwicker. [doi]
- ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical DataXiangjian Jiang, Andrei Margeloiu, Nikola Simidjievski, Mateja Jamnik. [doi]
- Neuroexplicit Diffusion Models for Inpainting of Optical Flow FieldsTom Fischer, Pascal Peter, Joachim Weickert, Eddy Ilg. [doi]
- Weisfeiler-Leman at the margin: When more expressivity mattersBilly Joe Franks, Christopher Morris 0001, Ameya Velingker, Floris Geerts. [doi]
- Long-Tail Learning with Foundation Model: Heavy Fine-Tuning HurtsJiang-Xin Shi, Tong Wei 0001, Zhi Zhou 0007, Jie-Jing Shao, Xin-Yan Han, Yufeng Li 0008. [doi]
- Why Do You Grok? A Theoretical Analysis on Grokking Modular AdditionMohamad Amin Mohamadi, Zhiyuan Li, Lei Wu, Danica J. Sutherland. [doi]
- Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention TransformersBrian K. Chen, Tianyang Hu, Hui Jin, Hwee Kuan Lee, Kenji Kawaguchi. [doi]
- Policy-conditioned Environment Models are More GeneralizableRuifeng Chen 0003, Xiong-Hui Chen, Yihao Sun, Siyuan Xiao, Minhui Li, Yang Yu 0001. [doi]
- Parameter-Dependent Competitive Analysis for Online Capacitated Coverage Maximization through Boostings and AttenuationsPan Xu 0001. [doi]
- The Computational Complexity of Finding Second-Order Stationary PointsAndreas Kontogiannis, Vasilis Pollatos, Sotiris Kanellopoulos, Panayotis Mertikopoulos, Aris Pagourtzis, Ioannis Panageas. [doi]
- Going beyond Compositions, DDPMs Can Produce Zero-Shot InterpolationsJustin Deschenaux, Igor Krawczuk, Grigorios Chrysos 0002, Volkan Cevher. [doi]
- Factored-Reward Bandits with Intermediate ObservationsMarco Mussi, Simone Drago, Marcello Restelli, Alberto Maria Metelli. [doi]
- Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based TrainingMing-Kun Xie, Jiahao Xiao, Pei Peng, Gang Niu 0001, Masashi Sugiyama, Sheng-Jun Huang. [doi]
- RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with ExplanationZelei Cheng, Xian Wu, Jiahao Yu 0001, Sabrina Yang, Gang Wang 0011, Xinyu Xing. [doi]
- Inverse-Variance Weighting for Estimation of Heterogeneous Treatment EffectsAaron Fisher. [doi]
- Stochastic Optimization with Arbitrary Recurrent Data SamplingWilliam G. Powell, Hanbaek Lyu. [doi]
- EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear TimeShengyao Lu, Bang Liu, Keith G. Mills, Jiao He, Di Niu. [doi]
- Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially FastXiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang 0007, Jing Jiang 0001, Min Lin. [doi]
- Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological AwarenessGuibin Zhang, Yanwei Yue, Kun Wang, Junfeng Fang, Yongduo Sui, Kai Wang 0036, Yuxuan Liang, Dawei Cheng, Shirui Pan, Tianlong Chen. [doi]
- How do Transformers Perform In-Context Autoregressive Learning ?Michael Eli Sander, Raja Giryes, Taiji Suzuki, Mathieu Blondel, Gabriel Peyré. [doi]
- Structure-based drug design by denoising voxel gridsPedro O. Pinheiro, Arian Rokkum Jamasb, Omar Mahmood, Vishnu Sresht, Saeed Saremi. [doi]
- Pricing with Contextual Elasticity and Heteroscedastic ValuationJianyu Xu, Yu-Xiang Wang. [doi]
- Differentiable Combinatorial Scheduling at ScaleMingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu. [doi]
- Graph Distillation with Eigenbasis MatchingYang Liu, Deyu Bo, Chuan Shi. [doi]
- Revisiting Inexact Fixed-Point Iterations for Min-Max Problems: Stochasticity and Structured NonconvexityAhmet Alacaoglu, Donghwan Kim, Stephen J. Wright 0001. [doi]
- Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained ModelsYongxian Wei, Zixuan Hu, Li Shen 0008, Zhenyi Wang, Yu Li, Chun Yuan, Dacheng Tao. [doi]
- Chain of Code: Reasoning with a Language Model-Augmented Code EmulatorChengshu Li 0002, Jacky Liang, Andy Zeng, Xinyun Chen, Karol Hausman, Dorsa Sadigh, Sergey Levine, Li Fei-Fei 0001, Fei Xia, Brian Ichter. [doi]
- Local Feature Selection without Label or Feature Leakage for Interpretable Machine Learning PredictionsHarrie Oosterhuis, Lijun Lyu, Avishek Anand. [doi]
- ACE: Off-Policy Actor-Critic with Causality-Aware Entropy RegularizationTianying Ji, Yongyuan Liang, Yan Zeng 0002, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun 0001, Huazhe Xu. [doi]
- Absolute Policy Optimization: Enhancing Lower Probability Bound of Performance with High ConfidenceWeiye Zhao, Feihan Li, Yifan Sun, Rui Chen 0030, Tianhao Wei, Changliu Liu. [doi]
- Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic GraphsLangzhang Liang, Sunwoo Kim, Kijung Shin, Zenglin Xu, Shirui Pan, Yuan Qi 0001. [doi]
- Stability and Generalization of Stochastic Compositional Gradient Descent AlgorithmsMing Yang, Xiyuan Wei, Tianbao Yang, Yiming Ying. [doi]
- Understanding Diffusion Models by Feynman's Path IntegralYuji Hirono, Akinori Tanaka, Kenji Fukushima. [doi]
- Unsupervised Domain Adaptation for Anatomical Structure Detection in Ultrasound ImagesBin Pu, Xingguo Lv, Jiewen Yang, Guannan He, Xingbo Dong, Yiqun Lin, Shengli Li 0001, Tan Ying, Fei Liu, Ming Chen, Zhe Jin, Kenli Li 0001, Xiaomeng Li 0001. [doi]
- ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal DataCarmen Martin-Turrero, Maxence Bouvier, Manuel Breitenstein, Pietro Zanuttigh, Vincent Parret. [doi]
- Proactive DP: A Multiple Target Optimization Framework for DP-SGDMarten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Phuong Ha Nguyen. [doi]
- On the Feasibility of Single-Pass Full-Capacity Learning in Linear Threshold Neurons with Binary Input VectorsRuipeng Liu, Borui He, Naveed Tahir, Garrett E. Katz. [doi]
- From Self-Attention to Markov Models: Unveiling the Dynamics of Generative TransformersMuhammed Emrullah Ildiz, Yixiao Huang 0004, Yingcong Li, Ankit Singh Rawat, Samet Oymak. [doi]
- Soft Prompt Recovers Compressed LLMs, TransferablyZhaozhuo Xu, Zirui Liu, Beidi Chen, Shaochen Zhong, Yuxin Tang, Jue Wang, Kaixiong Zhou, Xia Hu 0001, Anshumali Shrivastava. [doi]
- A Study of First-Order Methods with a Deterministic Relative-Error Gradient OracleNadav Hallak, Kfir Yehuda Levy. [doi]
- Pedestrian Attribute Recognition as Label-balanced Multi-label LearningYibo Zhou, Hai-Miao Hu, Yirong Xiang, Xiaokang Zhang, Haotian Wu. [doi]
- Partially Stochastic Infinitely Deep Bayesian Neural NetworksSergio Calvo-Ordoñez, Matthieu Meunier, Francesco Piatti, Yuantao Shi. [doi]
- Principled Preferential Bayesian OptimizationWenjie Xu, Wenbin Wang, Yuning Jiang, Bratislav Svetozarevic, Colin N. Jones. [doi]
- Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential EquationsJan Hagnberger, Marimuthu Kalimuthu, Daniel Musekamp, Mathias Niepert. [doi]
- One Meta-tuned Transformer is What You Need for Few-shot LearningXu Yang, Huaxiu Yao, Ying Wei. [doi]
- HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic ProgrammingYang Wu, Yifan Zhang, Zhenxing Liang, Jian Cheng. [doi]
- Pausing Policy Learning in Non-stationary Reinforcement LearningHyunin Lee, Ming Jin 0002, Javad Lavaei, Somayeh Sojoudi. [doi]
- Accelerating Convergence in Bayesian Few-Shot ClassificationTianjun Ke, Haoqun Cao, Feng Zhou. [doi]
- GiLOT: Interpreting Generative Language Models via Optimal TransportXuhong Li 0002, Jiamin Chen, Yekun Chai, Haoyi Xiong. [doi]
- Model-Free Robust ϕ-Divergence Reinforcement Learning Using Both Offline and Online DataKishan Panaganti, Adam Wierman, Eric Mazumdar. [doi]
- Merging Multi-Task Models via Weight-Ensembling Mixture of ExpertsAnke Tang, Li Shen 0008, Yong Luo 0002, Nan Yin, Lefei Zhang, Dacheng Tao. [doi]
- On the Second-Order Convergence of Biased Policy Gradient AlgorithmsSiqiao Mu, Diego Klabjan. [doi]
- Complexity Matters: Feature Learning in the Presence of Spurious CorrelationsGuanwen Qiu, Da Kuang, Surbhi Goel. [doi]
- Surface-VQMAE: Vector-quantized Masked Auto-encoders on Molecular SurfacesFang Wu, Stan Z. Li. [doi]
- MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with AutocorrelationsZhangyu Wang, Gengchen Mai, Krzysztof Janowicz, Ni Lao. [doi]
- DoRA: Weight-Decomposed Low-Rank AdaptationShih-Yang Liu, Chien-Yi Wang, Hongxu Yin, Pavlo Molchanov 0001, Yu-Chiang Frank Wang, Kwang-Ting Cheng, Min-Hung Chen. [doi]
- InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised LearningZhe Huang, Xiaowei Yu, Dajiang Zhu, Michael C. Hughes. [doi]
- Dynamic Memory Compression: Retrofitting LLMs for Accelerated InferencePiotr Nawrot, Adrian Lancucki, Marcin Chochowski, David Tarjan, Edoardo M. Ponti. [doi]
- Global Reinforcement Learning : Beyond Linear and Convex Rewards via Submodular Semi-gradient MethodsRiccardo De Santi, Manish Prajapat, Andreas Krause 0001. [doi]
- Junk DNA Hypothesis: Pruning Small Pre-Trained Weights Irreversibly and Monotonically Impairs "Difficult" Downstream Tasks in LLMsLu Yin 0006, Ajay Kumar Jaiswal, Shiwei Liu 0003, Souvik Kundu 0009, Zhangyang Wang. [doi]
- Learning to Remove Cuts in Integer Linear ProgrammingPol Puigdemont, Stratis Skoulakis, Grigorios Chrysos 0002, Volkan Cevher. [doi]
- DISCRET: Synthesizing Faithful Explanations For Treatment Effect EstimationYinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J. Getzen, Qi Long, Mayur Naik, Ravi B. Parikh, Eric Wong 0001. [doi]
- Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal PointDavid Martínez-Rubio, Christophe Roux, Sebastian Pokutta. [doi]
- Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human PreferencesAndi Nika, Debmalya Mandal, Parameswaran Kamalaruban, Georgios Tzannetos, Goran Radanovic, Adish Singla. [doi]
- Constrained Reinforcement Learning Under Model MismatchZhongchang Sun, Sihong He, Fei Miao, Shaofeng Zou. [doi]
- Position: A Roadmap to Pluralistic AlignmentTaylor Sorensen, Jared Moore, Jillian Fisher, Mitchell L. Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, Yejin Choi 0001. [doi]
- Mean Estimation in the Add-Remove Model of Differential PrivacyAlex Kulesza, Ananda Theertha Suresh, Yuyan Wang. [doi]
- PAGER: Accurate Failure Characterization in Deep Regression ModelsJayaraman J. Thiagarajan, Vivek Sivaraman Narayanaswamy, Puja Trivedi, Rushil Anirudh. [doi]
- Nonparametric Teaching of Implicit Neural RepresentationsChen Zhang, Steven Tin Sui Luo, Jason Chun Lok Li, Yik-Chung Wu, Ngai Wong. [doi]
- Entropy-Reinforced Planning with Large Language Models for Drug DiscoveryXuefeng Liu, Chih-chan Tien, Peng Ding, SongHao Jiang, Rick L. Stevens. [doi]
- Online Learning and Information Exponents: The Importance of Batch size & Time/Complexity TradeoffsLuca Arnaboldi 0002, Yatin Dandi, Florent Krzakala, Bruno Loureiro, Luca Pesce, Ludovic Stephan. [doi]
- Reward Shaping for Reinforcement Learning with An Assistant Reward AgentHaozhe Ma, Kuankuan Sima, Thanh Vinh Vo, Di Fu, Tze-Yun Leong. [doi]
- Overcoming the Optimizer's Curse: Obtaining Realistic Prescriptions from Neural NetworksAsterios Tsiourvas, Georgia Perakis. [doi]
- An LLM Compiler for Parallel Function CallingSehoon Kim, Suhong Moon, Ryan Tabrizi, Nicholas Lee, Michael W. Mahoney, Kurt Keutzer, Amir Gholami. [doi]
- The Illusion of State in State-Space ModelsWilliam Merrill, Jackson Petty, Ashish Sabharwal. [doi]
- Decomposing and Editing Predictions by Modeling Model ComputationHarshay Shah, Andrew Ilyas, Aleksander Madry. [doi]
- Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language ModelsZhengbo Wang, Jian Liang, Ran He 0001, Zilei Wang, Tieniu Tan. [doi]
- No Double Descent in Principal Component Regression: A High-Dimensional AnalysisDaniel Gedon, Antônio H. Ribeiro, Thomas B. Schön. [doi]
- Differentially Private Representation Learning via Image CaptioningTom Sander, Yaodong Yu, Maziar Sanjabi, Alain Oliviero Durmus, Yi Ma 0001, Kamalika Chaudhuri, Chuan Guo. [doi]
- Tuning-Free Stochastic OptimizationAhmed Khaled 0001, Chi Jin 0001. [doi]
- Large Language Models are Geographically BiasedRohin Manvi, Samar Khanna, Marshall Burke, David B. Lobell, Stefano Ermon. [doi]
- RoboCodeX: Multimodal Code Generation for Robotic Behavior SynthesisYao Mu 0001, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang 0026, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao 0001, Mingyu Ding, Ping Luo 0002. [doi]
- Transolver: A Fast Transformer Solver for PDEs on General GeometriesHaixu Wu, Huakun Luo, Haowen Wang, Jianmin Wang 0001, Mingsheng Long. [doi]
- FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group FramesRuidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu 0001, Yunan Luo, Jian Peng 0001. [doi]
- Reweighted Solutions for Weighted Low Rank ApproximationDavid P. Woodruff, Taisuke Yasuda 0002. [doi]
- Community-Invariant Graph Contrastive LearningShiyin Tan, Dongyuan Li, Renhe Jiang, Ying Zhang 0065, Manabu Okumura. [doi]
- Counterfactual Image EditingYushu Pan, Elias Bareinboim. [doi]
- Easing Concept Bleeding in Diffusion via Entity Localization and AnchoringJiewei Zhang, Song Guo 0001, Peiran Dong, Jie Zhang, Ziming Liu, Yue Yu, Xiao-Ming Wu. [doi]
- Centralized Selection with Preferences in the Presence of BiasesL. Elisa Celis, Amit Kumar 0001, Nisheeth K. Vishnoi, Andrew Xu. [doi]
- Plug-and-Play image restoration with Stochastic deNOising REgularizationMarien Renaud, Jean Prost, Arthur Leclaire, Nicolas Papadakis. [doi]
- Towards Compositionality in Concept LearningAdam Stein, Aaditya Naik, Yinjun Wu, Mayur Naik, Eric Wong 0001. [doi]
- Reflected Flow MatchingTianyu Xie 0001, Yu Zhu 0004, Longlin Yu, Tong Yang, Ziheng Cheng, Shiyue Zhang, Xiangyu Zhang, Cheng Zhang. [doi]
- Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer ReviewsWeixin Liang, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Daniel A. McFarland, James Y. Zou. [doi]
- MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language ModelsJustin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin, Mohit Bansal. [doi]
- Variational Learning is Effective for Large Deep NetworksYuesong Shen, Nico Daheim, Bai Cong, Peter Nickl, Gian Maria Marconi, Clement Bazan, Rio Yokota, Iryna Gurevych, Daniel Cremers, Mohammad Emtiyaz Khan, Thomas Möllenhoff. [doi]
- GPTSwarm: Language Agents as Optimizable GraphsMingchen Zhuge, Wenyi Wang, Louis Kirsch, Francesco Faccio, Dmitrii Khizbullin, Jürgen Schmidhuber. [doi]
- Provable Risk-Sensitive Distributional Reinforcement Learning with General Function ApproximationYu Chen, Xiangcheng Zhang, Siwei Wang 0002, Longbo Huang. [doi]
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language ModelsXavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, Pau Rodríguez. [doi]
- Position: Beyond Personhood: Agency, Accountability, and the Limits of Anthropomorphic Ethical AnalysisJessica Dai. [doi]
- Learning Decision Trees and Forests with Algorithmic RecourseKentaro Kanamori, Takuya Takagi, Ken Kobayashi, Yuichi Ike. [doi]
- Kernel Semi-Implicit Variational InferenceZiheng Cheng, Longlin Yu, Tianyu Xie 0001, Shiyue Zhang, Cheng Zhang. [doi]
- A Touch, Vision, and Language Dataset for Multimodal AlignmentLetian Fu, Gaurav Datta, Huang Huang, William Chung-Ho Panitch, Jaimyn Drake, Joseph Ortiz, Mustafa Mukadam, Mike Lambeta, Roberto Calandra, Ken Goldberg. [doi]
- Variational Linearized Laplace Approximation for Bayesian Deep LearningLuis A. Ortega Andrés, Simón Rodríguez Santana, Daniel Hernández-Lobato. [doi]
- Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory ArchitectureSangjun Park, JinYeong Bak. [doi]
- E(3)-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement LearningDingyang Chen 0001, Qi Zhang 0038. [doi]
- Scribble-Supervised Semantic Segmentation with Prototype-based Feature AugmentationGuiyang Chan, Pengcheng Zhang, Hai Dong, Shunhui Ji, Bainian Chen. [doi]
- Graphon Mean Field Games with a Representative Player: Analysis and Learning AlgorithmFuzhong Zhou, Chenyu Zhang 0002, Xu Chen, Xuan Di. [doi]
- tnGPS: Discovering Unknown Tensor Network Structure Search Algorithms via Large Language Models (LLMs)Junhua Zeng, Chao Li 0013, Zhun Sun, Qibin Zhao, GuoXu Zhou. [doi]
- Federated Representation Learning in the Under-Parameterized RegimeRenpu Liu, Cong Shen 0001, Jing Yang 0002. [doi]
- How Deep Networks Learn Sparse and Hierarchical Data: the Sparse Random Hierarchy ModelUmberto M. Tomasini, Matthieu Wyart. [doi]
- STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized AlignmentJaewoo Lee, Jaehong Yoon, Wonjae Kim, Yunji Kim, Sung Ju Hwang. [doi]
- MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-ExpertsJianan Zhou 0002, Zhiguang Cao, Yaoxin Wu, Wen Song, Yining Ma 0001, Jie Zhang 0002, Chi Xu. [doi]
- On Gradient-like Explanation under a Black-box Setting: When Black-box Explanations Become as Good as White-boxYi Cai 0005, Gerhard Wunder. [doi]
- Using AI Uncertainty Quantification to Improve Human Decision-MakingLaura Marusich, Jonathan Z. Bakdash, Yan Zhou 0001, Murat Kantarcioglu. [doi]
- Quantum Algorithm for Online Exp-concave OptimizationJianhao He, Chengchang Liu, Xutong Liu 0002, Lvzhou Li, John C. S. Lui. [doi]
- Neural Tangent Kernels Motivate Cross-Covariance Graphs in Neural NetworksShervin Khalafi, Saurabh Sihag, Alejandro Ribeiro. [doi]
- Decoding-time Realignment of Language ModelsTianlin Liu, Shangmin Guo, Leonardo Bianco, Daniele Calandriello, Quentin Berthet, Felipe Llinares-López, Jessica Hoffmann, Lucas Dixon, Michal Valko, Mathieu Blondel. [doi]
- Mitigating Catastrophic Forgetting in Online Continual Learning by Modeling Previous Task Interrelations via Pareto OptimizationYichen Wu, Hong Wang 0021, Peilin Zhao, Yefeng Zheng 0001, Ying Wei 0001, Long-Kai Huang. [doi]
- Bifurcated Attention for Single-Context Large-Batch SamplingBen Athiwaratkun, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Haifeng Qian, Hantian Ding, Qing Sun, Jun Wang 0022, Jiacheng Guo, Liangfu Chen, Parminder Bhatia, Ramesh Nallapati, Sudipta Sengupta, Bing Xiang. [doi]
- Test-Time Degradation Adaptation for Open-Set Image RestorationYuanbiao Gou, Haiyu Zhao, Boyun Li, Xinyan Xiao, Xi Peng 0001. [doi]
- ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence EstimationNantian He, Shaohui Li, Zhi Li, Yu Liu, You He. [doi]
- Enhancing Trajectory Prediction through Self-Supervised Waypoint Distortion PredictionPranav Singh Chib, Pravendra Singh. [doi]
- Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent SpaceMinji Lee, Luiz Felipe Vecchietti, Hyunkyu Jung, Hyun Joo Ro, Meeyoung Cha, Ho Min Kim. [doi]
- Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge EnhancementChe Liu, Zhongwei Wan, Cheng Ouyang, Anand Shah, Wenjia Bai, Rossella Arcucci. [doi]
- Federated Combinatorial Multi-Agent Multi-Armed BanditsFares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal. [doi]
- Rejuvenating image-GPT as Strong Visual Representation LearnersSucheng Ren, Zeyu Wang 0008, Hongru Zhu, Junfei Xiao, Alan L. Yuille, Cihang Xie. [doi]
- convSeq: Fast and Scalable Method for Detecting Patterns in Spike DataRoman Koshkin, Tomoki Fukai. [doi]
- Full-Atom Peptide Design based on Multi-modal Flow MatchingJiahan Li, Chaoran Cheng, Zuofan Wu, Ruihan Guo, Shitong Luo, Zhizhou Ren, Jian Peng 0001, Jianzhu Ma. [doi]
- WebLINX: Real-World Website Navigation with Multi-Turn DialogueXing Han Lu, Zdenek Kasner, Siva Reddy. [doi]
- Generalized Sobolev Transport for Probability Measures on a GraphTam Le, Truyen Nguyen, Kenji Fukumizu. [doi]
- Editing Partially Observable Networks via Graph Diffusion ModelsPuja Trivedi, Ryan A. Rossi, David Arbour, Tong Yu 0001, Franck Dernoncourt, SungChul Kim, Nedim Lipka, Namyong Park, Nesreen K. Ahmed, Danai Koutra. [doi]
- Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable TasksRahul Ramesh, Ekdeep Singh Lubana, Mikail Khona, Robert P. Dick, Hidenori Tanaka. [doi]
- tinyBenchmarks: evaluating LLMs with fewer examplesFelipe Maia Polo, Lucas Weber, Leshem Choshen, Yuekai Sun, Gongjun Xu, Mikhail Yurochkin. [doi]
- Amortized Variational Deep Kernel LearningAlan L. S. Matias, César Lincoln C. Mattos, João Paulo Pordeus Gomes, Diego Mesquita. [doi]
- Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained RepresentationsStefan Sylvius Wagner, Stefan Harmeling. [doi]
- A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPsKihyuk Hong, Ambuj Tewari. [doi]
- Federated Optimization with Doubly Regularized Drift CorrectionXiaowen Jiang, Anton Rodomanov, Sebastian U. Stich. [doi]
- S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular VideoHao Zhang, Fang Li, Samyak Rawlekar, Narendra Ahuja. [doi]
- PANDA: Expanded Width-Aware Message Passing Beyond RewiringJeongwhan Choi 0002, Sumin Park, Hyowon Wi, Sung-Bae Cho, Noseong Park. [doi]
- Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental UncertaintyLaixi Shi, Eric Mazumdar, Yuejie Chi, Adam Wierman. [doi]
- Improving Token-Based World Models with Parallel Observation PredictionLior Cohen, Kaixin Wang, Bingyi Kang, Shie Mannor. [doi]
- Meta-Learners for Partially-Identified Treatment Effects Across Multiple EnvironmentsJonas Schweisthal, Dennis Frauen, Mihaela van der Schaar, Stefan Feuerriegel. [doi]
- Ai-sampler: Adversarial Learning of Markov kernels with involutive mapsEvgenii Egorov, Riccardo Valperga, Stratis Gavves. [doi]
- An Empirical Study of Realized GNN ExpressivenessYanbo Wang, Muhan Zhang. [doi]
- Drug Discovery with Dynamic Goal-aware FragmentsSeul Lee, Seanie Lee, Kenji Kawaguchi, Sung Ju Hwang. [doi]
- Learning Multiple Secrets in MastermindMilind Prabhu, David P. Woodruff. [doi]
- Asymptotics of feature learning in two-layer networks after one gradient-stepHugo Cui, Luca Pesce, Yatin Dandi, Florent Krzakala, Yue M. Lu, Lenka Zdeborová, Bruno Loureiro. [doi]
- Learning to Infer Generative Template Programs for Visual ConceptsR. Kenny Jones, Siddhartha Chaudhuri, Daniel Ritchie. [doi]
- Robust Learning-Augmented DictionariesAli Zeynali, Shahin Kamali, Mohammad Hajiesmaili. [doi]
- Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view ClusteringJie Wen 0001, Shijie Deng, Waikeung Wong, Guoqing Chao, Chao Huang 0008, Lunke Fei, Yong Xu 0001. [doi]
- Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RLYu Luo, Tianying Ji, Fuchun Sun 0001, Jianwei Zhang 0001, Huazhe Xu, Xianyuan Zhan. [doi]
- Pi-DUAL: Using privileged information to distinguish clean from noisy labelsKe Wang, Guillermo Ortiz-Jiménez, Rodolphe Jenatton, Mark Collier, Efi Kokiopoulou, Pascal Frossard. [doi]
- What is Dataset Distillation Learning?William Yang, Ye Zhu, Zhiwei Deng, Olga Russakovsky. [doi]
- SelMatch: Effectively Scaling Up Dataset Distillation via Selection-Based Initialization and Partial Updates by Trajectory MatchingYongmin Lee, Hye Won Chung. [doi]
- Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?M. Saquib Sarfraz, Mei-Yen Chen, Lukas Layer, Kunyu Peng, Marios Koulakis. [doi]
- Learning from Integral Losses in Physics Informed Neural NetworksEhsan Saleh, Saba Ghaffari, Timothy Bretl, Luke N. Olson, Matthew West 0001. [doi]
- Aligned Objective for Soft-Pseudo-Label Generation in Supervised LearningNing Xu 0009, Yihao Hu, Congyu Qiao, Xin Geng 0001. [doi]
- In-Context Unlearning: Language Models as Few-Shot UnlearnersMartin Pawelczyk, Seth Neel, Himabindu Lakkaraju. [doi]
- Neural Collapse meets Differential Privacy: Curious behaviors of NoisyGD with Near-Perfect Representation LearningChendi Wang, Yuqing Zhu 0005, Weijie J. Su, Yu-Xiang Wang 0003. [doi]
- SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory SignalsRahul Thapa, Bryan He, Magnus Ruud Kjær, Hyatt E. Moore IV, Gauri Ganjoo, Emmanuel Mignot, James Zou 0001. [doi]
- Parameter Estimation in DAGs from Incomplete Data via Optimal TransportVy Vo, Trung Le, Long Tung Vuong, He Zhao 0001, Edwin V. Bonilla, Dinh Phung 0001. [doi]
- ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RLYifei Zhou, Andrea Zanette, Jiayi Pan, Sergey Levine, Aviral Kumar. [doi]
- Fully-Dynamic Approximate Decision Trees With Worst-Case Update Time GuaranteesMarco Bressan 0002, Mauro Sozio. [doi]
- A Rate-Distortion View of Uncertainty QuantificationIfigeneia Apostolopoulou, Benjamin Eysenbach, Frank Nielsen, Artur Dubrawski. [doi]
- Structure Your Data: Towards Semantic Graph CounterfactualsAngeliki Dimitriou, Maria Lymperaiou, Giorgos Filandrianos, Konstantinos Thomas, Giorgos Stamou. [doi]
- Pairwise Alignment Improves Graph Domain AdaptationShikun Liu, Deyu Zou, Han Zhao 0002, Pan Li 0005. [doi]
- CaRiNG: Learning Temporal Causal Representation under Non-Invertible Generation ProcessGuangyi Chen 0002, Yifan Shen, Zhenhao Chen, Xiangchen Song, Yuewen Sun, Weiran Yao, Xiao Liu, Kun Zhang 0001. [doi]
- Emergent Equivariance in Deep EnsemblesJan E. Gerken, Pan Kessel. [doi]
- WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in CancerKumar Shubham, Aishwarya Jayagopal, Syed Mohammed Danish, Prathosh A. P., Vaibhav Rajan. [doi]
- Hyperbolic Optimizer as a Dynamical SystemNicolás Alvarado, Hans Löbel. [doi]
- Conformal Prediction Sets Improve Human Decision MakingJesse C. Cresswell, Yi Sui, Bhargava Kumar, Noël Vouitsis. [doi]
- Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language ModelsChristian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein 0001. [doi]
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank ProjectionJiawei Zhao, Zhenyu Zhang 0015, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian. [doi]
- Debiased Distribution CompressionLingxiao Li, Raaz Dwivedi, Lester Mackey. [doi]
- Data Engineering for Scaling Language Models to 128K ContextYao Fu, Rameswar Panda, Xinyao Niu, Xiang Yue, Hannaneh Hajishirzi, Yoon Kim, Hao Peng 0018. [doi]
- Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal ExplanationXuexin Chen, Ruichu Cai, Zhengting Huang, Yuxuan Zhu, Julien Horwood, Zhifeng Hao, Zijian Li 0001, José Miguel Hernández-Lobato. [doi]
- Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer ApproachBin Zhang 0052, Hangyu Mao, Lijuan Li 0002, Zhiwei Xu 0005, Dapeng Li 0001, Rui Zhao 0018, Guoliang Fan. [doi]
- DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object DetectionYongchao Feng, Shiwei Li, Yingjie Gao, Ziyue Huang, Yanan Zhang 0005, Qingjie Liu, Yunhong Wang. [doi]
- Selecting Large Language Model to Fine-tune via Rectified Scaling LawHaowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, Zihao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan 0001, James Zou 0001, Yitao Liang. [doi]
- On the Embedding Collapse when Scaling up Recommendation ModelsXingzhuo Guo, Junwei Pan, Ximei Wang, Baixu Chen, Jie Jiang 0015, Mingsheng Long. [doi]
- HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement LearningShengchao Hu, Ziqing Fan, Li Shen 0008, Ya Zhang 0002, Yanfeng Wang, Dacheng Tao. [doi]
- DPZero: Private Fine-Tuning of Language Models without BackpropagationLiang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He. [doi]
- Adaptive Text Watermark for Large Language ModelsYepeng Liu, Yuheng Bu. [doi]
- In-Context Reinforcement Learning for Variable Action SpacesViacheslav Sinii, Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Sergey Kolesnikov. [doi]
- On Stronger Computational Separations Between Multimodal and Unimodal Machine LearningAri Karchmer. [doi]
- Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A BenchmarkYihua Zhang, Pingzhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong 0001, Zhangyang Wang, Sijia Liu 0001, Tianlong Chen. [doi]
- f-Divergence Based Classification: Beyond the Use of Cross-EntropyNicola Novello, Andrea M. Tonello. [doi]
- IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech SeparationKai Li, Runxuan Yang, Fuchun Sun 0001, Xiaolin Hu 0001. [doi]
- DITTO: Diffusion Inference-Time T-Optimization for Music GenerationZachary Novack, Julian J. McAuley, Taylor Berg-Kirkpatrick, Nicholas J. Bryan. [doi]
- Position: LLMs Can't Plan, But Can Help Planning in LLM-Modulo FrameworksSubbarao Kambhampati, Karthik Valmeekam, Lin Guan, Mudit Verma, Kaya Stechly, Siddhant Bhambri, Lucas Saldyt, Anil Murthy. [doi]
- PolySketchFormer: Fast Transformers via Sketching Polynomial KernelsPraneeth Kacham, Vahab Mirrokni, Peilin Zhong. [doi]
- InstructZero: Efficient Instruction Optimization for Black-Box Large Language ModelsLichang Chen, Jiuhai Chen, Tom Goldstein, Heng Huang, Tianyi Zhou 0001. [doi]
- Making Old Things New: A Unified Algorithm for Differentially Private ClusteringMax Dupré la Tour, Monika Henzinger, David Saulpic. [doi]
- Box Facets and Cut Facets of Lifted Multicut PolytopesLucas Fabian Naumann, Jannik Irmai, Shengxian Zhao, Bjoern Andres. [doi]
- Unsupervised Zero-Shot Reinforcement Learning via Functional Reward EncodingsKevin Frans, Seohong Park, Pieter Abbeel, Sergey Levine. [doi]
- In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space SteeringSheng Liu, Haotian Ye, Lei Xing 0001, James Y. Zou. [doi]
- What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian BenchmarksChing Yun Ko, Pin-Yu Chen, Payel Das, Jeet Mohapatra, Luca Daniel. [doi]
- Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and RetrainingAaron Jiaxun Li, Robin Netzorg, Zhihan Cheng, Zhuoqin Zhang, Bin Yu 0001. [doi]
- Autoencoding Conditional Neural Processes for Representation LearningVictor Prokhorov, Ivan Titov, N. Siddharth 0001. [doi]
- Exploring Correlations of Self-Supervised Tasks for GraphsTaoran Fang, Wei Chow, Yifei Sun 0002, Kaiqiao Han, Lvbin Ma, Yang Yang 0009. [doi]
- CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency ModelsJie Xiao 0002, Kai Zhu 0004, Han Zhang 0010, Zhiheng Liu, Yujun Shen, Zhantao Yang, Ruili Feng, Yu Liu 0063, Xueyang Fu, Zheng-Jun Zha. [doi]
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language ModelsHaoran You, Yichao Fu, Zheng Wang, Amir Yazdanbakhsh, Yingyan Celine Lin. [doi]
- Data-free Distillation of Diffusion Models with BootstrappingJiatao Gu, Chen Wang, Shuangfei Zhai, Yizhe Zhang 0002, Lingjie Liu, Joshua M. Susskind. [doi]
- Discrete Latent Perspective Learning for Segmentation and DetectionDeyi Ji, Feng Zhao, Lanyun Zhu, Wenwei Jin, Hongtao Lu, Jieping Ye. [doi]
- SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied ManipulationJunjie Zhang, Chenjia Bai, Haoran He, Zhigang Wang 0002, Bin Zhao 0001, Xiu Li 0001, Xuelong Li 0001. [doi]
- Optimal Recurrent Network Topologies for Dynamical Systems ReconstructionChristoph Jürgen Hemmer, Manuel Brenner, Florian Hess, Daniel Durstewitz. [doi]
- FRAPPÉ: A Group Fairness Framework for Post-Processing EverythingAlexandru Tifrea, Preethi Lahoti, Ben Packer, Yoni Halpern, Ahmad Beirami, Flavien Prost. [doi]
- Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower BoundsYuyang Zhang, Shahriar Talebi, Na Li. [doi]
- Transformers, parallel computation, and logarithmic depthClayton Sanford, Daniel Hsu 0001, Matus Telgarsky. [doi]
- A New Theoretical Perspective on Data Heterogeneity in Federated OptimizationJiayi Wang 0004, Shiqiang Wang 0001, Rong-Rong Chen, Mingyue Ji. [doi]
- Efficient PAC Learnability of Dynamical Systems Over Multilayer NetworksZirou Qiu, Abhijin Adiga, Madhav V. Marathe, S. S. Ravi, Daniel J. Rosenkrantz, Richard Edwin Stearns, Anil Kumar S. Vullikanti. [doi]
- Trustless Audits without Revealing Data or ModelsSuppakit Waiwitlikhit, Ion Stoica, Yi Sun 0010, Tatsunori Hashimoto, Daniel Kang. [doi]
- Imitation Learning in Discounted Linear MDPs without exploration assumptionsLuca Viano, Stratis Skoulakis, Volkan Cevher. [doi]
- Learning with Adaptive Resource AllocationJing Wang, Miao Yu, Peng Zhao 0006, Zhi-Hua Zhou. [doi]
- On Interpolating Experts and Multi-Armed BanditsHoushuang Chen, Yuchen He 0006, Chihao Zhang 0001. [doi]
- Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths AggregationXinyi Wang 0003, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang. [doi]
- Testing the Feasibility of Linear Programs with Bandit FeedbackAditya Gangrade, Aditya Gopalan, Venkatesh Saligrama, Clayton Scott. [doi]
- Seesaw: Compensating for Nonlinear Reduction with Linear Computations for Private InferenceFabing Li, Yuanhao Zhai, Shuangyu Cai, Mingyu Gao 0001. [doi]
- BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time AdaptationDaeun Lee, Jaehong Yoon, Sung Ju Hwang. [doi]
- Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion TransformersKatherine Crowson, Stefan Andreas Baumann, Alex Birch, Tanishq Mathew Abraham, Daniel Z. Kaplan, Enrico Shippole. [doi]
- PerceptAnon: Exploring the Human Perception of Image Anonymization Beyond Pseudonymization for GDPRKartik Patwari, Chen-Nee Chuah, Lingjuan Lyu, Vivek Sharma. [doi]
- Time Weaver: A Conditional Time Series Generation ModelSai Shankar Narasimhan, Shubhankar Agarwal, Oguzhan Akcin, Sujay Sanghavi, Sandeep P. Chinchali. [doi]
- Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function ApproximationFengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai 0001, Ramki Gummadi, Oscar A. Ramirez, Christopher K. Harris, A. Rupam Mahmood, Dale Schuurmans. [doi]
- Tripod: Three Complementary Inductive Biases for Disentangled Representation LearningKyle Hsu, Jubayer Ibn Hamid, Kaylee Burns, Chelsea Finn, Jiajun Wu 0001. [doi]
- Position: Data-driven Discovery with Large Generative ModelsBodhisattwa Prasad Majumder, Harshit Surana, Dhruv Agarwal 0003, Sanchaita Hazra, Ashish Sabharwal, Peter Clark. [doi]
- Prodigy: An Expeditiously Adaptive Parameter-Free LearnerKonstantin Mishchenko, Aaron Defazio. [doi]
- Can Machines Learn the True Probabilities?Jinsook Kim. [doi]
- Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation ProcessXiangxin Zhou, Liang Wang 0001, Yichi Zhou. [doi]
- ELF: Encoding Speaker-Specific Latent Speech Feature for Speech SynthesisJungil Kong, Junmo Lee, Jeongmin Kim, Beomjeong Kim, JiHoon Park, Dohee Kong, Changheon Lee, Sangjin Kim. [doi]
- Parsimonious Learning-Augmented Approximations for Dense Instances of NP-hard ProblemsEvripidis Bampis, Bruno Escoffier, Michalis Xefteris. [doi]
- An Intrinsic Vector Heat NetworkAlexander Gao, Maurice Chu, Mubbasir Kapadia, Ming C. Lin, Hsueh-Ti Derek Liu. [doi]
- Gradient-based Visual Explanation for Transformer-based CLIPChenyang Zhao, Kun Wang, Xingyu Zeng, Rui Zhao, Antoni B. Chan. [doi]
- Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle TasksLinyuan Gong, Sida Wang, Mostafa Elhoushi, Alvin Cheung. [doi]
- Asymmetry in Low-Rank Adapters of Foundation ModelsJiacheng Zhu, Kristjan H. Greenewald, Kimia Nadjahi, Haitz Sáez de Ocáriz Borde, Rickard Brüel Gabrielsson, Leshem Choshen, Marzyeh Ghassemi, Mikhail Yurochkin, Justin Solomon 0001. [doi]
- Scalable Multiple Kernel Clustering: Learning Clustering Structure from ExpectationWeixuan Liang, En Zhu, Shengju Yu, Huiying Xu, Xinzhong Zhu, Xinwang Liu 0002. [doi]
- Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian ManifoldTingting Dan, Ziquan Wei, Won Hwa Kim, Guorong Wu 0001. [doi]
- Speech Self-Supervised Learning Using Diffusion Model Synthetic DataHeting Gao, Kaizhi Qian, Junrui Ni, Chuang Gan, Mark A. Hasegawa-Johnson, Shiyu Chang, Yang Zhang 0001. [doi]
- Incentivized Learning in Principal-Agent Bandit GamesAntoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, Eric Moulines, Michael I. Jordan, El Mahdi El Mhamdi, Alain Oliviero Durmus. [doi]
- Generating In-Distribution Proxy Graphs for Explaining Graph Neural NetworksZhuomin Chen, Jiaxing Zhang 0002, Jingchao Ni, Xiaoting Li, Yuchen Bian, Md Mezbahul Islam, Ananda Mondal, Hua Wei 0001, Dongsheng Luo. [doi]
- Learning Divergence Fields for Shift-Robust Graph RepresentationsQitian Wu, Fan Nie, Chenxiao Yang, Junchi Yan. [doi]
- Generalization to New Sequential Decision Making Tasks with In-Context LearningSharath Chandra Raparthy, Eric Hambro, Robert Kirk, Mikael Henaff, Roberta Raileanu. [doi]
- A Geometric Explanation of the Likelihood OOD Detection ParadoxHamidreza Kamkari, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini, Rahul G. Krishnan, Gabriel Loaiza-Ganem. [doi]
- An Interpretable Evaluation of Entropy-based Novelty of Generative ModelsJingwei Zhang, Cheuk Ting Li, Farzan Farnia. [doi]
- Sequential Neural Score Estimation: Likelihood-Free Inference with Conditional Score Based Diffusion ModelsLouis Sharrock, Jack Simons, Song Liu, Mark Beaumont. [doi]
- Prospective Side Information for Latent MDPsJeongyeol Kwon, Yonathan Efroni, Shie Mannor, Constantine Caramanis. [doi]
- Efficient Exploration in Average-Reward Constrained Reinforcement Learning: Achieving Near-Optimal Regret With Posterior SamplingDanil Provodin, Maurits Clemens Kaptein, Mykola Pechenizkiy. [doi]
- Trust the Model Where It Trusts Itself - Model-Based Actor-Critic with Uncertainty-Aware Rollout AdaptionBernd Frauenknecht, Artur Eisele, Devdutt Subhasish, Friedrich Solowjow, Sebastian Trimpe. [doi]
- Loss Shaping Constraints for Long-Term Time Series ForecastingIgnacio Hounie, Javier Porras-Valenzuela, Alejandro Ribeiro. [doi]
- PinNet: Pinpoint Instructive Information for Retrieval Augmented Code-to-Text GenerationHan Fu, Jian Tan, Pinhan Zhang, Feifei Li 0001, Jianling Sun. [doi]
- Irregular Multivariate Time Series Forecasting: A Transformable Patching Graph Neural Networks ApproachWeijia Zhang, Chenlong Yin, Hao Liu, Xiaofang Zhou 0001, Hui Xiong 0001. [doi]
- Decomposing Uncertainty for Large Language Models through Input Clarification EnsemblingBairu Hou, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang, Yang Zhang 0001. [doi]
- Sign Rank Limitations for Inner Product Graph DecodersSu-Hyeong Lee, Qingqi Zhang, Risi Kondor. [doi]
- Invariant Risk Minimization Is A Total Variation ModelZhao-Rong Lai, Weiwen Wang. [doi]
- AlphaFold Meets Flow Matching for Generating Protein EnsemblesBowen Jing, Bonnie Berger, Tommi S. Jaakkola. [doi]
- Federated Self-Explaining GNNs with Anti-shortcut AugmentationsLinan Yue, Qi Liu 0003, Weibo Gao, Ye Liu 0011, Kai Zhang 0038, Yichao Du, Li Wang, Fangzhou Yao. [doi]
- An Efficient Maximal Ancestral Graph Listing AlgorithmTian-Zuo Wang, Wen-Bo Du, Zhi-Hua Zhou. [doi]
- A General Framework for Sequential Decision-Making under Adaptivity ConstraintsNuoya Xiong, Zhaoran Wang 0001, Zhuoran Yang. [doi]
- Rethinking Optimization and Architecture for Tiny Language ModelsYehui Tang, Kai Han 0002, Fangcheng Liu, Yunsheng Ni, Yuchuan Tian, Zheyuan Bai, Yi-Qi Hu, Sichao Liu, Shangling Jui, Yunhe Wang 0001. [doi]
- FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic PredictionZhonghang Li, Lianghao Xia, Yong Xu 0007, Chao Huang 0001. [doi]
- Incorporating probabilistic domain knowledge into deep multiple instance learningGhadi S. Al Hajj, Aliaksandr Hubin, Chakravarthi Kanduri, Milena Pavlovic, Knut Dagestad Rand, Michael Widrich, Anne H. Schistad Solberg, Victor Greiff, Johan Pensar, Günter Klambauer, Geir Kjetil Sandve. [doi]
- Fourier Controller Networks for Real-Time Decision-Making in Embodied LearningHengkai Tan, Songming Liu, Kai Ma, Chengyang Ying, Xingxing Zhang, Hang Su 0006, Jun Zhu 0001. [doi]
- Receptive Fields As Experts in Convolutional Neural ArchitecturesDongze Lian, Weihao Yu, Xinchao Wang. [doi]
- Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic ViewJin Wang, Shichao Dong, Yapeng Zhu, Kelu Yao, Weidong Zhao, Chao Li, Ping Luo. [doi]
- Contextual Feature Selection with Conditional Stochastic GatesRam Dyuthi Sristi, Ofir Lindenbaum, Shira Lifshitz, Maria Lavzin, Jackie Schiller, Gal Mishne, Hadas Benisty. [doi]
- Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated ExpertsShengzhuang Chen, Jihoon Tack, Yunqiao Yang, Yee Whye Teh, Jonathan Richard Schwarz, Ying Wei 0001. [doi]
- Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image RecognitionYicheng Liu, Jie Wen 0001, Chengliang Liu 0003, Xiaozhao Fang, Zuoyong Li, Yong Xu 0001, Zheng Zhang 0006. [doi]
- Critical windows: non-asymptotic theory for feature emergence in diffusion modelsMarvin Li, Sitan Chen. [doi]
- Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial OptimizationHyeonah Kim, Minsu Kim, Sungsoo Ahn, Jinkyoo Park. [doi]
- Offline Multi-Objective OptimizationKe Xue 0001, Rong-Xi Tan, Xiaobin Huang, Chao Qian 0001. [doi]
- Accelerating PDE Data Generation via Differential Operator Action in Solution SpaceHuanshuo Dong, Hong Wang, Haoyang Liu, Jian Luo, Jie Wang 0005. [doi]
- Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive NeuroscienceMartina G. Vilas, Federico Adolfi, David Poeppel, Gemma Roig. [doi]
- SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language ModelsXudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao 0007, Hongsheng Li 0001. [doi]
- InstructSpeech: Following Speech Editing Instructions via Large Language ModelsRongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang 0001, Xize Cheng, Ziyue Jiang 0001, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao. [doi]
- WARM: On the Benefits of Weight Averaged Reward ModelsAlexandre Ramé, Nino Vieillard, Léonard Hussenot, Robert Dadashi, Geoffrey Cideron, Olivier Bachem, Johan Ferret. [doi]
- Evaluating Model Bias Requires Characterizing its MistakesIsabela Albuquerque, Jessica Schrouff, David Warde-Farley, Ali Taylan Cemgil, Sven Gowal, Olivia Wiles. [doi]
- On Online Experimentation without Device IdentifiersShiv Shankar, Ritwik Sinha, Madalina Fiterau. [doi]
- End-to-End Neuro-Symbolic Reinforcement Learning with Textual ExplanationsLirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang 0001, Qing Li. [doi]
- Neural NeRF CompressionTuan Pham, Stephan Mandt. [doi]
- Exploiting Code Symmetries for Learning Program SemanticsKexin Pei, Weichen Li, Qirui Jin, Shuyang Liu, Scott Geng, Lorenzo Cavallaro, Junfeng Yang, Suman Jana. [doi]
- The Emergence of Reproducibility and Consistency in Diffusion ModelsHuijie Zhang, Jinfan Zhou, Yifu Lu, Minzhe Guo, Peng Wang, Liyue Shen, Qing Qu 0001. [doi]
- What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations AnalysisWaïss Azizian, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos. [doi]
- Amortizing Pragmatic Program Synthesis with RankingsYewen Pu, Saujas Vaduguru, Priyan Vaithilingam, Elena L. Glassman, Daniel Fried. [doi]
- Learning to Model the World With LanguageJessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca D. Dragan. [doi]
- Nearest Neighbour Score Estimators for Diffusion Generative ModelsMatthew Niedoba, Dylan Green, Saeid Naderiparizi, Vasileios Lioutas, Jonathan Wilder Lavington, Xiaoxuan Liang 0001, Yunpeng Liu 0007, Ke Zhang, Setareh Dabiri, Adam Scibior, Berend Zwartsenberg, Frank Wood. [doi]
- FedMBridge: Bridgeable Multimodal Federated LearningJiayi Chen, Aidong Zhang. [doi]
- A3S: A General Active Clustering Method with Pairwise ConstraintsXun Deng, Junlong Liu, Han Zhong, Fuli Feng, Chen Shen 0003, Xiangnan He 0001, Jieping Ye, Zheng Wang. [doi]
- CHAI: Clustered Head Attention for Efficient LLM InferenceSaurabh Agarwal, Bilge Acun, Basil Hosmer, Mostafa Elhoushi, Yejin Lee 0010, Shivaram Venkataraman, Dimitris Papailiopoulos, Carole-Jean Wu. [doi]
- Zero-Shot Reinforcement Learning via Function EncodersTyler Ingebrand, Amy Zhang, Ufuk Topcu. [doi]
- VideoPrism: A Foundational Visual Encoder for Video UnderstandingLong Zhao 0003, Nitesh Bharadwaj Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao 0006, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang 0001, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu 0005, Boqing Gong. [doi]
- GenCO: Generating Diverse Designs with Combinatorial ConstraintsAaron M. Ferber, Arman Zharmagambetov, Taoan Huang, Bistra Dilkina, Yuandong Tian. [doi]
- Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolutionXihaier Luo, Xiaoning Qian, Byung-Jun Yoon. [doi]
- Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian MixturesZenan Ling, Longbo Li, Zhanbo Feng, Yixuan Zhang 0006, Feng Zhou 0011, Robert C. Qiu, Zhenyu Liao 0001. [doi]
- Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM InferenceHarry Dong, Xinyu Yang, Zhenyu Zhang 0015, Zhangyang Wang, Yuejie Chi, Beidi Chen. [doi]
- Discovering Bias in Latent Space: An Unsupervised Debiasing ApproachDyah Adila, Shuai Zhang, Boran Han, Bernie Wang 0001. [doi]
- Auto-Regressive Next-Token Predictors are Universal LearnersEran Malach. [doi]
- Random matrix theory improved Fréchet mean of symmetric positive definite matricesFlorent Bouchard, Ammar Mian, Malik Tiomoko, Guillaume Ginolhac, Frédéric Pascal 0001. [doi]
- RMIB: Representation Matching Information Bottleneck for Matching Text RepresentationsHaihui Pan, Zhifang Liao, Wenrui Xie, Kun Han. [doi]
- Trained Random Forests Completely Reveal your DatasetJulien Ferry, Ricardo Fukasawa, Timothée Pascal, Thibaut Vidal. [doi]
- Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHFBanghua Zhu, Michael I. Jordan, Jiantao Jiao. [doi]
- NeWRF: A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel PredictionHaofan Lu, Christopher Vattheuer, Baharan Mirzasoleiman, Omid Abari. [doi]
- Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous EnvironmentsHan Wang 0016, Sihong He, Zhili Zhang, Fei Miao, James Anderson 0001. [doi]
- ViP: A Differentially Private Foundation Model for Computer VisionYaodong Yu, Maziar Sanjabi, Yi Ma 0001, Kamalika Chaudhuri, Chuan Guo. [doi]
- Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHFHan Shen, Zhuoran Yang, Tianyi Chen. [doi]
- GPT-4V(ision) is a Generalist Web Agent, if GroundedBoyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun 0001, Yu Su 0001. [doi]
- What Will My Model Forget? Forecasting Forgotten Examples in Language Model RefinementXisen Jin, Xiang Ren 0001. [doi]
- Random Scaling and Momentum for Non-smooth Non-convex OptimizationQinzi Zhang, Ashok Cutkosky. [doi]
- Differentially Private Sum-Product NetworksXenia Heilmann, Mattia Cerrato, Ernst Althaus. [doi]
- Improving Robustness to Multiple Spurious Correlations by Multi-Objective OptimizationNayeong Kim, Juwon Kang, Sungsoo Ahn, Jungseul Ok, Suha Kwak. [doi]
- Multiply Robust Estimation for Local Distribution Shifts with Multiple DomainsSteven Wilkins-Reeves, Xu Chen, Qi Ma, Christine Agarwal, Aude Hofleitner. [doi]
- Language Models as Semantic IndexersBowen Jin, Hansi Zeng, Guoyin Wang 0001, Xiusi Chen, Tianxin Wei, Ruirui Li 0002, Zhengyang Wang, Zheng Li 0018, Yang Li, Hanqing Lu, Suhang Wang, Jiawei Han 0001, Xianfeng Tang. [doi]
- Reducing Balancing Error for Causal Inference via Optimal TransportYuguang Yan, Hao Zhou, Zeqin Yang, Weilin Chen, Ruichu Cai, Zhifeng Hao. [doi]
- Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching PerspectiveCheng Tan 0012, Zhangyang Gao, Hanqun Cao, Xingran Chen, Ge Wang, Lirong Wu, Jun Xia, Jiangbin Zheng, Stan Z. Li. [doi]
- Optimal Exact Recovery in Semi-Supervised Learning: A Study of Spectral Methods and Graph Convolutional NetworksHaixiao Wang, Zhichao Wang. [doi]
- Scaling Beyond the GPU Memory Limit for Large Mixture-of-Experts Model TrainingYechan Kim, Hwijoon Lim, Dongsu Han. [doi]
- A Simple Early Exiting Framework for Accelerated Sampling in Diffusion ModelsTae Hong Moon, Moonseok Choi, Eunggu Yun, Jongmin Yoon, Gayoung Lee, Jaewoong Cho, Juho Lee 0001. [doi]
- Cross-view Masked Diffusion Transformers for Person Image SynthesisTrung X. Pham, Kang Zhang 0008, Chang D. Yoo. [doi]
- Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMsJordan Dotzel, Yuzong Chen 0001, Bahaa Kotb, Sushma Prasad, Gang Wu, Sheng Li 0007, Mohamed S. Abdelfattah, Zhiru Zhang. [doi]
- How Does Goal Relabeling Improve Sample Efficiency?Sirui Zheng, Chenjia Bai, Zhuoran Yang, Zhaoran Wang 0001. [doi]
- Token-level Direct Preference OptimizationYongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang. [doi]
- A Hierarchical Adaptive Multi-Task Reinforcement Learning Framework for Multiplier Circuit DesignZhihai Wang, Jie Wang 0005, Dongsheng Zuo, Yunjie Ji, Xilin Xia, Yuzhe Ma, Jianye Hao, Mingxuan Yuan, Yongdong Zhang 0001, Feng Wu 0001. [doi]
- Estimating Unknown Population Sizes Using the Hypergeometric DistributionLiam Hodgson, Danilo Bzdok. [doi]
- EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D ParallelismYanxi Chen, Xuchen Pan, Yaliang Li, Bolin Ding, Jingren Zhou. [doi]
- Bringing Motion Taxonomies to Continuous Domains via GPLVM on Hyperbolic manifoldsNoémie Jaquier, Leonel Rozo, Miguel González Duque, Viacheslav Borovitskiy, Tamim Asfour. [doi]
- Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR DataXuran Meng, Difan Zou, Yuan Cao 0006. [doi]
- A General Framework for Learning from Weak SupervisionHao Chen 0102, Jindong Wang 0001, Lei Feng, Xiang Li 0106, Yidong Wang, Xing Xie 0001, Masashi Sugiyama, Rita Singh, Bhiksha Raj. [doi]
- LayerMerge: Neural Network Depth Compression through Layer Pruning and MergingJinuk Kim, Marwa El Halabi, Mingi Ji, Hyun Oh Song. [doi]
- Approximate Nearest Neighbor Search with Window FiltersJoshua Engels, Benjamin Landrum, Shangdi Yu, Laxman Dhulipala, Julian Shun. [doi]
- On Positivity Condition for Causal InferenceInwoo Hwang, Yesong Choe, Yeahoon Kwon, Sanghack Lee. [doi]
- Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion ModelsDing Huang, Ting Li, Jian Huang. [doi]
- R2E: Turning any Github Repository into a Programming Agent EnvironmentNaman Jain, Manish Shetty, Tianjun Zhang, King Han, Koushik Sen, Ion Stoica. [doi]
- Potential Based Diffusion Motion PlanningYunhao Luo, Chen Sun 0002, Joshua B. Tenenbaum, Yilun Du. [doi]
- xT: Nested Tokenization for Larger Context in Large ImagesRitwik Gupta, Shufan Li, Tyler Zhu, Jitendra Malik, Trevor Darrell, Karttikeya Mangalam. [doi]
- Sign Gradient Descent-based Neuronal Dynamics: ANN-to-SNN Conversion Beyond ReLU NetworkHyunseok Oh, Youngki Lee. [doi]
- StrokeNUWA - Tokenizing Strokes for Vector Graphic SynthesisZecheng Tang, Chenfei Wu, Zekai Zhang, Minheng Ni, Shengming Yin, Yu Liu, Zhengyuan Yang, Lijuan Wang, Zicheng Liu 0001, Juntao Li, Nan Duan. [doi]
- RigorLLM: Resilient Guardrails for Large Language Models against Undesired ContentZhuowen Yuan, Zidi Xiong, Yi Zeng 0005, Ning Yu, Ruoxi Jia 0001, Dawn Song, Bo Li 0026. [doi]
- Supervised Matrix Factorization: Local Landscape Analysis and ApplicationsJoowon Lee, Hanbaek Lyu, Weixin Yao. [doi]
- Learning Universal PredictorsJordi Grau-Moya, Tim Genewein, Marcus Hutter, Laurent Orseau, Grégoire Delétang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison, Joel Veness. [doi]
- Optimal Transport for Structure Learning Under Missing DataVy Vo, He Zhao 0001, Trung Le, Edwin V. Bonilla, Dinh Phung 0001. [doi]
- Stereographic Spherical Sliced Wasserstein DistancesHuy Tran, Yikun Bai, Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Rocio Diaz Martin, Soheil Kolouri. [doi]
- OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution ShiftLin Li, Yifei Wang, Chawin Sitawarin, Michael W. Spratling. [doi]
- Self-cognitive Denoising in the Presence of Multiple Noisy Label SourcesYi-Xuan Sun, Ya-Lin Zhang 0001, Bin Han, Longfei Li, Jun Zhou. [doi]
- MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware DiffusionDi Chang, Yichun Shi, Quankai Gao, Hongyi Xu, Jessica Fu, Guoxian Song, Qing Yan, Yizhe Zhu, Xiao Yang, Mohammad Soleymani 0001. [doi]
- Exploiting Negative Samples: A Catalyst for Cohort Discovery in Healthcare AnalyticsKaiping Zheng, Horng Ruey Chua, Melanie Herschel, H. V. Jagadish, Beng Chin Ooi, James Wei Luen Yip. [doi]
- Inherent Trade-Offs between Diversity and Stability in Multi-Task BenchmarksGuanhua Zhang, Moritz Hardt. [doi]
- Thermometer: Towards Universal Calibration for Large Language ModelsMaohao Shen, Subhro Das, Kristjan H. Greenewald, Prasanna Sattigeri, Gregory W. Wornell, Soumya Ghosh. [doi]
- Enforcing Constraints in RNA Secondary Structure Predictions: A Post-Processing Framework Based on the Assignment ProblemGeewon Suh, Gyeongjo Hwang, Seokjun Kang, Doojin Baek, Mingeun Kang. [doi]
- Privacy-Preserving Instructions for Aligning Large Language ModelsDa Yu, Peter Kairouz, Sewoong Oh, Zheng Xu 0002. [doi]
- CarbonNovo: Joint Design of Protein Structure and Sequence Using a Unified Energy-based ModelMilong Ren, Tian Zhu, Haicang Zhang. [doi]
- Visual Transformer with Differentiable Channel Selection: An Information Bottleneck Inspired ApproachYancheng Wang, Ping Li 0001, Yingzhen Yang. [doi]
- Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic ProgrammingXinlei Niu, Christian Walder, Jing Zhang 0052, Charles Patrick Martin. [doi]
- Retrieval-Augmented Score Distillation for Text-to-3D GenerationJunyoung Seo, Susung Hong, Wooseok Jang, Inès Hyeonsu Kim, Minseop Kwak, Doyup Lee, Seungryong Kim. [doi]
- Executable Code Actions Elicit Better LLM AgentsXingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang 0002, Yunzhu Li, Hao Peng 0009, Heng Ji. [doi]
- Advancing Dynamic Sparse Training by Exploring Optimization OpportunitiesJie Ji, Gen Li, Lu Yin 0006, Minghai Qin, Geng Yuan, Linke Guo, Shiwei Liu 0003, Xiaolong Ma. [doi]
- Preventing Model Collapse in Gaussian Process Latent Variable ModelsYing Li, Zhidi Lin, Feng Yin, Michael Minyi Zhang. [doi]
- BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad RangesHoyong Choi, Nohyun Ki, Hye Won Chung. [doi]
- On the Maximal Local Disparity of Fairness-Aware ClassifiersJinqiu Jin, Haoxuan Li, Fuli Feng. [doi]
- A Fixed-Point Approach for Causal Generative ModelingMeyer Scetbon, Joel Jennings, Agrin Hilmkil, Cheng Zhang 0005, Chao Ma 0019. [doi]
- Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural NetworksStefano Sarao Mannelli, Yaraslau Ivashinka, Andrew M. Saxe, Luca Saglietti. [doi]
- IW-GAE: Importance weighted group accuracy estimation for improved calibration and model selection in unsupervised domain adaptationTaejong Joo, Diego Klabjan. [doi]
- COALA: A Practical and Vision-Centric Federated Learning PlatformWeiming Zhuang, Jian Xu, Chen Chen 0043, Jingtao Li, Lingjuan Lyu. [doi]
- Refining Minimax Regret for Unsupervised Environment DesignMichael Beukman, Samuel Coward, Michael Matthews, Mattie Fellows, Minqi Jiang, Michael D. Dennis, Jakob Nicolaus Foerster. [doi]
- Chain-of-Thought Predictive ControlZhiwei Jia, Vineet Thumuluri, Fangchen Liu, Linghao Chen, Zhiao Huang, Hao Su 0001. [doi]
- DataFreeShield: Defending Adversarial Attacks without Training DataHyeyoon Lee, Kanghyun Choi, Dain Kwon, Sunjong Park, Mayoore Selvarasa Jaiswal, Noseong Park, Jonghyun Choi, Jinho Lee. [doi]
- Coresets for Multiple ℓp RegressionDavid P. Woodruff, Taisuke Yasuda 0002. [doi]
- Do Transformer World Models Give Better Policy Gradients?Michel Ma, Tianwei Ni, Clement Gehring, Pierluca D'Oro, Pierre-Luc Bacon. [doi]
- SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDPSubhojyoti Mukherjee, Josiah P. Hanna, Robert D. Nowak. [doi]
- ByMI: Byzantine Machine Identification with False Discovery Rate ControlChengde Qian, Mengyuan Wang, Haojie Ren, Changliang Zou. [doi]
- Contrasting Multiple Representations with the Multi-Marginal Matching GapZoe Piran, Michal Klein, James Thornton, Marco Cuturi. [doi]
- Let Go of Your Labels with Unsupervised TransferArtyom Gadetsky, Yulun Jiang, Maria Brbic. [doi]
- Reservoir Computing for Short High-Dimensional Time Series: an Application to SARS-CoV-2 Hospitalization ForecastThomas Ferté, Dan Dutartre, Boris P. Hejblum, Romain Griffier, Vianney Jouhet, Rodolphe Thiébaut, Pierrick Legrand, Xavier Hinaut. [doi]
- Second-Order Uncertainty Quantification: A Distance-Based ApproachYusuf Sale, Viktor Bengs, Michele Caprio, Eyke Hüllermeier. [doi]
- Symmetric Matrix Completion with ReLU SamplingHuikang Liu, Peng Wang 0098, Longxiu Huang, Qing Qu 0001, Laura Balzano. [doi]
- Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and BeyondXutong Liu 0002, Siwei Wang 0002, Jinhang Zuo, Han Zhong, Xuchuang Wang, Zhiyong Wang, Shuai Li 0010, Mohammad Hajiesmaili, John C. S. Lui, Wei Chen 0020. [doi]
- DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite AutomatonYiyou Sun, Junjie Hu, Wei Cheng 0002, Haifeng Chen. [doi]
- Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human InputAndi Peng, Yuying Sun, Tianmin Shu, David Abel. [doi]
- Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy DataGiannis Daras, Alex Dimakis, Constantinos Daskalakis. [doi]
- Variational Partial Group Convolutions for Input-Aware Partial Equivariance of Rotations and Color-ShiftsHyunsu Kim, Yegon Kim, Hongseok Yang, Juho Lee. [doi]
- Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst KernelUri Gadot, Kaixin Wang, Navdeep Kumar, Kfir Yehuda Levy, Shie Mannor. [doi]
- Integrated Hardware Architecture and Device Placement SearchIrene Wang, Jakub Tarnawski, Amar Phanishayee, Divya Mahajan 0001. [doi]
- Performative Prediction with Bandit Feedback: Learning through ReparameterizationYatong Chen, Wei Tang, Chien-Ju Ho, Yang Liu 0018. [doi]
- Deep Functional Factor Models: Forecasting High-Dimensional Functional Time Series via Bayesian Nonparametric FactorizationYirui Liu, Xinghao Qiao, Yulong Pei, Liying Wang. [doi]
- Multi-Sender Persuasion: A Computational PerspectiveSafwan Hossain, Tonghan Wang 0003, Tao Lin 0013, Yiling Chen 0001, David C. Parkes, Haifeng Xu. [doi]
- Adaptive Proximal Gradient Methods Are Universal Without ApproximationKonstantinos A. Oikonomidis, Emanuel Laude, Puya Latafat, Andreas Themelis, Panagiotis Patrinos. [doi]
- Agnostic Learning of Mixed Linear Regressions with EM and AM AlgorithmsAvishek Ghosh, Arya Mazumdar. [doi]
- Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit FeedbackGuojun Xiong, Jian Li 0008. [doi]
- Analyzing Dα seeding for k-meansÉtienne Bamas, Sai Ganesh Nagarajan, Ola Svensson. [doi]
- Robustly Learning Single-Index Models via Alignment SharpnessNikos Zarifis, Puqian Wang, Ilias Diakonikolas, Jelena Diakonikolas. [doi]
- Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise NetworksHoJoon Lee, Hyeonseo Cho, Hyunseung Kim, Donghu Kim, Dugki Min, Jaegul Choo, Clare Lyle. [doi]
- Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph TransformersMd. Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian. [doi]
- Detecting Any instruction-to-answer interaction relationship: Universal Instruction-to-Answer Navigator for Med-VQAZhongze Wu, Hongyan Xu, Yitian Long, Shan You, Xiu Su, Jun Long, Yueyi Luo, Chang Xu 0002. [doi]
- Unbiased Multi-Label Learning from Crowdsourced AnnotationsMingxuan Xia, Zenan Huang, Runze Wu, Gengyu Lyu, Junbo Zhao 0002, Gang Chen 0001, Haobo Wang. [doi]
- Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness RisksLujing Zhang, Aaron Roth 0001, Linjun Zhang. [doi]
- Hard Tasks First: Multi-Task Reinforcement Learning Through Task SchedulingMyungsik Cho, Jongeui Park, Suyoung Lee, Youngchul Sung. [doi]
- RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI FeedbackHarrison Lee 0001, Samrat Phatale, Hassan Mansoor, Thomas Mesnard, Johan Ferret, Kellie Lu, Colton Bishop, Ethan Hall, Victor Carbune, Abhinav Rastogi, Sushant Prakash. [doi]
- Structure-Aware E(3)-Invariant Molecular Conformer Aggregation NetworksDuy Minh Ho Nguyen, Nina Lukashina, Tai Nguyen 0008, An T. Le 0001, TrungTin Nguyen, Nhat Ho, Jan Peters 0001, Daniel Sonntag, Viktor Zaverkin, Mathias Niepert. [doi]
- Sliced-Wasserstein Estimation with Spherical Harmonics as Control VariatesRémi Leluc, Aymeric Dieuleveut, François Portier, Johan Segers, Aigerim Zhuman. [doi]
- A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural NetworksBehrad Moniri, Donghwan Lee, Hamed Hassani, Edgar Dobriban. [doi]
- Learning in Feature Spaces via Coupled Covariances: Asymmetric Kernel SVD and Nyström methodQinghua Tao, Francesco Tonin, Alex Lambert, Yingyi Chen, Panagiotis Patrinos, Johan A. K. Suykens. [doi]
- Explaining Graph Neural Networks via Structure-aware Interaction IndexNgoc Bui, Hieu Trung Nguyen, Viet Anh Nguyen, Rex Ying. [doi]
- Image Hijacks: Adversarial Images can Control Generative Models at RuntimeLuke Bailey, Euan Ong, Stuart Russell 0001, Scott Emmons. [doi]
- Deep Stochastic MechanicsElena Orlova, Aleksei Ustimenko, Ruoxi Jia 0001, Peter Y. Lu, Rebecca Willett. [doi]
- E2GAN: Efficient Training of Efficient GANs for Image-to-Image TranslationYifan Gong 0004, Zheng Zhan 0001, Qing Jin, Yanyu Li, Yerlan Idelbayev, Xian Liu, Andrey Zharkov, Kfir Aberman, Sergey Tulyakov, Yanzhi Wang, Jian Ren. [doi]
- Harmonic Self-Conditioned Flow Matching for joint Multi-Ligand Docking and Binding Site DesignHannes Stärk, Bowen Jing, Regina Barzilay, Tommi S. Jaakkola. [doi]
- LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific DiscoveryPingchuan Ma, Tsun-Hsuan Wang, Minghao Guo, Zhiqing Sun, Joshua B. Tenenbaum, Daniela Rus, Chuang Gan, Wojciech Matusik. [doi]
- Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement LearningXiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li 0001, Zhen Wang 0004. [doi]
- New Bounds on the Cohesion of Complete-link and Other Linkage Methods for Agglomerative ClusteringSanjoy Dasgupta, Eduardo Sany Laber. [doi]
- Local vs. Global Interpretability: A Computational Complexity PerspectiveShahaf Bassan, Guy Amir, Guy Katz. [doi]
- Position: Evolving AI Collectives Enhance Human Diversity and Enable Self-RegulationShiyang Lai, Yujin Potter, Junsol Kim, Richard Zhuang, Dawn Song, James Evans. [doi]
- Referee Can Play: An Alternative Approach to Conditional Generation via Model InversionXuantong Liu, Tianyang Hu, Wenjia Wang, Kenji Kawaguchi, Yuan Yao 0011. [doi]
- Effective Federated Graph MatchingYang Zhou 0001, Zijie Zhang 0001, Zeru Zhang, Lingjuan Lyu, Wei-Shinn Ku. [doi]
- CF-OPT: Counterfactual Explanations for Structured PredictionGermain Vivier-Ardisson, Alexandre Forel, Axel Parmentier, Thibaut Vidal. [doi]
- A2Q+: Improving Accumulator-Aware Weight QuantizationIan Colbert, Alessandro Pappalardo, Jakoba Petri-Koenig, Yaman Umuroglu. [doi]
- Optimal Batched Linear BanditsXuanfei Ren, Tianyuan Jin, Pan Xu 0002. [doi]
- Uncertainty Estimation by Density Aware Evidential Deep LearningTaeseong Yoon, Heeyoung Kim. [doi]
- Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual BanditsJiabin Lin, Shana Moothedath, Namrata Vaswani. [doi]
- Towards Robust Model-Based Reinforcement Learning Against Adversarial CorruptionChenlu Ye, Jiafan He, Quanquan Gu, Tong Zhang 0001. [doi]
- Emergence of In-Context Reinforcement Learning from Noise DistillationIlya Zisman, Vladislav Kurenkov, Alexander Nikulin, Viacheslav Sinii, Sergey Kolesnikov. [doi]
- Expressivity and Generalization: Fragment-Biases for Molecular GNNsTom Wollschläger, Niklas Kemper, Leon Hetzel, Johanna Sommer, Stephan Günnemann. [doi]
- RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative SimulationYufei Wang, Zhou Xian, Feng Chen, Tsun-Hsuan Wang, Yian Wang, Katerina Fragkiadaki, Zackory Erickson, David Held, Chuang Gan. [doi]
- Information Flow in Self-Supervised LearningZhiquan Tan, Jingqin Yang, Weiran Huang 0001, Yang Yuan, Yifan Zhang. [doi]
- Fast, Scalable, Warm-Start Semidefinite Programming with Spectral Bundling and SketchingRico Angell, Andrew McCallum. [doi]
- Multi-View Stochastic Block ModelsVincent Cohen-Addad, Tommaso d'Orsi, Silvio Lattanzi, Rajai Nasser. [doi]
- CauDiTS: Causal Disentangled Domain Adaptation of Multivariate Time SeriesJunxin Lu, Shiliang Sun. [doi]
- BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield ModelChenwei Xu, Yu Chao Huang, Jerry Yao-Chieh Hu, Weijian Li, Ammar Gilani, Hsi-Sheng Goan, Han Liu. [doi]
- Tilt and Average : Geometric Adjustment of the Last Layer for RecalibrationGyusang Cho, Chan-Hyun Youn. [doi]
- Towards Theoretical Understandings of Self-Consuming Generative ModelsShi Fu, Sen Zhang 0006, Yingjie Wang 0007, Xinmei Tian 0001, Dacheng Tao. [doi]
- Learning with 3D rotations, a hitchhiker's guide to SO(3)Andreas René Geist, Jonas Frey, Mikel Zhobro, Anna Levina, Georg Martius. [doi]
- On the Complexity of Finite-Sum Smooth Optimization under the Polyak-Łojasiewicz ConditionYunyan Bai, Yuxing Liu, Luo Luo. [doi]
- Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer SamplesThomas T. C. K. Zhang, Bruce D. Lee, Ingvar M. Ziemann, George J. Pappas, Nikolai Matni. [doi]
- PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space ModelsDeividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihály Petreczky. [doi]
- ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal ModelsRohan Wadhawan, Hritik Bansal, Kai-Wei Chang, Nanyun Peng. [doi]
- Dirichlet Flow Matching with Applications to DNA Sequence DesignHannes Stärk, Bowen Jing, Chenyu Wang, Gabriele Corso, Bonnie Berger, Regina Barzilay, Tommi S. Jaakkola. [doi]
- Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention NormalizationXingyi Zhao, Depeng Xu, Shuhan Yuan. [doi]
- Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev LossYahong Yang, Juncai He. [doi]
- Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty EstimationPei Liu 0008, Luping Ji. [doi]
- On Prompt-Driven Safeguarding for Large Language ModelsChujie Zheng, Fan Yin, Hao Zhou 0012, Fandong Meng, Jie Zhou 0016, Kai-Wei Chang, Minlie Huang, Nanyun Peng. [doi]
- Graph External Attention Enhanced TransformerJianqing Liang, Min Chen, Jiye Liang. [doi]
- DiffFPR: Diffusion Prior for Oversampled Fourier Phase RetrievalJi Li, Chao Wang. [doi]
- On the Last-Iterate Convergence of Shuffling Gradient MethodsZijian Liu, Zhengyuan Zhou. [doi]
- Proteus: Exploring Protein Structure Generation for Enhanced Designability and EfficiencyChentong Wang, Yannan Qu, Zhangzhi Peng, Yukai Wang, Hongli Zhu, Dachuan Chen, Longxing Cao. [doi]
- Private and Federated Stochastic Convex Optimization: Efficient Strategies for Centralized SystemsRoie Reshef, Kfir Yehuda Levy. [doi]
- Differentially Private Post-Processing for Fair RegressionRuicheng Xian, Qiaobo Li, Gautam Kamath 0001, Han Zhao 0002. [doi]
- Unmasking Vulnerabilities: Cardinality Sketches under Adaptive InputsSara Ahmadian, Edith Cohen. [doi]
- On the Independence Assumption in Neurosymbolic LearningEmile van Krieken, Pasquale Minervini, Edoardo M. Ponti, Antonio Vergari. [doi]
- BetterV: Controlled Verilog Generation with Discriminative GuidanceZehua Pei, Hui-Ling Zhen, Mingxuan Yuan, Yu Huang, Bei Yu 0001. [doi]
- Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based LossesPanagiotis Koromilas, Giorgos Bouritsas, Theodoros Giannakopoulos, Mihalis Nicolaou, Yannis Panagakis. [doi]
- In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-ThoughtSili Huang, Jifeng Hu, Hechang Chen, Lichao Sun, Bo Yang 0002. [doi]
- Sequential Disentanglement by Extracting Static Information From A Single Sequence ElementNimrod Berman, Ilan Naiman, Idan Arbiv, Gal Fadlon, Omri Azencot. [doi]
- Sequential Kernel Goodness-of-fit TestingZhengyu Zhou, Weiwei Liu 0003. [doi]
- Conditional Normalizing Flows for Active Learning of Coarse-Grained Molecular RepresentationsHenrik Schopmans, Pascal Friederich. [doi]
- Selective Mixup Helps with Distribution Shifts, But Not (Only) because of MixupDamien Teney, Jindong Wang 0001, Ehsan Abbasnejad. [doi]
- Certifiably Byzantine-Robust Federated Conformal PredictionMintong Kang, Zhen Lin, Jimeng Sun 0001, Cao Xiao, Bo Li 0026. [doi]
- Graph Neural Networks with a Distribution of Parametrized GraphsSee Hian Lee, Feng Ji, Kelin Xia, Wee-Peng Tay. [doi]
- Learning Label Shift Correction for Test-Agnostic Long-Tailed RecognitionTong Wei 0001, Zhen Mao, Zi-Hao Zhou, Yuanyu Wan, Min-Ling Zhang. [doi]
- DFD: Distilling the Feature Disparity Differently for DetectorsKang Liu, Yingyi Zhang, Jingyun Zhang, Jinmin Li, Jun Wang, Shaoming Wang, Chun Yuan, Rizen Guo. [doi]
- Generalized Preference Optimization: A Unified Approach to Offline AlignmentYunhao Tang, Zhaohan Daniel Guo, Zeyu Zheng, Daniele Calandriello, Rémi Munos, Mark Rowland, Pierre Harvey Richemond, Michal Valko, Bernardo Ávila Pires, Bilal Piot. [doi]
- Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without DegenerationYoungsoo Jang, Geon-hyeong Kim, Byoungjip Kim, Yu-Jin Kim, Honglak Lee, Moontae Lee. [doi]
- Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?Andreas Opedal, Alessandro Stolfo, Haruki Shirakami, Ying Jiao, Ryan Cotterell, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan. [doi]
- Privacy-Preserving Embedding via Look-up Table Evaluation with Fully Homomorphic EncryptionJaeyun Kim, Saerom Park, Joohee Lee, Jung Hee Cheon. [doi]
- Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement LearningYen-Ju Chen, Nai-Chieh Huang, Ching-Pei Lee, Ping-Chun Hsieh. [doi]
- Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction GamesKexin Huang, Ziqian Chen, Xue Wang, Chongming Gao, Jinyang Gao, Bolin Ding, Xiang Wang 0010. [doi]
- Initial Guessing Bias: How Untrained Networks Favor Some ClassesEmanuele Francazi, Aurélien Lucchi, Marco Baity-Jesi. [doi]
- Offline Training of Language Model Agents with Functions as Learnable WeightsShaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang 0001, Ranjay Krishna, Qingyun Wu. [doi]
- Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and BeyondKyriakos Axiotis, Vincent Cohen-Addad, Monika Henzinger, Sammy Jerome, Vahab Mirrokni, David Saulpic, David P. Woodruff, Michael Wunder. [doi]
- Coarse-To-Fine Tensor Trains for Compact Visual RepresentationsSebastian Loeschcke, Dan Wang, Christian Leth-Espensen, Serge J. Belongie, Michael J. Kastoryano, Sagie Benaim. [doi]
- Spike Distance Function as a Learning Objective for Spike PredictionKevin Doran, Marvin Seifert, Carola A. M. Yovanovich, Tom Baden. [doi]
- Debating with More Persuasive LLMs Leads to More Truthful AnswersAkbir Khan, John Hughes, Dan Valentine, Laura Ruis, Kshitij Sachan, Ansh Radhakrishnan, Edward Grefenstette, Samuel R. Bowman, Tim Rocktäschel, Ethan Perez. [doi]
- The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright BreachesWithout Adjusting Finetuning PipelineHaonan Wang, Qianli Shen, Yao Tong, Yang Zhang, Kenji Kawaguchi. [doi]
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and ToxicityAndrew Lee, Xiaoyan Bai, Itamar Pres, Martin Wattenberg, Jonathan K. Kummerfeld, Rada Mihalcea. [doi]
- EMC2: Efficient MCMC Negative Sampling for Contrastive Learning with Global ConvergenceChung-Yiu Yau, Hoi-To Wai, Parameswaran Raman, Soumajyoti Sarkar, Mingyi Hong 0001. [doi]
- Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated TextAbhimanyu Hans, Avi Schwarzschild, Valeriia Cherepanova, Hamid Kazemi, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein. [doi]
- Polynomial-based Self-Attention for Table Representation LearningJayoung Kim 0002, Yehjin Shin, Jeongwhan Choi 0002, Hyowon Wi, Noseong Park. [doi]
- Understanding Retrieval-Augmented Task Adaptation for Vision-Language ModelsYifei Ming, Yixuan Li 0001. [doi]
- In-context Convergence of TransformersYu Huang, Yuan Cheng, Yingbin Liang. [doi]
- AdsorbDiff: Adsorbate Placement via Conditional Denoising DiffusionAdeesh Kolluru, John R. Kitchin. [doi]
- Imitation Learning from Purified DemonstrationsYunke Wang, Minjing Dong, Yukun Zhao, Bo Du 0001, Chang Xu 0002. [doi]
- The Entropy Enigma: Success and Failure of Entropy MinimizationOri Press, Ravid Shwartz-Ziv, Yann LeCun, Matthias Bethge. [doi]
- The Privacy Power of Correlated Noise in Decentralized LearningYoussef Allouah, Anastasia Koloskova, Aymane El Firdoussi, Martin Jaggi, Rachid Guerraoui. [doi]
- A Nearly Optimal Single Loop Algorithm for Stochastic Bilevel Optimization under Unbounded SmoothnessXiaochuan Gong, Jie Hao, Mingrui Liu. [doi]
- LPGD: A General Framework for Backpropagation through Embedded Optimization LayersAnselm Paulus, Georg Martius, Vít Musil. [doi]
- Harmonizing Generalization and Personalization in Federated Prompt LearningTianyu Cui, Hongxia Li, Jingya Wang, Ye Shi 0001. [doi]
- Discovering Features with Synergistic Interactions in Multiple ViewsChohee Kim, Mihaela van der Schaar, ChangHee Lee. [doi]
- Compute Better Spent: Replacing Dense Layers with Structured MatricesShikai Qiu, Andres Potapczynski, Marc Anton Finzi, Micah Goldblum, Andrew Gordon Wilson. [doi]
- Chasing Convex Functions with Long-term ConstraintsAdam Lechowicz, Nicolas Christianson, Bo Sun 0004, Noman Bashir, Mohammad Hajiesmaili, Adam Wierman, Prashant J. Shenoy. [doi]
- Graph Mixup on Approximate Gromov-Wasserstein GeodesicsZhichen Zeng, Ruizhong Qiu, Zhe Xu 0007, Zhining Liu 0002, Yuchen Yan, Tianxin Wei, Lei Ying 0001, Jingrui He, Hanghang Tong. [doi]
- Rethinking DP-SGD in Discrete Domain: Exploring Logistic Distribution in the Realm of signSGDJonggyu Jang, Seongjin Hwang, Hyun Jong Yang. [doi]
- Beyond Implicit Bias: The Insignificance of SGD Noise in Online LearningNikhil Vyas 0001, Depen Morwani, Rosie Zhao, Gal Kaplun, Sham M. Kakade, Boaz Barak. [doi]
- Benign Overfitting in Adversarial Training of Neural NetworksYunjuan Wang, Kaibo Zhang, Raman Arora. [doi]
- Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational EfficiencySudeep Salgia, Sattar Vakili, Qing Zhao 0001. [doi]
- HarmonyDream: Task Harmonization Inside World ModelsHaoyu Ma, Jialong Wu 0001, Ningya Feng, Chenjun Xiao, Dong Li 0016, Jianye Hao, Jianmin Wang 0001, Mingsheng Long. [doi]
- Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic DimensionFan Yin, Jayanth Srinivasa, Kai-Wei Chang. [doi]
- Auto-Linear Phenomenon in Subsurface ImagingYinan Feng, Yinpeng Chen, Peng Jin, Shihang Feng, Youzuo Lin. [doi]
- Agnostic Sample Compression Schemes for RegressionIdan Attias, Steve Hanneke, Aryeh Kontorovich, Menachem Sadigurschi. [doi]
- A Resilient and Accessible Distribution-Preserving Watermark for Large Language ModelsYihan Wu, Zhengmian Hu, Junfeng Guo, Hongyang Zhang 0001, Heng Huang. [doi]
- CW Complex Hypothesis for Image DataYi Wang, Zhiren Wang. [doi]
- A Multimodal Automated Interpretability AgentTamar Rott Shaham, Sarah Schwettmann, Franklin Wang, Achyuta Rajaram, Evan Hernandez, Jacob Andreas, Antonio Torralba 0001. [doi]
- Lyapunov-stable Neural Control for State and Output Feedback: A Novel FormulationLujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang 0001. [doi]
- Graph As Point SetXiyuan Wang, Pan Li 0005, Muhan Zhang. [doi]
- UniAudio: Towards Universal Audio Generation with Large Language ModelsDongchao Yang, Jinchuan Tian, Xu Tan 0003, Rongjie Huang, Songxiang Liu, Haohan Guo, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian 0002, Zhou Zhao, Xixin Wu, Helen M. Meng. [doi]
- Understanding the Training Speedup from Sampling with Approximate LossesRudrajit Das, Xi Chen, Bertram Ieong, Parikshit Bansal, Sujay Sanghavi. [doi]
- Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation ProblemMaciej Wolczyk, Bartlomiej Cupial, Mateusz Ostaszewski, Michal Bortkiewicz, Michal Zajac 0005, Razvan Pascanu, Lukasz Kucinski, Piotr Milos. [doi]
- Graph Neural Stochastic Diffusion for Estimating Uncertainty in Node ClassificationXixun Lin, Wenxiao Zhang, Fengzhao Shi, Chuan Zhou 0001, Lixin Zou, Xiangyu Zhao 0001, Dawei Yin, Shirui Pan, Yanan Cao. [doi]
- Disparate Impact on Group Accuracy of Linearization for Private InferenceSaswat Das, Marco Romanelli 0002, Ferdinando Fioretto. [doi]
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space ModelLianghui Zhu, Bencheng Liao, Qian Zhang 0009, Xinlong Wang, Wenyu Liu 0001, Xinggang Wang. [doi]
- Improving Interpretation Faithfulness for Vision TransformersLijie Hu, Yixin Liu 0002, Ninghao Liu, Mengdi Huai, Lichao Sun 0001, Di Wang 0015. [doi]
- Gradual Divergence for Seamless Adaptation: A Novel Domain Incremental Learning MethodKishaan Jeeveswaran, Elahe Arani, Bahram Zonooz. [doi]
- Modeling Caption Diversity in Contrastive Vision-Language PretrainingSamuel Lavoie, Polina Kirichenko, Mark Ibrahim, Mido Assran, Andrew Gordon Wilson, Aaron C. Courville, Nicolas Ballas. [doi]
- QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice CodebooksAlbert Tseng, Jerry Chee, Qingyao Sun, Volodymyr Kuleshov, Christopher De Sa. [doi]
- Equivariant Diffusion for Crystal Structure PredictionPeijia Lin, Pin Chen, Rui Jiao, Qing Mo, Jianhuan Cen, Wenbing Huang 0001, Yang Liu 0005, Dan Huang, Yutong Lu. [doi]
- Provably Better Explanations with Optimized Aggregation of Feature AttributionsThomas Decker 0004, Ananta R. Bhattarai, Jindong Gu, Volker Tresp, Florian Buettner 0001. [doi]
- OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial OptimizationXiang Meng, Shibal Ibrahim, Kayhan Behdin, Hussein Hazimeh 0001, Natalia Ponomareva 0001, Rahul Mazumder. [doi]
- Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement LearningMichal Nauman, Michal Bortkiewicz, Piotr Milos, Tomasz Trzcinski, Mateusz Ostaszewski, Marek Cygan. [doi]
- PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight RelabelingUtsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit S. Bedi. [doi]
- Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language ModelsYongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy M. Hospedales. [doi]
- LoRA+: Efficient Low Rank Adaptation of Large ModelsSoufiane Hayou, Nikhil Ghosh, Bin Yu 0001. [doi]
- Fast White-Box Adversarial Streaming Without a Random OracleYing Feng, Aayush Jain, David P. Woodruff. [doi]
- Out of the Ordinary: Spectrally Adapting Regression for Covariate ShiftBenjamin Eyre, Elliot Creager, David Madras, Vardan Papyan, Richard S. Zemel. [doi]
- Fast Peer Adaptation with Context-aware ExplorationLong Ma, Yuanfei Wang, Fangwei Zhong, Song Chun Zhu, Yizhou Wang 0001. [doi]
- Verification of Machine Unlearning is FragileBinchi Zhang, Zihan Chen, Cong Shen, Jundong Li. [doi]
- On Hypothesis Transfer Learning of Functional Linear ModelsHaotian Lin 0002, Matthew Reimherr. [doi]
- SΩI: Score-based O-INFORMATION EstimationMustapha Bounoua, Giulio Franzese, Pietro Michiardi. [doi]
- Causally Motivated Personalized Federated Invariant Learning with Shortcut-Averse Information-Theoretic RegularizationXueyang Tang, Song Guo 0001, Jingcai Guo, Jie Zhang 0076, Yue Yu. [doi]
- Sampling-based Multi-dimensional RecalibrationYoungseog Chung, Ian Char, Jeff Schneider 0001. [doi]
- Probabilistic Generating Circuits - DemystifiedSanyam Agarwal, Markus Bläser. [doi]
- From Fourier to Neural ODEs: Flow Matching for Modeling Complex SystemsXin Li, Jingdong Zhang, Qunxi Zhu, Chengli Zhao, Xue Zhang, Xiaojun Duan, Wei Lin 0003. [doi]
- Learning Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingYongxin Li, Mengyuan Liu, You Wu, Xucheng Wang, Xiangyang Yang, Shuiwang Li. [doi]
- Deep Regression Representation Learning with TopologyShihao Zhang, Kenji Kawaguchi, Angela Yao. [doi]
- From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased DecisionsTrenton Chang, Jenna Wiens. [doi]
- DistiLLM: Towards Streamlined Distillation for Large Language ModelsJongwoo Ko, Sungnyun Kim, Tianyi Chen, Se-Young Yun. [doi]
- When is Transfer Learning Possible?My Phan, Kianté Brantley, Stephanie Milani, Soroush Mehri, Gokul Swamy, Geoffrey J. Gordon. [doi]
- SILVER: Single-loop variance reduction and application to federated learningKazusato Oko, Shunta Akiyama, Denny Wu, Tomoya Murata, Taiji Suzuki. [doi]
- Expand-and-Cluster: Parameter Recovery of Neural NetworksFlavio Martinelli, Berfin Simsek, Wulfram Gerstner, Johanni Brea. [doi]
- Multi-Region Markovian Gaussian Process: An Efficient Method to Discover Directional Communications Across Multiple Brain RegionsWeihan Li, Chengrui Li, Yule Wang, Anqi Wu. [doi]
- DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-ConsistencyZalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi. [doi]
- Prospector Heads: Generalized Feature Attribution for Large Models & DataGautam Machiraju, Alexander Derry, Arjun D. Desai, Neel Guha, Amir-Hossein Karimi, James Zou 0001, Russ B. Altman, Christopher Ré, Parag Mallick. [doi]
- Fast Co-Training under Weak Dependence via Stream-Based Active LearningIlias Diakonikolas, Mingchen Ma, Lisheng Ren, Christos Tzamos. [doi]
- An Effective Dynamic Gradient Calibration Method for Continual LearningWeiChen Lin, Jiaxiang Chen, Ruomin Huang, Hu Ding. [doi]
- Detecting and Identifying Selection Structure in Sequential DataYujia Zheng, Zeyu Tang, Yiwen Qiu, Bernhard Schölkopf, Kun Zhang 0001. [doi]
- Langevin Policy for Safe Reinforcement LearningFenghao Lei, Long Yang, Shiting Wen, Zhixiong Huang, Zhiwang Zhang, Chaoyi Pang. [doi]
- Correlation-Induced Label Prior for Semi-Supervised Multi-Label LearningBiao Liu, Ning Xu 0009, Xiangyu Fang, Xin Geng 0001. [doi]
- Infinite-Horizon Distributionally Robust Regret-Optimal ControlTaylan Kargin, Joudi Hajar, Vikrant Malik, Babak Hassibi. [doi]
- FRAG: Frequency Adapting Group for Diffusion Video EditingSunjae Yoon, Gwanhyeong Koo, Geonwoo Kim, Chang D. Yoo. [doi]
- Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language ModelsMingjia Huo, Sai Ashish Somayajula, Youwei Liang, Ruisi Zhang, Farinaz Koushanfar, Pengtao Xie. [doi]
- Allocation Requires Prediction Only if Inequality Is LowAli Shirali, Rediet Abebe, Moritz Hardt. [doi]
- Homomorphism Counts for Graph Neural Networks: All About That BasisEmily Jin, Michael M. Bronstein, Ismail Ilkan Ceylan, Matthias Lanzinger. [doi]
- Towards Efficient Training and Evaluation of Robust Models against l0 Bounded Adversarial PerturbationsXuyang Zhong, Yixiao Huang 0004, Chen Liu. [doi]
- ContPhy: Continuum Physical Concept Learning and Reasoning from VideosZhicheng Zheng, Xin Yan 0008, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Private Truly-Everlasting Robust-PredictionUri Stemmer. [doi]
- Causal Customer Churn Analysis with Low-rank Tensor Block Hazard ModelChenyin Gao, Zhiming Zhang, Shu Yang. [doi]
- On Discrete Prompt Optimization for Diffusion ModelsRuochen Wang, Ting Liu 0005, Cho-Jui Hsieh, Boqing Gong. [doi]
- Memory-Space Visual Prompting for Efficient Vision-Language Fine-TuningShibo Jie, Yehui Tang, Ning Ding, Zhi-Hong Deng, Kai Han 0002, Yunhe Wang 0001. [doi]
- Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingWeijia Xu, Andrzej Banburski, Nebojsa Jojic. [doi]
- Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin DynamicsHaoyang Zheng, Hengrong Du, Qi Feng 0005, Wei Deng 0002, Guang Lin. [doi]
- Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein ChainsJiale Zhao, Wanru Zhuang, Jia Song, Yaqi Li, Shuqi Lu. [doi]
- Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised LearningKai Gan, Tong Wei 0001. [doi]
- A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric ModelsJiayi Wang, Zhengling Qi, Raymond K. W. Wong. [doi]
- SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal AlignmentZiping Ma 0003, Furong Xu, Jian Liu, Ming Yang 0007, Qingpei Guo. [doi]
- Position: Video as the New Language for Real-World Decision MakingSherry Yang, Jacob C. Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, André Barreto 0001, Pieter Abbeel, Dale Schuurmans. [doi]
- Model-based Reinforcement Learning for Parameterized Action SpacesRenhao Zhang, Haotian Fu, Yilin Miao, George Konidaris 0001. [doi]
- Finite Smoothing Algorithm for High-Dimensional Support Vector Machines and Quantile RegressionQian Tang, Yikai Zhang, Boxiang Wang. [doi]
- Switched Flow Matching: Eliminating Singularities via Switching ODEsQunxi Zhu, Wei Lin 0003. [doi]
- On the Error-Propagation of Inexact Hotelling's Deflation for Principal Component AnalysisFangshuo Liao, Junhyung Lyle Kim, Cruz Barnum, Anastasios Kyrillidis. [doi]
- Prior Mismatch and Adaptation in PnP-ADMM with a Nonconvex Convergence AnalysisShirin Shoushtari, Jiaming Liu 0001, Edward P. Chandler, M. Salman Asif, Ulugbek S. Kamilov. [doi]
- Larimar: Large Language Models with Episodic Memory ControlPayel Das, Subhajit Chaudhury, Elliot Nelson, Igor Melnyk, Sarathkrishna Swaminathan, Sihui Dai, Aurélie C. Lozano, Georgios Kollias, Vijil Chenthamarakshan, Jirí Navrátil 0001, Soham Dan, Pin-Yu Chen. [doi]
- On the Calibration of Human Pose EstimationKerui Gu, Rongyu Chen, Xuanlong Yu, Angela Yao. [doi]
- FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. DataShusen Jing, Anlan Yu, Shuai Zhang, Songyang Zhang. [doi]
- Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling LawsNikhil Sardana, Jacob Portes, Sasha Doubov, Jonathan Frankle. [doi]
- FlowMM: Generating Materials with Riemannian Flow MatchingBenjamin Kurt Miller, Ricky T. Q. Chen, Anuroop Sriram, Brandon M. Wood. [doi]
- Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference AdjustmentRui Yang 0010, Xiaoman Pan, Feng Luo, Shuang Qiu, Han Zhong 0001, Dong Yu 0001, Jianshu Chen. [doi]
- SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign DecodingChanho Park, Namyoon Lee. [doi]
- BiLLM: Pushing the Limit of Post-Training Quantization for LLMsWei Huang, Yangdong Liu, Haotong Qin, Ying Li, Shiming Zhang, Xianglong Liu 0001, Michele Magno, Xiaojuan Qi 0001. [doi]
- Stable Differentiable Causal DiscoveryAchille Nazaret, Justin Hong, Elham Azizi, David M. Blei. [doi]
- Agnostic Interactive Imitation Learning: New Theory and Practical AlgorithmsYichen Li, Chicheng Zhang. [doi]
- A Contextual Combinatorial Bandit Approach to NegotiationYexin Li, Zhancun Mu, Siyuan Qi. [doi]
- On Least Square Estimation in Softmax Gating Mixture of ExpertsHuy Nguyen, Nhat Ho, Alessandro Rinaldo. [doi]
- Privately Learning Smooth Distributions on the Hypercube by ProjectionsClément Lalanne, Sébastien Gadat. [doi]
- Autoformalizing Euclidean GeometryLogan Murphy, Kaiyu Yang, Jialiang Sun, Zhaoyu Li, Anima Anandkumar, Xujie Si. [doi]
- A Persuasive Approach to Combating MisinformationSafwan Hossain, Andjela Mladenovic, Yiling Chen 0001, Gauthier Gidel. [doi]
- Learning Optimal Projection for Forecast Reconciliation of Hierarchical Time SeriesAsterios Tsiourvas, Wei Sun 0031, Georgia Perakis, Pin-Yu Chen, Yada Zhu. [doi]
- In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter OptimizationHerilalaina Rakotoarison, Steven Adriaensen, Neeratyoy Mallik, Samir Garibov, Eddie Bergman, Frank Hutter. [doi]
- Stacking Deep Set Networks and Pooling by QuantilesZhuojun Chen, Xinghua Zhu, Dongzhe Su, Justin C.-I. Chuang. [doi]
- Latent Noise Segmentation: How Neural Noise Leads to the Emergence of Segmentation and GroupingBen Lonnqvist, Zhengqing Wu, Michael H. Herzog. [doi]
- Calibration Bottleneck: Over-compressed Representations are Less CalibratableDeng-Bao Wang, Min-Ling Zhang. [doi]
- Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local UpdatesYoussef Allouah, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta, Rafael Pinot, Geovani Rizk, Sasha Voitovych. [doi]
- Self-Infilling Code GenerationLin Zheng, Jianbo Yuan, Zhi Zhang, Hongxia Yang, Lingpeng Kong. [doi]
- Online conformal prediction with decaying step sizesAnastasios Nikolas Angelopoulos, Rina Barber, Stephen Bates. [doi]
- Bottleneck-Minimal Indexing for Generative Document RetrievalXin Du, Lixin Xiu, Kumiko Tanaka-Ishii. [doi]
- Stochastic Gradient Flow Dynamics of Test Risk and its Exact Solution for Weak FeaturesRodrigo Veiga 0001, Anastasia Remizova, Nicolas Macris. [doi]
- Unified Training of Universal Time Series Forecasting TransformersGerald Woo, Chenghao Liu, Akshat Kumar, Caiming Xiong, Silvio Savarese, Doyen Sahoo. [doi]
- Don't trust your eyes: on the (un)reliability of feature visualizationsRobert Geirhos, Roland S. Zimmermann, Blair L. Bilodeau, Wieland Brendel, Been Kim. [doi]
- Tight Partial Identification of Causal Effects with Marginal Distribution of Unmeasured ConfoundersZhiheng Zhang. [doi]
- Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal TransportJaemoo Choi, Jaewoong Choi, Myungjoo Kang. [doi]
- Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven OptimizationLi Ding 0010, Jenny Zhang, Jeff Clune, Lee Spector, Joel Lehman. [doi]
- Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement LearningZongmeng Zhang, Yufeng Shi, Jinhua Zhu 0001, Wengang Zhou, Xiang Qi, Peng Zhang, Houqiang Li. [doi]
- Cooperative Graph Neural NetworksBen Finkelshtein, Xingyue Huang, Michael M. Bronstein, Ismail Ilkan Ceylan. [doi]
- Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential EquationsKaiwen Xue, Yuhao Zhou, Shen Nie, Xu Min, Xiaolu Zhang, Jun Zhou 0011, Chongxuan Li. [doi]
- Out-of-Domain Generalization in Dynamical Systems ReconstructionNiclas Alexander Göring, Florian Hess, Manuel Brenner, Zahra Monfared, Daniel Durstewitz. [doi]
- Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive LossRuijie Zheng, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daumé III, Huazhe Xu, John Langford 0001, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang. [doi]
- DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform GenerationChenfeng Miao, Qingying Zhu, Minchuan Chen, Wei Hu, Zijian Li, Shaojun Wang, Jing Xiao 0006. [doi]
- Latent Space Symmetry DiscoveryJianke Yang, Nima Dehmamy, Robin Walters 0001, Rose Yu. [doi]
- Scalable Safe Policy Improvement for Factored Multi-Agent MDPsFederico Bianchi 0002, Edoardo Zorzi, Alberto Castellini, Thiago D. Simão, Matthijs T. J. Spaan, Alessandro Farinelli. [doi]
- Fool Your (Vision and) Language Model with Embarrassingly Simple PermutationsYongshuo Zong, Tingyang Yu, Ruchika Chavhan, Bingchen Zhao, Timothy M. Hospedales. [doi]
- Representation Surgery: Theory and Practice of Affine SteeringShashwat Singh, Shauli Ravfogel, Jonathan Herzig, Roee Aharoni, Ryan Cotterell, Ponnurangam Kumaraguru. [doi]
- Better Locally Private Sparse Estimation Given Multiple Samples Per UserYuheng Ma, Ke Jia, Hanfang Yang. [doi]
- Automating the Selection of Proxy Variables of Unmeasured ConfoundersFeng Xie 0002, Zhengming Chen, Shanshan Luo, Wang Miao, Ruichu Cai, Zhi Geng. [doi]
- Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion ModelsZalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi. [doi]
- Position: TrustLLM: Trustworthiness in Large Language ModelsYue Huang, Lichao Sun 0001, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Hanchi Sun, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric P. Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang 0001, Huan Zhang 0001, Huaxiu Yao, Manolis Kellis, Marinka Zitnik, Meng Jiang 0001, Mohit Bansal, James Zou 0001, Jian Pei, Jian Liu, Jianfeng Gao 0001, Jiawei Han 0001, Jieyu Zhao, Jiliang Tang, Jindong Wang 0001, Joaquin Vanschoren, John C. Mitchell, Kai Shu, Kaidi Xu, Kai-Wei Chang, Lifang He 0001, Lifu Huang, Michael Backes 0001, Neil Zhenqiang Gong, Philip S. Yu, Pin-Yu Chen, Quanquan Gu, Ran Xu, Rex Ying, Shuiwang Ji, Suman Jana, Tianlong Chen, Tianming Liu 0001, Tianyi Zhou 0001, William Wang 0001, Xiang Li 0001, Xiangliang Zhang 0001, Xiao Wang, Xing Xie 0001, Xun Chen, Xuyu Wang, Yan Liu 0002, Yanfang Ye 0001, Yinzhi Cao, Yong Chen, Yue Zhao 0016. [doi]
- Unveiling the Potential of AI for Nanomaterial Morphology PredictionIvan Dubrovsky, Andrei Dmitrenko, Aleksei Dmitrenko, Nikita Serov, Vladimir Vinogradov. [doi]
- Guidance with Spherical Gaussian Constraint for Conditional DiffusionLingxiao Yang, Shutong Ding, Yifan Cai, Jingyi Yu, Jingya Wang, Ye Shi 0001. [doi]
- AMPA: Adaptive Mixed Precision Allocation for Low-Bit Integer TrainingLi Ding, Wen Fei, Yuyang Huang, Shuangrui Ding, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong. [doi]
- Achieving Lossless Gradient Sparsification via Mapping to Alternative Space in Federated LearningDo Yeon Kim, Dong-Jun Han, Jun Seo, Jaekyun Moon. [doi]
- Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuningJiachen Li, Qiaozi Gao, Michael Johnston, Xiaofeng Gao 0002, Xuehai He, Hangjie Shi, Suhaila Shakiah, Reza Ghanadan, William Yang Wang. [doi]
- Outlier-aware Slicing for Post-Training Quantization in Vision TransformerYuexiao Ma, Huixia Li, Xiawu Zheng, Feng Ling, XueFeng Xiao, Rui Wang 0089, Shilei Wen, Fei Chao 0001, Rongrong Ji. [doi]
- Graph Geometry-Preserving AutoencodersJungbin Lim, Jihwan Kim, Yonghyeon Lee, Cheongjae Jang, Frank C. Park 0001. [doi]
- Online Learning in Betting Markets: Profit versus PredictionHaiqing Zhu, Alexander Soen, Yun Kuen Cheung, Lexing Xie. [doi]
- Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training EfficiencyVithursan Thangarasa, Shreyas Saxena, Abhay Gupta, Sean Lie. [doi]
- Position: Machine Learning-powered Assessments of the EU Digital Services Act Aid Quantify Policy Impacts on Online HarmsEleonora Bonel, Luca Nannini, Davide Bassi, Michele Joshua Maggini. [doi]
- Neuro-Visualizer: A Novel Auto-Encoder-Based Loss Landscape Visualization Method With an Application in Knowledge-Guided Machine LearningMohannad Elhamod, Anuj Karpatne. [doi]
- Automated Statistical Model Discovery with Language ModelsMichael Y. Li, Emily B. Fox, Noah D. Goodman. [doi]
- RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement LearningBoning Li, Zhixuan Fang, Longbo Huang. [doi]
- Error Feedback Can Accurately Compress PreconditionersIonut-Vlad Modoranu, Aleksei Kalinov, Eldar Kurtic, Elias Frantar, Dan Alistarh. [doi]
- Pursuing Overall Welfare in Federated Learning through Sequential Decision MakingSeok-Ju Hahn, Gi-Soo Kim, Junghye Lee. [doi]
- Local Causal Structure Learning in the Presence of Latent VariablesFeng Xie, Zheng Li, Peng Wu 0012, Yan Zeng, Chunchen Liu, Zhi Geng. [doi]
- Graph Positional and Structural EncoderSemih Cantürk, Renming Liu, Olivier Lapointe-Gagné, Vincent Létourneau, Guy Wolf, Dominique Beaini, Ladislav Rampásek. [doi]
- A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and FeedbackKihyun Kim, Jiawei Zhang 0007, Asuman E. Ozdaglar, Pablo A. Parrilo. [doi]
- Generative Conditional Distributions by Neural (Entropic) Optimal TransportBao Nguyen, Binh Nguyen, Hieu Trung Nguyen, Viet Anh Nguyen. [doi]
- Listening to the noise: Blind Denoising with Gibbs DiffusionDavid Heurtel-Depeiges, Charles Margossian, Ruben Ohana, Bruno Régaldo-Saint Blancard. [doi]
- Position: A Call to Action for a Human-Centered AutoML ParadigmMarius Lindauer, Florian Karl, Anne Klier, Julia Moosbauer, Alexander Tornede, Andreas Müller 0024, Frank Hutter, Matthias Feurer, Bernd Bischl. [doi]
- High-Probability Bound for Non-Smooth Non-Convex Stochastic Optimization with Heavy TailsLangqi Liu, Yibo Wang, Lijun Zhang 0005. [doi]
- Exploring the LLM Journey from Cognition to Expression with Linear RepresentationsYuzi Yan, Jialian Li, Yipin Zhang, Dong Yan. [doi]
- Reducing Item Discrepancy via Differentially Private Robust Embedding Alignment for Privacy-Preserving Cross Domain RecommendationWeiming Liu 0005, Xiaolin Zheng, Chaochao Chen 0001, Jiahe Xu, Xinting Liao, Fan Wang 0020, Yanchao Tan, Yew-Soon Ong. [doi]
- Measuring Stochastic Data Complexity with Boltzmann Influence FunctionsNathan H. Ng, Roger Baker Grosse, Marzyeh Ghassemi. [doi]
- GRATH: Gradual Self-Truthifying for Large Language ModelsWeixin Chen, Dawn Song, Bo Li 0026. [doi]
- Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain StimulationMichelle Pan, Mariah L. Schrum, Vivek Myers, Erdem Biyik, Anca D. Dragan. [doi]
- Geometric Active Exploration in Markov Decision Processes: the Benefit of AbstractionRiccardo De Santi, Federico Arangath Joseph, Noah Liniger, Mirco Mutti, Andreas Krause 0001. [doi]
- DetKDS: Knowledge Distillation Search for Object DetectorsLujun Li, Yufan Bao, Peijie Dong, Chuanguang Yang, Anggeng Li, Wenhan Luo, Qifeng Liu, Wei Xue, Yike Guo. [doi]
- Layerwise Proximal Replay: A Proximal Point Method for Online Continual LearningJinsoo Yoo, Yunpeng Liu 0007, Frank Wood, Geoff Pleiss. [doi]
- Symmetry Induces Structure and Constraint of LearningZiyin Liu. [doi]
- Improving Group Robustness on Spurious Correlation Requires Preciser Group InferenceYujin Han, Difan Zou. [doi]
- Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement LearningMohamed Elsayed 0003, Homayoon Farrahi, Felix Dangel, A. Rupam Mahmood. [doi]
- Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology PredictionChen-Yu Yen, Raghav Singhal, Umang Sharma, Rajesh Ranganath, Sumit Chopra, Lerrel Pinto. [doi]
- Exponential Spectral Pursuit: An Effective Initialization Method for Sparse Phase RetrievalMengchu Xu, Yuxuan Zhang, Jian Wang 0016. [doi]
- Fair Resource Allocation in Multi-Task LearningHao Ban, Kaiyi Ji. [doi]
- Meta-Reinforcement Learning Robust to Distributional Shift Via Performing Lifelong In-Context LearningTengye Xu, Zihao Li, Qinyuan Ren. [doi]
- Online Linear Regression in Dynamic Environments via DiscountingAndrew Jacobsen, Ashok Cutkosky. [doi]
- Slot Abstractors: Toward Scalable Abstract Visual ReasoningShanka Subhra Mondal, Jonathan D. Cohen 0003, Taylor Whittington Webb. [doi]
- Neural Jump-Diffusion Temporal Point ProcessesShuai Zhang, Chuang Zhou 0004, Yang Aron Liu, Peng Zhang 0001, Xixun Lin, Zhi-Ming Ma. [doi]
- Lookbehind-SAM: k steps back, 1 step forwardGonçalo Mordido, Pranshu Malviya, Aristide Baratin, Sarath Chandar. [doi]
- Combinatorial Approximations for Cluster Deletion: Simpler, Faster, and BetterVicente Balmaseda, Ying Xu, Yixin Cao 0001, Nate Veldt. [doi]
- Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement LearningZijian Guo, Weichao Zhou, Wenchao Li 0001. [doi]
- GeoMFormer: A General Architecture for Geometric Molecular Representation LearningTianlang Chen, Shengjie Luo, Di He 0001, Shuxin Zheng, Tie-Yan Liu, Liwei Wang 0001. [doi]
- FESSNC: Fast Exponentially Stable and Safe Neural ControllerJingdong Zhang, Luan Yang, Qunxi Zhu, Wei Lin. [doi]
- Faithfulness Measurable Masked Language ModelsAndreas Madsen, Siva Reddy, Sarath Chandar. [doi]
- Outlier-robust Kalman Filtering through Generalised BayesGerardo Duran-Martin, Matías Altamirano, Alexander Y. Shestopaloff, Leandro Sánchez-Betancourt, Jeremias Knoblauch, Matt Jones 0002, François-Xavier Briol, Kevin Patrick Murphy. [doi]
- GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local RefinementsAlexander Havrilla, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Maksym Zhuravinskyi, Eric Hambro, Roberta Raileanu. [doi]
- Theoretical insights for diffusion guidance: A case study for Gaussian mixture modelsYuchen Wu, Minshuo Chen, Zihao Li, Mengdi Wang, Yuting Wei. [doi]
- Improved Differentially Private and Lazy Online Convex Optimization: Lower Regret without Smoothness RequirementsNaman Agarwal, Satyen Kale, Karan Singh, Abhradeep Guha Thakurta. [doi]
- Logistic Variational Bayes RevisitedMichael Komodromos, Marina Evangelou, Sarah Filippi. [doi]
- Position: Relational Deep Learning - Graph Representation Learning on Relational DatabasesMatthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson 0001, Rex Ying, Jiaxuan You, Jure Leskovec. [doi]
- RODEO: Robust Outlier Detection via Exposing Adaptive Out-of-Distribution SamplesHossein Mirzaei, Mohammad Jafari, Hamid Reza Dehbashi, Ali Ansari 0001, Sepehr Ghobadi, Masoud Hadi, Arshia Soltani Moakhar, Mohammad Azizmalayeri, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban. [doi]
- Position: Cracking the Code of Cascading Disparity Towards Marginalized CommunitiesGolnoosh Farnadi, Mohammad Havaei, Negar Rostamzadeh. [doi]
- Discovering Symmetry Breaking in Physical Systems with Relaxed Group ConvolutionRui Wang 0086, Elyssa F. Hofgard, Hang Gao 0004, Robin Walters 0001, Tess E. Smidt. [doi]
- High-dimensional Linear Bandits with KnapsacksWanteng Ma, Dong Xia, Jiashuo Jiang. [doi]
- Doubly Robust Causal Effect Estimation under Networked Interference via Targeted LearningWeilin Chen, Ruichu Cai, Zeqin Yang, Jie Qiao, Yuguang Yan, Zijian Li 0001, Zhifeng Hao. [doi]
- GistScore: Learning Better Representations for In-Context Example Selection with Gist BottlenecksShivanshu Gupta, Clemens Rosenbaum, Ethan R. Elenberg. [doi]
- Replicable Learning of Large-Margin HalfspacesAlkis Kalavasis, Amin Karbasi, Kasper Green Larsen, Grigoris Velegkas, Felix Zhou 0002. [doi]
- SHINE: Shielding Backdoors in Deep Reinforcement LearningZhuowen Yuan, Wenbo Guo 0002, Jinyuan Jia 0001, Bo Li 0026, Dawn Song. [doi]
- Protein Conformation Generation via Force-Guided SE(3) Diffusion ModelsYan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu. [doi]
- Enhancing Class-Imbalanced Learning with Pre-Trained Guidance through Class-Conditional Knowledge DistillationLan Li, Xin-Chun Li, Han-Jia Ye, De-Chuan Zhan. [doi]
- Rethinking Generative Large Language Model Evaluation for Semantic ComprehensionFangyun Wei, Xi Chen, Lin Luo. [doi]
- Sparse-to-dense Multimodal Image Registration via Multi-Task LearningKaining Zhang, Jiayi Ma 0001. [doi]
- Interplay of ROC and Precision-Recall AUCs: Theoretical Limits and Practical Implications in Binary ClassificationMartin Mihelich, François Castagnos, Charles Dognin. [doi]
- Learning to Stabilize Online Reinforcement Learning in Unbounded State SpacesBrahma S. Pavse, Matthew Zurek, Yudong Chen 0001, Qiaomin Xie, Josiah P. Hanna. [doi]
- Accelerated Algorithms for Constrained Nonconvex-Nonconcave Min-Max Optimization and Comonotone InclusionYang Cai 0001, Argyris Oikonomou, Weiqiang Zheng. [doi]
- Explain Temporal Black-Box Models via Functional DecompositionLinxiao Yang, Yunze Tong, Xinyue Gu, Liang Sun 0001. [doi]
- Hybrid Inverse Reinforcement LearningJuntao Ren, Gokul Swamy, Steven Wu 0001, Drew Bagnell, Sanjiban Choudhury. [doi]
- Hyperbolic Geometric Latent Diffusion Model for Graph GenerationXingcheng Fu, Yisen Gao, Yuecen Wei, Qingyun Sun, Hao Peng 0001, Jianxin Li 0002, Xianxian Li. [doi]
- MADA: Meta-Adaptive Optimizers Through Hyper-Gradient DescentKaan Ozkara, Can Karakus, Parameswaran Raman, Mingyi Hong 0001, Shoham Sabach, Branislav Kveton, Volkan Cevher. [doi]
- Can Implicit Bias Imply Adversarial Robustness?Hancheng Min, René Vidal. [doi]
- Quantum Theory and Application of Contextual Optimal TransportNicola Mariella, Albert Akhriev, Francesco Tacchino, Christa Zoufal, Juan Carlos Gonzalez-Espitia, Benedek Harsanyi, Eugene Koskin, Ivano Tavernelli, Stefan Woerner, Marianna Rapsomaniki, Sergiy Zhuk, Jannis Born. [doi]
- Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets CannotZixuan Wang, Stanley Wei, Daniel Hsu 0001, Jason D. Lee. [doi]
- Distributionally Robust Data ValuationXiaoqiang Lin, Xinyi Xu, Zhaoxuan Wu, See-Kiong Ng, Bryan Kian Hsiang Low. [doi]
- CHEMREASONER: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical FeedbackHenry W. Sprueill, Carl Edwards, Khushbu Agarwal, Mariefel V. Olarte, Udishnu Sanyal, Conrad Johnston, Hongbin Liu, Heng Ji, Sutanay Choudhury. [doi]
- Lie Neurons: Adjoint-Equivariant Neural Networks for Semisimple Lie AlgebrasTzu-Yuan Lin, Minghan Zhu, Maani Ghaffari. [doi]
- Position: Stop Making Unscientific AGI Performance ClaimsPatrick Altmeyer, Andrew M. Demetriou, Antony Bartlett, Cynthia C. S. Liem. [doi]
- OT-CLIP: Understanding and Generalizing CLIP via Optimal TransportLiangliang Shi, Jack Fan, Junchi Yan. [doi]
- Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At OnceZhangheng Li, Shiwei Liu 0003, Tianlong Chen, Ajay Kumar Jaiswal, Zhenyu Zhang 0015, Dilin Wang, Raghuraman Krishnamoorthi, Shiyu Chang, Zhangyang Wang. [doi]
- PrE-Text: Training Language Models on Private Federated Data in the Age of LLMsCharlie Hou, Akshat Shrivastava, Hongyuan Zhan, Rylan Conway, Trang Le, Adithya Sagar, Giulia Fanti, Daniel Lazar. [doi]
- Position: Application-Driven Innovation in Machine LearningDavid Rolnick, Alán Aspuru-Guzik, Sara Beery, Bistra Dilkina, Priya L. Donti, Marzyeh Ghassemi, Hannah Kerner, Claire Monteleoni, Esther Rolf, Milind Tambe, Adam White. [doi]
- Position: Standardization of Behavioral Use Clauses is Necessary for the Adoption of Responsible Licensing of AIDaniel McDuff, Tim Korjakow, Scott Cambo, Jesse Josua Benjamin, Jenny Lee, Yacine Jernite, Carlos Muñoz Ferrandis, Aaron Gokaslan, Alek Tarkowski, Joseph Lindley, A. Feder Cooper, Danish Contractor. [doi]
- ESNet: Evolution and Succession Network for High-Resolution Salient Object DetectionHongyu Liu 0003, Runmin Cong, Hua Li 0012, Qianqian Xu, Qingming Huang, Wei Zhang 0021. [doi]
- Learning to Explore for Stochastic Gradient MCMCSeunghyun Kim, Seohyeon Jung, Seonghyeon Kim, Juho Lee 0001. [doi]
- Encodings for Prediction-based Neural Architecture SearchYash Akhauri, Mohamed S. Abdelfattah. [doi]
- The Non-linear F-Design and Applications to Interactive LearningAlekh Agarwal, Jian Qian, Alexander Rakhlin, Tong Zhang 0001. [doi]
- Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant LearningHaoxin Liu, Harshavardhan Kamarthi, Lingkai Kong, Zhiyuan Zhao, Chao Zhang 0014, B. Aditya Prakash. [doi]
- Small-loss Adaptive Regret for Online Convex OptimizationWenhao Yang, Wei Jiang, Yibo Wang, Ping Yang, Yao Hu, Lijun Zhang 0005. [doi]
- A General Online Algorithm for Optimizing Complex Performance MetricsWojciech Kotlowski, Marek Wydmuch, Erik Schultheis, Rohit Babbar, Krzysztof Dembczynski. [doi]
- Quantum Algorithms and Lower Bounds for Finite-Sum OptimizationYexin Zhang, Chenyi Zhang, Cong Fang 0001, Liwei Wang 0001, Tongyang Li. [doi]
- Position: Fundamental Limitations of LLM Censorship Necessitate New ApproachesDavid Glukhov, Ilia Shumailov, Yarin Gal, Nicolas Papernot, Vardan Papyan. [doi]
- Embarrassingly Parallel GFlowNetsTiago da Silva, Luiz Max Carvalho, Amauri H. Souza, Samuel Kaski, Diego Mesquita. [doi]
- Wukong: Towards a Scaling Law for Large-Scale RecommendationBuyun Zhang, Liang Luo, Yuxin Chen, Jade Nie, Xi Liu, Shen Li, Yanli Zhao, Yuchen Hao, Yantao Yao, Ellie Dingqiao Wen, JongSoo Park, Maxim Naumov, Wenlin Chen. [doi]
- MFTN: A Multi-scale Feature Transfer Network Based on IMatchFormer for Hyperspectral Image Super-ResolutionShuying Huang, Mingyang Ren, Yong Yang 0001, Xiaozheng Wang, Yingzhi Wei. [doi]
- Fast Sampling-Based Sketches for TensorsWilliam J. Swartworth, David Woodruff. [doi]
- AutoOS: Make Your OS More Powerful by Exploiting Large Language ModelsHuilai Chen, Yuanbo Wen, Limin Cheng, Shouxu Kuang, Yumeng Liu, Weijia Li, Ling Li 0001, Rui Zhang 0040, Xinkai Song, Wei Li, Qi Guo 0001, Yunji Chen. [doi]
- Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion ProcessesJaehyeong Jo, Sung Ju Hwang. [doi]
- USTAD: Unified Single-model Training Achieving Diverse Scores for Information RetrievalSeungyeon Kim, Ankit Singh Rawat, Manzil Zaheer, Wittawat Jitkrittum, Veeranjaneyulu Sadhanala, Sadeep Jayasumana, Aditya Krishna Menon, Rob Fergus, Sanjiv Kumar. [doi]
- Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail RecognitionZhiyong Yang 0001, Qianqian Xu, Zitai Wang, Sicong Li, Boyu Han, Shilong Bao, Xiaochun Cao, Qingming Huang. [doi]
- Differentially Private Worst-group Risk MinimizationXinyu Zhou, Raef Bassily. [doi]
- A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from DataWenqiang Li, Weijun Li, Lina Yu, Min Wu, Linjun Sun, Jingyi Liu, Yanjie Li, Shu Wei, Yusong Deng, Meilan Hao. [doi]
- Indirectly Parameterized Concrete AutoencodersAlfred Nilsson, Klas Wijk, Sai Bharath Chandra Gutha, Erik Englesson, Alexandra Hotti, Carlo Saccardi, Oskar Kviman, Jens Lagergren, Ricardo Vinuesa, Hossein Azizpour. [doi]
- Bivariate Causal Discovery using Bayesian Model SelectionAnish Dhir, Samuel Power, Mark van der Wilk. [doi]
- GNNs Also Deserve Editing, and They Need It More Than OnceShaochen Zhong, Duy Le, Zirui Liu, Zhimeng Jiang, Andrew Ye, Jiamu Zhang, Jiayi Yuan, Kaixiong Zhou, Zhaozhuo Xu, Jing Ma, Shuai Xu, Vipin Chaudhary, Xia Hu 0001. [doi]
- HumanTOMATO: Text-aligned Whole-body Motion GenerationShunlin Lu, Ling-Hao Chen, Ailing Zeng, Jing Lin, Ruimao Zhang, Lei Zhang 0001, Heung-Yeung Shum. [doi]
- Fast Timing-Conditioned Latent Audio DiffusionZach Evans, CJ Carr, Josiah Taylor, Scott H. Hawley, Jordi Pons. [doi]
- EfficientZero V2: Mastering Discrete and Continuous Control with Limited DataShengjie Wang, Shaohuai Liu, Weirui Ye, Jiacheng You, Yang Gao 0029. [doi]
- Learning to Compile Programs to Neural NetworksLogan Weber, Jesse Michel, Alex Renda, Michael Carbin. [doi]
- Relaxed Quantile Regression: Prediction Intervals for Asymmetric NoiseThomas Pouplin, Alan Jeffares, Nabeel Seedat, Mihaela van der Schaar. [doi]
- On the Implicit Bias of AdamMatias D. Cattaneo, Jason M. Klusowski, Boris Shigida. [doi]
- Towards Certified Unlearning for Deep Neural NetworksBinchi Zhang, Yushun Dong, Tianhao Wang, Jundong Li. [doi]
- How Well Can LLMs Negotiate? NegotiationArena Platform and AnalysisFederico Bianchi 0001, Patrick John Chia, Mert Yüksekgönül, Jacopo Tagliabue, Dan Jurafsky, James Zou 0001. [doi]
- Causal Discovery via Conditional Independence Testing with Proxy VariablesMingzhou Liu 0001, Xinwei Sun 0001, Yu Qiao 0001, Yizhou Wang 0001. [doi]
- Multi-Fidelity Residual Neural Processes for Scalable Surrogate ModelingRuijia Niu, Dongxia Wu, Kai Kim, Yian Ma, Duncan Watson-Parris, Rose Yu. [doi]
- Dynamic Metric Embedding into lp SpaceKiarash Banihashem, MohammadTaghi Hajiaghayi, Dariusz Rafal Kowalski, Jan Olkowski, Max Springer. [doi]
- EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian SplattingJiaxu Wang, Junhao He, Ziyi Zhang, Mingyuan Sun, Jingkai Sun, Renjing Xu. [doi]
- Towards a Self-contained Data-driven Global Weather Forecasting FrameworkYi Xiao, Lei Bai 0001, Wei Xue, Hao Chen 0045, Kun Chen, Kang Chen, Tao Han 0002, Wanli Ouyang. [doi]
- Memory Efficient Neural Processes via Constant Memory Attention BlockLeo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed. [doi]
- Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion modelsLudwig Winkler, Lorenz Richter, Manfred Opper. [doi]
- SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement LearningShuai Zhang 0015, Heshan Devaka Fernando, Miao Liu, Keerthiram Murugesan, Songtao Lu, Pin-Yu Chen, Tianyi Chen, Meng Wang 0003. [doi]
- Directly Denoising Diffusion ModelsDan Zhang, Jingjing Wang, Feng Luo. [doi]
- PIDformer: Transformer Meets Control TheoryTam Minh Nguyen, César A. Uribe, Tan Minh Nguyen, Richard G. Baraniuk. [doi]
- DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment DesignSamuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht. [doi]
- Active Preference Learning for Large Language ModelsWilliam Muldrew, Peter Hayes, Mingtian Zhang, David Barber. [doi]
- Bagged Deep Image Prior for Recovering Images in the Presence of Speckle NoiseXi Chen, Zhewen Hou, Christopher A. Metzler, Arian Maleki, Shirin Jalali. [doi]
- Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace RecoveryYassir Jedra, William Réveillard, Stefan Stojanovic, Alexandre Proutière. [doi]
- Toward Availability Attacks in 3D Point CloudsYifan Zhu, Yibo Miao, Yinpeng Dong, Xiao-Shan Gao. [doi]
- EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy SearchPengyi Li, Yan Zheng 0002, Hongyao Tang, Xian Fu, Jianye Hao. [doi]
- Sparser, Better, Deeper, Stronger: Improving Static Sparse Training with Exact Orthogonal InitializationAleksandra Nowak 0001, Lukasz Gniecki, Filip Szatkowski, Jacek Tabor. [doi]
- Reward-Free Kernel-Based Reinforcement LearningSattar Vakili, Farhang Nabiei, Da-shan Shiu, Alberto Bernacchia. [doi]
- Two Fists, One Heart: Multi-Objective Optimization Based Strategy Fusion for Long-tailed LearningZhe Zhao, Pengkun Wang, Haibin Wen, Wei Xu, Song Lai, Qingfu Zhang 0001, Yang Wang 0015. [doi]
- From Geometry to Causality- Ricci Curvature and the Reliability of Causal Inference on NetworksAmirhossein Farzam, Allen R. Tannenbaum, Guillermo Sapiro. [doi]
- Domain-wise Data Acquisition to Improve Performance under Distribution ShiftYue He 0001, Dongbai Li, Pengfei Tian, Han Yu 0009, Jiashuo Liu, Hao Zou 0001, Peng Cui 0001. [doi]
- Understanding the Effects of Iterative Prompting on TruthfulnessSatyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju. [doi]
- A Graph is Worth K Words: Euclideanizing Graph using Pure TransformerZhangyang Gao, Daize Dong, Cheng Tan 0012, Jun Xia, Bozhen Hu, Stan Z. Li. [doi]
- Discovering Environments with XRMMohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz. [doi]
- Novel Spectral Algorithms for the Partial Credit ModelDuc Nguyen, Anderson Ye Zhang. [doi]
- Recovering Labels from Local Updates in Federated LearningHuancheng Chen, Haris Vikalo. [doi]
- Dealing With Unbounded Gradients in Stochastic Saddle-point OptimizationGergely Neu, Nneka Okolo. [doi]
- DAG-Based Column Generation for Adversarial Team GamesYouzhi Zhang 0001, Bo An 0001, Daniel Dajun Zeng. [doi]
- Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and PlanningYizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang 0001, Song Chun Zhu, Xue Feng. [doi]
- Robust Graph Matching when Nodes are CorruptTaha Ameen, Bruce E. Hajek. [doi]
- A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative ModelsSebastian Gregor Gruber, Florian Buettner 0001. [doi]
- Towards efficient deep spiking neural networks construction with spiking activity based pruningYaxin Li, Qi Xu, Jiangrong Shen, Hongming Xu, Long Chen, Gang Pan 0001. [doi]
- I/O Complexity of Attention, or How Optimal is FlashAttention?Barna Saha, Christopher Ye 0001. [doi]
- Improved Stability and Generalization Guarantees of the Decentralized SGD AlgorithmBatiste Le Bars, Aurélien Bellet, Marc Tommasi, Kevin Scaman, Giovanni Neglia. [doi]
- Roping in Uncertainty: Robustness and Regularization in Markov GamesJeremy McMahan, Giovanni Artiglio, Qiaomin Xie. [doi]
- Subhomogeneous Deep Equilibrium ModelsPietro Sittoni, Francesco Tudisco. [doi]
- Position: Topological Deep Learning is the New Frontier for Relational LearningTheodore Papamarkou, Tolga Birdal, Michael M. Bronstein, Gunnar E. Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Lio, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Velickovic, Bei Wang 0001, Yusu Wang 0001, Guo-Wei Wei 0001, Ghada Zamzmi. [doi]
- VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and ProprioceptionZhaoliang Wan, Yonggen Ling, Senlin Yi, Lu Qi, Wang Wei Lee, Minglei Lu, Sicheng Yang, Xiao Teng, Peng Lu, Xu Yang 0004, Ming-Hsuan Yang 0001, Hui Cheng. [doi]
- Position: Tensor Networks are a Valuable Asset for Green AIEva Memmel, Clara Menzen, Jetze Schuurmans, Frederiek Wesel, Kim Batselier. [doi]
- LoRA Training in the NTK Regime has No Spurious Local MinimaUijeong Jang, Jason D. Lee, Ernest K. Ryu. [doi]
- PGODE: Towards High-quality System Dynamics ModelingXiao Luo 0001, Yiyang Gu, HuiYu Jiang, Hang Zhou 0008, Jinsheng Huang, Wei Ju, Zhiping Xiao 0001, Ming Zhang 0004, Yizhou Sun. [doi]
- Accelerating Iterative Retrieval-augmented Language Model Serving with SpeculationZhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, Lanting Li, Phitchaya Mangpo Phothilimthana, Zhihao Jia. [doi]
- Open-Vocabulary Calibration for Fine-tuned CLIPShuoyuan Wang, Jindong Wang 0001, Guoqing Wang, Bob Zhang 0001, Kaiyang Zhou, Hongxin Wei. [doi]
- CuTS: Customizable Tabular Synthetic Data GenerationMark Vero, Mislav Balunovic, Martin T. Vechev. [doi]
- Stochastic Bandits with ReLU Neural NetworksKan Xu, Hamsa Bastani, Surbhi Goel, Osbert Bastani. [doi]
- Cross-Domain Policy Adaptation by Capturing Representation MismatchJiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li 0001. [doi]
- Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuningJing Xu, Jingzhao Zhang. [doi]
- A sampling theory perspective on activations for implicit neural representationsHemanth Saratchandran, Sameera Ramasinghe, Violetta Shevchenko, Alexander Long, Simon Lucey. [doi]
- Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block QuantizationHaocheng Xi, Yuxiang Chen, Kang Zhao, Kai Jun Teh, Jianfei Chen, Jun Zhu. [doi]
- DE-COP: Detecting Copyrighted Content in Language Models Training DataAndré V. Duarte, Xuandong Zhao, Arlindo L. Oliveira, Lei Li 0005. [doi]
- Assessing Large Language Models on Climate InformationJannis Bulian, Mike S. Schäfer, Afra Amini, Heidi Lam, Massimiliano Ciaramita, Ben Gaiarin, Michelle Chen Huebscher, Christian Buck, Niels Mede, Markus Leippold, Nadine Strauß. [doi]
- TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine PrecisionZhuo Chen, Jacob McCarran, Esteban Vizcaino, Marin Soljacic, Di Luo. [doi]
- Robust Inverse Constrained Reinforcement Learning under Model MisspecificationSheng Xu, Guiliang Liu. [doi]
- Position: The Platonic Representation HypothesisMinyoung Huh, Brian Cheung, Tongzhou Wang 0001, Phillip Isola. [doi]
- Simulation-Based Inference with Quantile RegressionHe Jia. [doi]
- CLIF: Complementary Leaky Integrate-and-Fire Neuron for Spiking Neural NetworksYulong Huang, Xiaopeng Lin, Hongwei Ren, Haotian Fu, Yue Zhou, Zunchang Liu, Biao Pan, Bojun Cheng. [doi]
- Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor ControlDongyoon Hwang, ByungKun Lee, HoJoon Lee, Hyunseung Kim, Jaegul Choo. [doi]
- Towards General Algorithm Discovery for Combinatorial Optimization: Learning Symbolic Branching Policy from Bipartite GraphYufei Kuang, Jie Wang 0005, Yuyan Zhou, Xijun Li, Fangzhou Zhu, Jianye Hao, Feng Wu 0001. [doi]
- Understanding Unimodal Bias in Multimodal Deep Linear NetworksYedi Zhang, Peter E. Latham, Andrew M. Saxe. [doi]
- Provable Multi-Task Representation Learning by Two-Layer ReLU Neural NetworksLiam Collins, Hamed Hassani, Mahdi Soltanolkotabi, Aryan Mokhtari, Sanjay Shakkottai. [doi]
- Bridging Data Gaps in Diffusion Models with Adversarial Noise-Based Transfer LearningXiyu Wang, Baijiong Lin, Daochang Liu, Ying-Cong Chen, Chang Xu 0002. [doi]
- A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNsLars Veefkind, Gabriele Cesa. [doi]
- Value-Evolutionary-Based Reinforcement LearningPengyi Li, Jianye Hao, Hongyao Tang, Yan Zheng 0002, Fazl Barez. [doi]
- FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language ModelsJingwei Sun 0002, Ziyue Xu 0001, Hongxu Yin, Dong Yang 0005, Daguang Xu, Yudong Liu, Zhixu Du, Yiran Chen 0001, Holger R. Roth. [doi]
- Collaborative Learning with Different Labeling FunctionsYuyang Deng, Mingda Qiao. [doi]
- Towards Understanding the Word Sensitivity of Attention Layers: A Study via Random FeaturesSimone Bombari, Marco Mondelli. [doi]
- Language-guided Skill Learning with Temporal Variational InferenceHaotian Fu, Pratyusha Sharma, Elias Stengel-Eskin, George Konidaris 0001, Nicolas Le Roux, Marc-Alexandre Côté, Xingdi Yuan. [doi]
- Rethinking Decision Transformer via Hierarchical Reinforcement LearningYi Ma, Jianye Hao, Hebin Liang, Chenjun Xiao. [doi]
- Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free ApplicationsZixuan Hu, Yongxian Wei, Li Shen 0008, Zhenyi Wang, Lei Li, Chun Yuan, Dacheng Tao. [doi]
- Neural Operators with Localized Integral and Differential KernelsMiguel Liu-Schiaffini, Julius Berner, Boris Bonev, Thorsten Kurth, Kamyar Azizzadenesheli, Anima Anandkumar. [doi]
- MoMo: Momentum Models for Adaptive Learning RatesFabian Schaipp, Ruben Ohana, Michael Eickenberg, Aaron Defazio, Robert M. Gower. [doi]
- Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam GenerationGauthier Guinet, Behrooz Omidvar Tehrani, Anoop Deoras, Laurent Callot. [doi]
- Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order PerspectiveWu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani. [doi]
- Building Socially-Equitable Public ModelsYejia Liu, Jianyi Yang, Pengfei Li 0008, Tongxin Li, Shaolei Ren. [doi]
- Prediction Accuracy of Learning in Games : Follow-the-Regularized-Leader meets HeisenbergYi Feng, Georgios Piliouras, Xiao Wang 0036. [doi]
- Evolving Subnetwork Training for Large Language ModelsHanqi Li, Lu Chen 0002, Da Ma, Zijian Wu, Su Zhu, Kai Yu 0004. [doi]
- CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD DetectionLin Zhu, Yifeng Yang, Qinying Gu, Xinbing Wang, Chenghu Zhou, Nanyang Ye 0001. [doi]
- PPFLOW: Target-Aware Peptide Design with Torsional Flow MatchingHaitao Lin, Odin Zhang, Huifeng Zhao, Dejun Jiang 0002, Lirong Wu, Zicheng Liu 0006, Yufei Huang 0002, Stan Z. Li. [doi]
- Experts Don't Cheat: Learning What You Don't Know By Predicting PairsDaniel D. Johnson 0001, Daniel Tarlow, David Duvenaud, Chris J. Maddison. [doi]
- Predictive Performance Comparison of Decision Policies Under ConfoundingLuke Guerdan, Amanda Coston, Ken Holstein, Steven Wu 0001. [doi]
- Interpretable Deep Clustering for Tabular DataJonathan Svirsky, Ofir Lindenbaum. [doi]
- How Interpretable Are Interpretable Graph Neural Networks?Yongqiang Chen 0002, Yatao Bian, Bo Han 0003, James Cheng. [doi]
- Towards General Neural Surrogate Solvers with Specialized Neural AcceleratorsChenkai Mao, Robert Lupoiu, Tianxiang Dai, Mingkun Chen, Jonathan A. Fan. [doi]
- Robust Classification via a Single Diffusion ModelHuanran Chen, Yinpeng Dong, Zhengyi Wang, Xiao Yang, Chengqi Duan, Hang Su 0006, Jun Zhu 0001. [doi]
- An Information Theoretic Approach to Interaction-Grounded LearningXiaoyan Hu, Farzan Farnia, Ho-Fung Leung. [doi]
- On Universally Optimal Algorithms for A/B TestingPo-An Wang, Kaito Ariu, Alexandre Proutière. [doi]
- Provably Neural Active Learning Succeeds via Prioritizing Perplexing SamplesDake Bu, Wei Huang, Taiji Suzuki, Ji Cheng, Qingfu Zhang 0001, Zhiqiang Xu, Hau-San Wong. [doi]
- Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesZhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro. [doi]
- Unifying Image Processing as Visual Prompting Question AnsweringYihao Liu 0001, Xiangyu Chen 0006, Xianzheng Ma, Xintao Wang, Jiantao Zhou 0001, Yu Qiao 0001, Chao Dong. [doi]
- Human Alignment of Large Language Models through Online Preference OptimisationDaniele Calandriello, Zhaohan Daniel Guo, Rémi Munos, Mark Rowland, Yunhao Tang, Bernardo Ávila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu 0002, Rishabh Joshi, Zeyu Zheng, Bilal Piot. [doi]
- Model Alignment as Prospect Theoretic OptimizationKawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky, Douwe Kiela. [doi]
- Test-Time Model Adaptation with Only Forward PassesShuaicheng Niu, Chunyan Miao, Guohao Chen, Pengcheng Wu, Peilin Zhao. [doi]
- AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRAWeitao Feng, Wenbo Zhou, Jiyan He, Jie Zhang 0073, Tianyi Wei, Guanlin Li, Tianwei Zhang 0004, Weiming Zhang 0001, Nenghai Yu. [doi]
- Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-MakingVivek Myers, Chongyi Zheng, Anca D. Dragan, Sergey Levine, Benjamin Eysenbach. [doi]
- VNN: Verification-Friendly Neural Networks with Hard Robustness GuaranteesAnahita Baninajjar, Ahmed Rezine, Amir Aminifar. [doi]
- Neural SPH: Improved Neural Modeling of Lagrangian Fluid DynamicsArtur P. Toshev, Jonas A. Erbesdobler, Nikolaus A. Adams, Johannes Brandstetter. [doi]
- On the Asymptotic Distribution of the Minimum Empirical RiskJacob Westerhout, TrungTin Nguyen, Xin Guo, Hien Duy Nguyen. [doi]
- An Analysis of Linear Time Series Forecasting ModelsWilliam Toner, Luke Nicholas Darlow. [doi]
- Disentangled Graph Self-supervised Learning for Out-of-Distribution GeneralizationHaoyang Li 0001, Xin Wang 0019, Zeyang Zhang, Haibo Chen, Ziwei Zhang, Wenwu Zhu 0001. [doi]
- RAUCA: A Novel Physical Adversarial Attack on Vehicle Detectors via Robust and Accurate Camouflage GenerationJiawei Zhou, Linye Lyu, Daojing He, Yu Li 0007. [doi]
- Clifford-Steerable Convolutional Neural NetworksMaksim Zhdanov, David Ruhe, Maurice Weiler, Ana Lucic, Johannes Brandstetter, Patrick Forré. [doi]
- LLM Maybe LongLM: SelfExtend LLM Context Window Without TuningHongye Jin, Xiaotian Han, Jingfeng Yang, Zhimeng Jiang, Zirui Liu, Chia-Yuan Chang, Huiyuan Chen, Xia Hu 0001. [doi]
- DIDI: Diffusion-Guided Diversity for Offline Behavioral GenerationJinxin Liu, Xinghong Guo, Zifeng Zhuang, Donglin Wang. [doi]
- GaussianPro: 3D Gaussian Splatting with Progressive PropagationKai Cheng, Xiaoxiao Long, Kaizhi Yang, Yao Yao, Wei Yin 0006, Yuexin Ma, Wenping Wang, Xuejin Chen. [doi]
- Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak SupervisionCollin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeffrey Wu 0003. [doi]
- MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of DataPaul S. Scotti, Mihir Tripathy, Cesar Torrico 0002, Reese Kneeland, Tong Chen, Ashutosh Narang, Charan Santhirasegaran, Jonathan Xu, Thomas Naselaris, Kenneth A. Norman, Tanishq Mathew Abraham. [doi]
- Prompt-based Visual Alignment for Zero-shot Policy TransferHaihan Gao, Rui Zhang 0040, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, Qicheng Wang, Xing Hu 0001, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li 0001, Qi Guo 0001, Yunji Chen. [doi]
- Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular DomainsKyungeun Lee, Ye Seul Sim, Hye-Seung Cho, Moonjung Eo, Suhee Yoon, Sanghyu Yoon, Woohyung Lim. [doi]
- Stochastic Interpolants with Data-Dependent CouplingsMichael S. Albergo, Mark Goldstein, Nicholas Matthew Boffi, Rajesh Ranganath, Eric Vanden-Eijnden. [doi]
- Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data StreamsBrian Cho 0001, Kyra Gan, Nathan Kallus. [doi]
- Do Topological Characteristics Help in Knowledge Distillation?Jungeun Kim, Junwon You, DongJin Lee, Ha-Young Kim, Jae-Hun Jung. [doi]
- Enabling Uncertainty Estimation in Iterative Neural NetworksNikita Durasov, Doruk Öner, Jonathan Donier, Hieu Le, Pascal Fua. [doi]
- On the Effectiveness of Supervision in Asymmetric Non-Contrastive LearningJeongheon Oh, Kibok Lee 0003. [doi]
- PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context LearningHyeong Kyu Choi, Yixuan Li. [doi]
- How to Leverage Diverse Demonstrations in Offline Imitation LearningSheng Yue, Jiani Liu 0005, Xingyuan Hua, Ju Ren 0001, Sen Lin 0001, Junshan Zhang, Yaoxue Zhang. [doi]
- diff History for Neural Language AgentsUlyana Piterbarg, Lerrel Pinto, Rob Fergus. [doi]
- AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsYu Du, Fangyun Wei, Hongyang Zhang 0001. [doi]
- How Deep Do We Need: Accelerating Training and Inference of Neural ODEs via Control PerspectiveKeyan Miao, Konstantinos Gatsis. [doi]
- Position: Do pretrained Transformers Learn In-Context by Gradient Descent?Lingfeng Shen, Aayush Mishra, Daniel Khashabi. [doi]
- Confidence Aware Inverse Constrained Reinforcement LearningSriram Ganapathi Subramanian, Guiliang Liu, Mohammed Elmahgiubi, Kasra Rezaee, Pascal Poupart. [doi]
- Log Neural Controlled Differential Equations: The Lie Brackets Make A DifferenceBenjamin Walker 0001, Andrew D. McLeod, Tiexin Qin, Yichuan Cheng, Haoliang Li, Terry J. Lyons. [doi]
- Stereo Risk: A Continuous Modeling Approach to Stereo MatchingCe Liu 0004, Suryansh Kumar 0001, Shuhang Gu, Radu Timofte, Yao Yao, Luc Van Gool. [doi]
- Combining Experimental and Historical Data for Policy EvaluationTing Li, Chengchun Shi, Qianglin Wen, Yang Sui, Yongli Qin, Chunbo Lai, Hongtu Zhu. [doi]
- Robust Data-driven Prescriptiveness OptimizationMehran Poursoltani, Erick Delage, Angelos Georghiou. [doi]
- Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement LearningDake Zhang, Boxiang Lyu, Shuang Qiu, Mladen Kolar, Tong Zhang 0001. [doi]
- MMPareto: Boosting Multimodal Learning with Innocent Unimodal AssistanceYake Wei, Di Hu 0001. [doi]
- Challenges and Considerations in the Evaluation of Bayesian Causal DiscoveryAmir Mohammad Karimi-Mamaghan, Panagiotis Tigas, Karl Henrik Johansson, Yarin Gal, Yashas Annadani, Stefan Bauer. [doi]
- InfiAgent-DABench: Evaluating Agents on Data Analysis TasksXueyu Hu, Ziyu Zhao, Shuang Wei, Ziwei Chai, Qianli Ma, Guoyin Wang 0002, Xuwu Wang, Jing Su, Jingjing Xu, Ming Zhu, Yao Cheng, Jianbo Yuan, Jiwei Li 0001, Kun Kuang, Yang Yang 0009, Hongxia Yang, Fei Wu 0001. [doi]
- Diffusion Models Demand Contrastive Guidance for Adversarial Purification to AdvanceMingyuan Bai, Wei Huang, Tenghui Li, Andong Wang, Junbin Gao, Cesar F. Caiafa, Qibin Zhao. [doi]
- Unveiling the Dynamics of Information Interplay in Supervised LearningKun Song 0004, Zhiquan Tan, Bochao Zou, Huimin Ma 0001, Weiran Huang 0001. [doi]
- Predictive Linear Online Tracking for Unknown TargetsAnastasios Tsiamis, Aren Karapetyan, Yueshan Li, Efe C. Balta, John Lygeros. [doi]
- Multi-Source Conformal Inference Under Distribution ShiftYi Liu, Alexander Levis, Sharon-Lise T. Normand, Larry Han. [doi]
- Learning Pseudo-Contractive Denoisers for Inverse ProblemsDeliang Wei, Peng Chen, Fang Li 0004. [doi]
- Modeling Language Tokens as Functionals of Semantic FieldsZhengqi Pei, Anran Zhang, Shuhui Wang, Qingming Huang. [doi]
- Realistic Unsupervised CLIP Fine-tuning with Universal Entropy OptimizationJian Liang, Lijun Sheng, Zhengbo Wang, Ran He, Tieniu Tan. [doi]
- Comparing Graph Transformers via Positional EncodingsMitchell Black, Zhengchao Wan, Gal Mishne, Amir Nayyeri, Yusu Wang 0001. [doi]
- DUPLEX: Dual GAT for Complex Embedding of Directed GraphsZhaoru Ke, Hang Yu, Jianguo Li, Haipeng Zhang. [doi]
- Accelerating Parallel Sampling of Diffusion ModelsZhiwei Tang, Jiasheng Tang, Hao Luo 0004, Fan Wang 0019, Tsung-Hui Chang. [doi]
- TravelPlanner: A Benchmark for Real-World Planning with Language AgentsJian Xie, Kai Zhang 0033, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su 0001. [doi]
- Reinforcement Learning from Reachability Specifications: PAC Guarantees with Expected Conditional DistanceJakub Svoboda, Suguman Bansal, Krishnendu Chatterjee. [doi]
- Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine WorkersRon Dorfman, Naseem Yehya, Kfir Yehuda Levy. [doi]
- Revisiting Context Aggregation for Image MattingQinglin Liu, Xiaoqian Lv, Quanling Meng, Zonglin Li, Xiangyuan Lan, Shuo Yang, Shengping Zhang, Liqiang Nie. [doi]
- Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMsAndries P. Smit, Nathan Grinsztajn, Paul Duckworth, Thomas D. Barrett, Arnu Pretorius. [doi]
- Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and DepthKevin Kögler, Aleksandr Shevchenko, Hamed Hassani, Marco Mondelli. [doi]
- HGAP: Boosting Permutation Invariant and Permutation Equivariant in Multi-Agent Reinforcement Learning via Graph Attention NetworkBor-Jiun Lin, Chun-Yi Lee. [doi]
- Bipartite Matching in Massive Graphs: A Tight Analysis of EDCSAmir Azarmehr, Soheil Behnezhad, Mohammad Roghani. [doi]
- Improving Open-Ended Text Generation via Adaptive DecodingWenhong Zhu, Hongkun Hao, Zhiwei He 0002, Yiming Ai, Rui Wang 0015. [doi]
- Simplicity Bias via Global Convergence of Sharpness MinimizationKhashayar Gatmiry, Zhiyuan Li 0005, Sashank J. Reddi, Stefanie Jegelka. [doi]
- Unsupervised Parameter-free Simplicial Representation Learning with Scattering TransformsHiren Madhu, Sravanthi Gurugubelli, Sundeep Prabhakar Chepuri. [doi]
- Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language ModelsSiddharth Karamcheti, Suraj Nair 0003, Ashwin Balakrishna, Percy Liang, Thomas Kollar, Dorsa Sadigh. [doi]
- Weighted distance nearest neighbor condensingLee-Ad Gottlieb, Timor Sharabi, Roi Weiss. [doi]
- Scalable Online Exploration via CoverabilityPhilip Amortila, Dylan J. Foster, Akshay Krishnamurthy. [doi]
- Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences ConstraintsYuantong Li, Guang Cheng 0003, Xiaowu Dai. [doi]
- S2IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series ForecastingZijie Pan, Yushan Jiang, Sahil Garg, Anderson Schneider, Yuriy Nevmyvaka, Dongjin Song. [doi]
- Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization AlgorithmsYuchen Li, Laura Balzano, Deanna Needell, Hanbaek Lyu. [doi]
- An Explicit Frame Construction for Normalizing 3D Point CloudsJustin M. Baker, Shih-Hsin Wang, Tommaso de Fernex, Bao Wang. [doi]
- InstructRetro: Instruction Tuning post Retrieval-Augmented PretrainingBoxin Wang, Wei Ping, Lawrence McAfee, Peng Xu 0008, Bo Li 0026, Mohammad Shoeybi, Bryan Catanzaro. [doi]
- Transforming and Combining Rewards for Aligning Large Language ModelsZihao Wang, Chirag Nagpal, Jonathan Berant, Jacob Eisenstein, Alexander Nicholas D'Amour, Sanmi Koyejo, Victor Veitch. [doi]
- Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL AgentsChung-En Sun, Sicun Gao, Tsui-Wei Weng. [doi]
- Graph Neural Network Explanations are FragileJiate Li, Meng Pang, Yun Dong, Jinyuan Jia 0001, Binghui Wang. [doi]
- TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance LearningXiwen Chen, Peijie Qiu, Wenhui Zhu, Huayu Li, Hao Wang 0176, Aristeidis Sotiras, Yalin Wang 0001, Abolfazl Razi. [doi]
- Balancing Feature Similarity and Label Variability for Optimal Size-Aware One-shot Subset SelectionAbhinab Acharya, Dayou Yu, Qi Yu 0001, Xumin Liu. [doi]
- Riemannian coordinate descent algorithms on matrix manifoldsAndi Han, Pratik Jawanpuria, Bamdev Mishra. [doi]
- Position: Scaling Simulation is Neither Necessary Nor Sufficient for In-the-Wild Robot ManipulationHomanga Bharadhwaj. [doi]
- PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic ManipulationRunze Liu, Yali Du 0001, Fengshuo Bai, Jiafei Lyu, Xiu Li 0001. [doi]
- Scaling Tractable Probabilistic Circuits: A Systems PerspectiveAnji Liu, Kareem Ahmed, Guy Van den Broeck. [doi]
- From Generalization Analysis to Optimization Designs for State Space ModelsFusheng Liu, Qianxiao Li. [doi]
- What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning TasksXingwu Chen, Difan Zou. [doi]
- MOKD: Cross-domain Finetuning for Few-shot Classification via Maximizing Optimized Kernel DependenceHongduan Tian, Feng Liu 0003, Tongliang Liu, Bo Du 0001, Yiu-ming Cheung, Bo Han 0003. [doi]
- Near-Linear Time Approximation Algorithms for k-means with OutliersJunyu Huang, Qilong Feng, Ziyun Huang, Jinhui Xu 0001, Jianxin Wang 0001. [doi]
- Towards Causal Foundation Model: on Duality between Optimal Balancing and AttentionJiaqi Zhang, Joel Jennings, Agrin Hilmkil, Nick Pawlowski, Cheng Zhang 0005, Chao Ma 0019. [doi]
- Position: Scarce Resource Allocations That Rely On Machine Learning Should Be RandomizedShomik Jain, Kathleen Creel, Ashia Camage Wilson. [doi]
- Online Learning in CMDPs: Handling Stochastic and Adversarial ConstraintsFrancesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti, Matteo Castiglioni, Alberto Marchesi 0001, Nicola Gatti 0001. [doi]
- Position: Levels of AGI for Operationalizing Progress on the Path to AGIMeredith Ringel Morris, Jascha Sohl-Dickstein, Noah Fiedel, Tris Warkentin, Allan Dafoe, Aleksandra Faust, Clément Farabet, Shane Legg. [doi]
- QORA: Zero-Shot Transfer via Interpretable Object-Relational Model LearningGabriel Stella, Dmitri Loguinov. [doi]
- On dimensionality of feature vectors in MPNNsCésar Bravo, Alexander Kozachinskiy, Cristobal Rojas. [doi]
- Tuning-free Estimation and Inference of Cumulative Distribution Function under Local Differential PrivacyYi Liu, Qirui Hu, Linglong Kong. [doi]
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationShiqi Chen, Miao Xiong, Junteng Liu, Zhengxuan Wu, Teng Xiao, Siyang Gao, Junxian He. [doi]
- Gambling-Based Confidence Sequences for Bounded Random VectorsJongha Jon Ryu, Gregory W. Wornell. [doi]
- Towards the Theory of Unsupervised Federated Learning: Non-asymptotic Analysis of Federated EM AlgorithmsYe Tian, Haolei Weng, Yang Feng 0002. [doi]
- SLOG: An Inductive Spectral Graph Neural Network Beyond Polynomial FilterHaobo Xu, Yuchen Yan, Dingsu Wang, Zhe Xu 0007, Zhichen Zeng, Tarek F. Abdelzaher, Jiawei Han 0001, Hanghang Tong. [doi]
- Jacobian Regularizer-based Neural Granger CausalityWanqi Zhou, Shuanghao Bai, Shujian Yu, Qibin Zhao, Badong Chen. [doi]
- Balanced Resonate-and-Fire NeuronsSaya Higuchi, Sebastian Kairat, Sander M. Bohté, Sebastian Otte. [doi]
- A Tale of Tails: Model Collapse as a Change of Scaling LawsElvis Dohmatob, Yunzhen Feng, Pu Yang, François Charton, Julia Kempe. [doi]
- Gradient Compressed Sensing: A Query-Efficient Gradient Estimator for High-Dimensional Zeroth-Order OptimizationRuizhong Qiu, Hanghang Tong. [doi]
- Position: Benchmarking is Limited in Reinforcement Learning ResearchScott M. Jordan, Adam White 0001, Bruno Castro da Silva, Martha White, Philip S. Thomas. [doi]
- Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy BiasesZiyi Zhang, Sen Zhang 0006, Yibing Zhan, Yong Luo 0002, Yonggang Wen 0001, Dacheng Tao. [doi]
- BOtied: Multi-objective Bayesian optimization with tied multivariate ranksJi-Won Park, Natasa Tagasovska, Michael Maser, Stephen Ra, KyungHyun Cho. [doi]
- DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language ModelsSidi Lu, Wenbo Zhao 0006, Chenyang Tao, Arpit Gupta, Shanchan Wu, Tagyoung Chung, Nanyun Peng. [doi]
- A New Computationally Efficient Algorithm to solve Feature Selection for Functional Data Classification in High-dimensional SpacesTobia Boschi, Francesca Bonin, Rodrigo Ordonez-Hurtado, Alessandra Pascale, Jonathan P. Epperlein. [doi]
- Learning a Diffusion Model Policy from Rewards via Q-Score MatchingMichael Psenka, Alejandro Escontrela, Pieter Abbeel, Yi Ma 0001. [doi]
- Smooth Min-Max Monotonic NetworksChristian Igel. [doi]
- Predicting Dose-Response Curves with Deep Neural NetworksPedro Alonso Campana, Paul Prasse, Tobias Scheffer. [doi]
- Energy-based Backdoor Defense without Task-Specific Samples and Model RetrainingYudong Gao, Honglong Chen, Peng Sun 0003, Zhe Li, Junjian Li, Huajie Shao. [doi]
- Robustness of Deep Learning for Accelerated MRI: Benefits of Diverse Training DataKang Lin, Reinhard Heckel. [doi]
- Ranking-based Client Imitation Selection for Efficient Federated LearningChunlin Tian, Zhan Shi, Xinpeng Qin, Li Li, Chengzhong Xu. [doi]
- Studying K-FAC Heuristics by Viewing Adam through a Second-Order LensRoss M. Clarke, José Miguel Hernández-Lobato. [doi]
- SFC: Achieve Accurate Fast Convolution under Low-precision ArithmeticLiulu He, Yufei Zhao, Rui Gao, Yuan Du, Li Du. [doi]
- Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph ProductsGuy Bar-Shalom, Beatrice Bevilacqua, Haggai Maron. [doi]
- video-SALMONN: Speech-Enhanced Audio-Visual Large Language ModelsGuangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan 0019, Wei Li 0119, Lu Lu 0015, Zejun Ma, Yuxuan Wang 0002, Chao Zhang 0031. [doi]
- Active Ranking and Matchmaking, with Perfect MatchingsHafedh El Ferchichi, Matthieu Lerasle, Vianney Perchet. [doi]
- Extracting Training Data From Document-Based VQA ModelsFrancesco Pinto, Nathalie Rauschmayr, Florian Tramèr, Philip Torr 0001, Federico Tombari. [doi]
- MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesWeihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu 0001, Xinchao Wang, Lijuan Wang. [doi]
- GroupCover: A Secure, Efficient and Scalable Inference Framework for On-device Model Protection based on TEEsZheng Zhang, Na Wang 0003, Ziqi Zhang, Yao Zhang, Tianyi Zhang, Jianwei Liu 0001, Ye Wu. [doi]
- Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene ReconstructionDiwen Wan, Ruijie Lu, Gang Zeng. [doi]
- COPAL: Continual Pruning in Large Language Generative ModelsSrikanth Malla, Joon Hee Choi, Chiho Choi. [doi]
- Estimating the Permanent by Nesting Importance SamplingJuha Harviainen, Mikko Koivisto. [doi]
- Position: C∗-Algebraic Machine Learning - Moving in a New DirectionYuka Hashimoto, Masahiro Ikeda, Hachem Kadri. [doi]
- BeigeMaps: Behavioral Eigenmaps for Reinforcement Learning from ImagesSandesh Adhikary, Anqi Li, Byron Boots. [doi]
- BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedbackGaurav Pandey 0001, Yatin Nandwani, Tahira Naseem, Mayank Mishra, Guangxuan Xu, Dinesh Raghu, Sachindra Joshi, Asim Munawar, Ramón Fernandez Astudillo. [doi]
- Regression with Multi-Expert DeferralAnqi Mao, Mehryar Mohri, Yutao Zhong 0002. [doi]
- Active Statistical InferenceTijana Zrnic, Emmanuel J. Candès. [doi]
- Analysis for Abductive Learning and Neural-Symbolic Reasoning ShortcutsXiaowen Yang, Wenda Wei, Jie-Jing Shao, Yu-Feng Li, Zhi-Hua Zhou. [doi]
- Learning Iterative Reasoning through Energy DiffusionYilun Du, Jiayuan Mao, Joshua B. Tenenbaum. [doi]
- Position: Mission Critical - Satellite Data is a Distinct Modality in Machine LearningEsther Rolf, Konstantin Klemmer, Caleb Robinson, Hannah Kerner. [doi]
- Uncertainty for Active Learning on GraphsDominik Fuchsgruber, Tom Wollschläger, Bertrand Charpentier, Antonio Oroz, Stephan Günnemann. [doi]
- On the Origins of Linear Representations in Large Language ModelsYibo Jiang, Goutham Rajendran, Pradeep Kumar Ravikumar, Bryon Aragam, Victor Veitch. [doi]
- ULAREF: A Unified Label Refinement Framework for Learning with Inaccurate SupervisionCongyu Qiao, Ning Xu 0009, Yihao Hu, Xin Geng 0001. [doi]
- CogDPM: Diffusion Probabilistic Models via Cognitive Predictive CodingKaiYuan Chen, Xingzhuo Guo, Yu Zhang, Jianmin Wang, Mingsheng Long. [doi]
- Deep Fusion: Efficient Network Training via Pre-trained InitializationsHanna Mazzawi, Javier Gonzalvo, Michael Wunder, Sammy Jerome, Benoit Dherin. [doi]
- Longitudinal Targeted Minimum Loss-based Estimation with Temporal-Difference Heterogeneous TransformerToru Shirakawa, Yi Li, Yulun Wu, Sky Qiu, Yuxuan Li, Mingduo Zhao, Hiroyasu Iso, Mark J. van der Laan. [doi]
- A Statistical Theory of Regularization-Based Continual LearningXuyang Zhao, Huiyuan Wang, Weiran Huang 0001, Wei Lin. [doi]
- Language Generation with Strictly Proper Scoring RulesChenze Shao, Fandong Meng, Yijin Liu, Jie Zhou 0016. [doi]
- Detecting Influence Structures in Multi-Agent Reinforcement LearningFabian Raoul Pieroth, Katherine E. Fitch, Lenz Belzner. [doi]
- Rethinking Transformers in Solving POMDPsChenhao Lu, Ruizhe Shi, Yuyao Liu, Kaizhe Hu, Simon Shaolei Du, Huazhe Xu. [doi]
- Scaling Laws for the Value of Individual Data Points in Machine LearningIan Connick Covert, Wenlong Ji, Tatsunori Hashimoto, James Zou 0001. [doi]
- To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language ModelsGeorge-Octavian Barbulescu, Peter Triantafillou. [doi]
- Conformalized Adaptive Forecasting of Heterogeneous TrajectoriesYanfei Zhou, Lars Lindemann, Matteo Sesia. [doi]
- LoCoCo: Dropping In Convolutions for Long Context CompressionRuisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen. [doi]
- Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case RegretHan Zhong 0001, Jiachen Hu, Yecheng Xue, Tongyang Li, Liwei Wang 0001. [doi]
- On The Statistical Complexity of Offline Decision-MakingThanh Nguyen-Tang, Raman Arora. [doi]
- Privacy Backdoors: Stealing Data with Corrupted Pretrained ModelsShanglun Feng, Florian Tramèr. [doi]
- FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical GuaranteesJiahao Liu, Yipeng Zhou, Di Wu 0001, Miao Hu, Mohsen Guizani, Quan Z. Sheng. [doi]
- Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical PerspectiveLei Zhao, Mengdi Wang, Yu Bai 0017. [doi]
- ERQ: Error Reduction for Post-Training Quantization of Vision TransformersYunshan Zhong, Jiawei Hu, You Huang, Yuxin Zhang 0002, Rongrong Ji. [doi]
- Transformers Get Stable: An End-to-End Signal Propagation Theory for Language ModelsAkhil Kedia, Mohd Abbas Zaidi, Sushil Khyalia, Jungho Jung, Harshith Goka, Haejun Lee. [doi]
- Transport of Algebraic Structure to Latent EmbeddingsSamuel Pfrommer, Brendon G. Anderson, Somayeh Sojoudi. [doi]
- Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov GamesSongtao Feng, Ming Yin 0003, Yu-Xiang Wang 0003, Jing Yang 0002, Yingbin Liang. [doi]
- Learning Associative Memories with Gradient DescentVivien Cabannes, Berfin Simsek, Alberto Bietti. [doi]
- Understanding Heterophily for Graph Neural NetworksJunfu Wang, Yuanfang Guo, Liang Yang 0002, Yunhong Wang. [doi]
- Leveraging (Biased) Information: Multi-armed Bandits with Offline DataWang Chi Cheung, Lixing Lyu. [doi]
- Adaptive Robust Learning using Latent Bernoulli VariablesAleksandr Karakulev, Dave Zachariah, Prashant Singh. [doi]
- Improving Factuality and Reasoning in Language Models through Multiagent DebateYilun Du, Shuang Li 0013, Antonio Torralba 0001, Joshua B. Tenenbaum, Igor Mordatch. [doi]
- TimeSiam: A Pre-Training Framework for Siamese Time-Series ModelingJiaxiang Dong, Haixu Wu, Yuxuan Wang, Yunzhong Qiu, Li Zhang 0065, Jianmin Wang 0001, Mingsheng Long. [doi]
- Towards a Better Theoretical Understanding of Independent Subnetwork TrainingEgor Shulgin, Peter Richtárik. [doi]
- Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under SmoothnessSamir Khan, Martin Saveski, Johan Ugander. [doi]
- Efficient Online Set-valued Classification with Bandit FeedbackZhou Wang, Xingye Qiao. [doi]
- The Relative Value of Prediction in Algorithmic Decision MakingJuan Carlos Perdomo. [doi]
- ACPO: A Policy Optimization Algorithm for Average MDPs with ConstraintsAkhil Agnihotri, Rahul Jain 0002, Haipeng Luo. [doi]
- OAK: Enriching Document Representations using Auxiliary Knowledge for Extreme ClassificationShikhar Mohan, Deepak Saini, Anshul Mittal, Sayak Ray Chowdhury, Bhawna Paliwal, Jian Jiao 0007, Manish Gupta, Manik Varma. [doi]
- Particle Denoising Diffusion SamplerAngus Phillips, Hai-Dang Dau, Michael John Hutchinson, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet. [doi]
- Robustness of Nonlinear Representation LearningSimon Buchholz, Bernhard Schölkopf. [doi]
- An Empirical Examination of Balancing Strategy for Counterfactual Estimation on Time SeriesQiang Huang, Chuizheng Meng, Defu Cao, Biwei Huang, Yi Chang 0001, Yan Liu 0002. [doi]
- Bridging Environments and Language with Rendering Functions and Vision-Language ModelsThéo Cachet, Christopher R. Dance, Olivier Sigaud. [doi]
- A Universal Transfer Theorem for Convex Optimization Algorithms Using Inexact First-order OraclesPhillip A. Kerger, Marco Molinaro 0001, Hongyi Jiang, Amitabh Basu. [doi]
- Simple Ingredients for Offline Reinforcement LearningEdoardo Cetin, Andrea Tirinzoni, Matteo Pirotta, Alessandro Lazaric, Yann Ollivier, Ahmed Touati. [doi]
- Position: Towards Unified Alignment Between Agents, Humans, and EnvironmentZonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li 0030, Yang Liu 0005. [doi]
- Dynamic Spectral Clustering with Provable Approximation GuaranteeSteinar Laenen, He Sun 0001. [doi]
- Sample as you Infer: Predictive Coding with Langevin DynamicsUmais Zahid, Qinghai Guo, Zafeirios Fountas. [doi]
- Generalized Neural Collapse for a Large Number of ClassesJiachen Jiang, Jinxin Zhou, Peng Wang 0098, Qing Qu 0001, Dustin G. Mixon, Chong You, Zhihui Zhu. [doi]
- Leveraging Attractor Dynamics in Spatial Navigation for Better Language ParsingXiaolong Zou, Xingxing Cao, Xiaojiao Yang, Bo Hong. [doi]
- CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous VariablesJiecheng Lu, Xu Han, Yan Sun, Shihao Yang. [doi]
- ELTA: An Enhancer against Long-Tail for Aesthetics-oriented ModelsLimin Liu, Shuai He, Anlong Ming, Rui Xie, Huadong Ma. [doi]
- On the Role of Edge Dependency in Graph Generative ModelsSudhanshu Chanpuriya, Cameron Musco, Konstantinos Sotiropoulos, Charalampos E. Tsourakakis. [doi]
- Interacting Diffusion Processes for Event Sequence ForecastingMai Zeng, Florence Regol, Mark Coates. [doi]
- Stationarity without mean reversion in improper Gaussian processesLuca Ambrogioni. [doi]
- Various Lengths, Constant Speed: Efficient Language Modeling with Lightning AttentionZhen Qin, Weigao Sun, Dong Li 0033, Xuyang Shen, Weixuan Sun, Yiran Zhong. [doi]
- Autonomous Sparse Mean-CVaR Portfolio OptimizationYizun Lin, Yangyu Zhang, Zhao-Rong Lai, Cheng Li. [doi]
- WAVES: Benchmarking the Robustness of Image WatermarksBang An, Mucong Ding, Tahseen Rabbani, Aakriti Agrawal, Yuancheng Xu, Chenghao Deng, Sicheng Zhu, Abdirisak Mohamed, Yuxin Wen, Tom Goldstein, Furong Huang. [doi]
- First-Order Manifold Data Augmentation for Regression LearningIlya Kaufman, Omri Azencot. [doi]
- Leverage Class-Specific Accuracy to Guide Data Generation for Improving Image ClassificationJay Gala, Pengtao Xie. [doi]
- Fast Adversarial Attacks on Language Models In One GPU MinuteVinu Sankar Sadasivan, Shoumik Saha, Gaurang Sriramanan, Priyatham Kattakinda, Atoosa Malemir Chegini, Soheil Feizi. [doi]
- Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage SufficesJiin Woo, Laixi Shi, Gauri Joshi, Yuejie Chi. [doi]
- Converting Transformers to Polynomial Form for Secure Inference Over Homomorphic EncryptionItamar Zimerman, Moran Baruch, Nir Drucker, Gilad Ezov, Omri Soceanu, Lior Wolf. [doi]
- Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian OptimizationKwang-Sung Jun, Jungtaek Kim. [doi]
- Promoting External and Internal Equities Under Ex-Ante/Ex-Post Metrics in Online Resource AllocationKarthik Abinav Sankararaman, Aravind Srinivasan, Pan Xu 0001. [doi]
- On Statistical Learning Theory for Distributional InputsChristian Fiedler, Pierre-François Massiani, Friedrich Solowjow, Sebastian Trimpe. [doi]
- GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision TransformerDing Jia, Jianyuan Guo, Kai Han 0002, Han Wu, Chao Zhang 0001, Chang Xu 0002, Xinghao Chen 0001. [doi]
- Incremental Topological Ordering and Cycle Detection with PredictionsSamuel McCauley, Benjamin Moseley, Aidin Niaparast, Shikha Singh 0002. [doi]
- Unsupervised Episode Generation for Graph Meta-learningJihyeong Jung, Sangwoo Seo, Sungwon Kim 0002, Chanyoung Park. [doi]
- Distributed High-Dimensional Quantile Regression: Estimation Efficiency and Support RecoveryCaixing Wang, Ziliang Shen. [doi]
- Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic SegmentationYunheng Li, Zhongyu Li, Quansheng Zeng, Qibin Hou, Ming-Ming Cheng. [doi]
- Bayesian Exploration NetworksMattie Fellows, Brandon Kaplowitz, Christian Schröder de Witt, Shimon Whiteson. [doi]
- Socialized Learning: Making Each Other Better Through Multi-Agent CollaborationXinjie Yao, Yu Wang 0106, Pengfei Zhu 0001, Wanyu Lin, Jialu Li, Weihao Li, Qinghua Hu. [doi]
- Sharpness-Aware Data Generation for Zero-shot QuantizationHoang Anh Dung, Cuong Pham 0007, Trung Le, Jianfei Cai 0001, Thanh-Toan Do. [doi]
- Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object DetectionFeiran Li, Qianqian Xu, Shilong Bao, Zhiyong Yang 0001, Runmin Cong, Xiaochun Cao, Qingming Huang. [doi]
- Boosting Offline Optimizers with Surrogate SensitivityManh Cuong Dao, Phi-Le Nguyen, Truong Thao Nguyen, Trong Nghia Hoang. [doi]
- Arrows of Time for Large Language ModelsVassilis Papadopoulos, Jérémie Wenger, Clément Hongler. [doi]
- Optimal Differentially Private Model Training with Public DataAndrew Lowy, Zeman Li, Tianjian Huang, Meisam Razaviyayn. [doi]
- Binary Decomposition: A Problem Transformation Perspective for Open-Set Semi-Supervised LearningJun-Yi Hang, Min-Ling Zhang. [doi]
- Double-Step Alternating Extragradient with Increasing Timescale Separation for Finding Local Minimax Points: Provable ImprovementsKyuwon Kim, Donghwan Kim. [doi]
- Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function ApproximationYudan Wang, Yue Wang 0068, Yi Zhou 0017, Shaofeng Zou. [doi]
- On the Convergence of Projected Bures-Wasserstein Gradient Descent under Euclidean Strong ConvexityJunyi Fan, Yuxuan Han, Zijian Liu, Jian-Feng Cai 0001, Yang Wang, Zhengyuan Zhou. [doi]
- Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation LearningZheng Huang, Qihui Yang, Dawei Zhou 0003, Yujun Yan. [doi]
- Defense against Model Extraction Attack by Bayesian Active WatermarkingZhenyi Wang, Yihan Wu, Heng Huang. [doi]
- A Unified Adaptive Testing System Enabled by Hierarchical Structure SearchJunhao Yu, Yan Zhuang, Zhenya Huang, Qi Liu 0003, Xin Li 0064, Rui Li, Enhong Chen. [doi]
- NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion ModelsZeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan 0003, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu 0001, Tao Qin 0001, Xiangyang Li 0001, Wei Ye 0004, Shikun Zhang, Jiang Bian 0002, Lei He 0005, Jinyu Li 0001, Sheng Zhao. [doi]
- Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-DesignAndrew Campbell, Jason Yim, Regina Barzilay, Tom Rainforth, Tommi S. Jaakkola. [doi]
- Understanding Forgetting in Continual Learning with Linear RegressionMeng Ding, Kaiyi Ji, Di Wang 0015, Jinhui Xu 0001. [doi]
- Topological Neural Networks go Persistent, Equivariant, and ContinuousYogesh Verma, Amauri H. Souza, Vikas Garg 0001. [doi]
- Position: Is machine learning good or bad for the natural sciences?David W. Hogg, Soledad Villar. [doi]
- Towards Efficient Exact Optimization of Language Model AlignmentHaozhe Ji, Cheng Lu 0011, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu 0001, Jie Tang 0001, Minlie Huang. [doi]
- Evaluation of Trajectory Distribution Predictions with Energy ScoreNovin Shahroudi, Mihkel Lepson, Meelis Kull. [doi]
- Empowering Graph Invariance Learning with Deep Spurious InfomaxTianjun Yao, Yongqiang Chen 0002, Zhenhao Chen, Kai Hu 0010, Zhiqiang Shen, Kun Zhang 0001. [doi]
- Universal Consistency of Wide and Deep ReLU Neural Networks and Minimax Optimal Convergence Rates for Kolmogorov-Donoho Optimal Function ClassesHyunouk Ko, Xiaoming Huo. [doi]
- STEER: Assessing the Economic Rationality of Large Language ModelsNarun Krishnamurthi Raman, Taylor Lundy, Samuel Joseph Amouyal, Yoav Levine, Kevin Leyton-Brown, Moshe Tennenholtz. [doi]
- Why do Variational Autoencoders Really Promote Disentanglement?Pratik Bhowal, Achint Soni, Sirisha Rambhatla. [doi]
- Breadth-First Exploration on Adaptive Grid for Reinforcement LearningYoungsik Yoon, Gangbok Lee, Sungsoo Ahn, Jungseul Ok. [doi]
- On the Trajectory Regularity of ODE-based Diffusion SamplingDefang Chen 0001, Zhenyu Zhou, Can Wang 0001, Chunhua Shen, Siwei Lyu. [doi]
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length ExtrapolationZhenyu He 0012, Guhao Feng, Shengjie Luo, Kai Yang, Liwei Wang 0001, Jingjing Xu, Zhi Zhang, Hongxia Yang, Di He 0001. [doi]
- Position: Exploring the Robustness of Pipeline-Parallelism-Based Decentralized TrainingLin Lu, Chenxi Dai, Wangcheng Tao, Binhang Yuan, Yanan Sun, Pan Zhou 0001. [doi]
- Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy DataFahim Tajwar, Anikait Singh, Archit Sharma, Rafael Rafailov, Jeff Schneider 0001, Tengyang Xie, Stefano Ermon, Chelsea Finn, Aviral Kumar. [doi]
- Training Greedy Policy for Proposal Batch Selection in Expensive Multi-Objective Combinatorial OptimizationDeokjae Lee, Hyun Oh Song, KyungHyun Cho. [doi]
- Fair Off-Policy Learning from Observational DataDennis Frauen, Valentyn Melnychuk, Stefan Feuerriegel. [doi]
- The Surprising Effectiveness of Skip-Tuning in Diffusion SamplingJiaJun Ma, Shuchen Xue, Tianyang Hu, Wenjia Wang, Zhaoqiang Liu, Zhenguo Li, Zhi-Ming Ma, Kenji Kawaguchi. [doi]
- Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and RejectionNils Palumbo, Yang Guo, Xi Wu 0001, Jiefeng Chen 0001, Yingyu Liang, Somesh Jha. [doi]
- On the Nonlinearity of Layer NormalizationYunhao Ni, Yuxin Guo, Junlong Jia, Lei Huang 0015. [doi]
- Scalable and Flexible Causal Discovery with an Efficient Test for AdjacencyAlan Nawzad Amin, Andrew Gordon Wilson. [doi]
- Federated Continual Learning via Prompt-based Dual Knowledge TransferHongming Piao, Yichen Wu, Dapeng Wu 0001, Ying Wei 0001. [doi]
- Integrating Multimodal Data for Joint Generative Modeling of Complex DynamicsManuel Brenner, Florian Hess, Georgia Koppe, Daniel Durstewitz. [doi]
- MorphGrower: A Synchronized Layer-by-layer Growing Approach for Plausible Neuronal Morphology GenerationNianzu Yang, Kaipeng Zeng, Haotian Lu, Yexin Wu, Zexin Yuan, Danni Chen, Shengdian Jiang, Jiaxiang Wu 0001, Yimin Wang, Junchi Yan. [doi]
- HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingZhaorun Chen, Zhuokai Zhao, Hongyin Luo, Huaxiu Yao, Bo Li 0026, Jiawei Zhou. [doi]
- Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial MonitoringTaira Tsuchiya, Shinji Ito, Junya Honda. [doi]
- Distributional Bellman Operators over Mean EmbeddingsLi Kevin Wenliang, Grégoire Delétang, Matthew Aitchison, Marcus Hutter, Anian Ruoss, Arthur Gretton, Mark Rowland. [doi]
- Do Models Explain Themselves? Counterfactual Simulatability of Natural Language ExplanationsYanda Chen, Ruiqi Zhong, Narutatsu Ri, Chen Zhao, He He 0001, Jacob Steinhardt, Zhou Yu, Kathleen R. McKeown. [doi]
- Neighboring Perturbations of Knowledge Editing on Large Language ModelsJun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang 0001, Jia-Chen Gu. [doi]
- BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression TasksZhiyuan Cheng 0010, Zhaoyi Liu, Tengda Guo, Shiwei Feng 0002, Dongfang Liu, MingJie Tang, Xiangyu Zhang 0001. [doi]
- A Dynamic Algorithm for Weighted Submodular Cover ProblemKiarash Banihashem, Samira Goudarzi, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, Morteza Monemizadeh. [doi]
- PriorBoost: An Adaptive Algorithm for Learning from Aggregate ResponsesAdel Javanmard, Matthew Fahrbach, Vahab Mirrokni. [doi]
- Quantum Implicit Neural RepresentationsJiaming Zhao, Wenbo Qiao, Peng Zhang, Hui Gao. [doi]
- Improving Antibody Humanness Prediction using Patent DataTalip Ucar, Aubin Ramon, Dino Oglic, Rebecca Croasdale-Wood, Tom Diethe, Pietro Sormanni. [doi]
- VideoPoet: A Large Language Model for Zero-Shot Video GenerationDan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Joshua V. Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso-Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang 0001, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang 0004. [doi]
- Convergence Guarantees for the DeepWalk Embedding on Block ModelsChristopher Harker, Aditya Bhaskara. [doi]
- Compact Optimality Verification for Optimization ProxiesWenbo Chen 0001, Haoruo Zhao, Mathieu Tanneau, Pascal Van Hentenryck. [doi]
- Learning Mixtures of Gaussian Processes through Random ProjectionEmmanuel Akeweje, Mimi Zhang. [doi]
- Layerwise Change of Knowledge in Neural NetworksXu Cheng, Lei Cheng, Zhaoran Peng, Yang Xu, Tian Han 0001, Quanshi Zhang. [doi]
- A Subquadratic Time Algorithm for Robust Sparse Mean EstimationAnkit Pensia. [doi]
- How Far Can Fairness Constraints Help Recover From Biased Data?Mohit Sharma, Amit Deshpande 0001. [doi]
- Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization ProblemsDavid T. Hoffmann, Simon Schrodi, Jelena Bratulic, Nadine Behrmann, Volker Fischer 0003, Thomas Brox. [doi]
- Privacy Preserving Adaptive Experiment DesignJiachun Li, Kaining Shi, David Simchi-Levi. [doi]
- PARDEN, Can You Repeat That? Defending against Jailbreaks via RepetitionZiyang Zhang, Qizhen Zhang, Jakob Nicolaus Foerster. [doi]
- Better & Faster Large Language Models via Multi-token PredictionFabian Gloeckle, Badr Youbi Idrissi, Baptiste Rozière, David Lopez-Paz, Gabriel Synnaeve. [doi]
- Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential EquationsJonas Beck, Nathanael Bosch, Michael Deistler, Kyra L. Kadhim, Jakob H. Macke, Philipp Hennig, Philipp Berens. [doi]
- Foundation Policies with Hilbert RepresentationsSeohong Park, Tobias Kreiman, Sergey Levine. [doi]
- Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature LearningYuxiao Wen, Arthur Jacot. [doi]
- Stability and Multigroup Fairness in Ranking with Uncertain PredictionsSiddartha Devic, Aleksandra Korolova, David Kempe 0001, Vatsal Sharan. [doi]
- SPADE: Sparsity-Guided Debugging for Deep Neural NetworksArshia Soltani Moakhar, Eugenia Iofinova, Elias Frantar, Dan Alistarh. [doi]
- A Sparsity Principle for Partially Observable Causal Representation LearningDanru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius von Kügelgen, Francesco Locatello, Sara Magliacane. [doi]
- How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow TeachersGon Buzaglo, Itamar Harel, Mor Shpigel Nacson, Alon Brutzkus, Nathan Srebro, Daniel Soudry. [doi]
- Position: Automatic Environment Shaping is the Next Frontier in RLYounghyo Park, Gabriel B. Margolis, Pulkit Agrawal 0001. [doi]
- CoLoRA: Continuous low-rank adaptation for reduced implicit neural modeling of parameterized partial differential equationsJules Berman, Benjamin Peherstorfer. [doi]
- Regression Learning with Limited Observations of Multivariate Outcomes and FeaturesYifan Sun, Grace Yi. [doi]
- AI Control: Improving Safety Despite Intentional SubversionRyan Greenblatt, Buck Shlegeris, Kshitij Sachan, Fabien Roger. [doi]
- Graph-based Forecasting with Missing Data through Spatiotemporal DownsamplingIvan Marisca, Cesare Alippi, Filippo Maria Bianchi. [doi]
- Breaking through the learning plateaus of in-context learning in TransformerJingwen Fu, Tao Yang, Yuwang Wang, Yan Lu 0001, Nanning Zheng 0001. [doi]
- Cell2Sentence: Teaching Large Language Models the Language of BiologyDaniel Levine, Syed Asad Rizvi, Sacha Lévy, Nazreen Pallikkavaliyaveetil, David Zhang, Xingyu Chen, Sina Ghadermarzi, Ruiming Wu, Zihe Zheng, Ivan Vrkic, Anna Zhong, Daphne Raskin, Insu Han, Antonio Henrique de Oliveira Fonseca, Josue Ortega Caro, Amin Karbasi, Rahul Madhav Dhodapkar, David van Dijk. [doi]
- SiT: Symmetry-invariant Transformers for Generalisation in Reinforcement LearningMatthias Weissenbacher, Rishabh Agarwal, Yoshinobu Kawahara. [doi]
- Learning Optimal Deterministic Policies with Stochastic Policy GradientsAlessandro Montenegro, Marco Mussi, Alberto Maria Metelli, Matteo Papini. [doi]
- Delaunay Graph: Addressing Over-Squashing and Over-Smoothing Using Delaunay TriangulationHugo Attali, Davide Buscaldi, Nathalie Pernelle. [doi]
- Riemannian Accelerated Zeroth-order Algorithm: Improved Robustness and Lower Query ComplexityChang-He, Zhaoye Pan, Xiao Wang, Bo Jiang. [doi]
- Cross-domain Open-world DiscoveryShuo Wen, Maria Brbic. [doi]
- Deep Networks Always Grok and Here is WhyAhmed Imtiaz Humayun, Randall Balestriero, Richard G. Baraniuk. [doi]
- Learning to Scale Logits for Temperature-Conditional GFlowNetsMinsu Kim, Joohwan Ko, Taeyoung Yun, Dinghuai Zhang, Ling Pan, Woochang Kim, Jinkyoo Park, Emmanuel Bengio, Yoshua Bengio. [doi]
- Decomposable Submodular Maximization in Federated SettingAkbar Rafiey. [doi]
- Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model PredictionsJingtan Wang, Xiaoqiang Lin, Rui Qiao 0006, Chuan-Sheng Foo, Bryan Kian Hsiang Low. [doi]
- VITS : Variational Inference Thompson Sampling for contextual banditsPierre Clavier, Tom Huix, Alain Oliviero Durmus. [doi]
- Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical GuidelinesYuchen Li, Alexandre Kirchmeyer, Aashay Mehta, Yilong Qin, Boris Dadachev, Kishore Papineni, Sanjiv Kumar, Andrej Risteski. [doi]
- Best Arm Identification for Stochastic Rising BanditsMarco Mussi, Alessandro Montenegro, Francesco Trovò, Marcello Restelli, Alberto Maria Metelli. [doi]
- Improved Operator Learning by Orthogonal AttentionZipeng Xiao, Zhongkai Hao, Bokai Lin, Zhijie Deng, Hang Su 0006. [doi]
- FiT: Flexible Vision Transformer for Diffusion ModelZeyu Lu, Zidong Wang, Di Huang, Chengyue Wu, Xihui Liu, Wanli Ouyang, Lei Bai 0001. [doi]
- Position: Towards Implicit Prompt For Text-To-Image ModelsYue Yang, Yuqi Lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao 0001, Kaipeng Zhang, Ping Luo 0002. [doi]
- From Inverse Optimization to Feasibility to ERMSaurabh Mishra, Anant Raj, Sharan Vaswani. [doi]
- Spectral Preconditioning for Gradient Methods on Graded Non-convex FunctionsNikita Doikov, Sebastian U. Stich, Martin Jaggi. [doi]
- Using Left and Right Brains Together: Towards Vision and Language PlanningJun Cen, Chenfei Wu, Xiao Liu 0029, Shengming Yin, Yixuan Pei, Jinglong Yang, Qifeng Chen, Nan Duan, Jianguo Zhang. [doi]
- Individual Fairness in Graph DecompositionKamesh Munagala, Govind S. Sankar. [doi]
- DiffDA: a Diffusion model for weather-scale Data AssimilationLangwen Huang, Lukas Gianinazzi, Yuejiang Yu, Peter D. Düben, Torsten Hoefler. [doi]
- GFlowNet Training by Policy GradientsPuhua Niu, Shili Wu, Mingzhou Fan, Xiaoning Qian. [doi]
- The Max-Min Formulation of Multi-Objective Reinforcement Learning: From Theory to a Model-Free AlgorithmGiseung Park, Woohyeon Byeon, Seongmin Kim, Elad Havakuk, Amir Leshem, Youngchul Sung. [doi]
- Efficient Contextual Bandits with Uninformed Feedback GraphsMengxiao Zhang, Yuheng Zhang, Haipeng Luo, Paul Mineiro. [doi]
- Offline Imitation from Observation via Primal Wasserstein State Occupancy MatchingKai Yan, Alexander G. Schwing, Yu-Xiong Wang. [doi]
- Linear Explanations for Individual NeuronsTuomas P. Oikarinen, Tsui-Wei Weng. [doi]
- Position: Why Tabular Foundation Models Should Be a Research PriorityBoris van Breugel, Mihaela van der Schaar. [doi]
- Rapid Learning without Catastrophic Forgetting in the Morris Water MazeRaymond Wang, Jaedong Hwang, Akhilan Boopathy, Ila R. Fiete. [doi]
- GeoAB: Towards Realistic Antibody Design and Reliable Affinity MaturationHaitao Lin, Lirong Wu, Yufei Huang 0002, Yunfan Liu 0002, Odin Zhang, Yuanqing Zhou, Rui Sun, Stan Z. Li. [doi]
- An Independence-promoting Loss for Music Generation with Language ModelsJean-Marie Lemercier, Simon Rouard, Jade Copet, Yossi Adi, Alexandre Défossez. [doi]
- A connection between Tempering and Entropic Mirror DescentNicolas Chopin, Francesca R. Crucinio, Anna Korba. [doi]
- SelfIE: Self-Interpretation of Large Language Model EmbeddingsHaozhe Chen, Carl Vondrick, Chengzhi Mao. [doi]
- Projection-Free Variance Reduction Methods for Stochastic Constrained Multi-Level Compositional OptimizationWei Jiang, Sifan Yang, Wenhao Yang, Yibo Wang, Yuanyu Wan, Lijun Zhang 0005. [doi]
- Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time OraclesBhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Dinesh Manocha, Amrit S. Bedi. [doi]
- Critical feature learning in deep neural networksKirsten Fischer, Javed Lindner, David Dahmen, Zohar Ringel, Michael Krämer, Moritz Helias. [doi]
- Feasible Reachable Policy IterationShentao Qin, Yujie Yang, Yao Mu, Jie Li 0042, Wenjun Zou, Jingliang Duan, Shengbo Eben Li. [doi]
- Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End DecoderYiyang Ma, Wenhan Yang, Jiaying Liu 0001. [doi]
- Risk Aware Benchmarking of Large Language ModelsApoorva Nitsure, Youssef Mroueh, Mattia Rigotti, Kristjan H. Greenewald, Brian Belgodere, Mikhail Yurochkin, Jirí Navrátil 0001, Igor Melnyk, Jarret Ross. [doi]
- The Perception-Robustness Tradeoff in Deterministic Image RestorationGuy Ohayon, Tomer Michaeli, Michael Elad. [doi]
- Probabilistic Inference in Language Models via Twisted Sequential Monte CarloStephen Zhao, Rob Brekelmans, Alireza Makhzani, Roger Baker Grosse. [doi]
- A Computational Framework for Solving Wasserstein Lagrangian FlowsKirill Neklyudov, Rob Brekelmans, Alexander Tong 0001, Lazar Atanackovic, Qiang Liu, Alireza Makhzani. [doi]
- Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined LevelsHaoning Wu 0001, Zicheng Zhang, Weixia Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Yixuan Gao, Annan Wang, Erli Zhang 0001, Wenxiu Sun, Qiong Yan, Xiongkuo Min, Guangtao Zhai, Weisi Lin. [doi]
- Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical DistributionElen Vardanyan, Sona Hunanyan, Tigran Galstyan, Arshak Minasyan, Arnak S. Dalalyan. [doi]
- Robust Sparse Estimation for Gaussians with Optimal Error under Huber ContaminationIlias Diakonikolas, Daniel Kane 0001, Sushrut Karmalkar, Ankit Pensia, Thanasis Pittas. [doi]
- Learning Scale-Aware Spatio-temporal Implicit Representation for Event-based Motion DeblurringWei Yu, Jianing Li, Shengping Zhang, Xiangyang Ji. [doi]
- Successor Features for Efficient Multi-Subject Controlled Text GenerationMeng Cao, Mehdi Fatemi, Jackie C. K. Cheung, Samira Shabanian. [doi]
- Hybrid Reinforcement Learning from Offline Observation AloneYuda Song 0001, Drew Bagnell, Aarti Singh. [doi]
- Enhancing Sufficient Dimension Reduction via Hellinger CorrelationSeungBeom Hong, Ilmun Kim, Jun Song. [doi]
- Learning to Route Among Specialized Experts for Zero-Shot GeneralizationMohammed Muqeeth, Haokun Liu, Yufan Liu, Colin Raffel. [doi]
- TSLANet: Rethinking Transformers for Time Series Representation LearningEmadeldeen Eldele, Mohamed Ragab 0002, Zhenghua Chen, Min Wu 0008, Xiaoli Li 0001. [doi]
- On Mechanistic Knowledge Localization in Text-to-Image Generative ModelsSamyadeep Basu, Keivan Rezaei, Priyatham Kattakinda, Vlad I. Morariu, Nanxuan Zhao, Ryan A. Rossi, Varun Manjunatha, Soheil Feizi. [doi]
- Multi-Track Message Passing: Tackling Oversmoothing and Oversquashing in Graph Learning via Preventing Heterophily MixingHongbin Pei, Yu Li, Huiqi Deng, Jingxin Hai, Pinghui Wang, Jie Ma 0001, Jing Tao, Yuheng Xiong, Xiaohong Guan. [doi]
- Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image GeneratorsJianhao Yuan, Francesco Pinto, Adam Davies, Philip Torr 0001. [doi]
- The Good, The Bad, and Why: Unveiling Emotions in Generative AICheng Li, Jindong Wang 0001, Yixuan Zhang, Kaijie Zhu, Xinyi Wang, Wenxin Hou, Jianxun Lian, Fang Luo, Qiang Yang 0001, Xing Xie 0001. [doi]
- RIME: Robust Preference-based Reinforcement Learning with Noisy PreferencesJie Cheng, Gang Xiong 0001, Xingyuan Dai, Qinghai Miao, Yisheng Lv, Fei-Yue Wang 0001. [doi]
- Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single DemonstrationXiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li 0001, Xuhui Liu, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Yang Yu 0001, Anqi Huang, Kai Xu 0004, Zongzhang Zhang. [doi]
- Learning to Continually Learn with the Bayesian PrincipleSoochan Lee, Hyeonseong Jeon, Jaehyeon Son, Gunhee Kim. [doi]
- Position: Future Directions in the Theory of Graph Machine LearningChristopher Morris 0001, Fabrizio Frasca, Nadav Dym, Haggai Maron, Ismail Ilkan Ceylan, Ron Levie, Derek Lim, Michael M. Bronstein, Martin Grohe, Stefanie Jegelka. [doi]
- Transferable Facial Privacy Protection against Blind Face Restoration via Domain-Consistent Adversarial ObfuscationKui Zhang, Hang Zhou 0007, Jie Zhang 0073, Wenbo Zhou, Weiming Zhang 0001, Nenghai Yu. [doi]
- Self-attention Networks Localize When QK-eigenspectrum ConcentratesHan Bao 0002, Ryuichiro Hataya, Ryo Karakida. [doi]
- TIC-TAC: A Framework For Improved Covariance Estimation In Deep Heteroscedastic RegressionMegh Shukla, Mathieu Salzmann, Alexandre Alahi. [doi]
- On the sample complexity of conditional independence testing with Von Mises estimator with application to causal discoveryFateme Jamshidi, Luca Ganassali, Negar Kiyavash. [doi]
- The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap ExponentsYatin Dandi, Emanuele Troiani, Luca Arnaboldi 0002, Luca Pesce, Lenka Zdeborová, Florent Krzakala. [doi]
- Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features CriticsLuca Grillotti, Maxence Faldor, Borja G. León, Antoine Cully. [doi]
- Minimum Norm Interpolation Meets The Local Theory of Banach SpacesGil Kur, Pedro Abdalla, Pierre Bizeul, Fanny Yang. [doi]
- Parameter-Efficient Fine-Tuning with ControlsChi Zhang 0007, Jingpu Cheng, Yanyu Xu, Qianxiao Li. [doi]
- Dynamic Facility Location in High Dimensional Euclidean SpacesSayan Bhattacharya, Gramoz Goranci, Shaofeng H.-C. Jiang, Yi Qian, Yubo Zhang. [doi]
- Probabilistic Routing for Graph-Based Approximate Nearest Neighbor SearchKejing Lu, Chuan Xiao 0001, Yoshiharu Ishikawa. [doi]
- Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-errorHaoran Li, Zicheng Zhang, Wang Luo, Congying Han, Yudong Hu, Tiande Guo, Shichen Liao. [doi]
- Activation-Descent Regularization for Input Optimization of ReLU NetworksHongzhan Yu, Sicun Gao. [doi]
- Fair Classification with Partial Feedback: An Exploration-Based Data Collection ApproachVijay Keswani, Anay Mehrotra, L. Elisa Celis. [doi]
- Scaling Laws for Fine-Grained Mixture of ExpertsJan Ludziejewski, Jakub Krajewski, Kamil Adamczewski, Maciej Pióro, Michal Krutul, Szymon Antoniak, Kamil Ciebiera, Krystian Król, Tomasz Odrzygózdz, Piotr Sankowski, Marek Cygan, Sebastian Jaszczur. [doi]
- InferCept: Efficient Intercept Support for Augmented Large Language Model InferenceReyna Abhyankar, Zijian He, Vikranth Srivatsa, Hao Zhang, Yiying Zhang 0005. [doi]
- Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral ImbalanceChiraag Kaushik, Ran Liu, Chi-Heng Lin, Amrit Khera, Matthew Y. Jin, Wenrui Ma, Vidya Muthukumar, Eva L. Dyer. [doi]
- Partial Optimality in the Linear Ordering ProblemDavid Stein 0001, Bjoern Andres. [doi]
- Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank BanditsKyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun. [doi]
- Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement LearningInwoo Hwang, Yunhyeok Kwak, Suhyung Choi, Byoung-Tak Zhang, Sanghack Lee. [doi]
- Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable SimulationIgnat Georgiev, Krishnan Srinivasan, Jie Xu, Eric Heiden, Animesh Garg. [doi]
- Mastering Zero-Shot Interactions in Cooperative and Competitive Simultaneous GamesYannik Mahlau, Frederik Schubert, Bodo Rosenhahn. [doi]
- Gaussian Processes on Cellular ComplexesMathieu Alain, So Takao, Brooks Paige, Marc Peter Deisenroth. [doi]
- Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast SamplingZehao Dou, Minshuo Chen, Mengdi Wang, Zhuoran Yang. [doi]
- Knowledge Distillation with Auxiliary VariableBo Peng, Zhen Fang 0001, Guangquan Zhang 0001, Jie Lu 0001. [doi]
- Parallelized Spatiotemporal Slot Binding for VideosGautam Singh, Yue Wang, Jiawei Yang, Boris Ivanovic, Sungjin Ahn, Marco Pavone 0001, Tong Che. [doi]
- Copyright Traps for Large Language ModelsMatthieu Meeus, Igor Shilov, Manuel Faysse, Yves-Alexandre de Montjoye. [doi]
- Efficient Denoising Diffusion via Probabilistic MaskingWeizhong Zhang, Zhiwei Zhang, Renjie Pi, Zhongming Jin, Yuan Gao 0015, Jieping Ye, Kani Chen. [doi]
- Regularized Q-learning through Robust AveragingPeter Schmitt-Förster, Tobias Sutter. [doi]
- Towards Realistic Model Selection for Semi-supervised LearningMuyang Li, Xiaobo Xia, Runze Wu, Fengming Huang, Jun Yu 0001, Bo Han 0003, Tongliang Liu. [doi]
- Causal Discovery with Fewer Conditional Independence TestsKirankumar Shiragur, Jiaqi Zhang, Caroline Uhler. [doi]
- Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language ModelsSom Sagar, Aditya Taparia, Ransalu Senanayake. [doi]
- Split-and-Denoise: Protect large language model inference with local differential privacyPeihua Mai, Ran Yan, Zhe Huang, Youjia Yang, Yan Pang. [doi]
- VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language ModelPengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu 0002. [doi]
- Memory Consolidation Enables Long-Context Video UnderstandingIvana Balazevic, Yuge Shi, Pinelopi Papalampidi, Rahma Chaabouni 0001, Skanda Koppula, Olivier J. Hénaff. [doi]
- Floating Anchor Diffusion Model for Multi-motif ScaffoldingKe Li 0015, Weian Mao, Shuaike Shen, Xiaoran Jiao, Zheng Sun, Hao Cheng 0012, Chunhua Shen. [doi]
- Contamination-Resilient Anomaly Detection via Adversarial Learning on Partially-Observed Normal and Anomalous DataWenxi Lv, Qinliang Su, Hai Wan, Hongteng Xu, Wenchao Xu. [doi]
- Estimating Distributional Treatment Effects in Randomized Experiments: Machine Learning for Variance ReductionUndral Byambadalai, Tatsushi Oka, Shota Yasui. [doi]
- Aligning Transformers with Weisfeiler-LemanLuis Müller, Christopher Morris 0001. [doi]
- RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust AdaptationMahdi Nikdan, Soroush Tabesh, Elvir Crncevic, Dan Alistarh. [doi]
- From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and GenerationKun Su, Xiulong Liu, Eli Shlizerman. [doi]
- Fundamental Limits of Distributed Covariance Matrix Estimation Under Communication ConstraintsMohammad-Reza Rahmani, Mohammad Hossein Yassaee, Mohammad Ali Maddah-Ali, Mohammad Reza Aref. [doi]
- Equivariant Frames and the Impossibility of Continuous CanonicalizationNadav Dym, Hannah Lawrence, Jonathan W. Siegel. [doi]
- Bayesian Uncertainty for Gradient Aggregation in Multi-Task LearningIdan Achituve, Idit Diamant, Arnon Netzer, Gal Chechik, Ethan Fetaya. [doi]
- Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret BoundsShion Takeno, Yu Inatsu, Masayuki Karasuyama, Ichiro Takeuchi. [doi]
- Break the Sequential Dependency of LLM Inference Using Lookahead DecodingYichao Fu, Peter Bailis, Ion Stoica, Hao Zhang 0108. [doi]
- PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in ControlRuijie Zheng, Ching-An Cheng, Hal Daumé III, Furong Huang, Andrey Kolobov. [doi]
- The Pitfalls and Promise of Conformal Inference Under Adversarial AttacksZiquan Liu, Yufei Cui, Yan Yan, Yi Xu 0008, Xiangyang Ji, Xue Liu 0001, Antoni B. Chan. [doi]
- Interpreting Equivariant RepresentationsAndreas Abildtrup Hansen, Anna Calissano, Aasa Feragen. [doi]
- Tag-LLM: Repurposing General-Purpose LLMs for Specialized DomainsJunhong Shen, Neil A. Tenenholtz, James Brian Hall, David Alvarez-Melis, Nicolò Fusi. [doi]
- Stochastic Localization via Iterative Posterior SamplingLouis Grenioux, Maxence Noble, Marylou Gabrié, Alain Oliviero Durmus. [doi]
- Impact of Decentralized Learning on Player Utilities in Stackelberg GamesKate Donahue, Nicole Immorlica, Meena Jagadeesan, Brendan Lucier, Aleksandrs Slivkins. [doi]
- Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-trainingJinxia Yang, Bing Su 0001, Xin Zhao 0018, Ji-Rong Wen. [doi]
- Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding PerspectiveYulong Zhang, Yuan Yao, Shuhao Chen, Pengrong Jin, Yu Zhang, Jian Jin, Jiangang Lu. [doi]
- Total Variation Floodgate for Variable Importance Inference in ClassificationWenshuo Wang, Lucas Janson, Lihua Lei, Aaditya Ramdas. [doi]
- Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum LearningJin Hwa Lee, Stefano Sarao Mannelli, Andrew M. Saxe. [doi]
- PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-PerformerChang Chen, Junyeob Baek, Fei Deng, Kenji Kawaguchi, Caglar Gulcehre, Sungjin Ahn. [doi]
- PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear ProgrammingBingheng Li, Linxin Yang, Yupeng Chen, Senmiao Wang, Haitao Mao, Qian Chen, Yao Ma 0001, Akang Wang, Tian Ding, Jiliang Tang, Ruoyu Sun 0001. [doi]
- A Dual-module Framework for Counterfactual Estimation over TimeXin Wang, Shengfei Lyu, Lishan Yang, Yibing Zhan, Huanhuan Chen. [doi]
- LaMAGIC: Language-Model-based Topology Generation for Analog Integrated CircuitsChen-Chia Chang, Yikang Shen, Shaoze Fan, Jing Li, Shun Zhang, Ningyuan Cao, Yiran Chen, Xin Zhang. [doi]
- Position: Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?Shayne Longpre, Robert Mahari, Naana Obeng-Marnu, William Brannon, Tobin South, Katy Ilonka Gero, Alex Pentland, Jad Kabbara. [doi]
- SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual DatasetsShenghua Wan, Ziyuan Chen, Le Gan, Shuai Feng, De-Chuan Zhan. [doi]
- Defining Neural Network Architecture through Polytope Structures of DatasetsSangmin Lee 0017, Abbas Mammadov, Jong Chul Ye. [doi]
- Equivariant Graph Neural Operator for Modeling 3D DynamicsMinkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar. [doi]
- Environment Design for Inverse Reinforcement LearningThomas Kleine Buening, Victor Villin, Christos Dimitrakakis. [doi]
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin RepresentationWanpeng Zhang 0002, Yilin Li, Boyu Yang, Zongqing Lu. [doi]
- Optimal Kernel Quantile Learning with Random FeaturesCaixing Wang, Xingdong Feng. [doi]
- Relaxing the Accurate Imputation Assumption in Doubly Robust Learning for Debiased Collaborative FilteringHaoxuan Li, Chunyuan Zheng, Shuyi Wang, Kunhan Wu, Eric Hao Wang, Peng Wu 0012, Zhi Geng, Xu Chen, Xiao-Hua Zhou. [doi]
- PAPM: A Physics-aware Proxy Model for Process SystemsPengwei Liu, Zhongkai Hao, Xingyu Ren, Hangjie Yuan, Jiayang Ren, Dong Ni 0002. [doi]
- Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and MoreFanchen Bu, Hyeonsoo Jo, Soo Yong Lee, Sungsoo Ahn, Kijung Shin. [doi]
- Beyond the Calibration Point: Mechanism Comparison in Differential PrivacyGeorgios Kaissis, Stefan Kolek, Borja Balle, Jamie Hayes, Daniel Rueckert. [doi]
- Dual Operating Modes of In-Context LearningZiqian Lin, Kangwook Lee 0001. [doi]
- Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization FunctionKeyon Vafa, Ashesh Rambachan, Sendhil Mullainathan. [doi]
- BBox-Adapter: Lightweight Adapting for Black-Box Large Language ModelsHaotian Sun, Yuchen Zhuang, Wei Wei 0019, Chao Zhang 0014, Bo Dai 0001. [doi]
- Adversarial Robustness Limits via Scaling-Law and Human-Alignment StudiesBrian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura. [doi]
- Fundamental Limitations of Alignment in Large Language ModelsYotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua. [doi]
- Prediction-powered Generalization of Causal InferencesIlker Demirel, Ahmed M. Alaa, Anthony Philippakis, David A. Sontag. [doi]
- Geometry-Calibrated DRO: Combating Over-Pessimism with Free Energy ImplicationsJiashuo Liu, Jiayun Wu, Tianyu Wang, Hao Zou 0001, Bo Li 0064, Peng Cui 0001. [doi]
- Unraveling the Impact of Heterophilic Structures on Graph Positive-Unlabeled LearningYuhao Wu, Jiangchao Yao, Bo Han 0003, Lina Yao 0001, Tongliang Liu. [doi]
- Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement LearningZhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang 0001, Xinbo Zhang, Peng Sun 0006, Tao Gui, Qi Zhang 0001, Xuanjing Huang 0001. [doi]
- SAPG: Split and Aggregate Policy GradientsJayesh Singla, Ananye Agarwal, Deepak Pathak. [doi]
- Disguised Copyright Infringement of Latent Diffusion ModelsYiwei Lu 0001, Matthew Y. R. Yang, Zuoqiu Liu, Gautam Kamath 0001, Yaoliang Yu. [doi]
- MusicRL: Aligning Music Generation to Human PreferencesGeoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli. [doi]
- Position: Insights from Survey Methodology can Improve Training DataStephanie Eckman, Barbara Plank, Frauke Kreuter. [doi]
- A Generative Approach for Treatment Effect Estimation under Collider Bias: From an Out-of-Distribution PerspectiveBaohong Li, Haoxuan Li, Anpeng Wu, Minqin Zhu, Shiyuan Peng, Qingyu Cao, Kun Kuang. [doi]
- Position: Compositional Generative Modeling: A Single Model is Not All You NeedYilun Du, Leslie Pack Kaelbling. [doi]
- Minimum-Norm Interpolation Under Covariate ShiftNeil Mallinar, Austin Zane, Spencer Frei, Bin Yu 0001. [doi]
- Position: Key Claims in LLM Research Have a Long Tail of FootnotesAnna Rogers, Sasha Luccioni. [doi]
- How Graph Neural Networks Learn: Lessons from Training DynamicsChenxiao Yang, Qitian Wu, David Wipf, Ruoyu Sun 0001, Junchi Yan. [doi]
- Fast Algorithms for Hypergraph PageRank with Applications to Semi-Supervised LearningKonstantinos Ameranis, Adela Frances DePavia, Lorenzo Orecchia, Erasmo Tani. [doi]
- Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical DataYujun Zhou 0002, Yufei Han, Haomin Zhuang, Hongyan Bao, Xiangliang Zhang 0001. [doi]
- Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology ControlZheng Xiong, Risto Vuorio, Jacob Beck, Matthieu Zimmer, Kun Shao, Shimon Whiteson. [doi]
- Diffuse, Sample, Project: Plug-And-Play Controllable Graph GenerationKartik Sharma, Srijan Kumar, Rakshit Trivedi. [doi]
- Simplicity Bias of Two-Layer Networks beyond Linearly Separable DataNikita Tsoy, Nikola Konstantinov. [doi]
- Improving Adversarial Energy-Based Model via Diffusion ProcessCong Geng, Tian Han, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Søren Hauberg, Bo Li. [doi]
- RoboDreamer: Learning Compositional World Models for Robot ImaginationSiyuan Zhou, Yilun Du, Jiaben Chen, Yandong Li, Dit-Yan Yeung, Chuang Gan. [doi]
- Information-Directed Pessimism for Offline Reinforcement LearningAlec Koppel, Sujay Bhatt, Jiacheng Guo, Joe Eappen, Mengdi Wang, Sumitra Ganesh. [doi]
- Is In-Context Learning in Large Language Models Bayesian? A Martingale PerspectiveFabian Falck, Ziyu Wang, Christopher C. Holmes. [doi]
- Graph Out-of-Distribution Detection Goes Neighborhood ShapingTianyi Bao, Qitian Wu, Zetian Jiang, Yiting Chen, Jiawei Sun, Junchi Yan. [doi]
- Efficient World Models with Context-Aware TokenizationVincent Micheli, Eloi Alonso, François Fleuret. [doi]
- Repoformer: Selective Retrieval for Repository-Level Code CompletionDi Wu 0054, Wasi Uddin Ahmad, Dejiao Zhang, Murali Krishna Ramanathan, Xiaofei Ma 0001. [doi]
- Retrieval Across Any Domains via Large-scale Pre-trained ModelJiexi Yan, Zhihui Yin, Chenghao Xu, Cheng Deng, Heng Huang. [doi]
- TERD: A Unified Framework for Safeguarding Diffusion Models Against BackdoorsYichuan Mo, Hui Huang, Mingjie Li, Ang Li, Yisen Wang 0001. [doi]
- EDISON: Enhanced Dictionary-Induced Tensorized Incomplete Multi-View Clustering with Gaussian Error Rank MinimizationZhibin Gu, Zhendong Li, Songhe Feng. [doi]
- Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale PlannersChengjie Wu, Hao Hu 0006, Yiqin Yang, Ning Zhang, Chongjie Zhang. [doi]
- Fast Decision Boundary based Out-of-Distribution DetectorLitian Liu, Yao Qin 0001. [doi]
- What's the score? Automated Denoising Score Matching for Nonlinear DiffusionsRaghav Singhal, Mark Goldstein, Rajesh Ranganath. [doi]
- Acquisition Conditioned Oracle for Nongreedy Active Feature AcquisitionMichael Valancius, Max Lennon, Junier Oliva. [doi]
- SparQ Attention: Bandwidth-Efficient LLM InferenceLuka Ribar, Ivan Chelombiev, Luke Hudlass-Galley, Charlie Blake, Carlo Luschi, Douglas Orr. [doi]
- Implicit Representations for Constrained Image SegmentationJan Philipp Schneider, Mishal Fatima, Jovita Lukasik, Andreas Kolb 0001, Margret Keuper, Michael Moeller 0001. [doi]
- Achieving Margin Maximization Exponentially Fast via Progressive Norm RescalingMingze Wang, Zeping Min, Lei Wu. [doi]
- Cluster-Aware Similarity Diffusion for Instance RetrievalJifei Luo, Hantao Yao, Changsheng Xu. [doi]
- Counterfactual Metarules for Local and Global RecourseTom Bewley, Salim I. Amoukou, Saumitra Mishra, Daniele Magazzeni, Manuela Veloso. [doi]
- Optimally Improving Cooperative Learning in a Social SettingShahrzad Haddadan, Cheng Xin, Jie Gao. [doi]
- No Wrong Turns: The Simple Geometry Of Neural Networks Optimization PathsCharles Guille-Escuret, Hiroki Naganuma, Kilian Fatras, Ioannis Mitliagkas. [doi]
- GATE: How to Keep Out Intrusive NeighborsNimrah Mustafa, Rebekka Burkholz. [doi]
- MaxMin-RLHF: Alignment with Diverse Human PreferencesSouradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit S. Bedi, Mengdi Wang. [doi]
- Learning to Play Atari in a World of TokensPranav Agarwal, Sheldon Andrews, Samira Ebrahimi Kahou. [doi]
- Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMsYeonhong Park, Jake Hyun, SangLyul Cho, Bonggeun Sim, Jae W. Lee. [doi]
- TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce EdgeYoung D. Kwon, Rui Li 0052, Stylianos I. Venieris, Jagmohan Chauhan, Nicholas Donald Lane, Cecilia Mascolo. [doi]
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, David Vázquez 0001, Nicolas Chapados, Alexandre Lacoste. [doi]
- PcLast: Discovering Plannable Continuous Latent StatesAnurag Koul, Shivakanth Sujit, Shaoru Chen, Ben Evans, Lili Wu, Byron Xu, Rajan Chari, Riashat Islam, Raihan Seraj, Yonathan Efroni, Lekan P. Molu, Miroslav Dudík, John Langford 0001, Alex Lamb. [doi]
- NeuralIndicator: Implicit Surface Reconstruction from Neural Indicator PriorsShi-Sheng Huang, Guo Chen, Chen Li Heng, Hua Huang 0001. [doi]
- Rate-Optimal Policy Optimization for Linear Markov Decision ProcessesUri Sherman, Alon Cohen, Tomer Koren, Yishay Mansour. [doi]
- Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts?Huy Nguyen, Pedram Akbarian, Nhat Ho. [doi]
- Disentanglement Learning via TopologyNikita Balabin, Daria Voronkova, Ilya Trofimov, Evgeny Burnaev, Serguei Barannikov. [doi]
- Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language ModelsQitan Lv, Jie Wang 0005, Hanzhu Chen, Bin Li 0025, Yongdong Zhang 0001, Feng Wu 0001. [doi]
- Sparse Dimensionality Reduction RevisitedMikael Møller Høgsgaard, Lior Kamma, Kasper Green Larsen, Jelani Nelson, Chris Schwiegelshohn. [doi]
- Relational DNN Verification With Cross Executional Bound RefinementDebangshu Banerjee, Gagandeep Singh 0001. [doi]
- RVI-SAC: Average Reward Off-Policy Deep Reinforcement LearningYukinari Hisaki, Isao Ono. [doi]
- Projection-Free Online Convex Optimization with Time-Varying ConstraintsDan Garber, Ben Kretzu. [doi]
- Classification Under Strategic Self-SelectionGuy Horowitz, Yonatan Sommer, Moran Koren, Nir Rosenfeld. [doi]
- Generalization Analysis of Stochastic Weight Averaging with General SamplingPeng Wang, Li Shen 0008, Zerui Tao, Shuaida He, Dacheng Tao. [doi]
- CogBench: a large language model walks into a psychology labJulian Coda-Forno, Marcel Binz, Jane X. Wang, Eric Schulz. [doi]
- Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural NetworksAkshay Kumar Jagadish, Julian Coda-Forno, Mirko Thalmann, Eric Schulz, Marcel Binz. [doi]
- Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation TreeLang Feng, Pengjie Gu, Bo An 0001, Gang Pan 0001. [doi]
- Learning Latent Structures in Network Games via Data-Dependent Gated-Prior Graph Variational AutoencodersXue Yu, Muchen Li, Yan Leng, Renjie Liao. [doi]
- Confidence-aware Contrastive Learning for Selective ClassificationYu-Chang Wu, Shen-Huan Lyu, Haopu Shang, Xiangyu Wang, Chao Qian 0001. [doi]
- Network Tight Community DetectionJiayi Deng, Xiaodong Yang, Jun Yu, Jun Liu, Zhaiming Shen, Danyang Huang, Huimin Cheng. [doi]
- A Doubly Recursive Stochastic Compositional Gradient Descent Method for Federated Multi-Level Compositional OptimizationHongchang Gao. [doi]
- Concentration Inequalities for General Functions of Heavy-Tailed Random VariablesShaojie Li, Yong Liu 0018. [doi]
- Networked Inequality: Preferential Attachment Bias in Graph Neural Network Link PredictionArjun Subramonian, Levent Sagun, Yizhou Sun. [doi]
- FedREDefense: Defending against Model Poisoning Attacks for Federated Learning using Model Update Reconstruction ErrorYueqi Xie, Minghong Fang, Neil Zhenqiang Gong. [doi]
- DRCT: Diffusion Reconstruction Contrastive Training towards Universal Detection of Diffusion Generated ImagesBaoying Chen, Jishen Zeng, Jianquan Yang, Rui Yang. [doi]
- Collage: Light-Weight Low-Precision Strategy for LLM TrainingTao Yu, Gaurav Gupta, Karthick Gopalswamy, Amith R. Mamidala, Hao Zhou, Jeffrey Huynh, Youngsuk Park, Ron Diamant, Anoop Deoras, Luke Huan. [doi]
- Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement LearningTenglong Liu, Yang Li 0116, Yixing Lan, Hao Gao, Wei Pan 0004, Xin Xu 0001. [doi]
- DiJiang: Efficient Large Language Models through Compact KernelizationHanting Chen, Liuzhi Cheng, Xutao Wang, Yuchuan Tian, Yunhe Wang 0001. [doi]
- Equivariant Deep Weight Space AlignmentAviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik, Nadav Dym, Haggai Maron. [doi]
- Visual Representation Learning with Stochastic Frame PredictionHuiwon Jang, Dongyoung Kim, Junsu Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo. [doi]
- Decentralized Convex Finite-Sum Optimization with Better Dependence on Condition NumbersYuxing Liu, Lesi Chen, Luo Luo. [doi]
- Do Efficient Transformers Really Save Computation?Kai Yang, Jan Ackermann, Zhenyu He 0012, Guhao Feng, Bohang Zhang, Yunzhen Feng, Qiwei Ye, Di He 0001, Liwei Wang 0001. [doi]
- Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and ValueYoung Wu, Jeremy McMahan, Yiding Chen, Yudong Chen 0001, Jerry Zhu, Qiaomin Xie. [doi]
- Gaussian Plane-Wave Neural Operator for Electron Density Estimationseongsu Kim, Sungsoo Ahn. [doi]
- Dynamic Correlation Clustering in Sublinear Update TimeVincent Cohen-Addad, Silvio Lattanzi, Andreas Maggiori, Nikos Parotsidis. [doi]
- On The Fairness Impacts of Hardware Selection in Machine LearningSree Harsha Nelaturu, Nishaanth Kanna Ravichandran, Cuong Tran 0007, Sara Hooker, Ferdinando Fioretto. [doi]
- Training-Free Long-Context Scaling of Large Language ModelsChenxin An, Fei Huang 0004, Jun Zhang, Shansan Gong, Xipeng Qiu, Chang Zhou, Lingpeng Kong. [doi]
- Riemannian Preconditioned LoRA for Fine-Tuning Foundation ModelsFangzhao Zhang, Mert Pilanci. [doi]
- Sequence Compression Speeds Up Credit Assignment in Reinforcement LearningAditya A. Ramesh, Kenny John Young, Louis Kirsch, Jürgen Schmidhuber. [doi]
- INViT: A Generalizable Routing Problem Solver with Invariant Nested View TransformerHan Fang, Zhihao Song, Paul Weng, Yutong Ban. [doi]
- Mobile Attention: Mobile-Friendly Linear-Attention for Vision TransformersZhiyu Yao, Jian Wang, Haixu Wu, Jingdong Wang 0001, Mingsheng Long. [doi]
- AND: Audio Network Dissection for Interpreting Deep Acoustic ModelsTung-Yu Wu, Yu-Xiang Lin, Tsui-Wei Weng. [doi]
- Non-Vacuous Generalization Bounds for Large Language ModelsSanae Lotfi, Marc Anton Finzi, Yilun Kuang, Tim G. J. Rudner, Micah Goldblum, Andrew Gordon Wilson. [doi]
- Delving into the Convergence of Generalized Smooth Minimax OptimizationWenhan Xian, Ziyi Chen 0002, Heng Huang. [doi]
- Tabular Insights, Visual Impacts: Transferring Expertise from Tables to ImagesJun-Peng Jiang, Han-Jia Ye, Leye Wang, Yang Yang 0074, Yuan Jiang 0001, De-Chuan Zhan. [doi]
- Near-Optimal Regret in Linear MDPs with Aggregate Bandit FeedbackAsaf Cassel, Haipeng Luo, Aviv Rosenberg 0002, Dmitry Sotnikov. [doi]
- Implicit Regularization in Feedback Alignment Learning Mechanisms for Neural NetworksZachary Robertson, Sanmi Koyejo. [doi]
- Scaling Exponents Across Parameterizations and OptimizersKatie E. Everett, Lechao Xiao, Mitchell Wortsman, Alexander A. Alemi, Roman Novak, Peter J. Liu, Izzeddin Gur, Jascha Sohl-Dickstein, Leslie Pack Kaelbling, Jaehoon Lee 0001, Jeffrey Pennington. [doi]
- Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise SensitivityXudong Li, Timin Gao, Runze Hu, Yan Zhang 0002, Shengchuan Zhang, Xiawu Zheng, Jingyuan Zheng, Yunhang Shen, Ke Li 0015, Yutao Liu 0002, Pingyang Dai, Rongrong Ji. [doi]
- Learning Latent Dynamic Robust Representations for World ModelsRuixiang Sun, Hongyu Zang, Xin Li 0033, Riashat Islam. [doi]
- Incorporating Information into Shapley Values: Reweighting via a Maximum Entropy ApproachDarya Biparva, Donatello Materassi. [doi]
- Post-hoc Part-Prototype NetworksAndong Tan, Fengtao Zhou, Hao Chen 0011. [doi]
- A Distributional Analogue to the Successor RepresentationHarley Wiltzer, Jesse Farebrother, Arthur Gretton, Yunhao Tang, André Barreto 0001, Will Dabney, Marc G. Bellemare, Mark Rowland. [doi]
- Modelling Microbial Communities with Graph Neural NetworksAlbane Ruaud, Cansu Sancaktar, Marco Bagatella, Christoph Ratzke, Georg Martius. [doi]
- VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual ContextYunxin Li, Baotian Hu, Haoyuan Shi, Wei Wang 0164, Longyue Wang, Min Zhang 0005. [doi]
- Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks?Emanuel Sommer, Lisa Wimmer, Theodore Papamarkou, Ludwig Bothmann, Bernd Bischl, David Rügamer. [doi]
- Parallel Affine Transformation Tuning of Markov Chain Monte CarloPhilip Schär, Michael Habeck, Daniel Rudolf. [doi]
- Kernel Debiased Plug-in Estimation: Simultaneous, Automated Debiasing without Influence Functions for Many Target ParametersBrian Cho 0001, Yaroslav Mukhin, Kyra Gan, Ivana Malenica. [doi]
- Saliency strikes back: How filtering out high frequencies improves white-box explanationsSabine Muzellec, Thomas Fel, Victor Boutin, Léo Andéol, Rufin VanRullen, Thomas Serre. [doi]
- Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention LandscapeJuno Kim, Taiji Suzuki. [doi]
- Causality Based Front-door Defense Against Backdoor Attack on Language ModelsYiran Liu, Xiaoang Xu, Zhiyi Hou, Yang Yu. [doi]
- Kepler codebookJunrong Lian, Ziyue Dong, Pengxu Wei, Wei Ke 0003, Chang Liu 0030, Qixiang Ye, Xiangyang Ji, Liang Lin. [doi]
- PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMsSoroush Nasiriany, Fei Xia, Wenhao Yu 0003, Ted Xiao, Jacky Liang, Ishita Dasgupta 0001, Annie Xie, Danny Driess, Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee, Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter. [doi]
- Latent Logic Tree Extraction for Event Sequence Explanation from LLMsZitao Song, Chao Yang, Chaojie Wang 0001, Bo An 0001, Shuang Li 0002. [doi]
- Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary DomainsLevi E. Lingsch, Mike Yan Michelis, Emmanuel de Bézenac, Sirani M. Perera, Robert K. Katzschmann, Siddhartha Mishra. [doi]
- Learning Causal Relations from Subsampled Time Series with Two Time-SlicesAnpeng Wu, Haoxuan Li, Kun Kuang, Keli Zhang, Fei Wu 0001. [doi]
- DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory StitchingGuanghe Li, Yixiang Shan, Zhengbang Zhu, Ting Long, Weinan Zhang 0001. [doi]
- Stochastic positional embeddings improve masked image modelingAmir Bar, Florian Bordes, Assaf Shocher, Mido Assran, Pascal Vincent, Nicolas Ballas, Trevor Darrell, Amir Globerson, Yann LeCun. [doi]
- In-Context Principle Learning from MistakesTianjun Zhang, Aman Madaan, Luyu Gao, Steven Zheng, Swaroop Mishra, Yiming Yang, Niket Tandon, Uri Alon 0002. [doi]
- Provably Robust DPO: Aligning Language Models with Noisy FeedbackSayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan. [doi]
- Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language ModelsMingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu 0004, Xiaoshuai Sun, Rongrong Ji. [doi]
- Compress Clean Signal from Noisy Raw Image: A Self-Supervised ApproachZhihao Li, Yufei Wang, Alex C. Kot, Bihan Wen. [doi]
- Intersectional Unfairness DiscoveryGezheng Xu, Qi Chen, Charles Ling 0001, Boyu Wang, Changjian Shui. [doi]
- Robust Multi-Task Learning with Excess RisksYifei He, Shiji Zhou, Guojun Zhang, Hyokun Yun, Yi Xu, Belinda Zeng, Trishul Chilimbi, Han Zhao 0002. [doi]
- Generalizing Knowledge Graph Embedding with Universal Orthogonal ParameterizationRui Li 0086, Chaozhuo Li 0001, Yanming Shen, Zeyu Zhang, Xu Chen. [doi]
- Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic PerspectiveYuxin Dong, Tieliang Gong, Hong Chen 0004, Zhongjiang He, Mengxiang Li, Shuangyong Song, Chen Li 0011. [doi]
- Policy Learning for Balancing Short-Term and Long-Term RewardsPeng Wu 0012, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng. [doi]
- Does Label Smoothing Help Deep Partial Label Learning?Xiuwen Gong, Nitin Bisht, Guandong Xu 0001. [doi]
- D-Flow: Differentiating through Flows for Controlled GenerationHeli Ben Hamu, Omri Puny, Itai Gat, Brian Karrer, Uriel Singer, Yaron Lipman. [doi]
- An Unsupervised Approach for Periodic Source Detection in Time SeriesBerken Utku Demirel, Christian Holz 0001. [doi]
- A New Branch-and-Bound Pruning Framework for ℓ0-Regularized ProblemsThéo Guyard, Cédric Herzet, Clément Elvira, Ayse-Nur Arslan. [doi]
- Subequivariant Reinforcement Learning in 3D Multi-Entity Physical EnvironmentsRunfa Chen, Ling Wang, Yu Du, Tianrui Xue, Fuchun Sun 0001, Jianwei Zhang 0001, Wenbing Huang 0001. [doi]
- Non-parametric Online Change Point Detection on Riemannian ManifoldsXiuheng Wang, Ricardo Augusto Borsoi, Cédric Richard. [doi]
- A Unified View of FANOVA: A Comprehensive Bayesian Framework for Component Selection and EstimationYosra Marnissi, Maxime Leiber. [doi]
- Optimal Acceleration for Minimax and Fixed-Point Problems is Not UniqueTaeho Yoon, Jaeyeon Kim, Jaewook J. Suh, Ernest K. Ryu. [doi]
- Understanding the Learning Dynamics of Alignment with Human FeedbackShawn Im, Yixuan Li. [doi]
- CodeIt: Self-Improving Language Models with Prioritized Hindsight ReplayNatasha Butt, Blazej Manczak, Auke J. Wiggers, Corrado Rainone, David W. Zhang, Michaël Defferrard, Taco Cohen. [doi]
- Precise Accuracy / Robustness Tradeoffs in Regression: Case of General NormsElvis Dohmatob, Meyer Scetbon. [doi]
- MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGIKaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao 0007, Yali Wang, Yu Qiao 0001, Ping Luo 0002, Kaipeng Zhang, Wenqi Shao. [doi]
- Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph ProblemsTa Duy Nguyen, Alina Ene. [doi]
- Stay on Topic with Classifier-Free GuidanceGuillaume Sanchez, Alexander Spangher, Honglu Fan, Elad Levi, Stella Biderman. [doi]
- Keypoint-based Progressive Chain-of-Thought Distillation for LLMsKaituo Feng, Changsheng Li, Xiaolu Zhang, Jun Zhou 0011, Ye Yuan 0001, Guoren Wang. [doi]
- Model-Based Minimum Bayes Risk Decoding for Text GenerationYuu Jinnai, Tetsuro Morimura, Ukyo Honda, Kaito Ariu, Kenshi Abe. [doi]
- On the Duality Between Sharpness-Aware Minimization and Adversarial TrainingYihao Zhang, Hangzhou He, Jingyu Zhu, Huanran Chen, Yifei Wang, Zeming Wei. [doi]
- Reducing sequential change detection to sequential estimationShubhanshu Shekhar, Aaditya Ramdas. [doi]
- Simple linear attention language models balance the recall-throughput tradeoffSimran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, James Zou 0001, Atri Rudra, Christopher Ré. [doi]
- MaSS: Multi-attribute Selective Suppression for Utility-preserving Data Transformation from an Information-theoretic PerspectiveYizhuo Chen, Chun-Fu Chen 0001, Hsiang Hsu, Shaohan Hu, Marco Pistoia, Tarek F. Abdelzaher. [doi]
- Expert Proximity as Surrogate Rewards for Single Demonstration Imitation LearningChia-Cheng Chiang, Li-Cheng Lan, Wei-Fang Sun, Chien Feng, Cho-Jui Hsieh, Chun-Yi Lee. [doi]
- Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning ApproachAnton Plaksin, Vitaly Kalev. [doi]
- Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model SplittingAnthony Chen, Huanrui Yang, Yulu Gan, Denis A. Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang. [doi]
- Less is More: on the Over-Globalizing Problem in Graph TransformersYujie Xing, Xiao Wang 0017, Yibo Li, Hai Huang, Chuan Shi. [doi]
- Mitigating Oversmoothing Through Reverse Process of GNNs for Heterophilic GraphsMoonjeong Park, Jaeseung Heo, Dongwoo Kim. [doi]
- Mimicking Better by Matching the Approximate Action DistributionJoão A. Cândido Ramos, Lionel Blondé, Naoya Takeishi, Alexandros Kalousis. [doi]
- KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache GenerationMinsik Cho, Mohammad Rastegari, Devang Naik. [doi]
- Revisit the Essence of Distilling Knowledge through CalibrationWen-Shu Fan, Su Lu, Xin-Chun Li, De-Chuan Zhan, Le Gan. [doi]
- Positive Concave Deep Equilibrium ModelsMateusz Gabor, Tomasz Piotrowski, Renato L. G. Cavalcante. [doi]
- Auto-Encoding Morph-Tokens for Multimodal LLMKaihang Pan, Siliang Tang, Juncheng Li 0006, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang. [doi]
- Run-Time Task Composition with Safety SemanticsKevin Leahy 0001, Makai Mann, Zachary Serlin. [doi]
- Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional TokenizationYang Jin, Zhicheng Sun 0001, Kun Xu 0005, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song 0008, Kun Gai, Yadong Mu. [doi]
- Sarah Frank-Wolfe: Methods for Constrained Optimization with Best Rates and Practical FeaturesAleksandr Beznosikov, David Dobre, Gauthier Gidel. [doi]
- Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow ModelsNeta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le 0001, Ali K. Thabet, Albert Pumarola, Yaron Lipman. [doi]
- Few-Shot Unsupervised Implicit Neural Shape Representation Learning with Spatial AdversariesAmine Ouasfi, Adnane Boukhayma. [doi]
- Conditionally-Conjugate Gaussian Process Factor Analysis for Spike Count Data via Data AugmentationYididiya Y. Nadew, Xuhui Fan 0001, Christopher John Quinn. [doi]
- NExT: Teaching Large Language Models to Reason about Code ExecutionAnsong Ni, Miltiadis Allamanis, Arman Cohan, Yinlin Deng, Kensen Shi, Charles Sutton, Pengcheng Yin. [doi]
- Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual FidelityHagyeong Lee, Minkyu Kim 0004, Jun Hyuk Kim, Seungeon Kim, Dokwan Oh, Jaeho Lee 0001. [doi]
- Modular Learning of Deep Causal Generative Models for High-dimensional Causal InferenceMd. Musfiqur Rahman, Murat Kocaoglu. [doi]
- Stability-Informed Initialization of Neural Ordinary Differential EquationsTheodor Westny, Arman Mohammadi, Daniel Jung 0002, Erik Frisk. [doi]
- Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration via Shift Reduction LemmasClément Pierquin, Aurélien Bellet, Marc Tommasi, Matthieu Boussard. [doi]
- Characterizing ResNet's Universal Approximation CapabilityChenghao Liu, Enming Liang, Minghua Chen 0001. [doi]
- Near-Optimal Reinforcement Learning with Self-Play under Adaptivity ConstraintsDan Qiao 0002, Yu-Xiang Wang 0003. [doi]
- On the Tractability of SHAP Explanations under Markovian DistributionsReda Marzouk, Colin de la Higuera. [doi]
- Online Learning with Bounded RecallJon Schneider, Kiran Vodrahalli. [doi]
- BiE: Bi-Exponent Block Floating-Point for Large Language Models QuantizationLancheng Zou, Wenqian Zhao, Shuo Yin, Chen Bai, Qi Sun 0002, Bei Yu 0001. [doi]
- Turnstile ℓp leverage score sampling with applicationsAlexander Munteanu, Simon Omlor. [doi]
- On the Expressive Power of Spectral Invariant Graph Neural NetworksBohang Zhang, Lingxiao Zhao, Haggai Maron. [doi]
- Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solvingSohei Arisaka, Qianxiao Li. [doi]
- 3D-VLA: A 3D Vision-Language-Action Generative World ModelHaoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan 0008, Yilun Du, Yining Hong, Chuang Gan. [doi]
- Deconstructing the Goldilocks Zone of Neural Network InitializationArtem Vysogorets, Anna Dawid, Julia Kempe. [doi]
- Large Language Models Can Automatically Engineer Features for Few-Shot Tabular LearningSungwon Han 0001, Jinsung Yoon, Sercan Ö. Arik, Tomas Pfister. [doi]
- Maestro: Uncovering Low-Rank Structures via Trainable DecompositionSamuel Horváth, Stefanos Laskaridis, Shashank Rajput, Hongyi Wang 0001. [doi]
- Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term ExperimentsAllen Tran, Aurélien Bibaut, Nathan Kallus. [doi]
- Predictive Coding beyond CorrelationsTommaso Salvatori, Luca Pinchetti, Amine M'Charrak, Beren Millidge, Thomas Lukasiewicz. [doi]
- Generalization Bound and New Algorithm for Clean-Label Backdoor AttackLijia Yu, Shuang Liu, Yibo Miao, Xiao-Shan Gao, Lijun Zhang. [doi]
- Accelerating Heterogeneous Federated Learning with Closed-form ClassifiersEros Fanì, Raffaello Camoriano, Barbara Caputo, Marco Ciccone. [doi]
- Knowledge Graphs Can be Learned with Just Intersection FeaturesDuy Le, Shaochen Zhong, Zirui Liu, Shuai Xu, Vipin Chaudhary, Kaixiong Zhou, Zhaozhuo Xu. [doi]
- Conditional Common Entropy for Instrumental Variable Testing and Partial IdentificationZiwei Jiang, Murat Kocaoglu. [doi]
- Score-Based Causal Discovery of Latent Variable Causal ModelsIgnavier Ng, Xinshuai Dong, Haoyue Dai, Biwei Huang, Peter Spirtes, Kun Zhang 0001. [doi]
- Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled DataJiahan Zhang, Qi Wei 0004, Feng Liu, Lei Feng 0006. [doi]
- Clustered Federated Learning via Gradient-based PartitioningHeasung Kim, Hyeji Kim, Gustavo de Veciana. [doi]
- Superposition Prompting: Improving and Accelerating Retrieval-Augmented GenerationThomas Merth, Qichen Fu, Mohammad Rastegari, Mahyar Najibi. [doi]
- Semantic-Aware Human Object Interaction Image GenerationZhu Xu, Qingchao Chen, Yuxin Peng, Yang Liu. [doi]
- Risk Estimation in a Markov Cost Process: Lower and Upper BoundsGugan Thoppe, Prashanth L. A., Sanjay P. Bhat. [doi]
- Optimistic Multi-Agent Policy GradientWenshuai Zhao, Yi Zhao, Zhiyuan Li, Juho Kannala, Joni Pajarinen. [doi]
- ReLUs Are Sufficient for Learning Implicit Neural RepresentationsJoseph Shenouda, Yamin Zhou, Robert D. Nowak. [doi]
- Generalization Bounds for Heavy-Tailed SDEs through the Fractional Fokker-Planck EquationBenjamin Dupuis, Umut Simsekli. [doi]
- Reflective Policy OptimizationYaozhong Gan, Renye Yan, Zhe Wu, Junliang Xing. [doi]
- Instruction Tuning for Secure Code GenerationJingxuan He, Mark Vero, Gabriela Krasnopolska, Martin T. Vechev. [doi]
- Structured Inverse-Free Natural Gradient Descent: Memory-Efficient & Numerically-Stable KFACWu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani. [doi]
- Conformal prediction for multi-dimensional time series by ellipsoidal setsChen Xu, Hanyang Jiang, Yao Xie 0002. [doi]
- Provably Scalable Black-Box Variational Inference with Structured Variational FamiliesJoohwan Ko, Kyurae Kim, Woochang Kim, Jacob R. Gardner. [doi]
- Optimal Ridge Regularization for Out-of-Distribution PredictionPratik Patil, Jin-Hong Du, Ryan J. Tibshirani. [doi]
- High-Dimensional Bayesian Optimization via Semi-Supervised Learning with Optimized Unlabeled Data SamplingYuxuan Yin, Yu Wang 0002, Peng Li 0001. [doi]
- Stealing part of a production language modelNicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke 0002, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Eric Wallace, David Rolnick, Florian Tramèr. [doi]
- State-Constrained Zero-Sum Differential Games with One-Sided InformationMukesh Ghimire, Lei Zhang, Zhe Xu 0005, Yi Ren. [doi]
- Representation Surgery for Multi-Task Model MergingEnneng Yang, Li Shen 0008, Zhenyi Wang, Guibing Guo, Xiaojun Chen 0006, Xingwei Wang 0001, Dacheng Tao. [doi]
- Localizing Task Information for Improved Model Merging and CompressionKe Wang, Nikolaos Dimitriadis, Guillermo Ortiz-Jiménez, François Fleuret, Pascal Frossard. [doi]
- Pluvial Flood Emulation with Hydraulics-informed Message PassingArnold Kazadi, James Doss-Gollin, Arlei Lopes da Silva. [doi]
- Diversified Batch Selection for Training AccelerationFeng Hong 0004, Yueming Lyu, Jiangchao Yao, Ya Zhang 0002, Ivor W. Tsang, Yanfeng Wang. [doi]
- Beyond the Norms: Detecting Prediction Errors in Regression ModelsAndrés Altieri, Marco Romanelli 0002, Georg Pichler, Florence Alberge, Pablo Piantanida. [doi]
- Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgentYingru Li, Jiawei Xu, Lei Han 0001, Zhi-Quan Luo. [doi]
- Private Gradient Descent for Linear Regression: Tighter Error Bounds and Instance-Specific Uncertainty EstimationGavin Brown 0003, Krishnamurthy Dj Dvijotham, Georgina Evans, Daogao Liu, Adam Smith, Abhradeep Guha Thakurta. [doi]
- Online Learning under Budget and ROI Constraints via Weak AdaptivityMatteo Castiglioni, Andrea Celli, Christian Kroer. [doi]
- Unveiling the Cycloid Trajectory of EM Iterations in Mixed Linear RegressionZhankun Luo, Abolfazl Hashemi. [doi]
- A Field Guide for Pacing Budget and ROS ConstraintsSantiago R. Balseiro, Kshipra Bhawalkar, Zhe Feng 0004, Haihao Lu, Vahab Mirrokni, Balasubramanian Sivan, Di Wang 0005. [doi]
- Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy RegularizationLiam Schramm, Abdeslam Boularias. [doi]
- Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic DependenciesAlex DeWeese, Guannan Qu. [doi]
- CLIPZyme: Reaction-Conditioned Virtual Screening of EnzymesPeter Mikhael, Itamar Chinn, Regina Barzilay. [doi]
- Adaptive Online Experimental Design for Causal DiscoveryMuhammad Qasim Elahi, Lai Wei, Murat Kocaoglu, Mahsa Ghasemi. [doi]
- In-Context Language Learning: Architectures and AlgorithmsEkin Akyürek, Bailin Wang, Yoon Kim, Jacob Andreas. [doi]
- Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM InversionHila Manor, Tomer Michaeli. [doi]
- Probabilistic Time Series Modeling with Decomposable Denoising Diffusion ModelTijin Yan, Hengheng Gong, Yongping He, Yufeng Zhan, Yuanqing Xia. [doi]
- Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided DiffusionBowen Gao, Minsi Ren, Yuyan Ni, Yanwen Huang, Bo Qiang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan. [doi]
- Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial TimeSungyoon Kim, Mert Pilanci. [doi]
- A Closer Look at the Limitations of Instruction TuningSreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Ramaneswaran S., Deepali Aneja, Zeyu Jin, Ramani Duraiswami, Dinesh Manocha. [doi]
- DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM ServingFoteini Strati, Sara McAllister, Amar Phanishayee, Jakub Tarnawski, Ana Klimovic. [doi]
- Trainable Transformer in TransformerAbhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora. [doi]
- Bootstrapping Fisher Market Equilibrium and First-Price Pacing EquilibriumLuofeng Liao, Christian Kroer. [doi]
- KernelWarehouse: Rethinking the Design of Dynamic ConvolutionChao Li, Anbang Yao. [doi]
- Batch and match: black-box variational inference with a score-based divergenceDiana Cai, Chirag Modi 0002, Loucas Pillaud-Vivien, Charles Margossian, Robert M. Gower, David M. Blei, Lawrence K. Saul. [doi]
- Exploring Intrinsic Dimension for Vision-Language Model PruningHanzhang Wang, Jiawen Zhang, Qingyuan Ma. [doi]
- Bayesian Knowledge Distillation: A Bayesian Perspective of Distillation with Uncertainty QuantificationLuyang Fang, Yongkai Chen, Wenxuan Zhong, Ping Ma. [doi]
- A Statistical Framework for Data-dependent Retrieval-Augmented ModelsSoumya Basu 0001, Ankit Singh Rawat, Manzil Zaheer. [doi]
- Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window MatchingYuchen Zhang, Tianle Zhang, Kai Wang 0036, Ziyao Guo, Yuxuan Liang, Xavier Bresson, Wei Jin, Yang You 0001. [doi]
- Hierarchical Integral Probability Metrics: A distance on random probability measures with low sample complexityMarta Catalano, Hugo Lavenant. [doi]
- Smoothing Proximal Gradient Methods for Nonsmooth Sparsity Constrained Optimization: Optimality Conditions and Global ConvergenceGanzhao Yuan. [doi]
- Projecting Molecules into Synthesizable Chemical SpacesShitong Luo, Wenhao Gao 0001, Zuofan Wu, Jian Peng 0001, Connor W. Coley, Jianzhu Ma. [doi]
- Demystifying SGD with Doubly Stochastic GradientsKyurae Kim, Joohwan Ko, Yian Ma, Jacob R. Gardner. [doi]
- Dynamic Evaluation of Large Language Models by Meta Probing AgentsKaijie Zhu, Jindong Wang 0001, Qinlin Zhao, Ruochen Xu, Xing Xie 0001. [doi]
- Noise-Aware Algorithm for Heterogeneous Differentially Private Federated LearningSaber Malekmohammadi, Yaoliang Yu, Yang Cao. [doi]
- Graph Adversarial Diffusion ConvolutionSongtao Liu, Jinghui Chen, Tianfan Fu, Lu Lin 0001, Marinka Zitnik, Dinghao Wu. [doi]
- Designing Decision Support Systems using Counterfactual Prediction SetsEleni Straitouri, Manuel Gomez-Rodriguez. [doi]
- Piecewise Constant and Linear Regression Trees: An Optimal Dynamic Programming ApproachMim van den Bos, Jacobus G. M. van der Linden, Emir Demirovic. [doi]
- Improving SAM Requires Rethinking its Optimization FormulationWanyun Xie, Fabian Latorre, Kimon Antonakopoulos, Thomas Pethick, Volkan Cevher. [doi]
- Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to GeneralizeTianren Zhang, Chujie Zhao, Guanyu Chen, Yizhou Jiang, Feng Chen 0007. [doi]
- Causal Action Influence Aware Counterfactual Data AugmentationNúria Armengol Urpí, Marco Bagatella, Marin Vlastelica, Georg Martius. [doi]
- Kernel-Based Evaluation of Conditional Biological Sequence ModelsPierre Glaser, Steffanie Paul, Alissa M. Hummer, Charlotte M. Deane, Debora Susan Marks, Alan Nawzad Amin. [doi]
- Controllable Prompt Tuning For Balancing Group Distributional RobustnessHoang Phan, Andrew Gordon Wilson, Qi Lei. [doi]
- Hierarchical Novelty Detection via Fine-Grained Evidence AllocationSpandan Pyakurel, Qi Yu. [doi]
- MLAgentBench: Evaluating Language Agents on Machine Learning ExperimentationQian Huang, Jian Vora, Percy Liang, Jure Leskovec. [doi]
- MGit: A Model Versioning and Management SystemWei Hao, Daniel Mendoza, Rafael Mendes, Deepak Narayanan, Amar Phanishayee, Asaf Cidon, Junfeng Yang. [doi]
- VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence ModelingSiyuan Li, Zedong Wang, Zicheng Liu 0006, Di Wu 0057, Cheng Tan 0012, Jiangbin Zheng, Yufei Huang 0002, Stan Z. Li. [doi]
- Relational Learning in Pre-Trained Models: A Theory from Hypergraph Recovery PerspectiveYang Chen, Cong Fang 0001, Zhouchen Lin, Bing Liu. [doi]
- Averaging n-step Returns Reduces Variance in Reinforcement LearningBrett Daley, Martha White, Marlos C. Machado. [doi]
- Linguistic Calibration of Long-Form GenerationsNeil Band, Xuechen Li, Tengyu Ma 0001, Tatsunori Hashimoto. [doi]
- A Differentiable Partially Observable Generalized Linear Model with Forward-Backward Message PassingChengrui Li, Weihan Li, Yule Wang, Anqi Wu. [doi]
- Genie: Generative Interactive EnvironmentsJake Bruce, Michael D. Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes 0001, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal M. P. Behbahani, Stephanie C. Y. Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott E. Reed, Jingwei Zhang 0001, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh 0001, Tim Rocktäschel. [doi]
- On The Complexity of First-Order Methods in Stochastic Bilevel OptimizationJeongyeol Kwon, Dohyun Kwon, Hanbaek Lyu. [doi]
- Learning Causal Dynamics Models in Object-Oriented EnvironmentsZhongwei Yu, Jingqing Ruan, Dengpeng Xing. [doi]
- Feedback Efficient Online Fine-Tuning of Diffusion ModelsMasatoshi Uehara, Yulai Zhao 0002, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M. Tseng, Sergey Levine, Tommaso Biancalani. [doi]
- On the Recoverability of Causal Relations from Temporally Aggregated I.I.D. DataShunxing Fan, Mingming Gong, Kun Zhang 0001. [doi]
- RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model FeedbackYufei Wang, Zhanyi Sun, Jesse Zhang, Zhou Xian, Erdem Biyik, David Held, Zackory Erickson. [doi]
- Predicting Lagrangian Multipliers for Mixed Integer Linear ProgramsFrancesco Demelas, Joseph Le Roux, Mathieu Lacroix, Axel Parmentier. [doi]
- In value-based deep reinforcement learning, a pruned network is a good networkJohan Samir Obando-Ceron, Aaron C. Courville, Pablo Samuel Castro. [doi]
- Self-Consistency Training for Density-Functional-Theory Hamiltonian PredictionHe Zhang, Chang Liu 0030, Zun Wang, Xinran Wei, Siyuan Liu, Nanning Zheng 0001, Bin Shao, Tie-Yan Liu. [doi]
- Accelerating Convergence of Score-Based Diffusion Models, ProvablyGen Li 0005, Yu Huang, Timofey Efimov, Yuting Wei, Yuejie Chi, Yuxin Chen 0002. [doi]
- ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive AdvantagesAndrew Jesson, Chris Lu 0001, Gunshi Gupta, Nicolas Beltran-Velez, Angelos Filos, Jakob Nicolaus Foerster, Yarin Gal. [doi]
- Recurrent Early Exits for Federated Learning with Heterogeneous ClientsRoyson Lee, Javier Fernández-Marqués, Shell Xu Hu, Da Li 0001, Stefanos Laskaridis, Lukasz Dudziak, Timothy M. Hospedales, Ferenc Huszár, Nicholas Donald Lane. [doi]
- Density Ratio Estimation with Doubly Strong RobustnessRyosuke Nagumo, Hironori Fujisawa. [doi]
- Scaling Down Deep Learning with MNIST-1DSamuel Greydanus, Dmitry Kobak. [doi]
- Momentor: Advancing Video Large Language Model with Fine-Grained Temporal ReasoningLong Qian, Juncheng Li 0006, Yu Wu 0011, Yaobo Ye, Hao Fei 0001, Tat-Seng Chua, Yueting Zhuang, Siliang Tang. [doi]
- REMEDI: Corrective Transformations for Improved Neural Entropy EstimationViktor Nilsson, Anirban Samaddar, Sandeep Madireddy, Pierre Nyquist. [doi]
- Two Tales of Single-Phase Contrastive Hebbian LearningRasmus Kjær Høier, Christopher Zach. [doi]
- Low-Cost High-Power Membership Inference AttacksSajjad Zarifzadeh, Philippe Liu, Reza Shokri. [doi]
- Purify Unlearnable Examples via Rate-Constrained Variational AutoencodersYi Yu, Yufei Wang, Song Xia, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot. [doi]
- No-Regret Reinforcement Learning in Smooth MDPsDavide Maran, Alberto Maria Metelli, Matteo Papini, Marcello Restelli. [doi]
- Auditing Private PredictionKaran Chadha, Matthew Jagielski, Nicolas Papernot, Christopher A. Choquette-Choo, Milad Nasr. [doi]
- DMTG: One-Shot Differentiable Multi-Task GroupingYuan Gao 0015, Shuguo Jiang, Moran Li, Jin-Gang Yu, Gui-Song Xia. [doi]
- Multi-Factor Adaptive Vision Selection for Egocentric Video Question AnsweringHaoyu Zhang, Meng Liu 0006, Zixin Liu, Xuemeng Song, Yaowei Wang 0001, Liqiang Nie. [doi]
- Active Label Correction for Semantic Segmentation with Foundation ModelsHoyoung Kim, Sehyun Hwang, Suha Kwak, Jungseul Ok. [doi]
- H-Consistency Guarantees for RegressionAnqi Mao, Mehryar Mohri, Yutao Zhong 0002. [doi]
- MF-CLR: Multi-Frequency Contrastive Learning Representation for Time SeriesJufang Duan, Wei Zheng, Yangzhou Du, Wenfa Wu, Haipeng Jiang, Hongsheng Qi. [doi]
- Vision Transformers as Probabilistic Expansion from LearngeneQiufeng Wang, Xu Yang, Haokun Chen, Xin Geng 0001. [doi]
- Identifiability Matters: Revealing the Hidden Recoverable Condition in Unbiased Learning to RankMouxiang Chen, Chenghao Liu, Zemin Liu, Zhuo Li, Jianling Sun. [doi]
- A fast algorithm to simulate nonlinear resistive networksBenjamin Scellier. [doi]
- Conformal Prediction with Learned FeaturesShayan Kiyani, George J. Pappas, Hamed Hassani. [doi]
- Major-Minor Mean Field Multi-Agent Reinforcement LearningKai Cui 0001, Christian Fabian, Anam Tahir, Heinz Koeppl. [doi]
- Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsStelios Triantafyllou, Aleksa Sukovic, Debmalya Mandal, Goran Radanovic. [doi]
- Learning to Intervene on Concept BottlenecksDavid Steinmann, Wolfgang Stammer, Felix Friedrich, Kristian Kersting. [doi]
- Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In ContextXiang Cheng, Yuxin Chen, Suvrit Sra. [doi]
- Augmenting Decision with Hypothesis in Reinforcement LearningNguyen Minh Quang, Hady W. Lauw. [doi]
- Catapults in SGD: spikes in the training loss and their impact on generalization through feature learningLibin Zhu, Chaoyue Liu 0001, Adityanarayanan Radhakrishnan, Mikhail Belkin. [doi]
- Exploiting Human-AI Dependence for Learning to DeferZixi Wei, Yuzhou Cao, Lei Feng 0006. [doi]
- Graph2Tac: Online Representation Learning of Formal Math ConceptsLasse Blaauwbroek, Mirek Olsák, Jason Rute, Fidel Ivan Schaposnik Massolo, Jelle Piepenbrock, Vasily Pestun. [doi]
- Improved Generalization of Weight Space Networks via AugmentationsAviv Shamsian, Aviv Navon, David W. Zhang, Yan Zhang, Ethan Fetaya, Gal Chechik, Haggai Maron. [doi]
- SMaRt: Improving GANs with Score Matching RegularityMengfei Xia, Yujun Shen, Ceyuan Yang, Ran Yi, Wenping Wang, Yongjin Liu. [doi]
- Privacy Profiles for Private SelectionAntti Koskela, Rachel Redberg, Yu-Xiang Wang 0003. [doi]
- The Role of Learning Algorithms in Collective ActionOmri Ben-Dov, Jake Fawkes, Samira Samadi, Amartya Sanyal. [doi]
- GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language ModelLing Li, Yu Ye 0002, Bingchuan Jiang, Wei Zeng 0004. [doi]
- Configurable Mirror Descent: Towards a Unification of Decision MakingPengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Shuyue Hu, Xiao Huang, Hau Chan, Bo An 0001. [doi]
- Position: The Causal Revolution Needs Scientific PragmatismJoshua R. Loftus. [doi]
- Provable Interactive Learning with Hindsight Instruction FeedbackDipendra Misra, Aldo Pacchiano, Robert E. Schapire. [doi]
- Generalization Analysis for Multi-Label LearningYifan Zhang, Min-Ling Zhang. [doi]
- Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily PerspectiveSoo Yong Lee, Sunwoo Kim, Fanchen Bu, Jaemin Yoo, Jiliang Tang, Kijung Shin. [doi]
- On Multi-Armed Bandit with Impatient ArmsYuming Shao, Zhixuan Fang. [doi]
- FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement LearningWenzhe Li, Zihan Ding, Seth Karten, Chi Jin 0001. [doi]
- Revealing the Dark Secrets of Extremely Large Kernel ConvNets on RobustnessHonghao Chen, Yurong Zhang, Xiaokun Feng, Xiangxiang Chu, Kaiqi Huang. [doi]
- Generalization Bounds for Causal Regression: Insights, Guarantees and Sensitivity AnalysisDaniel Csillag, Cláudio José Struchiner, Guilherme Tegoni Goedert. [doi]
- To the Max: Reinventing Reward in Reinforcement LearningGrigorii Veviurko, Wendelin Boehmer, Mathijs de Weerdt. [doi]
- StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought DecompilationWeike Fang, Zhejian Zhou, Junzhou He, Weihang Wang 0006. [doi]
- Causal Inference out of Control: Estimating Performativity without Treatment RandomizationGary Cheng 0004, Moritz Hardt, Celestine Mendler-Dünner. [doi]
- Verifying message-passing neural networks via topology-based bounds tighteningChristopher Hojny, Shiqiang Zhang, Juan S. Campos, Ruth Misener. [doi]
- Adaptive Observation Cost Control for Variational Quantum EigensolversChristopher J. Anders, Kim Andrea Nicoli, Bingting Wu, Naima Elosegui, Samuele Pedrielli, Lena Funcke, Karl Jansen, Stefan Kühn, Shinichi Nakajima. [doi]
- Efficient Pareto Manifold Learning with Low-Rank StructureWeiyu Chen, James T. Kwok. [doi]
- Position: On the Possibilities of AI-Generated Text DetectionSouradip Chakraborty, Amrit S. Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang. [doi]
- Learning to Predict Mutational Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt LearningLirong Wu, Yijun Tian 0001, Haitao Lin, Yufei Huang 0002, Siyuan Li, Nitesh V. Chawla, Stan Z. Li. [doi]
- How Learning by Reconstruction Produces Uninformative Features For PerceptionRandall Balestriero, Yann LeCun. [doi]
- LCA-on-the-Line: Benchmarking Out of Distribution Generalization with Class TaxonomiesJia Shi, Gautam Rajendrakumar Gare, Jinjin Tian, Siqi Chai, Zhiqiu Lin, Arun Balajee Vasudevan, Di Feng, Francesco Ferroni, Shu Kong. [doi]
- Evaluation of Test-Time Adaptation Under Computational Time ConstraintsMotasem Alfarra, Hani Itani, Alejandro Pardo, Shyma Alhuwaider, Merey Ramazanova, Juan Camilo Pérez, Zhipeng Cai, Matthias Müller 0011, Bernard Ghanem. [doi]
- An Iterative Min-Min Optimization Method for Sparse Bayesian LearningYasen Wang, Junlin Li, Zuogong Yue, Ye Yuan 0002. [doi]
- Straight-Through Meets Sparse Recovery: the Support Exploration AlgorithmMimoun Mohamed, François Malgouyres, Valentin Emiya, Caroline Chaux. [doi]
- EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal TokensSunil Hwang, Jaehong Yoon, Youngwan Lee, Sung Ju Hwang. [doi]
- Stochastic Weakly Convex Optimization beyond Lipschitz ContinuityWenzhi Gao, Qi Deng. [doi]
- Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short DelaysQingyuan Wu, Simon Sinong Zhan, Yixuan Wang 0001, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu 0002, Jürgen Schmidhuber, Chao Huang 0015. [doi]
- Safe and Robust Subgame Exploitation in Imperfect Information GamesZhenxing Ge, Zheng Xu, Tianyu Ding, Linjian Meng, Bo An 0001, Wenbin Li 0006, Yang Gao 0001. [doi]
- MolCRAFT: Structure-Based Drug Design in Continuous Parameter SpaceYanru Qu, Keyue Qiu, Yuxuan Song, Jingjing Gong, Jiawei Han 0001, Mingyue Zheng, Hao Zhou 0012, Wei-Ying Ma. [doi]
- Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality AssessmentXudong Li, Runze Hu, Jingyuan Zheng, Yan Zhang 0002, Shengchuan Zhang, Xiawu Zheng, Ke Li 0015, Yunhang Shen, Yutao Liu 0002, Pingyang Dai, Rongrong Ji. [doi]
- Online Variational Sequential Monte CarloAlessandro Mastrototaro, Jimmy Olsson. [doi]
- Position: Why We Must Rethink Empirical Research in Machine LearningMoritz Herrmann, F. Julian D. Lange, Katharina Eggensperger, Giuseppe Casalicchio, Marcel Wever, Matthias Feurer, David Rügamer, Eyke Hüllermeier, Anne-Laure Boulesteix, Bernd Bischl. [doi]
- Contrastive Predict-and-Search for Mixed Integer Linear ProgramsTaoan Huang, Aaron M. Ferber, Arman Zharmagambetov, Yuandong Tian, Bistra Dilkina. [doi]
- Prototypical Transformer As Unified Motion LearnersCheng Han, Yawen Lu, Guohao Sun, James Chenhao Liang, Zhiwen Cao, Qifan Wang, Qiang Guan, Sohail A. Dianat, Raghuveer Rao, Tong Geng, Zhiqiang Tao, Dongfang Liu. [doi]
- PID: Prompt-Independent Data Protection Against Latent Diffusion ModelsAng Li, Yichuan Mo, Mingjie Li, Yisen Wang 0001. [doi]
- The Pitfalls of Next-Token PredictionGregor Bachmann, Vaishnavh Nagarajan. [doi]
- Non-stationary Online Convex Optimization with Arbitrary DelaysYuanyu Wan, Chang Yao, Mingli Song, Lijun Zhang 0005. [doi]
- Challenges in Training PINNs: A Loss Landscape PerspectivePratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu 0015, Madeleine Udell. [doi]
- How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex OptimizationAndrew Lowy, Jonathan R. Ullman, Stephen J. Wright 0001. [doi]
- On the Universality of Volume-Preserving and Coupling-Based Normalizing FlowsFelix Draxler, Stefan Wahl, Christoph Schnörr, Ullrich Köthe. [doi]
- Online Resource Allocation with Non-Stationary CustomersXiaoyue Zhang, Hanzhang Qin, Mabel C. Chou. [doi]
- BAT: Learning to Reason about Spatial Sounds with Large Language ModelsZhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen 0001, Eunsol Choi, David Harwath. [doi]
- NExT-Chat: An LMM for Chat, Detection and SegmentationAo Zhang, Yuan Yao 0013, Wei Ji 0008, Zhiyuan Liu 0001, Tat-Seng Chua. [doi]
- Locally Differentially Private Decentralized Stochastic Bilevel Optimization with Guaranteed Convergence AccuracyZiqin Chen, Yongqiang Wang. [doi]
- Harnessing Neural Unit Dynamics for Effective and Scalable Class-Incremental LearningDepeng Li 0001, Tianqi Wang, Junwei Chen, Wei Dai 0004, Zhigang Zeng. [doi]
- Position: Optimization in SciML Should Employ the Function Space GeometryJohannes Müller, Marius Zeinhofer. [doi]
- A Language Model's Guide Through Latent SpaceDimitri von Rütte, Sotiris Anagnostidis, Gregor Bachmann, Thomas Hofmann. [doi]
- Copula-Nested Spectral Kernel NetworkJinyue Tian, Hui Xue 0002, Yanfang Xue, Pengfei Fang. [doi]
- A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM)Dehao Yuan, Cornelia Fermüller, Tahseen Rabbani, Furong Huang, Yiannis Aloimonos. [doi]
- Compressing Large Language Models by Joint Sparsification and QuantizationJinyang Guo, Jianyu Wu, Zining Wang, Jiaheng Liu, Ge Yang, Yifu Ding, Ruihao Gong, Haotong Qin, Xianglong Liu 0001. [doi]
- Efficient Error Certification for Physics-Informed Neural NetworksFrancisco Eiras, Adel Bibi, Rudy Bunel, Krishnamurthy Dj Dvijotham, Philip Torr 0001, M. Pawan Kumar. [doi]
- Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight ObservationTonghe Zhang, Yu Chen, Longbo Huang. [doi]
- TabLog: Test-Time Adaptation for Tabular Data Using Logic RulesWeijieying Ren, Xiaoting Li 0001, Huiyuan Chen, Vineeth Rakesh, Zhuoyi Wang, Mahashweta Das, Vasant G. Honavar. [doi]
- Neural-Kernel Conditional Mean EmbeddingsEiki Shimizu, Kenji Fukumizu, Dino Sejdinovic. [doi]
- Probabilistic Forecasting with Stochastic Interpolants and Föllmer ProcessesYifan Chen, Mark Goldstein, Mengjian Hua, Michael S. Albergo, Nicholas Matthew Boffi, Eric Vanden-Eijnden. [doi]
- BayOTIDE: Bayesian Online Multivariate Time Series Imputation with Functional DecompositionShikai Fang, Qingsong Wen, Yingtao Luo, Shandian Zhe, Liang Sun 0001. [doi]
- Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square LossIngvar M. Ziemann, Stephen Tu, George J. Pappas, Nikolai Matni. [doi]
- Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality FusionIshaan Singh Rawal, Alexander Matyasko, Shantanu Jaiswal, Basura Fernando, Cheston Tan. [doi]
- LAGMA: LAtent Goal-guided Multi-Agent Reinforcement LearningHyungho Na, Il-Chul Moon. [doi]
- Collaborative Heterogeneous Causal Inference Beyond Meta-analysisTianyu Guo 0004, Sai Praneeth Karimireddy, Michael I. Jordan. [doi]
- EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site PredictionYang Zhang 0094, Zhewei Wei, Ye Yuan 0001, Chongxuan Li, Wenbing Huang 0001. [doi]
- Sparse and Structured Hopfield NetworksSaul José Rodrigues dos Santos, Vlad Niculae, Daniel C. McNamee, André F. T. Martins. [doi]
- Neural Tangent Kernels for Axis-Aligned Tree EnsemblesRyuichi Kanoh, Mahito Sugiyama. [doi]
- Rethinking Independent Cross-Entropy Loss For Graph-Structured DataRui Miao, Kaixiong Zhou, Yili Wang, Ninghao Liu, Ying Wang 0009, Xin Wang 0035. [doi]
- ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet AccuracyKirill Vishniakov, Zhiqiang Shen, Zhuang Liu 0003. [doi]
- Implicit Representations via Operator LearningSourav Pal, Harshavardhan Adepu, Clinton J. Wang, Polina Golland, Vikas Singh. [doi]
- Grokking Group Multiplication with CosetsDashiell Stander, Qinan Yu, Honglu Fan, Stella Biderman. [doi]
- Multimodal Prototyping for cancer survival predictionAndrew H. Song, Richard J. Chen, Guillaume Jaume, Anurag J. Vaidya, Alexander S. Baras, Faisal Mahmood. [doi]
- Decoupling Feature Extraction and Classification Layers for Calibrated Neural NetworksMikkel Jordahn, Pablo M. Olmos. [doi]
- Can Gaussian Sketching Converge Faster on a Preconditioned Landscape?Yilong Wang, Haishan Ye, Guang Dai, Ivor W. Tsang. [doi]
- Language Models Represent Beliefs of Self and OthersWentao Zhu, Zhining Zhang, Yizhou Wang 0001. [doi]
- Efficient Policy Evaluation with Offline Data Informed Behavior Policy DesignShuze Liu, Shangtong Zhang. [doi]
- Ditto: Quantization-aware Secure Inference of Transformers upon MPCHaoqi Wu, Wenjing Fang, Yancheng Zheng, Junming Ma, Jin Tan, Lei Wang. [doi]
- Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term FrequencyHyeongjin Kim, Sangwon Kim, Dasom Ahn, Jong Taek Lee, Byoung Chul Ko. [doi]
- FairProof : Confidential and Certifiable Fairness for Neural NetworksChhavi Yadav, Amrita Roy Chowdhury 0001, Dan Boneh, Kamalika Chaudhuri. [doi]
- Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative RecommendationsJiaqi Zhai, Lucy Liao, Xing Liu, Yueming Wang, Rui Li, Xuan Cao, Leon Gao, Zhaojie Gong, Fangda Gu, Jiayuan He, Yinghai Lu, Yu Shi. [doi]
- Masked Face Recognition with Generative-to-Discriminative RepresentationsShiming Ge, Weijia Guo, Chenyu Li, Junzheng Zhang, Yong Li, Dan Zeng 0001. [doi]
- Self-Alignment of Large Language Models via Monopolylogue-based Social Scene SimulationXianghe Pang, Shuo Tang, Rui Ye, Yuxin Xiong, Bolun Zhang, Yanfeng Wang, Siheng Chen. [doi]
- Learning in Deep Factor Graphs with Gaussian Belief PropagationSeth Nabarro, Mark van der Wilk, Andrew J. Davison. [doi]
- Solving Poisson Equations using Neural Walk-on-SpheresHong Chul Nam, Julius Berner, Anima Anandkumar. [doi]
- Residual Quantization with Implicit Neural CodebooksIris A. M. Huijben, Matthijs Douze, Matthew J. Muckley, Ruud van Sloun, Jakob Verbeek. [doi]
- OTMatch: Improving Semi-Supervised Learning with Optimal TransportZhiquan Tan, Kaipeng Zheng, Weiran Huang 0001. [doi]
- Stationary Latent Weight Inference for Unreliable Observations from Online Test-Time AdaptationJae Hong Lee, Joon-Hyuk Chang. [doi]
- Neurodegenerative Brain Network Classification via Adaptive Diffusion with Temporal RegularizationHyuna Cho, Jaeyoon Sim, Guorong Wu 0001, Won Hwa Kim. [doi]
- Emergent Representations of Program Semantics in Language Models Trained on ProgramsCharles Jin, Martin C. Rinard. [doi]
- Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and GenerationRandall Balestriero, Romain Cosentino, Sarath Shekkizhar. [doi]
- In-Context Learning Agents Are Asymmetric Belief UpdatersJohannes A. Schubert, Akshay K. Jagadish, Marcel Binz, Eric Schulz. [doi]
- Biharmonic Distance of Graphs and its Higher-Order Variants: Theoretical Properties with Applications to Centrality and ClusteringMitchell Black, Lucy Lin, Weng-Keen Wong, Amir Nayyeri. [doi]
- Path-Guided Particle-based SamplingMingzhou Fan, Ruida Zhou, Chao Tian 0002, Xiaoning Qian. [doi]
- Wasserstein Wormhole: Scalable Optimal Transport Distance with TransformerDoron Haviv, Russell Zhang Kunes, Thomas Dougherty, Cassandra Burdziak, Tal Nawy, Anna Gilbert, Dana Pe'er. [doi]
- Position: Technical Research and Talent is Needed for Effective AI GovernanceAnka Reuel, Lisa Soder, Benjamin Bucknall, Trond Arne Undheim. [doi]
- The Merit of River Network Topology for Neural Flood ForecastingNikolas Kirschstein, Yixuan Sun. [doi]
- Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a BudgetFlorian E. Dorner, Moritz Hardt. [doi]
- InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature InterpretationJacob Yoke Hong Si, Wendy Yusi Cheng, Michael Cooper, Rahul G. Krishnan. [doi]
- DOGE: Domain Reweighting with Generalization EstimationSimin Fan, Matteo Pagliardini, Martin Jaggi. [doi]
- Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised LearningZhiyuan He, Yijun Yang, Pin-Yu Chen, Qiang Xu 0001, Tsung-Yi Ho. [doi]
- MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM ServingJiangfei Duan, Runyu Lu, Haojie Duanmu, Xiuhong Li, Xingcheng Zhang, Dahua Lin, Ion Stoica, Hao Zhang 0108. [doi]
- Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion BridgeYufei Huang 0002, Odin Zhang, Lirong Wu, Cheng Tan 0012, Haitao Lin, Zhangyang Gao, Siyuan Li, Stan Z. Li. [doi]
- Borda Regret Minimization for Generalized Linear Dueling BanditsYue Wu, Tao Jin 0002, Qiwei Di, Hao Lou, Farzad Farnoud, Quanquan Gu. [doi]
- Limited Preference Aided Imitation Learning from Imperfect DemonstrationsXingchen Cao, Fan-Ming Luo, Junyin Ye, Tian Xu, Zhilong Zhang, Yang Yu 0001. [doi]
- Dynamic Survival Analysis with Controlled Latent StatesLinus Bleistein, Van Tuan Nguyen, Adeline Fermanian, Agathe Guilloux. [doi]
- Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game ApproachJohan Peralez, Aurélien Delage, Olivier Buffet, Jilles Steeve Dibangoye. [doi]
- StableMask: Refining Causal Masking in Decoder-only TransformerQingyu Yin, Xuzheng He, Xiang Zhuang, Yu Zhao 0009, Jianhua Yao, Xiaoyu Shen, Qiang Zhang. [doi]
- Generalization Error of Graph Neural Networks in the Mean-field RegimeGholamali Aminian, Yixuan He, Gesine Reinert, Lukasz Szpruch, Samuel N. Cohen. [doi]
- Sample-specific Masks for Visual Reprogramming-based PromptingChengyi Cai, Zesheng Ye, Lei Feng, Jianzhong Qi 0001, Feng Liu. [doi]
- Bayesian Regret Minimization in Offline BanditsMarek Petrik, Guy Tennenholtz, Mohammad Ghavamzadeh. [doi]
- Adversarially Robust Deep Multi-View Clustering: A Novel Attack and Defense FrameworkHaonan Huang, GuoXu Zhou, Yanghang Zheng, Yuning Qiu, Andong Wang, Qibin Zhao. [doi]
- Adversarially Robust Hypothesis Transfer LearningYunjuan Wang, Raman Arora. [doi]
- Out-of-Distribution Detection via Deep Multi-Comprehension EnsembleChenhui Xu, Fuxun Yu, Zirui Xu, Nathan Inkawhich, Xiang Chen 0010. [doi]
- Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations ModelingGuoqi Yu, Jing Zou, Xiaowei Hu 0001, Angelica I. Avilés-Rivero, Jing Qin 0001, Shujun Wang. [doi]
- Density-Softmax: Efficient Test-time Model for Uncertainty Estimation and Robustness under Distribution ShiftsHa Manh Bui, Anqi Liu. [doi]
- Understanding the Impact of Introducing Constraints at Inference Time on Generalization ErrorMasaaki Nishino, Kengo Nakamura 0001, Norihito Yasuda. [doi]
- Gibbs Sampling of Continuous Potentials on a Quantum ComputerArsalan Motamedi, Pooya Ronagh. [doi]
- S3GCL: Spectral, Swift, Spatial Graph Contrastive LearningGuancheng Wan, Yijun Tian 0001, Wenke Huang, Nitesh V. Chawla, Mang Ye. [doi]
- Double Stochasticity Gazes Faster: Snap-Shot Decentralized Stochastic Gradient Tracking MethodsHao Di, Haishan Ye, Xiangyu Chang, Guang Dai, Ivor W. Tsang. [doi]
- Gated Linear Attention Transformers with Hardware-Efficient TrainingSonglin Yang, Bailin Wang, Yikang Shen, Rameswar Panda, Yoon Kim. [doi]
- MathScale: Scaling Instruction Tuning for Mathematical ReasoningZhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei. [doi]
- Decouple then Classify: A Dynamic Multi-view Labeling Strategy with Shared and Specific InformationXinhang Wan, Jiyuan Liu 0003, Xinwang Liu 0002, Yi Wen 0001, Hao Yu, Siwei Wang 0001, Shengju Yu, Tianjiao Wan, Jun Wang, En Zhu. [doi]
- How Free is Parameter-Free Stochastic Optimization?Amit Attia, Tomer Koren. [doi]
- Hyperbolic Active Learning for Semantic Segmentation under Domain ShiftLuca Franco, Paolo Mandica, Konstantinos Kallidromitis, Devin Guillory, Yu-Teng Li, Trevor Darrell, Fabio Galasso. [doi]
- Inferring Change Points in High-Dimensional Linear Regression via Approximate Message PassingGabriel Arpino, Xiaoqi Liu, Ramji Venkataramanan. [doi]
- Reinforcement Learning and Regret Bounds for Admission ControlLucas Weber, Ana Busic, Jiamin Zhu. [doi]
- Stochastic Q-learning for Large Discrete Action SpacesFares Fourati, Vaneet Aggarwal, Mohamed-Slim Alouini. [doi]
- Don't be so Negative! Score-based Generative Modeling with Oracle-assisted GuidanceSaeid Naderiparizi, Xiaoxuan Liang 0001, Setareh Cohan, Berend Zwartsenberg, Frank Wood. [doi]
- Bayesian Design Principles for Offline-to-Online Reinforcement LearningHao Hu 0006, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang. [doi]
- Differentiable Distributionally Robust Optimization LayersXutao Ma, Chao Ning, Wenli Du. [doi]
- Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated LearningWenke Huang, Zekun Shi, Mang Ye, He Li, Bo Du 0001. [doi]
- Improving Equivariant Graph Neural Networks on Large Geometric Graphs via Virtual Nodes LearningYuelin Zhang, Jiacheng Cen, Jiaqi Han, Zhiqiang Zhang, Jun Zhou, Wenbing Huang 0001. [doi]
- MS-TIP: Imputation Aware Pedestrian Trajectory PredictionPranav Singh Chib, Achintya Nath, Paritosh Kabra, Ishu Gupta, Pravendra Singh. [doi]
- Task-aware Orthogonal Sparse Network for Exploring Shared Knowledge in Continual LearningYusong Hu, De Cheng, Dingwen Zhang, Nannan Wang 0001, Tongliang Liu, Xinbo Gao 0001. [doi]
- Transferring Knowledge From Large Foundation Models to Small Downstream ModelsShikai Qiu, Boran Han, Danielle C. Maddix, Shuai Zhang, Bernie Wang 0001, Andrew Gordon Wilson. [doi]
- Sparse is Enough in Fine-tuning Pre-trained Large Language ModelsWeixi Song, Zuchao Li, Lefei Zhang, Hai Zhao 0001, Bo Du 0001. [doi]
- Dynamic Anisotropic Smoothing for Noisy Derivative-Free OptimizationSam Reifenstein, Timothée G. Leleu, Yoshihisa Yamamoto. [doi]
- Improving fine-grained understanding in image-text pre-trainingIoana Bica, Anastasija Ilic, Matthias Bauer, Goker Erdogan, Matko Bosnjak, Christos Kaplanis, Alexey A. Gritsenko, Matthias Minderer, Charles Blundell, Razvan Pascanu, Jovana Mitrovic. [doi]
- Bayesian Adaptation of Network Depth and Width for Continual LearningJeevan Thapa, Rui Li. [doi]
- Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut DependencyRunqi Lin, Chaojian Yu, Bo Han 0003, Hang Su 0006, Tongliang Liu. [doi]
- EvoluNet: Advancing Dynamic Non-IID Transfer Learning on GraphsHaohui Wang, Yuzhen Mao, Yujun Yan, Yaoqing Yang, Jianhui Sun, Kevin Choi, Balaji Veeramani, Alison Hu, Edward Bowen, Tyler Cody, Dawei Zhou 0003. [doi]
- Robust Inverse Graphics via Probabilistic InferenceTuan Anh Le, Pavel Sountsov, Matthew Douglas Hoffman, Ben Lee, Brian Patton, Rif A. Saurous. [doi]
- Criterion Collapse and Loss Distribution ControlMatthew J. Holland. [doi]
- Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement LearningXinran Li, Zifan Liu, Shibo Chen, Jun Zhang 0004. [doi]
- Position: Near to Mid-term Risks and Opportunities of Open-Source Generative AIFrancisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schröder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Thomas Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob N. Foerster. [doi]
- How Spurious Features are Memorized: Precise Analysis for Random and NTK FeaturesSimone Bombari, Marco Mondelli. [doi]
- Contrastive Learning for Clinical Outcome Prediction with Partial Data SourcesMeng Xia, Jonathan Wilson, Benjamin Goldstein 0001, Ricardo Henao. [doi]
- MOMENT: A Family of Open Time-series Foundation ModelsMononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski. [doi]
- Positive and Unlabeled Learning with Controlled Probability Boundary FenceChangchun Li, Yuanchao Dai, Lei Feng 0006, Ximing Li 0002, Bing Wang, Jihong OuYang. [doi]
- Exploring the Complexity of Deep Neural Networks through Functional EquivalenceGuohao Shen. [doi]
- Reference Neural Operators: Learning the Smooth Dependence of Solutions of PDEs on Geometric DeformationsZe Cheng, Zhongkai Hao, Xiaoqiang Wang, Jianing Huang, Youjia Wu, Xudan Liu, Yiru Zhao, Songming Liu, Hang Su 0006. [doi]
- SurfPro: Functional Protein Design Based on Continuous SurfaceZhenqiao Song, Tinglin Huang, Lei Li 0005, Wengong Jin. [doi]
- A Human-Inspired Reading Agent with Gist Memory of Very Long ContextsKuang-Huei Lee, Xinyun Chen, Hiroki Furuta, John F. Canny, Ian Fischer. [doi]
- Self-Play Fine-Tuning Converts Weak Language Models to Strong Language ModelsZixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu. [doi]
- Triple Changes Estimator for Targeted PoliciesSina Akbari, Negar Kiyavash. [doi]
- AST-T5: Structure-Aware Pretraining for Code Generation and UnderstandingLinyuan Gong, Mostafa Elhoushi, Alvin Cheung. [doi]
- Randomized Confidence Bounds for Stochastic Partial MonitoringMaxime Heuillet, Ola Ahmad, Audrey Durand. [doi]
- Adaptively Learning to Select-Rank in Online PlatformsJingyuan Wang, Perry Dong, Ying Jin, Ruohan Zhan, Zhengyuan Zhou. [doi]
- Listenable Maps for Audio ClassifiersFrancesco Paissan, Mirco Ravanelli, Cem Subakan. [doi]
- High-Probability Convergence for Composite and Distributed Stochastic Minimization and Variational Inequalities with Heavy-Tailed NoiseEduard Gorbunov, Abdurakhmon Sadiev, Marina Danilova, Samuel Horváth, Gauthier Gidel, Pavel E. Dvurechensky, Alexander V. Gasnikov, Peter Richtárik. [doi]
- Distributed Bilevel Optimization with Communication CompressionYutong He, Jie Hu, Xinmeng Huang, Songtao Lu, Bin Wang, Kun Yuan. [doi]
- Subgoal-based Demonstration Learning for Formal Theorem ProvingXueliang Zhao, Wenda Li, Lingpeng Kong. [doi]
- Geometry-Aware Instrumental Variable RegressionHeiner Kremer, Bernhard Schölkopf. [doi]
- High-Order Contrastive Learning with Fine-grained Comparative Levels for Sparse Ordinal Tensor CompletionYu Dai, Junchen Shen, Zijie Zhai, Danlin Liu, Jingyang Chen, Yu Sun, Ping Li, Jie Zhang, Kai Zhang. [doi]
- Feel-Good Thompson Sampling for Contextual Dueling BanditsXuheng Li, Heyang Zhao, Quanquan Gu. [doi]
- Surprisingly Strong Performance Prediction with Neural Graph FeaturesGabriela Kadlecová, Jovita Lukasik, Martin Pilát, Petra Vidnerová, Mahmoud Safari, Roman Neruda, Frank Hutter. [doi]
- Learning-Rate-Free Stochastic Optimization over Riemannian ManifoldsDaniel Dodd, Louis Sharrock, Christopher Nemeth. [doi]
- EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyYuhui Li, Fangyun Wei, Chao Zhang 0001, Hongyang Zhang 0001. [doi]
- Boximator: Generating Rich and Controllable Motions for Video SynthesisJiawei Wang, Yuchen Zhang, Jiaxin Zou, Yan Zeng, Guoqiang Wei, Liping Yuan, Hang Li. [doi]
- Momentum Particle Maximum LikelihoodJen Ning Lim, Juan Kuntz, Samuel Power, Adam M. Johansen. [doi]
- Can AI Assistants Know What They Don't Know?Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen 0026, Xipeng Qiu. [doi]
- CurBench: Curriculum Learning BenchmarkYuwei Zhou, Zirui Pan, Xin Wang 0019, Hong Chen, Haoyang Li, Yanwen Huang, Zhixiao Xiong, Fangzhou Xiong, Peiyang Xu, Shengnan Liu, Wenwu Zhu 0001. [doi]
- Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language ModelsAndy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang. [doi]
- Statistical Test for Attention Maps in Vision TransformersTomohiro Shiraishi, Daiki Miwa, Teruyuki Katsuoka, Vo Nguyen Le Duy, Kouichi Taji, Ichiro Takeuchi. [doi]
- Policy Evaluation for Variance in Average Reward Reinforcement LearningShubhada Agrawal, Prashanth L. A., Siva Theja Maguluri. [doi]
- Can Mamba Learn How To Learn? A Comparative Study on In-Context Learning TasksJongho Park, JaeSeung Park, Zheyang Xiong, Nayoung Lee, Jaewoong Cho, Samet Oymak, Kangwook Lee 0001, Dimitris Papailiopoulos. [doi]
- Active Adaptive Experimental Design for Treatment Effect Estimation with Covariate ChoiceMasahiro Kato, Akihiro Oga, Wataru Komatsubara, Ryo Inokuchi. [doi]
- ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsZiniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu 0001, Ruoyu Sun 0001, Zhi-Quan Luo. [doi]
- Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language ModelsZhihe Lu, Jiawang Bai, Xin Li 0082, Zeyu Xiao, Xinchao Wang. [doi]
- Open-Domain Text Evaluation via Contrastive Distribution MethodsSidi Lu, Hongyi Liu, Asli Celikyilmaz, Tianlu Wang, Nanyun Peng. [doi]
- Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention CalibrationZhongzhi Yu, Zheng Wang, Yonggan Fu, Huihong Shi, Khalid Shaikh, Yingyan Celine Lin. [doi]
- Dense Reward for Free in Reinforcement Learning from Human FeedbackAlex James Chan, Hao Sun, Samuel Holt, Mihaela van der Schaar. [doi]
- A Unified Framework for Learning with Nonlinear Model Classes from Arbitrary Linear SamplesBen Adcock, Juan M. Cardenas, Nick C. Dexter. [doi]
- Bootstrap AutoEncoders With Contrastive Paradigm for Self-supervised Gaze EstimationYaoming Wang, Jin Li, Wenrui Dai, Bowen Shi, Xiaopeng Zhang 0008, Chenglin Li, Hongkai Xiong. [doi]
- CRUXEval: A Benchmark for Code Reasoning, Understanding and ExecutionAlex Gu, Baptiste Rozière, Hugh James Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida Wang 0001. [doi]
- SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression SegmentationDanni Yang, Jiayi Ji, Yiwei Ma, Tianyu Guo 0005, Haowei Wang 0001, Xiaoshuai Sun, Rongrong Ji. [doi]
- Single-Trajectory Distributionally Robust Reinforcement LearningZhipeng Liang, Xiaoteng Ma, José H. Blanchet, Jun Yang 0028, Jiheng Zhang, Zhengyuan Zhou. [doi]
- Language Agents with Reinforcement Learning for Strategic Play in the Werewolf GameZelai Xu, Chao Yu 0005, Fei Fang 0001, Yu Wang 0002, Yi Wu. [doi]
- Revisiting Character-level Adversarial Attacks for Language ModelsElías Abad-Rocamora, Yongtao Wu, Fanghui Liu 0001, Grigorios Chrysos 0002, Volkan Cevher. [doi]
- Low-Rank Similarity Mining for Multimodal Dataset DistillationYue Xu, Zhilin Lin, Yusong Qiu, Cewu Lu, Yong-Lu Li 0001. [doi]
- Video-of-Thought: Step-by-Step Video Reasoning from Perception to CognitionHao Fei 0001, Shengqiong Wu, Wei Ji 0008, Hanwang Zhang, Meishan Zhang, Mong-Li Lee, Wynne Hsu. [doi]
- Towards Modular LLMs by Building and Reusing a Library of LoRAsOleksiy Ostapenko, Zhan Su, Edoardo M. Ponti, Laurent Charlin, Nicolas Le Roux, Lucas Caccia, Alessandro Sordoni. [doi]
- Timer: Generative Pre-trained Transformers Are Large Time Series ModelsYong Liu, Haoran Zhang, Chenyu Li, Xiangdong Huang, Jianmin Wang 0001, Mingsheng Long. [doi]
- FreeBind: Free Lunch in Unified Multimodal Space via Knowledge FusionZehan Wang 0001, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin 0004, Peng Gao, Zhou Zhao. [doi]
- A Near-Linear Time Approximation Algorithm for Beyond-Worst-Case Graph ClusteringVincent Cohen-Addad, Tommaso d'Orsi, Aida Mousavifar. [doi]
- High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit RegularizationYihang Chen, Fanghui Liu 0001, Taiji Suzuki, Volkan Cevher. [doi]
- Translation Equivariant Transformer Neural ProcessesMatthew Ashman, Cristiana Diaconu, Junhyuck Kim, Lakee Sivaraya, Stratis Markou, James Requeima, Wessel P. Bruinsma, Richard E. Turner. [doi]
- What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional EncodingHongkang Li, Meng Wang 0003, Tengfei Ma, Sijia Liu 0001, Zaixi Zhang, Pin-Yu Chen. [doi]
- Asymptotics of Learning with Deep Structured (Random) FeaturesDominik Schröder, Daniil Dmitriev, Hugo Cui, Bruno Loureiro. [doi]
- A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?Agustinus Kristiadi, Felix Strieth-Kalthoff, Marta Skreta, Pascal Poupart, Alán Aspuru-Guzik, Geoff Pleiss. [doi]
- Rethinking Adversarial Robustness in the Context of the Right to be ForgottenChenxu Zhao, Wei Qian, Yangyi Li, Aobo Chen, Mengdi Huai. [doi]
- Learning Linear Block Error Correction CodesYoni Choukroun, Lior Wolf. [doi]
- Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 KilobytesZhen Qin, Daoyuan Chen, Bingchen Qian, Bolin Ding, Yaliang Li, ShuiGuang Deng. [doi]
- Simulation of Graph Algorithms with Looped TransformersArtur Back de Luca, Kimon Fountoulakis. [doi]
- Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic DataYvonne Zhou, Mingyu Liang, Ivan Brugere, Danial Dervovic, Antigoni Polychroniadou, Min Wu 0001, Dana Dachman-Soled. [doi]
- Q-value Regularized Transformer for Offline Reinforcement LearningShengchao Hu, Ziqing Fan, Chaoqin Huang, Li Shen 0008, Ya Zhang 0002, Yanfeng Wang, Dacheng Tao. [doi]
- Position: Explain to Question not to JustifyPrzemyslaw Biecek, Wojciech Samek. [doi]
- Joint Composite Latent Space Bayesian OptimizationNatalie Maus, Zhiyuan (Jerry) Lin, Maximilian Balandat, Eytan Bakshy. [doi]
- Bidirectional Reciprocative Information Communication for Few-Shot Semantic SegmentationYuanwei Liu, Junwei Han, Xiwen Yao, Salman Khan 0001, Hisham Cholakkal, Rao Muhammad Anwer, Nian Liu, Fahad Shahbaz Khan. [doi]
- Distribution Alignment Optimization through Neural Collapse for Long-tailed ClassificationJintong Gao, He Zhao 0001, Dandan Guo, Hongyuan Zha. [doi]
- Position: On the Societal Impact of Open Foundation ModelsSayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen K. Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson 0002, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan. [doi]
- SqueezeLLM: Dense-and-Sparse QuantizationSehoon Kim, Coleman Hooper, Amir Gholami, Zhen Dong, Xiuyu Li, Sheng Shen, Michael W. Mahoney, Kurt Keutzer. [doi]
- Position: Categorical Deep Learning is an Algebraic Theory of All ArchitecturesBruno Gavranovic, Paul Lessard, Andrew Joseph Dudzik, Tamara von Glehn, João Guilherme Madeira Araújo, Petar Velickovic. [doi]
- IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D GenerationLuke Melas-Kyriazi, Iro Laina, Christian Rupprecht 0001, Natalia Neverova, Andrea Vedaldi, Oran Gafni, Filippos Kokkinos. [doi]
- Prompting is a Double-Edged Sword: Improving Worst-Group Robustness of Foundation ModelsAmrith Setlur, Saurabh Garg, Virginia Smith, Sergey Levine. [doi]
- Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under CompressionJunyuan Hong, Jinhao Duan, Chenhui Zhang, Zhangheng Li, Chulin Xie, Kelsey Lieberman, James Diffenderfer, Brian R. Bartoldson, Ajay Kumar Jaiswal, Kaidi Xu, Bhavya Kailkhura, Dan Hendrycks, Dawn Song, Zhangyang Wang, Bo Li 0026. [doi]
- Learning Latent Space Hierarchical EBM Diffusion ModelsJiali Cui, Tian Han 0001. [doi]
- Compositional Image Decomposition with Diffusion ModelsJocelin Su, Nan Liu 0010, Yanbo Wang, Joshua B. Tenenbaum, Yilun Du. [doi]
- Random Latent Exploration for Deep Reinforcement LearningSrinath Mahankali, Zhang-Wei Hong, Ayush Sekhari, Alexander Rakhlin, Pulkit Agrawal 0001. [doi]
- LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different ViewsYuji Roh, Qingyun Liu, Huan Gui, Zhe Yuan, Yujin Tang, Steven Euijong Whang, Liang Liu, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao 0001. [doi]
- Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational InferenceJian Xu, Delu Zeng, John W. Paisley. [doi]
- New Sample Complexity Bounds for Sample Average Approximation in Heavy-Tailed Stochastic ProgrammingHongcheng Liu, Jindong Tong. [doi]
- ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint ShrinkingWenshuo Li, Xinghao Chen 0001, Han Shu, Yehui Tang, Yunhe Wang 0001. [doi]
- LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language ModelsGuangyan Li, Yongqiang Tang, Wensheng Zhang 0002. [doi]
- StyDeSty: Min-Max Stylization and Destylization for Single Domain GeneralizationSonghua Liu, Xin Jin, Xingyi Yang, Jingwen Ye, Xinchao Wang. [doi]
- Quasi-Monte Carlo Features for Kernel ApproximationZhen Huang, Jiajin Sun, Yian Huang. [doi]
- Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High SparsityLu Yin 0006, You Wu, Zhenyu Zhang 0015, Cheng-Yu Hsieh, Yaqing Wang, Yiling Jia, Gen Li, Ajay Kumar Jaiswal, Mykola Pechenizkiy, Yi Liang, Michael Bendersky, Zhangyang Wang, Shiwei Liu 0003. [doi]
- EvIL: Evolution Strategies for Generalisable Imitation LearningSilvia Sapora, Gokul Swamy, Chris Lu 0001, Yee Whye Teh, Jakob Nicolaus Foerster. [doi]
- Online Matrix Completion: A Collaborative Approach with Hott ItemsDheeraj Baby, Soumyabrata Pal. [doi]
- Uniformly Stable Algorithms for Adversarial Training and BeyondJiancong Xiao, Jiawei Zhang 0007, Zhi-Quan Luo, Asuman E. Ozdaglar. [doi]
- Tilt your Head: Activating the Hidden Spatial-Invariance of ClassifiersJohann Schmidt, Sebastian Stober. [doi]
- OLLIE: Imitation Learning from Offline Pretraining to Online FinetuningSheng Yue, Xingyuan Hua, Ju Ren 0001, Sen Lin 0001, Junshan Zhang, Yaoxue Zhang. [doi]
- FedCal: Achieving Local and Global Calibration in Federated Learning via Aggregated Parameterized ScalerHongyi Peng, Han Yu 0001, Xiaoli Tang, Xiaoxiao Li. [doi]
- Magicoder: Empowering Code Generation with OSS-InstructYuxiang Wei 0003, Zhe Wang, Jiawei Liu 0004, Yifeng Ding, Lingming Zhang 0001. [doi]
- CKGConv: General Graph Convolution with Continuous KernelsLiheng Ma, Soumyasundar Pal, Yitian Zhang, Jiaming Zhou, Yingxue Zhang 0001, Mark Coates. [doi]
- A Fresh Take on Stale Embeddings: Improving Dense Retriever Training with Corrector NetworksNicholas Monath, Will Sussman Grathwohl, Michael Boratko, Rob Fergus, Andrew McCallum, Manzil Zaheer. [doi]
- How Universal Polynomial Bases Enhance Spectral Graph Neural Networks: Heterophily, Over-smoothing, and Over-squashingKeke Huang, Yu Guang Wang 0001, Ming Li 0065, Pietro Lio. [doi]
- Nesting Particle Filters for Experimental Design in Dynamical SystemsSahel Iqbal, Adrien Corenflos, Simo Särkkä, Hany Abdulsamad. [doi]
- Skill Set Optimization: Reinforcing Language Model Behavior via Transferable SkillsKolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh 0001, Peter Clark, Roy Fox. [doi]
- Learning Useful Representations of Recurrent Neural Network Weight MatricesVincent Herrmann, Francesco Faccio, Jürgen Schmidhuber. [doi]
- Identification and Estimation for Nonignorable Missing Data: A Data Fusion ApproachZixiao Wang, AmirEmad Ghassami, Ilya Shpitser. [doi]
- A Bayesian Approach to Online PlanningNir Greshler, David Ben-Eli, Carmel Rabinovitz, Gabi Guetta, Liran Gispan, Guy Zohar, Aviv Tamar. [doi]
- Unveiling Privacy, Memorization, and Input Curvature LinksDeepak Ravikumar, Efstathia Soufleri, Abolfazl Hashemi, Kaushik Roy 0001. [doi]
- Implicit Bias of AdamW: ℓ∞-Norm Constrained OptimizationShuo Xie, Zhiyuan Li. [doi]
- What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formationAaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe. [doi]
- A Space Group Symmetry Informed Network for O(3) Equivariant Crystal Tensor PredictionKeqiang Yan, Alexandra Saxton, Xiaofeng Qian, Xiaoning Qian, Shuiwang Ji. [doi]
- Reshape and Adapt for Output Quantization (RAOQ): Quantization-aware Training for In-memory Computing SystemsBonan Zhang, Chia-Yu Chen, Naveen Verma. [doi]
- Probabilistic Modeling of Interpersonal Coordination ProcessesPaulo Soares, Adarsh Pyarelal, Meghavarshini Krishnaswamy, Emily Butler, Kobus Barnard. [doi]
- On the Identifiability of Switching Dynamical SystemsCarles Balsells Rodas, Yixin Wang, Yingzhen Li. [doi]
- How to Explore with Belief: State Entropy Maximization in POMDPsRiccardo Zamboni, Duilio Cirino, Marcello Restelli, Mirco Mutti. [doi]
- Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence ModelingYair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov. [doi]
- Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens RotationXinyu Ma, Xu Chu, ZhiBang Yang, Yang Lin, Xin Gao, Junfeng Zhao 0001. [doi]
- Best of Both Worlds Guarantees for Smoothed Online Quadratic OptimizationNeelkamal Bhuyan, Debankur Mukherjee, Adam Wierman. [doi]
- Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural NetworksAmit Peleg, Matthias Hein 0001. [doi]
- Universality of Linear Recurrences Followed by Non-linear Projections: Finite-Width Guarantees and Benefits of Complex EigenvaluesAntonio Orvieto, Soham De, Caglar Gulcehre, Razvan Pascanu, Samuel L. Smith. [doi]
- Consistent Long-Term Forecasting of Ergodic Dynamical SystemsVladimir R. Kostic, Karim Lounici, Prune Inzerilli, Pietro Novelli, Massimiliano Pontil. [doi]
- Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement LearningXu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Ruifeng Chen 0003, Zhilong Zhang, Xinwei Chen, Yang Yu 0001. [doi]
- Evolution-Inspired Loss Functions for Protein Representation LearningChengYue Gong, Adam R. Klivans, James Loy, Tianlong Chen, Qiang Liu 0001, Daniel Jesus Diaz. [doi]
- Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian ProcessesYingyi Chen, Qinghua Tao, Francesco Tonin, Johan A. K. Suykens. [doi]
- Harnessing the Power of Neural Operators with Automatically Encoded Conservation LawsNing Liu, Yiming Fan, Xianyi Zeng, Milan Klöwer, Lu Zhang, Yue Yu. [doi]
- Constrained Ensemble Exploration for Unsupervised Skill DiscoveryChenjia Bai, Rushuai Yang, Qiaosheng Zhang, Kang Xu, Yi Chen, Ting Xiao, Xuelong Li 0001. [doi]
- UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEsXi Han, Fei Hou, Hong Qin 0001. [doi]
- Inexact Newton-type Methods for Optimisation with Nonnegativity ConstraintsOscar Smee, Fred Roosta. [doi]
- Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningMichael T. Matthews, Michael Beukman, Benjamin Ellis, Mikayel Samvelyan, Matthew Thomas Jackson, Samuel Coward, Jakob Nicolaus Foerster. [doi]
- Multicalibration for Confidence Scoring in LLMsGianluca Detommaso, Martin Bertran Lopez, Riccardo Fogliato, Aaron Roth 0001. [doi]
- MEMORYLLM: Towards Self-Updatable Large Language ModelsYu Wang, YiFan Gao, Xiusi Chen, Haoming Jiang, Shiyang Li, Jingfeng Yang, Qingyu Yin, Zheng Li, Xian Li, Bing Yin, Jingbo Shang, Julian J. McAuley. [doi]
- Multiply-Robust Causal Change AttributionVictor Quintas-Martinez, Mohammad Taha Bahadori, Eduardo Santiago, Jeff Mu, David Heckerman. [doi]
- Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance ConstraintsXiaobo Xia, Jiale Liu, Shaokun Zhang, Qingyun Wu, Hongxin Wei, Tongliang Liu. [doi]
- Faster Adaptive Decentralized Learning AlgorithmsFeihu Huang, Jianyu Zhao. [doi]
- Improving Neural Logic Machines via Failure ReflectionZhiming Li, Yushi Cao, Yan Zheng 0002, Xu Liu, Bozhi Wu, Tianlin Li, XiuFeng Xu, Junzhe Jiang, Yon Shin Teo, Shang-Wei Lin 0001, Yang Liu 0003. [doi]
- Optimization without Retraction on the Random Generalized Stiefel ManifoldSimon Vary, Pierre Ablin, Bin Gao 0007, Pierre-Antoine Absil. [doi]
- Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown MarginalsZiyi Liu, Idan Attias, Daniel M. Roy 0001. [doi]
- Variational Schrödinger Diffusion ModelsWei Deng 0002, Weijian Luo, Yixin Tan, Marin Bilos, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen. [doi]
- Recurrent Distance Filtering for Graph Representation LearningYuhui Ding, Antonio Orvieto, Bobby He, Thomas Hofmann. [doi]
- Position: Open-Endedness is Essential for Artificial Superhuman IntelligenceEdward Hughes 0001, Michael D. Dennis, Jack Parker-Holder, Feryal M. P. Behbahani, Aditi Mavalankar, Yuge Shi, Tom Schaul, Tim Rocktäschel. [doi]
- Uniform Memory Retrieval with Larger Capacity for Modern Hopfield ModelsDennis Wu, Jerry Yao-Chieh Hu, Teng-Yun Hsiao, Han Liu. [doi]
- SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsDongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao 0001, Hongsheng Li 0001, Peng Gao 0007. [doi]
- Trust Regions for Explanations via Black-Box Probabilistic CertificationAmit Dhurandhar, Swagatam Haldar, Dennis Wei, Karthikeyan Natesan Ramamurthy. [doi]
- Unsupervised Evaluation of Code LLMs with Round-Trip CorrectnessMiltiadis Allamanis, Sheena Panthaplackel, Pengcheng Yin. [doi]
- APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and InferenceBowen Zhao, Hannaneh Hajishirzi, Qingqing Cao. [doi]
- DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)Zongxin Yang, Guikun Chen, Xiaodi Li, Wenguan Wang, Yi Yang 0001. [doi]
- Barrier Algorithms for Constrained Non-Convex OptimizationPavel E. Dvurechensky, Mathias Staudigl. [doi]
- A Neural-Preconditioned Poisson Solver for Mixed Dirichlet and Neumann Boundary ConditionsKai Weixian Lan, Elias Gueidon, Ayano Kaneda, Julian Panetta, Joseph Teran. [doi]
- Finite Time Logarithmic Regret Bounds for Self-Tuning RegulationRahul Singh 0001, Akshay Mete, Avik Kar, Panganamala R. Kumar. [doi]
- Graph-enhanced Large Language Models in Asynchronous Plan ReasoningFangru Lin, Emanuele La Malfa, Valentin Hofmann, Elle Michelle Yang, Anthony G. Cohn 0001, Janet B. Pierrehumbert. [doi]
- Position: Considerations for Differentially Private Learning with Large-Scale Public PretrainingFlorian Tramèr, Gautam Kamath 0001, Nicholas Carlini. [doi]
- KernelSHAP-IQ: Weighted Least Square Optimization for Shapley InteractionsFabian Fumagalli, Maximilian Muschalik, Patrick Kolpaczki, Eyke Hüllermeier, Barbara Hammer. [doi]
- A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-ExpertsMohammed Nowaz Rabbani Chowdhury, Meng Wang 0003, Kaoutar El Maghraoui, Naigang Wang, Pin-Yu Chen, Christopher D. Carothers. [doi]
- MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data AnalysisLuyuan Xie, Manqing Lin, Tianyu Luan, Cong Li, Yuejian Fang, Qingni Shen, Zhonghai Wu. [doi]
- Iterative Search Attribution for Deep Neural NetworksZhiyu Zhu, Huaming Chen, Xinyi Wang 0005, Jiayu Zhang, Zhibo Jin, Jason Xue, Jun Shen 0001. [doi]
- Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball RepresentationJiashun Liu, Jianye Hao, Yi Ma, Shuyin Xia. [doi]
- Large Scale Dataset Distillation with Domain ShiftNoel Loo, Alaa Maalouf, Ramin M. Hasani, Mathias Lechner, Alexander Amini, Daniela Rus. [doi]
- Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine TranslationHaoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young-Jin Kim 0001. [doi]
- Collapse-Aware Triplet Decoupling for Adversarially Robust Image RetrievalQiwei Tian, Chenhao Lin, Zhengyu Zhao 0001, Qian Li 0024, Chao Shen 0001. [doi]
- Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement LearningTianchen Zhou, Hairi, Haibo Yang 0001, Jia Liu 0002, Tian Tong, Fan Yang, Michinari Momma, Yan Gao. [doi]
- Sample Complexity Bounds for Estimating Probability Divergences under InvariancesBehrooz Tahmasebi, Stefanie Jegelka. [doi]
- A Tensor Decomposition Perspective on Second-order RNNsMaude Lizaire, Michael Rizvi-Martel, Marawan Gamal Abdel Hameed, Guillaume Rabusseau. [doi]
- Stealthy Imitation: Reward-guided Environment-free Policy StealingZhixiong Zhuang, Maria-Irina Nicolae, Mario Fritz. [doi]
- Efficient Algorithms for Sum-Of-Minimum OptimizationLisang Ding, Ziang Chen, Xinshang Wang, Wotao Yin. [doi]
- Observable Propagation: Uncovering Feature Vectors in TransformersJacob Dunefsky, Arman Cohan. [doi]
- Partial Multi-View Multi-Label Classification via Semantic Invariance Learning and Prototype ModelingChengliang Liu 0003, Gehui Xu, Jie Wen 0001, Yabo Liu, Chao Huang 0008, Yong Xu 0001. [doi]
- Differentially Private Bias-Term Fine-tuning of Foundation ModelsZhiqi Bu, Yu-Xiang Wang 0003, Sheng Zha, George Karypis. [doi]
- Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language ModelsDidi Zhu, Zhongyi Sun 0002, Zexi Li 0001, Tao Shen 0002, Ke Yan, Shouhong Ding, Chao Wu 0001, Kun Kuang. [doi]
- On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity AnalysisJerry Yao-Chieh Hu, Thomas Lin, Zhao Song 0002, Han Liu. [doi]
- Image Clustering with External GuidanceYunfan Li 0003, Peng Hu 0002, Dezhong Peng, Jiancheng Lv 0001, Jianping Fan 0007, Xi Peng 0001. [doi]
- ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic CreationsKailas Vodrahalli, James Zou 0001. [doi]
- Plug-in Performative OptimizationLicong Lin, Tijana Zrnic. [doi]
- Neural Collapse in Multi-label Learning with Pick-all-label LossPengyu Li, Xiao Li, Yutong Wang, Qing Qu. [doi]
- Towards AutoAI: Optimizing a Machine Learning System with Black-box and Differentiable ComponentsZhiliang Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low. [doi]
- Causal-IQA: Towards the Generalization of Image Quality Assessment Based on Causal InferenceYan Zhong, Xingyu Wu, Li Zhang, Chenxi Yang, Tingting Jiang 0001. [doi]
- LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph ClusteringLi Sun 0008, Zhenhao Huang, Hao Peng 0001, Yujie Wang, Chunyang Liu, Philip S. Yu. [doi]
- DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic SystemsYair Schiff, Zhong Yi Wan, Jeffrey B. Parker, Stephan Hoyer, Volodymyr Kuleshov, Fei Sha, Leonardo Zepeda-Núñez. [doi]
- Nash Learning from Human FeedbackRémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Côme Fiegel, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot. [doi]
- Attention Meets Post-hoc Interpretability: A Mathematical PerspectiveGianluigi Lopardo, Frédéric Precioso, Damien Garreau. [doi]
- Unsupervised Concept Discovery Mitigates Spurious CorrelationsMd Rifat Arefin, Yan Zhang, Aristide Baratin, Francesco Locatello, Irina Rish, Dianbo Liu, Kenji Kawaguchi. [doi]
- Online bipartite matching with imperfect adviceDavin Choo, Themistoklis Gouleakis, Chun Kai Ling, Arnab Bhattacharyya 0001. [doi]
- Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual FeaturesThalles Silva 0001, Hélio Pedrini, Adín Ramírez Rivera. [doi]
- Sample Average Approximation for Conditional Stochastic Optimization with Dependent DataYafei Wang, Bo Pan, Mei Li, Jianya Lu, Lingchen Kong, Bei Jiang, Linglong Kong. [doi]
- Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation LearningJannik Deuschel, Caleb Ellington, Yingtao Luo, Benjamin J. Lengerich, Pascal Friederich, Eric P. Xing. [doi]
- Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate ThoughtZhen-yu Zhang, Siwei Han, Huaxiu Yao, Gang Niu 0001, Masashi Sugiyama. [doi]
- AegisFL: Efficient and Flexible Privacy-Preserving Byzantine-Robust Cross-silo Federated LearningDong Chen, Hongyuan Qu, Guangwu Xu. [doi]
- Deep Neural Room Acoustics PrimitiveYuhang He, Anoop Cherian, Gordon Wichern, Andrew Markham. [doi]
- SiBBlInGS: Similarity-driven Building-Block Inference using Graphs across StatesNoga Mudrik, Gal Mishne, Adam S. Charles. [doi]
- Leveraging Self-Consistency for Data-Efficient Amortized Bayesian InferenceMarvin Schmitt, Desi R. Ivanova, Daniel Habermann, Ullrich Köthe, Paul-Christian Bürkner, Stefan T. Radev. [doi]
- Foundations of Testing for Finite-Sample Causal DiscoveryTom Yan, Ziyu Xu, Zachary Chase Lipton. [doi]
- Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMsLing Yang 0006, Zhaochen Yu, Chenlin Meng, Minkai Xu, Stefano Ermon, Bin Cui 0001. [doi]
- Multigroup RobustnessLunjia Hu, Charlotte Peale, Judy Hanwen Shen. [doi]
- Latent variable model for high-dimensional point process with structured missingnessMaksim Sinelnikov, Manuel Haussmann, Harri Lähdesmäki. [doi]
- ReGAL: Refactoring Programs to Discover Generalizable AbstractionsElias Stengel-Eskin, Archiki Prasad, Mohit Bansal. [doi]
- High-Dimensional Geometric Streaming for Nearly Low Rank DataHossein Esfandiari, Praneeth Kacham, Vahab Mirrokni, David P. Woodruff, Peilin Zhong. [doi]
- Exploring the Low-Pass Filtering Behavior in Image Super-ResolutionHaoyu Deng, Zijing Xu, Yule Duan, Xiao Wu, Wenjie Shu, Liang-Jian Deng. [doi]
- Prometheus: Out-of-distribution Fluid Dynamics Modeling with Disentangled Graph ODEHao Wu, Huiyuan Wang, Kun Wang, Weiyan Wang, Changan Ye, Yangyu Tao, Chong Chen, Xian-Sheng Hua, Xiao Luo 0001. [doi]
- Faster Streaming and Scalable Algorithms for Finding Directed Dense Subgraphs in Large GraphsSlobodan Mitrovic, Theodore Pan. [doi]
- ΦFlow: Differentiable Simulations for PyTorch, TensorFlow and JaxPhilipp Holl, Nils Thuerey. [doi]
- CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasksShashank Agnihotri, Steffen Jung 0001, Margret Keuper. [doi]
- SIN: Selective and Interpretable Normalization for Long-Term Time Series ForecastingLu Han, Han-Jia Ye, De-Chuan Zhan. [doi]
- Generalizing Orthogonalization for Models with Non-LinearitiesDavid Rügamer, Chris Kolb, Tobias Weber, Lucas Kook, Thomas Nagler. [doi]
- Multi-Agent Reinforcement Learning Meets Leaf Sequencing in RadiotherapyRiqiang Gao, Florin-Cristian Ghesu, Simon Arberet, Shahab Basiri, Esa Kuusela, Martin Kraus, Dorin Comaniciu, Ali Kamen. [doi]
- Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AITheodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David B. Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang. [doi]
- SceneCraft: An LLM Agent for Synthesizing 3D Scenes as Blender CodeZiniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi. [doi]
- Libra: Building Decoupled Vision System on Large Language ModelsYifan Xu 0008, Xiaoshan Yang, Yaguang Song, Changsheng Xu. [doi]
- Efficient Contrastive Learning for Fast and Accurate Inference on GraphsTeng Xiao, Huaisheng Zhu, Zhiwei Zhang, Zhimeng Guo, Charu C. Aggarwal, Suhang Wang, Vasant G. Honavar. [doi]
- SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample ComplexityTianshu Chu, Dachuan Xu, Wei Yao, Jin Zhang. [doi]
- Reparameterized Importance Sampling for Robust Variational Bayesian Neural NetworksYunfei Long, Zilin Tian, Liguo Zhang, Huosheng Xu. [doi]
- OpenMoE: An Early Effort on Open Mixture-of-Experts Language ModelsFuzhao Xue, Zian Zheng, Yao Fu, Jinjie Ni, Zangwei Zheng, Wangchunshu Zhou, Yang You 0001. [doi]
- The Linear Representation Hypothesis and the Geometry of Large Language ModelsKiho Park, Yo Joong Choe, Victor Veitch. [doi]
- Safe Reinforcement Learning using Finite-Horizon Gradient-based EstimationJuntao Dai, Yaodong Yang 0001, Qian Zheng, Gang Pan 0001. [doi]
- Pruned Pivot: Correlation Clustering Algorithm for Dynamic, Parallel, and Local Computation ModelsMina Dalirrooyfard, Konstantin Makarychev, Slobodan Mitrovic. [doi]
- Position: Graph Foundation Models Are Already HereHaitao Mao, Zhikai Chen, Wenzhuo Tang, Jianan Zhao 0002, Yao Ma 0001, Tong Zhao 0003, Neil Shah, Mikhail Galkin 0001, Jiliang Tang. [doi]
- ULTRAFEEDBACK: Boosting Language Models with Scaled AI FeedbackGanqu Cui, Lifan Yuan, Ning Ding 0002, Guanming Yao, Bingxiang He, Wei Zhu 0016, Yuan Ni, Guotong Xie, Ruobing Xie, Yankai Lin, Zhiyuan Liu 0001, Maosong Sun 0001. [doi]
- Case-Based or Rule-Based: How Do Transformers Do the Math?Yi Hu, Xiaojuan Tang, Haotong Yang, Muhan Zhang. [doi]
- Ambiguity-Aware Abductive LearningHao-Yuan He 0001, Hui Sun 0003, Zheng Xie 0001, Ming Li 0005. [doi]
- Naive Bayes Classifiers over Missing Data: Decision and PoisoningSong Bian 0002, Xiating Ouyang, Zhiwei Fan, Paraschos Koutris. [doi]
- Online Speculative DecodingXiaoxuan Liu, Lanxiang Hu, Peter Bailis, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang 0108. [doi]
- LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned ProportionsVictor Agostinelli, Sanghyun Hong 0001, Lizhong Chen. [doi]
- Data Poisoning Attacks against Conformal PredictionYangyi Li, Aobo Chen, Wei Qian, Chenxu Zhao, Divya Lidder, Mengdi Huai. [doi]
- MusicFlow: Cascaded Flow Matching for Text Guided Music GenerationK. R. Prajwal, Bowen Shi, Matthew Le 0001, Apoorv Vyas, Andros Tjandra, Mahi Luthra, Baishan Guo, Huiyu Wang, Triantafyllos Afouras, David Kant, Wei-Ning Hsu. [doi]
- SSL4Q: Semi-Supervised Learning of Quantum Data with Application to Quantum State ClassificationYehui Tang, Nianzu Yang, Mabiao Long, Junchi Yan. [doi]
- Differentiable Weightless Neural NetworksAlan Tendler Leibel Bacellar, Zachary Susskind, Maurício Breternitz Jr., Eugene John, Lizy Kurian John, Priscila Machado Vieira Lima, Felipe M. G. França. [doi]
- An Empirical Study Into What Matters for Calibrating Vision-Language ModelsWeijie Tu, Weijian Deng, Dylan Campbell, Stephen Gould, Tom Gedeon. [doi]
- SparseTSF: Modeling Long-term Time Series Forecasting with *1k* ParametersShengsheng Lin, Weiwei Lin 0001, Wentai Wu, Haojun Chen, Junjie Yang. [doi]
- Attribute Based Interpretable Evaluation Metrics for Generative ModelsDongkyun Kim, Mingi Kwon, Youngjung Uh. [doi]
- PASOA- PArticle baSed Bayesian Optimal Adaptive designJacopo Iollo, Christophe Heinkelé, Pierre Alliez, Florence Forbes. [doi]
- Generalization in Kernel Regression Under Realistic AssumptionsDaniel Barzilai, Ohad Shamir. [doi]
- The WMDP Benchmark: Measuring and Reducing Malicious Use with UnlearningNathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Ariel Herbert-Voss, Cort B. Breuer, Andy Zou, Mantas Mazeika, Zifan Wang 0001, Palash Oswal, Weiran Lin, Adam A. Hunt, Justin Tienken-Harder, Kevin Y. Shih, Kemper Talley, John Guan, Ian Steneker, David Campbell, Brad Jokubaitis, Steven Basart, Stephen Fitz, Ponnurangam Kumaraguru, Kallol Krishna Karmakar, Uday Kiran Tupakula, Vijay Varadharajan, Yan Shoshitaishvili, Jimmy Ba, Kevin M. Esvelt, Alexandr Wang, Dan Hendrycks. [doi]
- Improving Gradient-Guided Nested Sampling for Posterior InferencePablo Lemos, Nikolay Malkin, Will Handley, Yoshua Bengio, Yashar Hezaveh, Laurence Perreault Levasseur. [doi]
- Switchable Decision: Dynamic Neural Generation NetworksShujian Zhang, Korawat Tanwisuth, ChengYue Gong, Pengcheng He, Mingyuan Zhou. [doi]
- Open Ad Hoc Teamwork with Cooperative Game TheoryJianhong Wang, Yang Li 0116, Yuan Zhang, Wei Pan 0004, Samuel Kaski. [doi]
- Consistent Submodular MaximizationPaul Duetting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam. [doi]
- SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised LearningChaoqun Du, Yizeng Han, Gao Huang 0001. [doi]
- Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function PriorShuyu Cheng, Yibo Miao, Yinpeng Dong, Xiao Yang, Xiao-Shan Gao, Jun Zhu 0001. [doi]
- LLM-Empowered State Representation for Reinforcement LearningBoyuan Wang, Yun Qu 0002, Yuhang Jiang, Jianzhun Shao, Chang Liu, Wenming Yang, Xiangyang Ji. [doi]
- Operator SVD with Neural Networks via Nested Low-Rank ApproximationJongha Jon Ryu, Xiangxiang Xu 0001, Hasan Sabri Melihcan Erol, Yuheng Bu, Lizhong Zheng, Gregory W. Wornell. [doi]
- ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic AssistanceLiwen Sun, Abhineet Agarwal, Aaron Kornblith, Bin Yu 0001, Chenyan Xiong. [doi]
- Stochastic Conditional Diffusion Models for Robust Semantic Image SynthesisJuyeon Ko, Inho Kong, Dogyun Park, Hyunwoo J. Kim. [doi]
- Regularizing with Pseudo-Negatives for Continual Self-Supervised LearningSungmin Cha, KyungHyun Cho, Taesup Moon. [doi]
- Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality GenerationLincan Cai, Shuang Li 0008, Wenxuan Ma 0001, Jingxuan Kang, Binhui Xie, Zixun Sun, Chengwei Zhu. [doi]
- A Minimaximalist Approach to Reinforcement Learning from Human FeedbackGokul Swamy, Christoph Dann, Rahul Kidambi, Steven Wu 0001, Alekh Agarwal. [doi]
- Boundary Exploration for Bayesian Optimization With Unknown Physical ConstraintsYunsheng Tian, Ane Zuniga, Xinwei Zhang 0001, Johannes P. Dürholt, Payel Das, Jie Chen 0007, Wojciech Matusik, Mina Konakovic-Lukovic. [doi]
- Is Kernel Prediction More Powerful than Gating in Convolutional Neural Networks?Lorenz K. Müller. [doi]
- Antibody Design Using a Score-based Diffusion Model Guided by Evolutionary, Physical and Geometric ConstraintsTian Zhu, Milong Ren, Haicang Zhang. [doi]
- Learning to Reach Goals via DiffusionVineet Jain, Siamak Ravanbakhsh. [doi]
- FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust ClusteringYongxin Guo, Xiaoying Tang, Tao Lin. [doi]
- The Expressive Power of Path-Based Graph Neural NetworksCaterina Graziani, Tamara Drucks, Fabian Jogl, Monica Bianchini, Franco Scarselli, Thomas Gärtner 0001. [doi]
- Optimal bounds for ℓp sensitivity sampling via ℓ2 augmentationAlexander Munteanu, Simon Omlor. [doi]