Abstract is missing.
- FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization AccuracyYan Sun, Li Shen 0008, Tiansheng Huang, Liang Ding 0006, Dacheng Tao. [doi]
- ContraNorm: A Contrastive Learning Perspective on Oversmoothing and BeyondXiaojun Guo, Yifei Wang 0001, Tianqi Du, Yisen Wang 0001. [doi]
- Learning ReLU networks to high uniform accuracy is intractableJulius Berner, Philipp Grohs, Felix Voigtländer. [doi]
- Transferable Unlearnable ExamplesJie Ren, Han Xu 0002, Yuxuan Wan, Xingjun Ma, Lichao Sun 0001, Jiliang Tang. [doi]
- On the Soft-Subnetwork for Few-Shot Class Incremental LearningHaeyong Kang, Jaehong Yoon, Sultan Rizky Hikmawan Madjid, Sung Ju Hwang, Chang D. Yoo. [doi]
- Achieving Sub-linear Regret in Infinite Horizon Average Reward Constrained MDP with Linear Function ApproximationArnob Ghosh, Xingyu Zhou 0001, Ness B. Shroff. [doi]
- Context-enriched molecule representations improve few-shot drug discoveryJohannes Schimunek, Philipp Seidl, Lukas Friedrich, Daniel Kuhn, Friedrich Rippmann, Sepp Hochreiter, Günter Klambauer. [doi]
- The Role of ImageNet Classes in Fréchet Inception DistanceTuomas Kynkäänniemi, Tero Karras, Miika Aittala, Timo Aila, Jaakko Lehtinen. [doi]
- PaLI: A Jointly-Scaled Multilingual Language-Image ModelXi Chen, Xiao Wang 0038, Soravit Changpinyo, A. J. Piergiovanni, Piotr Padlewski, Daniel Salz, Sebastian Goodman, Adam Grycner, Basil Mustafa, Lucas Beyer, Alexander Kolesnikov 0003, Joan Puigcerver, Nan Ding 0002, Keran Rong, Hassan Akbari, Gaurav Mishra, Linting Xue, Ashish V. Thapliyal, James Bradbury, Weicheng Kuo. [doi]
- Sparse Token Transformer with Attention Back TrackingHeejun Lee, Minki Kang, Youngwan Lee, Sung Ju Hwang. [doi]
- Embedding Fourier for Ultra-High-Definition Low-Light Image EnhancementChongyi Li, Chun-Le Guo, Man Zhou, Zhexin Liang, Shangchen Zhou, Ruicheng Feng, Chen Change Loy. [doi]
- Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial QueriesYuxin Wen, Arpit Bansal, Hamid Kazemi, Eitan Borgnia, Micah Goldblum, Jonas Geiping, Tom Goldstein. [doi]
- Exploring Low-Rank Property in Multiple Instance Learning for Whole Slide Image ClassificationJinxi Xiang, Jun Zhang 0018. [doi]
- Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement LearningYat Long Lo, Christian Schröder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson. [doi]
- Human alignment of neural network representationsLukas Muttenthaler, Jonas Dippel, Lorenz Linhardt, Robert A. Vandermeulen, Simon Kornblith. [doi]
- VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-TrainingYecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar, Amy Zhang 0001. [doi]
- Generative Modeling Helps Weak Supervision (and Vice Versa)Benedikt Boecking, Nicholas Roberts, Willie Neiswanger, Stefano Ermon, Frederic Sala, Artur Dubrawski. [doi]
- Block and Subword-Scaling Floating-Point (BSFP) : An Efficient Non-Uniform Quantization For Low Precision InferenceYun-Chen Lo, Tse-Kuang Lee, Ren-Shuo Liu. [doi]
- Humanly Certifying Superhuman ClassifiersQiongkai Xu, Christian Walder, Chenchen Xu. [doi]
- Valid P-Value for Deep Learning-driven Salient RegionDaiki Miwa, Vo Nguyen Le Duy, Ichiro Takeuchi. [doi]
- Voxurf: Voxel-based Efficient and Accurate Neural Surface ReconstructionTong Wu, Jiaqi Wang, Xingang Pan, Xudong Xu, Christian Theobalt, Ziwei Liu, Dahua Lin. [doi]
- Massively Scaling Heteroscedastic ClassifiersMark Collier, Rodolphe Jenatton, Basil Mustafa, Neil Houlsby, Jesse Berent, Effrosyni Kokiopoulou. [doi]
- $\Lambda$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among CellsSajad Movahedi, Melika Adabinejad, Ayyoob Imani, Arezou Keshavarz, Mostafa Dehghani 0001, Azadeh Shakery, Babak Nadjar Araabi. [doi]
- Preference Transformer: Modeling Human Preferences using Transformers for RLChangyeon Kim, Jongjin Park, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee. [doi]
- Brain-like representational straightening of natural movies in robust feedforward neural networksTahereh Toosi, Elias Issa. [doi]
- A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph DataJiajin Li, Jianheng Tang, Lemin Kong, Huikang Liu, Jia Li, Anthony Man-Cho So, Jose H. Blanchet. [doi]
- Rethinking the Expressive Power of GNNs via Graph BiconnectivityBohang Zhang, Shengjie Luo, Liwei Wang 0001, Di He. [doi]
- A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momentaMaksim Velikanov, Denis Kuznedelev, Dmitry Yarotsky. [doi]
- When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement LearningJianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang. [doi]
- What Do Self-Supervised Vision Transformers Learn?Namuk Park, Wonjae Kim, Byeongho Heo, Taekyung Kim 0005, Sangdoo Yun. [doi]
- Diffusion Probabilistic FieldsPeiye Zhuang, Samira Abnar, Jiatao Gu, Alexander G. Schwing, Joshua M. Susskind, Miguel Ángel Bautista 0001. [doi]
- Meta Knowledge Condensation for Federated LearningPing Liu, Xin Yu, Joey Tianyi Zhou. [doi]
- Latent Variable Representation for Reinforcement LearningTongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li 0002, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai 0001. [doi]
- Distilling Cognitive Backdoor Patterns within an ImageHanxun Huang, Xingjun Ma, Sarah Monazam Erfani, James Bailey 0001. [doi]
- Statistical Theory of Differentially Private Marginal-based Data Synthesis AlgorithmsXiMing Li, Chendi Wang, Guang Cheng. [doi]
- More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using SparsityShiwei Liu, Tianlong Chen, Xiaohan Chen, Xuxi Chen, Qiao Xiao, Boqian Wu, Tommi Kärkkäinen, Mykola Pechenizkiy, Decebal Constantin Mocanu, Zhangyang Wang. [doi]
- Imitating Graph-Based Planning with Goal-Conditioned PoliciesJunsu Kim, Younggyo Seo, Sungsoo Ahn, Kyunghwan Son, Jinwoo Shin. [doi]
- Learning Symbolic Models for Graph-structured Physical MechanismHongzhi Shi, Jingtao Ding, Yufan Cao, Quanming Yao, Li Liu, Yong Li 0008. [doi]
- A Time Series is Worth 64 Words: Long-term Forecasting with TransformersYuqi Nie, Nam H. Nguyen, Phanwadee Sinthong, Jayant Kalagnanam. [doi]
- Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio BarrierPierluca D'Oro, Max Schwarzer, Evgenii Nikishin, Pierre-Luc Bacon, Marc G. Bellemare, Aaron C. Courville. [doi]
- Enhancing the Inductive Biases of Graph Neural ODE for Modeling Physical SystemsSuresh Bishnoi, Ravinder Bhattoo, Jayadeva, Sayan Ranu, N. M. Anoop Krishnan. [doi]
- Neural ePDOs: Spatially Adaptive Equivariant Partial Differential Operator Based NetworksLingshen He, Yuxuan Chen, Zhengyang Shen, Yibo Yang, Zhouchen Lin. [doi]
- Moving Forward by Moving Backward: Embedding Action Impact over Action SemanticsKuo-Hao Zeng, Luca Weihs, Roozbeh Mottaghi, Ali Farhadi. [doi]
- Replicable BanditsHossein Esfandiari, Alkis Kalavasis, Amin Karbasi, Andreas Krause 0001, Vahab Mirrokni, Grigoris Velegkas. [doi]
- TranSpeech: Speech-to-Speech Translation With Bilateral PerturbationRongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren 0006, Lichao Zhang, Jinzheng He, Zhou Zhao. [doi]
- Bort: Towards Explainable Neural Networks with Bounded Orthogonal ConstraintBorui Zhang, Wenzhao Zheng, Jie Zhou 0001, Jiwen Lu. [doi]
- Understanding Edge-of-Stability Training Dynamics with a Minimalist ExampleXingyu Zhu 0003, Zixuan Wang, Xiang Wang 0011, Mo Zhou, Rong Ge 0001. [doi]
- Recon: Reducing Conflicting Gradients From the Root For Multi-Task LearningGuangyuan Shi, Qimai Li, Wenlong Zhang, Jiaxin Chen, Xiao-Ming Wu 0003. [doi]
- Transfer Learning with Deep Tabular ModelsRoman Levin, Valeriia Cherepanova, Avi Schwarzschild, Arpit Bansal, C. Bayan Bruss, Tom Goldstein, Andrew Gordon Wilson, Micah Goldblum. [doi]
- MECTA: Memory-Economic Continual Test-Time Model AdaptationJunyuan Hong, Lingjuan Lyu, Jiayu Zhou, Michael Spranger. [doi]
- Stochastic No-regret Learning for General Games with Variance ReductionYichi Zhou, Fang Kong 0002, Shuai Li 0010. [doi]
- Semi-supervised learning with a principled likelihood from a generative model of data curationStoil Ganev, Laurence Aitchison. [doi]
- Cycle-consistent Masked AutoEncoder for Unsupervised Domain GeneralizationHaiyang Yang, Xiaotong Li, Shixiang Tang, Feng Zhu 0006, Yizhou Wang, Meilin Chen, Lei Bai 0001, Rui Zhao 0001, Wanli Ouyang. [doi]
- Re-calibrating Feature Attributions for Model InterpretationPeiyu Yang, Naveed Akhtar, Zeyi Wen, Mubarak Shah, Ajmal Saeed Mian. [doi]
- Distributionally Robust Post-hoc Classifiers under Prior ShiftsJiaheng Wei, Harikrishna Narasimhan, Ehsan Amid, Wen-Sheng Chu, Yang Liu 0018, Abhishek Kumar. [doi]
- Domain Generalization via Heckman-type Selection ModelsHyungu Kahng, Hyungrok Do, Judy Zhong. [doi]
- LilNetX: Lightweight Networks with EXtreme Model Compression and Structured SparsificationSharath Girish, Kamal Gupta 0002, Saurabh Singh, Abhinav Shrivastava. [doi]
- GAMR: A Guided Attention Model for (visual) ReasoningMohit Vaishnav, Thomas Serre. [doi]
- Hierarchical Sliced Wasserstein DistanceKhai Nguyen, Tongzheng Ren, Huy Nguyen, Litu Rout, Tan Minh Nguyen, Nhat Ho. [doi]
- Solving Constrained Variational Inequalities via a First-order Interior Point-based MethodTong Yang, Michael I. Jordan, Tatjana Chavdarova. [doi]
- Policy Expansion for Bridging Offline-to-Online Reinforcement LearningHaichao Zhang, Wei Xu 0017, Haonan Yu. [doi]
- LightGCL: Simple Yet Effective Graph Contrastive Learning for RecommendationXuheng Cai, Chao Huang 0001, Lianghao Xia, Xubin Ren. [doi]
- BrainBERT: Self-supervised representation learning for intracranial recordingsChristopher Wang, Vighnesh Subramaniam, Adam Uri Yaari, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu. [doi]
- The Role of Coverage in Online Reinforcement LearningTengyang Xie, Dylan J. Foster, Yu Bai 0017, Nan Jiang 0008, Sham M. Kakade. [doi]
- Link Prediction with Non-Contrastive LearningWilliam Shiao, Zhichun Guo, Tong Zhao 0003, Evangelos E. Papalexakis, Yozen Liu, Neil Shah. [doi]
- Robust Fair Clustering: A Novel Fairness Attack and Defense FrameworkAnshuman Chhabra, Peizhao Li, Prasant Mohapatra, Hongfu Liu. [doi]
- Learning to Generate Columns with Application to Vertex ColoringYuan Sun 0003, Andreas T. Ernst, Xiaodong Li 0001, Jake Weiner. [doi]
- FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced DataZhun Deng, Jiayao Zhang 0001, Linjun Zhang, Ting Ye, Yates Coley, Weijie J. Su, James Zou 0001. [doi]
- Information-Theoretic Analysis of Unsupervised Domain AdaptationZiqiao Wang, Yongyi Mao. [doi]
- Anisotropic Message Passing: Graph Neural Networks with Directional and Long-Range InteractionsMoritz Thürlemann, Sereina Riniker. [doi]
- Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representationsAli Hummos. [doi]
- Optimal Transport for Offline Imitation LearningYicheng Luo, Zhengyao Jiang, Samuel Cohen, Edward Grefenstette, Marc Peter Deisenroth. [doi]
- KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLPYufei Wang 0003, Jiayi Zheng, Can Xu, Xiubo Geng, Tao Shen 0001, Chongyang Tao, Daxin Jiang. [doi]
- Boosting Causal Discovery via Adaptive Sample ReweightingAn Zhang 0003, Fangfu Liu, Wenchang Ma, Zhibo Cai, Xiang Wang 0010, Tat-Seng Chua. [doi]
- SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic SegmentationQiang Wan, Zilong Huang, Jiachen Lu, Gang Yu, Li Zhang 0001. [doi]
- Mosaic Representation Learning for Self-supervised Visual Pre-trainingZhaoqing Wang, Ziyu Chen, Yaqian Li, Yandong Guo, Jun Yu 0002, Mingming Gong, Tongliang Liu. [doi]
- Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal SearchMichal Zawalski, Michal Tyrolski, Konrad Czechowski, Tomasz Odrzygózdz, Damian Stachura, Piotr Piekos, Yuhuai Wu, Lukasz Kucinski, Piotr Milos. [doi]
- Memory Gym: Partially Observable Challenges to Memory-Based AgentsMarco Pleines, Matthias Pallasch, Frank Zimmer, Mike Preuss. [doi]
- Causal Reasoning in the Presence of Latent Confounders via Neural ADMG LearningMatthew Ashman, Chao Ma 0019, Agrin Hilmkil, Joel Jennings, Cheng Zhang. [doi]
- Long-Tailed Partial Label Learning via Dynamic RebalancingFeng Hong 0004, Jiangchao Yao, Zhihan Zhou 0002, Ya Zhang 0002, Yanfeng Wang. [doi]
- The hidden uniform cluster prior in self-supervised learningMido Assran, Randall Balestriero, Quentin Duval, Florian Bordes, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael G. Rabbat, Nicolas Ballas. [doi]
- Understanding Why Generalized Reweighting Does Not Improve Over ERMRuntian Zhai, Chen Dan 0001, J. Zico Kolter, Pradeep Kumar Ravikumar. [doi]
- Learning to Extrapolate: A Transductive ApproachAviv Netanyahu, Abhishek Gupta 0004, Max Simchowitz, Kaiqing Zhang, Pulkit Agrawal. [doi]
- Multi-task Self-supervised Graph Neural Networks Enable Stronger Task GeneralizationMingxuan Ju, Tong Zhao 0003, Qianlong Wen, Wenhao Yu 0002, Neil Shah, Yanfang Ye 0001, Chuxu Zhang. [doi]
- Diffusion Policies as an Expressive Policy Class for Offline Reinforcement LearningZhendong Wang, Jonathan J. Hunt, Mingyuan Zhou. [doi]
- Basic Binary Convolution Unit for Binarized Image Restoration NetworkBin Xia, Yulun Zhang, Yitong Wang, Yapeng Tian, Wenming Yang, Radu Timofte, Luc Van Gool. [doi]
- Expressive Monotonic Neural NetworksNiklas Nolte, Ouail Kitouni, Mike Williams. [doi]
- Revisiting the Assumption of Latent Separability for Backdoor DefensesXiangyu Qi, Tinghao Xie, Yiming Li, Saeed Mahloujifar, Prateek Mittal. [doi]
- Understanding DDPM Latent Codes Through Optimal TransportValentin Khrulkov, Gleb V. Ryzhakov, Andrei Chertkov, Ivan V. Oseledets. [doi]
- Protein Sequence and Structure Co-Design with Equivariant TranslationChence Shi, Chuanrui Wang, Jiarui Lu, Bozitao Zhong, Jian Tang 0005. [doi]
- Is Synthetic Data from Generative Models Ready for Image Recognition?Ruifei He, Shuyang Sun, Xin Yu 0004, Chuhui Xue, Wenqing Zhang, Philip H. S. Torr, Song Bai, Xiaojuan Qi. [doi]
- Complexity-Based Prompting for Multi-step ReasoningYao Fu, Hao Peng, Ashish Sabharwal, Peter Clark, Tushar Khot. [doi]
- Accelerating Guided Diffusion Sampling with Splitting Numerical MethodsSuttisak Wizadwongsa, Supasorn Suwajanakorn. [doi]
- Autoregressive Conditional Neural ProcessesWessel P. Bruinsma, Stratis Markou, James Requeima, Andrew Y. K. Foong, Tom R. Andersson, Anna Vaughan, Anthony Buonomo, J. Scott Hosking, Richard E. Turner. [doi]
- Online Low Rank Matrix CompletionSoumyabrata Pal, Prateek Jain 0002. [doi]
- Deep Ensembles for Graphs with Higher-order DependenciesSteven J. Krieg, William C. Burgis, Patrick M. Soga, Nitesh V. Chawla. [doi]
- Classically Approximating Variational Quantum Machine Learning with Random Fourier FeaturesJonas Landman, Slimane Thabet, Constantin Dalyac, Hela Mhiri, Elham Kashefi. [doi]
- BC-IRL: Learning Generalizable Reward Functions from DemonstrationsAndrew Szot, Amy Zhang 0001, Dhruv Batra, Zsolt Kira, Franziska Meier. [doi]
- Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial ExamplesQizhang Li, Yiwen Guo, Wangmeng Zuo, Hao Chen 0003. [doi]
- Relational Attention: Generalizing Transformers for Graph-Structured TasksCameron Diao, Ricky Loynd. [doi]
- Online Boundary-Free Continual Learning by Scheduled Data PriorHyunseo Koh, Minhyuk Seo, Jihwan Bang, Hwanjun Song, Deokki Hong, Seulki Park, Jung-Woo Ha 0001, Jonghyun Choi. [doi]
- Part-Based Models Improve Adversarial RobustnessChawin Sitawarin, Kornrapat Pongmala, Yizheng Chen 0001, Nicholas Carlini, David A. Wagner 0001. [doi]
- FedDAR: Federated Domain-Aware Representation LearningAoxiao Zhong, Hao He, Zhaolin Ren, Na Li 0002, Quanzheng Li. [doi]
- Offline Q-learning on Diverse Multi-Task Data Both Scales And GeneralizesAviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine. [doi]
- Revisiting Graph Adversarial Attack and Defense From a Data Distribution PerspectiveKuan Li, Yang Liu 0200, Xiang Ao 0001, Qing He 0003. [doi]
- Learning Hierarchical Protein Representations via Complete 3D Graph NetworksLimei Wang, Haoran Liu, Yi Liu, Jerry Kurtin, Shuiwang Ji. [doi]
- Matching receptor to odorant with protein language and graph neural networksMatej Hladis, Maxence Lalis, Sébastien Fiorucci, Jérémie Topin. [doi]
- Learning Simultaneous Navigation and Construction in Grid WorldsWenyu Han, Haoran Wu, Eisuke Hirota, Alexander Gao, Lerrel Pinto, Ludovic Righetti, Chen Feng 0002. [doi]
- Building Normalizing Flows with Stochastic InterpolantsMichael S. Albergo, Eric Vanden-Eijnden. [doi]
- Better Generative Replay for Continual Federated LearningDaiqing Qi, Handong Zhao, Sheng Li 0001. [doi]
- PLOT: Prompt Learning with Optimal Transport for Vision-Language ModelsGuangyi Chen 0002, Weiran Yao, Xiangchen Song, Xinyue Li, Yongming Rao, Kun Zhang 0001. [doi]
- Is Forgetting Less a Good Inductive Bias for Forward Transfer?Jiefeng Chen 0001, Timothy Nguyen, Dilan Görür, Arslan Chaudhry. [doi]
- Why adversarial training can hurt robust accuracyJacob Clarysse, Julia Hörrmann, Fanny Yang. [doi]
- DySR: Adaptive Super-Resolution via Algorithm and System Co-designSyed Zawad, Cheng Li 0001, Zhewei Yao, Elton Zheng, Yuxiong He, Feng Yan 0001. [doi]
- Re-parameterizing Your Optimizers rather than ArchitecturesXiaohan Ding, Honghao Chen, Xiangyu Zhang 0005, Kaiqi Huang, Jungong Han, Guiguang Ding. [doi]
- MocoSFL: enabling cross-client collaborative self-supervised learningJingtao Li, Lingjuan Lyu, Daisuke Iso, Chaitali Chakrabarti, Michael Spranger. [doi]
- Mole-BERT: Rethinking Pre-training Graph Neural Networks for MoleculesJun Xia, Chengshuai Zhao, Bozhen Hu, Zhangyang Gao, Cheng Tan 0012, Yue Liu, Siyuan Li, Stan Z. Li. [doi]
- Leveraging Importance Weights in Subset SelectionGui Citovsky, Giulia DeSalvo, Sanjiv Kumar, Srikumar Ramalingam, Afshin Rostamizadeh, Yunjuan Wang. [doi]
- Deep Learning on Implicit Neural Representations of ShapesLuca De Luigi, Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi di Stefano. [doi]
- Single-shot General Hyper-parameter Optimization for Federated LearningYi Zhou 0015, Parikshit Ram, Theodoros Salonidis, Nathalie Baracaldo, Horst Samulowitz, Heiko Ludwig. [doi]
- Out-of-distribution Detection with Implicit Outlier TransformationQizhou Wang, Junjie Ye, Feng Liu 0003, Quanyu Dai, Marcus Kalander, Tongliang Liu, Jianye Hao, Bo Han 0003. [doi]
- Revisiting the Entropy Semiring for Neural Speech RecognitionOscar Chang, Dongseong Hwang, Olivier Siohan. [doi]
- What Makes Convolutional Models Great on Long Sequence Modeling?Yuhong Li, Tianle Cai, Yi Zhang, Deming Chen, Debadeepta Dey. [doi]
- Benchmarking Offline Reinforcement Learning on Real-Robot HardwareNico Gürtler, Sebastian Blaes, Pavel Kolev, Felix Widmaier, Manuel Wuthrich, Stefan Bauer, Bernhard Schölkopf, Georg Martius. [doi]
- Dataset Pruning: Reducing Training Data by Examining Generalization InfluenceShuo Yang 0006, Zeke Xie, Hanyu Peng, Min Xu 0001, Mingming Sun, Ping Li 0001. [doi]
- STREET: A Multi-Task Structured Reasoning and Explanation BenchmarkDanilo Neves Ribeiro, Shen Wang 0005, Xiaofei Ma, Henghui Zhu, Rui Dong, Deguang Kong, Juliette Burger, Anjelica Ramos, Zhiheng Huang, William Yang Wang, George Karypis, Bing Xiang, Dan Roth. [doi]
- Distributional Meta-Gradient Reinforcement LearningHaiyan Yin, Shuicheng Yan, Zhongwen Xu. [doi]
- The Provable Benefit of Unsupervised Data Sharing for Offline Reinforcement LearningHao Hu 0006, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang. [doi]
- Learning Math Reasoning from Self-Sampled Correct and Partially-Correct SolutionsAnsong Ni, Jeevana Priya Inala, Chenglong Wang, Alex Polozov, Christopher Meek, Dragomir Radev, Jianfeng Gao. [doi]
- On the Sensitivity of Reward Inference to Misspecified Human ModelsJoey Hong, Kush Bhatia, Anca D. Dragan. [doi]
- Dirichlet-based Uncertainty Calibration for Active Domain AdaptationMixue Xie, Shuang Li 0008, Rui Zhang, Chi Harold Liu. [doi]
- Flow Matching for Generative ModelingYaron Lipman, Ricky T. Q. Chen, Heli Ben Hamu, Maximilian Nickel, Matthew Le. [doi]
- Neural Agents Struggle to Take Turns in Bidirectional Emergent CommunicationValentin Taillandier, Dieuwke Hupkes, Benoît Sagot, Emmanuel Dupoux, Paul Michel. [doi]
- E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic SegmentationJie Zhu, Huabin Huang, Banghuai Li, Leye Wang. [doi]
- UL2: Unifying Language Learning ParadigmsYi Tay, Mostafa Dehghani 0001, Vinh Q. Tran 0002, Xavier Garcia, Jason Wei, Xuezhi Wang 0002, Hyung Won Chung, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler. [doi]
- Imitating Human Behaviour with Diffusion ModelsTim Pearce, Tabish Rashid, Anssi Kanervisto, David Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin. [doi]
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RLBaiting Zhu, Meihua Dang, Aditya Grover. [doi]
- Jointly Learning Visual and Auditory Speech Representations from Raw DataAlexandros Haliassos, Pingchuan Ma 0001, Rodrigo Mira, Stavros Petridis, Maja Pantic. [doi]
- Memorization Capacity of Neural Networks with Conditional ComputationErdem Koyuncu. [doi]
- Soft Neighbors are Positive Supporters in Contrastive Visual Representation LearningChongjian Ge, Jiangliu Wang, Zhan Tong, Shoufa Chen, Yibing Song, Ping Luo 0002. [doi]
- Spherical Sliced-WassersteinClément Bonet, Paul Berg, Nicolas Courty, François Septier, Lucas Drumetz, Minh-Tan Pham. [doi]
- Fairness-aware Contrastive Learning with Partially Annotated Sensitive AttributesFengda Zhang, Kun Kuang, Long Chen 0016, Yuxuan Liu, Chao Wu 0001, Jun Xiao 0001. [doi]
- BEEF: Bi-Compatible Class-Incremental Learning via Energy-Based Expansion and FusionFu-Yun Wang, Da-Wei Zhou 0001, Liu Liu, Han-Jia Ye, Yatao Bian, De-Chuan Zhan, Peilin Zhao. [doi]
- How I Learned to Stop Worrying and Love RetrainingMax Zimmer, Christoph Spiegel 0002, Sebastian Pokutta. [doi]
- Learning where and when to reason in neuro-symbolic inferenceCristina Cornelio, Jan Stuehmer, Shell Xu Hu, Timothy M. Hospedales. [doi]
- Generating Sequences by Learning to Self-CorrectSean Welleck, Ximing Lu, Peter West, Faeze Brahman, Tianxiao Shen, Daniel Khashabi, Yejin Choi 0001. [doi]
- Representation Learning for Low-rank General-sum Markov GamesChengzhuo Ni, Yuda Song 0001, Xuezhou Zhang, Zihan Ding, Chi Jin, Mengdi Wang. [doi]
- Pruning Deep Neural Networks from a Sparsity PerspectiveEnmao Diao, Ganghua Wang, Jiawei Zhang, Yuhong Yang 0002, Jie Ding 0002, Vahid Tarokh. [doi]
- Protein Representation Learning by Geometric Structure PretrainingZuobai Zhang, Minghao Xu, Arian Rokkum Jamasb, Vijil Chenthamarakshan, Aurélie C. Lozano, Payel Das, Jian Tang 0005. [doi]
- Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence ModelZhihai Wang, Xijun Li, Jie Wang 0005, Yufei Kuang, Mingxuan Yuan, Jia Zeng, Yongdong Zhang 0001, Feng Wu 0001. [doi]
- Human Motion Diffusion ModelGuy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Daniel Cohen-Or, Amit Haim Bermano. [doi]
- Learning Label Encodings for Deep RegressionDeval Shah, Tor M. Aamodt. [doi]
- Kernel Neural Optimal TransportAlexander Korotin, Daniil Selikhanovych, Evgeny Burnaev. [doi]
- Continuized Acceleration for Quasar Convex Functions in Non-Convex OptimizationJun-Kun Wang, Andre Wibisono. [doi]
- CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language AlignmentHongwei Xue, Yuchong Sun, Bei Liu 0001, Jianlong Fu, Ruihua Song, Houqiang Li, Jiebo Luo. [doi]
- Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning?Rui Wen 0002, Zhengyu Zhao 0001, Zhuoran Liu 0001, Michael Backes 0001, Tianhao Wang 0001, Yang Zhang 0016. [doi]
- A Graph Neural Network Approach to Automated Model Building in Cryo-EM MapsKiarash Jamali, Dari Kimanius, Sjors H. W. Scheres. [doi]
- Learning Harmonic Molecular Representations on Riemannian ManifoldYiqun Wang, Yuning Shen, Shi Chen, Lihao Wang, Fei Ye, Hao Zhou. [doi]
- A new characterization of the edge of stability based on a sharpness measure aware of batch gradient distributionSungyoon Lee, Cheongjae Jang. [doi]
- In-sample Actor Critic for Offline Reinforcement LearningHongchang Zhang, Yixiu Mao, Boyuan Wang, Shuncheng He, Yi Xu, Xiangyang Ji. [doi]
- Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) EquivarianceXueyi Liu, Ji Zhang, Ruizhen Hu, Haibin Huang, He Wang 0010, Li Yi. [doi]
- Population-size-Aware Policy Optimization for Mean-Field GamesPengdeng Li, Xinrun Wang, Shuxin Li, Hau Chan, Bo An 0001. [doi]
- Approximate Vanishing Ideal Computations at ScaleElias Samuel Wirth, Hiroshi Kera, Sebastian Pokutta. [doi]
- Energy-Based Test Sample Adaptation for Domain GeneralizationZehao Xiao, Xiantong Zhen, ShengCai Liao, Cees G. M. Snoek. [doi]
- Phase2vec: dynamical systems embedding with a physics-informed convolutional networkMatthew Ricci, Noa Moriel, Zoe Piran, Mor Nitzan. [doi]
- CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisErik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong. [doi]
- Efficient Certified Training and Robustness Verification of Neural ODEsMustafa Zeqiri, Mark Niklas Müller, Marc Fischer 0002, Martin T. Vechev. [doi]
- Machine Unlearning of Federated ClustersChao Pan 0003, Jin Sima, Saurav Prakash, Vishal Rana, Olgica Milenkovic. [doi]
- Fooling SHAP with Stealthily Biased SamplingGabriel Laberge, Ulrich Aïvodji, Satoshi Hara 0001, Mario Marchand, Foutse Khomh. [doi]
- Learning with Stochastic OrdersCarles Domingo-Enrich, Yair Schiff, Youssef Mroueh. [doi]
- BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object DetectionZehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao. [doi]
- A Higher Precision Algorithm for Computing the $1$-Wasserstein DistancePankaj K. Agarwal, Sharath Raghvendra, Pouyan Shirzadian, Rachita Sowle. [doi]
- Measuring Forgetting of Memorized Training ExamplesMatthew Jagielski, Om Thakkar 0001, Florian Tramèr, Daphne Ippolito, Katherine Lee, Nicholas Carlini, Eric Wallace, Shuang Song 0001, Abhradeep Guha Thakurta, Nicolas Papernot, Chiyuan Zhang. [doi]
- Scaleformer: Iterative Multi-scale Refining Transformers for Time Series ForecastingMohammad Amin Shabani, Amir H. Abdi, Lili Meng, Tristan Sylvain. [doi]
- Towards Better Selective ClassificationLeo Feng, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Amir H. Abdi. [doi]
- A Kernel Perspective of Skip Connections in Convolutional NetworksDaniel Barzilai, Amnon Geifman, Meirav Galun, Ronen Basri. [doi]
- Semi-Parametric Inducing Point Networks and Neural ProcessesRicha Rastogi, Yair Schiff, Alon Hacohen, Zhaozhi Li, Ian Lee, Yuntian Deng, Mert R. Sabuncu, Volodymyr Kuleshov. [doi]
- A law of adversarial risk, interpolation, and label noiseDaniel Paleka, Amartya Sanyal. [doi]
- Proposal-Contrastive Pretraining for Object Detection from Fewer DataQuentin Bouniot, Romaric Audigier, Angélique Loesch, Amaury Habrard. [doi]
- Automated Data Augmentations for Graph ClassificationYouzhi Luo, Michael McThrow, Wing Yee Au, Tao Komikado, Kanji Uchino, Koji Maruhashi, Shuiwang Ji. [doi]
- Robust Algorithms on Adaptive Inputs from Bounded AdversariesYeshwanth Cherapanamjeri, Sandeep Silwal, David P. Woodruff, Fred Zhang, Qiuyi Zhang 0001, Samson Zhou. [doi]
- Diffusion Models Already Have A Semantic Latent SpaceMingi Kwon, Jaeseok Jeong, Youngjung Uh. [doi]
- Measuring axiomatic soundness of counterfactual image modelsMiguel Monteiro, Fabio De Sousa Ribeiro, Nick Pawlowski, Daniel C. Castro, Ben Glocker. [doi]
- Adaptive Robust Evidential Optimization For Open Set Detection from Imbalanced DataHitesh Sapkota, Qi Yu 0001. [doi]
- Multimodal Analogical Reasoning over Knowledge GraphsNingyu Zhang 0001, Lei Li 0040, Xiang Chen 0016, Xiaozhuan Liang, Shumin Deng, Huajun Chen. [doi]
- Sequential Attention for Feature SelectionTaisuke Yasuda 0002, Mohammad Hossein Bateni 0001, Lin Chen, Matthew Fahrbach, Gang Fu, Vahab Mirrokni. [doi]
- A System for Morphology-Task Generalization via Unified Representation and Behavior DistillationHiroki Furuta, Yusuke Iwasawa, Yutaka Matsuo, Shixiang Shane Gu. [doi]
- FairGBM: Gradient Boosting with Fairness ConstraintsAndré Ferreira Cruz, Catarina Belém, João Bravo, Pedro Saleiro, Pedro Bizarro. [doi]
- Learning to Linearize Deep Neural Networks for Secure and Efficient Private InferenceSouvik Kundu 0002, Shunlin Lu, Yuke Zhang, Jacqueline Tiffany Liu, Peter A. Beerel. [doi]
- Characterizing the spectrum of the NTK via a power series expansionMichael Murray, Hui Jin, Benjamin Bowman, Guido Montúfar. [doi]
- 3D generation on ImageNetIvan Skorokhodov, Aliaksandr Siarohin, Yinghao Xu, Jian Ren, Hsin-Ying Lee, Peter Wonka, Sergey Tulyakov. [doi]
- Rotamer Density Estimator is an Unsupervised Learner of the Effect of Mutations on Protein-Protein InteractionShitong Luo, Yufeng Su, Zuofan Wu, Chenpeng Su, Jian Peng 0001, Jianzhu Ma. [doi]
- Continual Pre-training of Language ModelsZixuan Ke, Yijia Shao, Haowei Lin, Tatsuya Konishi, Gyuhak Kim, Bing Liu 0001. [doi]
- Text Summarization with Oracle ExpectationYumo Xu, Mirella Lapata. [doi]
- CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous DrivingRunjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Yu Qiao 0001, Zhenguo Li, Ping Luo 0002. [doi]
- DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object DetectionHao Zhang 0097, Feng Li, Shilong Liu, Lei Zhang 0001, Hang Su 0006, Jun Zhu 0001, Lionel M. Ni, Heung-Yeung Shum. [doi]
- D4AM: A General Denoising Framework for Downstream Acoustic ModelsChi-Chang Lee, Yu Tsao 0001, Hsin-Min Wang, Chu-Song Chen. [doi]
- Towards convergence to Nash equilibria in two-team zero-sum gamesFivos Kalogiannis, Ioannis Panageas, Emmanouil-Vasileios Vlatakis-Gkaragkounis. [doi]
- MEDICAL IMAGE UNDERSTANDING WITH PRETRAINED VISION LANGUAGE MODELS: A COMPREHENSIVE STUDYZiyuan Qin, Huahui Yi, Qicheng Lao, Kang Li 0006. [doi]
- Does Zero-Shot Reinforcement Learning Exist?Ahmed Touati, Jérémy Rapin, Yann Ollivier. [doi]
- Impossibly Good Experts and How to Follow ThemAaron Walsman, Muru Zhang, Sanjiban Choudhury, Dieter Fox, Ali Farhadi. [doi]
- Hierarchical Abstraction for Combinatorial Generalization in Object RearrangementMichael Chang 0003, Alyssa L. Dayan, Franziska Meier, Thomas L. Griffiths 0001, Sergey Levine, Amy Zhang 0001. [doi]
- One Transformer Can Understand Both 2D & 3D Molecular DataShengjie Luo, Tianlang Chen, Yixian Xu, Shuxin Zheng, Tie-Yan Liu, Liwei Wang 0001, Di He. [doi]
- Learning to reason over visual objectsShanka Subhra Mondal, Taylor Whittington Webb, Jonathan Cohen 0003. [doi]
- GFlowNets and variational inferenceNikolay Malkin, Salem Lahlou, Tristan Deleu, Xu Ji, Edward J. Hu, Katie Everett, Dinghuai Zhang, Yoshua Bengio. [doi]
- PerFedMask: Personalized Federated Learning with Optimized Masking VectorsMehdi Setayesh, Xiaoxiao Li, Vincent W. S. Wong 0001. [doi]
- Model-based Causal Bayesian OptimizationScott Sussex, Anastasia Makarova, Andreas Krause 0001. [doi]
- Finding Actual Descent Directions for Adversarial TrainingFabian Latorre, Igor Krawczuk, Leello Tadesse Dadi, Thomas Pethick, Volkan Cevher. [doi]
- Promptagator: Few-shot Dense Retrieval From 8 ExamplesZhuyun Dai, Vincent Y. Zhao, Ji Ma, Yi Luan, Jianmo Ni, Jing Lu, Anton Bakalov, Kelvin Guu, Keith B. Hall, Ming-Wei Chang. [doi]
- LAVA: Data Valuation without Pre-Specified Learning AlgorithmsHoang Anh Just, Feiyang Kang, Tianhao Wang 0013, Yi Zeng, Myeongseob Ko, Ming Jin 0002, Ruoxi Jia. [doi]
- Rethinking Self-Supervised Visual Representation Learning in Pre-training for 3D Human Pose and Shape EstimationHongsuk Choi, Hyeongjin Nam, Taeryung Lee, Gyeongsik Moon, Kyoung Mu Lee. [doi]
- On The Inadequacy of Optimizing Alignment and Uniformity in Contrastive Learning of Sentence RepresentationsZhijie Nie, Richong Zhang, Yongyi Mao. [doi]
- Betty: An Automatic Differentiation Library for Multilevel OptimizationSang Keun Choe, Willie Neiswanger, Pengtao Xie, Eric P. Xing. [doi]
- Optimizing Bi-Encoder for Named Entity Recognition via Contrastive LearningSheng Zhang 0012, Hao Cheng 0002, Jianfeng Gao, Hoifung Poon. [doi]
- Learning Uncertainty for Unknown Domains with Zero-Target-AssumptionYu Yu 0004, Hassan Sajjad, Jia Xu 0004. [doi]
- Scale-invariant Bayesian Neural Networks with Connectivity Tangent KernelSungyub Kim, Sihwan Park, Kyung-Su Kim 0002, Eunho Yang. [doi]
- Reliability of CKA as a Similarity Measure in Deep LearningMohammadReza Davari, Stefan Horoi, Amine Natik, Guillaume Lajoie, Guy Wolf, Eugene Belilovsky. [doi]
- Continuous PDE Dynamics Forecasting with Implicit Neural RepresentationsYuan Yin, Matthieu Kirchmeyer, Jean-Yves Franceschi, Alain Rakotomamonjy, Patrick Gallinari. [doi]
- Monocular Scene Reconstruction with 3D SDF TransformersWeihao Yuan, Xiaodong Gu 0004, Heng Li, Zilong Dong, Siyu Zhu 0001. [doi]
- A Simple Approach for Visual Room Rearrangement: 3D Mapping and Semantic SearchBrandon Trabucco, Gunnar A. Sigurdsson, Robinson Piramuthu, Gaurav S. Sukhatme, Ruslan Salakhutdinov. [doi]
- Self-Distillation for Further Pre-training of TransformersSeanie Lee, Minki Kang, Juho Lee 0001, Sung Ju Hwang, Kenji Kawaguchi. [doi]
- Understanding the Generalization of Adam in Learning Neural Networks with Proper RegularizationDifan Zou, Yuan Cao 0006, Yuanzhi Li, Quanquan Gu. [doi]
- Test-Time Robust Personalization for Federated LearningLiangze Jiang, Tao Lin. [doi]
- Zeroth-Order Optimization with Trajectory-Informed Derivative EstimationYao Shu, Zhongxiang Dai, Weicong Sng, Arun Verma, Patrick Jaillet, Bryan Kian Hsiang Low. [doi]
- Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the WildKaifeng Zhang, Yang Fu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang 0004. [doi]
- Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descentAvrajit Ghosh, He Lyu, Xitong Zhang, Rongrong Wang. [doi]
- SE(3)-Equivariant Attention Networks for Shape Reconstruction in Function SpaceEvangelos Chatzipantazis, Stefanos Pertigkiozoglou, Edgar Dobriban, Kostas Daniilidis. [doi]
- Uniform-in-time propagation of chaos for the mean-field gradient Langevin dynamicsTaiji Suzuki, Atsushi Nitanda, Denny Wu. [doi]
- Learning Continuous Normalizing Flows For Faster Convergence To Target Distribution via Ascent RegularizationsShuangshuang Chen, Sihao Ding 0002, Yiannis Karayiannidis, Mårten Björkman. [doi]
- Learning About Progress From ExpertsJake Bruce, Ankit Anand, Bogdan Mazoure, Rob Fergus. [doi]
- Pseudoinverse-Guided Diffusion Models for Inverse ProblemsJiaming Song, Arash Vahdat, Morteza Mardani, Jan Kautz. [doi]
- SCoMoE: Efficient Mixtures of Experts with Structured CommunicationZhiyuan Zeng, Deyi Xiong. [doi]
- Emergence of Maps in the Memories of Blind Navigation AgentsErik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra. [doi]
- AudioGen: Textually Guided Audio GenerationFelix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi. [doi]
- Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPsYuan Cheng, Ruiquan Huang, Yingbin Liang, Jing Yang 0002. [doi]
- Provable Defense Against Geometric TransformationsRem Yang, Jacob Laurel, Sasa Misailovic, Gagandeep Singh 0001. [doi]
- Planning with Large Language Models for Code GenerationShun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Confidence Estimation Using Unlabeled DataChen Li, Xiaoling Hu, Chao Chen 0012. [doi]
- How gradient estimator variance and bias impact learning in neural networksArna Ghosh, Yuhan Helena Liu, Guillaume Lajoie, Konrad P. Körding, Blake Aaron Richards. [doi]
- BigVGAN: A Universal Neural Vocoder with Large-Scale TrainingSang Gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon. [doi]
- Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement LearningMhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht. [doi]
- E3Bind: An End-to-End Equivariant Network for Protein-Ligand DockingYangtian Zhang, Huiyu Cai, Chence Shi, Jian Tang 0005. [doi]
- S-NeRF: Neural Radiance Fields for Street ViewsZiyang Xie, Junge Zhang, Wenye Li 0002, Feihu Zhang, Li Zhang 0001. [doi]
- Accurate Bayesian Meta-Learning by Accurate Task Posterior InferenceMichael Volpp, Philipp Dahlinger, Philipp Becker, Christian Daniel, Gerhard Neumann. [doi]
- Delving into Semantic Scale ImbalanceYanbiao Ma, Licheng Jiao, Fang Liu 0001, Yuxin Li, Shuyuan Yang, Xu Liu 0006. [doi]
- Critic Sequential Monte CarloVasileios Lioutas, Jonathan Wilder Lavington, Justice Sefas, Matthew Niedoba, Yunpeng Liu 0007, Berend Zwartsenberg, Setareh Dabiri, Frank Wood, Adam Scibior. [doi]
- Lossless Adaptation of Pretrained Vision Models For Robotic ManipulationMohit Sharma, Claudio Fantacci, Yuxiang Zhou, Skanda Koppula, Nicolas Heess, Jon Scholz, Yusuf Aytar. [doi]
- -1 Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov GamesYuepeng Yang, Cong Ma. [doi]
- Faster federated optimization under second-order similarityAhmed Khaled 0001, Chi Jin 0001. [doi]
- FedFA: Federated Feature AugmentationTianfei Zhou, Ender Konukoglu. [doi]
- A Control-Centric Benchmark for Video PredictionStephen Tian, Chelsea Finn, Jiajun Wu 0001. [doi]
- StyleMorph: Disentangled 3D-Aware Image Synthesis with a 3D Morphable StyleGANEric-Tuan Le, Edward Bartrum, Iasonas Kokkinos. [doi]
- Feature selection and low test error in shallow low-rotation ReLU networksMatus Telgarsky. [doi]
- Self-Guided Noise-Free Data Generation for Efficient Zero-Shot LearningJiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu 0003, Weizhong Zhang, Xiaodan Liang, Zhenguo Li, Lingpeng Kong. [doi]
- Novel View Synthesis with Diffusion ModelsDaniel Watson, William Chan, Ricardo Martin-Brualla, Jonathan Ho, Andrea Tagliasacchi, Mohammad Norouzi 0002. [doi]
- Toeplitz Neural Network for Sequence ModelingZhen Qin, Xiaodong Han, Weixuan Sun, Bowen He, Dong Li, Dongxu Li, Yuchao Dai, Lingpeng Kong, Yiran Zhong. [doi]
- Error Sensitivity Modulation based Experience Replay: Mitigating Abrupt Representation Drift in Continual LearningFahad Sarfraz, Elahe Arani, Bahram Zonooz. [doi]
- Iterative Patch Selection for High-Resolution Image RecognitionBenjamin Bergner, Christoph Lippert, Aravindh Mahendran. [doi]
- Latent Bottlenecked Attentive Neural ProcessesLeo Feng, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed. [doi]
- Offline Congestion Games: How Feedback Type Affects Data Coverage RequirementHaozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon Shaolei Du. [doi]
- Conservative Bayesian Model-Based Value Expansion for Offline Policy OptimizationJihwan Jeong, Xiaoyu Wang, Michael Gimelfarb, Hyunwoo Kim, Baher Abdulhai, Scott Sanner. [doi]
- Treeformer: Dense Gradient Trees for Efficient Attention ComputationLovish Madaan, Srinadh Bhojanapalli, Himanshu Jain, Prateek Jain 0002. [doi]
- Approximate Bayesian Inference with Stein Functional Variational Gradient DescentTobias Pielok, Bernd Bischl, David Rügamer. [doi]
- NANSY++: Unified Voice Synthesis with Neural Analysis and SynthesisHyeong-Seok Choi, Jinhyeok Yang, Juheon Lee, Hyeongju Kim. [doi]
- Agnostic Learning of General ReLU Activation Using Gradient DescentPranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan. [doi]
- RandProx: Primal-Dual Optimization Algorithms with Randomized Proximal UpdatesLaurent Condat, Peter Richtárik. [doi]
- Generate rather than Retrieve: Large Language Models are Strong Context GeneratorsWenhao Yu 0002, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya Sanyal 0001, Chenguang Zhu 0001, Michael Zeng 0001, Meng Jiang 0001. [doi]
- Decomposed Prompting: A Modular Approach for Solving Complex TasksTushar Khot, Harsh Trivedi, Matthew Finlayson, Yao Fu, Kyle Richardson 0001, Peter Clark, Ashish Sabharwal. [doi]
- Hybrid RL: Using both offline and online data can make RL efficientYuda Song 0001, Yifei Zhou, Ayush Sekhari, Drew Bagnell, Akshay Krishnamurthy, Wen Sun 0002. [doi]
- Synthetic Data Generation of Many-to-Many Datasets via Random Graph GenerationKai Xu, Georgi Ganev, Emile Joubert, Rees Davison, Olivier Van Acker, Luke Robinson. [doi]
- What shapes the loss landscape of self supervised learning?Ziyin Liu, Ekdeep Singh Lubana, Masahito Ueda, Hidenori Tanaka. [doi]
- Riemannian Metric Learning via Optimal TransportChristopher Scarvelis, Justin Solomon 0001. [doi]
- Learning to Estimate Shapley Values with Vision TransformersIan Connick Covert, Chanwoo Kim 0002, Su-In Lee. [doi]
- Human-level Atari 200x fasterSteven Kapturowski, Victor Campos 0001, Ray Jiang, Nemanja Rakicevic, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia. [doi]
- Equivariance-aware Architectural Optimization of Neural NetworksKaitlin Maile, Dennis George Wilson, Patrick Forré. [doi]
- Confidential-PROFITT: Confidential PROof of FaIr Training of TreesAli Shahin Shamsabadi, Sierra Calanda Wyllie, Nicholas Franzese, Natalie Dullerud, Sébastien Gambs, Nicolas Papernot, Xiao Wang 0012, Adrian Weller. [doi]
- Visual Classification via Description from Large Language ModelsSachit Menon, Carl Vondrick. [doi]
- Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) PropertyYingzhen Yang, Ping Li 0001. [doi]
- Multi-lingual Evaluation of Code Generation ModelsBen Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang 0002, Qing Sun, Mingyue Shang, Sujan Kumar Gonugondla, Hantian Ding, Varun Kumar, Nathan Fulton, Arash Farahani, Siddhartha Jain 0001, Robert Giaquinto, Haifeng Qian, Murali Krishna Ramanathan, Ramesh Nallapati. [doi]
- Supervision Complexity and its Role in Knowledge DistillationHrayr Harutyunyan, Ankit Singh Rawat, Aditya Krishna Menon, Seungyeon Kim, Sanjiv Kumar. [doi]
- Estimating individual treatment effects under unobserved confounding using binary instrumentsDennis Frauen, Stefan Feuerriegel. [doi]
- Dual Student Networks for Data-Free Model StealingJames Beetham, Navid Kardan, Ajmal Saeed Mian, Mubarak Shah. [doi]
- The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge DistillationHuancheng Chen, Chaining Wang, Haris Vikalo. [doi]
- A framework for benchmarking Class-out-of-distribution detection and its application to ImageNetIdo Galil, Mohammed Dabbah, Ran El-Yaniv. [doi]
- Learning Hyper Label Model for Programmatic Weak SupervisionRenzhi Wu, Shen-En Chen, Jieyu Zhang, Xu Chu. [doi]
- Associative Memory Augmented Asynchronous Spatiotemporal Representation Learning for Event-based PerceptionUday Kamal, Saurabh Dash, Saibal Mukhopadhyay. [doi]
- Truthful Self-PlayShohei Ohsawa. [doi]
- PowerQuant: Automorphism Search for Non-Uniform QuantizationEdouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly. [doi]
- Diminishing Return of Value Expansion Methods in Model-Based Reinforcement LearningDaniel Palenicek, Michael Lutter, Joao Carvalho, Jan Peters 0001. [doi]
- TabPFN: A Transformer That Solves Small Tabular Classification Problems in a SecondNoah Hollmann, Samuel Müller 0005, Katharina Eggensperger, Frank Hutter. [doi]
- TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax OptimizationXiang Li, Junchi Yang, Niao He. [doi]
- Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised LearningZehao Niu, Mihai Anitescu, Jie Chen 0007. [doi]
- Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided GuaranteesFlorent Delgrange, Ann Nowé, Guillermo A. Pérez 0001. [doi]
- A General Framework For Proving The Equivariant Strong Lottery Ticket HypothesisDamien Ferbach, Christos Tsirigotis, Gauthier Gidel, Avishek Joey Bose. [doi]
- The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich RegimesAlexander Atanasov, Blake Bordelon, Sabarish Sainathan, Cengiz Pehlevan. [doi]
- DocPrompting: Generating Code by Retrieving the DocsShuyan Zhou, Uri Alon 0002, Frank F. Xu, Zhengbao Jiang, Graham Neubig. [doi]
- Sound Randomized Smoothing in Floating-Point ArithmeticVáclav Vorácek, Matthias Hein 0001. [doi]
- Preserving Pre-trained Features Helps Calibrate Fine-tuned Language ModelsGuande He, Jianfei Chen, Jun Zhu. [doi]
- GLM-130B: An Open Bilingual Pre-trained ModelAohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding 0004, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Zhiyuan Liu, Peng Zhang, Yuxiao Dong, Jie Tang 0001. [doi]
- Disentangling Learning Representations with Density EstimationEric C. Yeats, Frank Y. Liu, Hai Li. [doi]
- Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated LearningYujun Shi, Jian Liang, Wenqing Zhang, Vincent Y. F. Tan, Song Bai. [doi]
- Learning topology-preserving data representationsIlya Trofimov, Daniil Cherniavskii, Eduard Tulchinskii, Nikita Balabin, Evgeny Burnaev, Serguei Barannikov. [doi]
- Fundamental Limits in Formal Verification of Message-Passing Neural NetworksMarco Sälzer, Martin Lange. [doi]
- Quasi-optimal Reinforcement Learning with Continuous ActionsYuhan Li, Wenzhuo Zhou, Ruoqing Zhu. [doi]
- Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity GapWeiyang Liu, Longhui Yu, Adrian Weller, Bernhard Schölkopf. [doi]
- Accurate Neural Training with 4-bit Matrix Multiplications at Standard FormatsBrian Chmiel, Ron Banner, Elad Hoffer, Hilla Ben-Yaacov, Daniel Soudry. [doi]
- Evidential Uncertainty and Diversity Guided Active Learning for Scene Graph GenerationShuzhou Sun, Shuaifeng Zhi, Janne Heikkilä, Li Liu. [doi]
- Dilated convolution with learnable spacingsIsmail Khalfaoui Hassani, Thomas Pellegrini, Timothée Masquelier. [doi]
- Minimum Description Length ControlTed Moskovitz, Ta-Chu Kao, Maneesh Sahani, Matt M. Botvinick. [doi]
- Interaction-Based Disentanglement of Entities for Object-Centric World ModelsAkihiro Nakano, Masahiro Suzuki, Yutaka Matsuo. [doi]
- Liquid Structural State-Space ModelsRamin M. Hasani, Mathias Lechner, Tsun-Hsuan Wang, Makram Chahine, Alexander Amini, Daniela Rus. [doi]
- Integrating Symmetry into Differentiable Planning with Steerable ConvolutionsLinfeng Zhao, Xupeng Zhu, Lingzhi Kong, Robin Walters, Lawson L. S. Wong. [doi]
- NTK-SAP: Improving neural network pruning by aligning training dynamicsYite Wang, Dawei Li, Ruoyu Sun 0001. [doi]
- Guiding continuous operator learning through Physics-based boundary constraintsNadim Saad, Gaurav Gupta, Shima Alizadeh, Danielle C. Maddix. [doi]
- Real-time variational method for learning neural trajectory and its dynamicsMatthew Dowling, Yuan Zhao 0004, Il Memming Park. [doi]
- Sequential Latent Variable Models for Few-Shot High-Dimensional Time-Series ForecastingXiajun Jiang, Ryan Missel, Zhiyuan Li, Linwei Wang. [doi]
- Spatial Attention Kinetic Networks with E(n)-EquivarianceYuanqing Wang, John D. Chodera. [doi]
- Transformer-based model for symbolic regression via joint supervised learningWenqiang Li, Weijun Li, Linjun Sun, Min Wu, Lina Yu, Jingyi Liu, Yanjie Li, Songsong Tian. [doi]
- WiNeRT: Towards Neural Ray Tracing for Wireless Channel Modelling and Differentiable SimulationsTribhuvanesh Orekondy, Kumar Pratik, Shreya Kadambi, Hao Ye, Joseph Soriaga, Arash Behboodi. [doi]
- Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary ConstraintsDavid Henry Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang 0012. [doi]
- Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of StabilityAlex Damian, Eshaan Nichani, Jason D. Lee. [doi]
- MICN: Multi-scale Local and Global Context Modeling for Long-term Series ForecastingHuiqiang Wang, Jian Peng 0002, Feihu Huang, Jince Wang, Junhui Chen, Yifei Xiao. [doi]
- How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?Jetze Schuurmans, Kim Batselier, Julian F. P. Kooij. [doi]
- Nonlinear Reconstruction for Operator Learning of PDEs with DiscontinuitiesSamuel Lanthaler, Roberto Molinaro, Patrik Hadorn, Siddhartha Mishra. [doi]
- Incremental Learning of Structured Memory via Closed-Loop TranscriptionShengbang Tong, Xili Dai, Ziyang Wu, Mingyang Li, Brent Yi, Yi Ma 0001. [doi]
- TEMPERA: Test-Time Prompt Editing via Reinforcement LearningTianjun Zhang, Xuezhi Wang 0002, Denny Zhou, Dale Schuurmans, Joseph E. Gonzalez. [doi]
- PGrad: Learning Principal Gradients For Domain GeneralizationZhe Wang 0025, Jake Grigsby, Yanjun Qi. [doi]
- Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsDaniel Y. Fu, Tri Dao, Khaled Kamal Saab, Armin W. Thomas, Atri Rudra, Christopher Ré. [doi]
- Real-Time Image Demoiréing on Mobile DevicesYuxin Zhang 0002, Mingbao Lin, Xunchao Li, Han Liu, Guozhi Wang, Fei Chao 0001, Shuai Ren, Yafei Wen, Xiaoxin Chen, Rongrong Ji. [doi]
- Vision Transformer Adapter for Dense PredictionsZhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao. [doi]
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot DataZichen Jeff Cui, Yibin Wang, Nur Muhammad (Mahi) Shafiullah, Lerrel Pinto. [doi]
- Adaptive Optimization in the ∞-Width LimitEtai Littwin, Greg Yang. [doi]
- Excess Risk of Two-Layer ReLU Neural Networks in Teacher-Student Settings and its Superiority to Kernel MethodsShunta Akiyama, Taiji Suzuki. [doi]
- Multifactor Sequential Disentanglement via Structured Koopman AutoencodersNimrod Berman, Ilan Naiman, Omri Azencot. [doi]
- Spectral Decomposition Representation for Reinforcement LearningTongzheng Ren, Tianjun Zhang, Lisa Lee, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai 0001. [doi]
- On The Specialization of Neural ModulesDevon Jarvis, Richard Klein, Benjamin Rosman, Andrew M. Saxe. [doi]
- TVSPrune - Pruning Non-discriminative filters via Total Variation separability of intermediate representations without fine tuningChaitanya Murti, Tanay Narshana, Chiranjib Bhattacharyya. [doi]
- Hyper-Decision Transformer for Efficient Online Policy AdaptationMengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan. [doi]
- Open-Vocabulary Object Detection upon Frozen Vision and Language ModelsWeicheng Kuo, Yin Cui, Xiuye Gu, A. J. Piergiovanni, Anelia Angelova. [doi]
- Binding Language Models in Symbolic LanguagesZhoujun Cheng, Tianbao Xie, Peng Shi 0010, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu 0009. [doi]
- Approximation and non-parametric estimation of functions over high-dimensional spheres via deep ReLU networksNamjoon Suh, Tian-Yi Zhou, Xiaoming Huo. [doi]
- SIMPLE: A Gradient Estimator for k-Subset SamplingKareem Ahmed, Zhe Zeng, Mathias Niepert, Guy Van den Broeck. [doi]
- Backstepping Temporal Difference LearningHan-Dong Lim, Donghwan Lee 0002. [doi]
- Federated Neural BanditsZhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet. [doi]
- On Compositional Uncertainty Quantification for Seq2seq Graph ParsingZi Lin, Du Phan, Panupong Pasupat, Jeremiah Zhe Liu, Jingbo Shang. [doi]
- Learning to Compose Soft Prompts for Compositional Zero-Shot LearningNihal V. Nayak, Peilin Yu, Stephen H. Bach. [doi]
- Bridging the Gap between ANNs and SNNs by Calibrating Offset SpikesZecheng Hao, Jianhao Ding, Tong Bu, Tiejun Huang 0001, Zhaofei Yu. [doi]
- On the Perils of Cascading Robust ClassifiersRavi Mangal, Zifan Wang, Chi Zhang, Klas Leino, Corina S. Pasareanu, Matt Fredrikson. [doi]
- Image to Sphere: Learning Equivariant Features for Efficient Pose PredictionDavid Klee, Ondrej Biza, Robert Platt, Robin Walters. [doi]
- Generalization Bounds for Federated Learning: Fast Rates, Unparticipating Clients and Unbounded LossesXiaolin Hu, Shaojie Li, Yong Liu. [doi]
- Does Deep Learning Learn to Abstract? A Systematic Probing FrameworkShengnan An, Zeqi Lin, Bei Chen, Qiang Fu, Nanning Zheng 0001, Jian-Guang Lou. [doi]
- FoSR: First-order spectral rewiring for addressing oversquashing in GNNsKedar Karhadkar, Pradeep Kr. Banerjee, Guido Montúfar. [doi]
- Graph Contrastive Learning for Skeleton-based Action RecognitionXiaohu Huang, Hao Zhou, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang 0001, Xinggang Wang, Wenyu Liu 0001, Bin Feng 0001. [doi]
- Quantifying Memorization Across Neural Language ModelsNicholas Carlini, Daphne Ippolito, Matthew Jagielski, Katherine Lee, Florian Tramèr, Chiyuan Zhang. [doi]
- Self-supervision through Random Segments with Autoregressive Coding (RandSAC)Tianyu Hua, Yonglong Tian, Sucheng Ren, Michalis Raptis, Hang Zhao, Leonid Sigal. [doi]
- Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization ProblemsZhongyuan Zhao 0002, Ananthram Swami, Santiago Segarra. [doi]
- Learning Probabilistic Topological Representations Using Discrete Morse TheoryXiaoling Hu 0002, Dimitris Samaras, Chao Chen 0012. [doi]
- Discovering Informative and Robust Positives for Video Domain AdaptationChang Liu 0022, Kunpeng Li, Michael Stopa, Jun Amano, Yun Fu 0001. [doi]
- Sampling-free Inference for Ab-Initio Potential Energy Surface NetworksNicholas Gao, Stephan Günnemann. [doi]
- Copy is All You NeedTian Lan, Deng Cai 0002, Yan Wang, Heyan Huang, Xian-Ling Mao. [doi]
- Performance Bounds for Model and Policy Transfer in Hidden-parameter MDPsHaotian Fu, Jiayu Yao, Omer Gottesman, Finale Doshi-Velez, George Konidaris 0001. [doi]
- DiGress: Discrete Denoising diffusion for graph generationClément Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, Volkan Cevher, Pascal Frossard. [doi]
- Neuromechanical Autoencoders: Learning to Couple Elastic and Neural Network NonlinearityDeniz Oktay, Mehran Mirramezani, Eder Medina, Ryan P. Adams. [doi]
- Transformer-based World Models Are Happy With 100k InteractionsJan Robine, Marc Höftmann, Tobias Uelwer, Stefan Harmeling. [doi]
- Sub-Task Decomposition Enables Learning in Sequence to Sequence TasksNoam Wies, Yoav Levine, Amnon Shashua. [doi]
- Towards Inferential Reproducibility of Machine Learning ResearchMichael Hagmann, Philipp Meier, Stefan Riezler. [doi]
- Faster Gradient-Free Methods for Escaping Saddle PointsHualin Zhang, Bin Gu 0001. [doi]
- Backpropagation through Combinatorial Algorithms: Identity with Projection WorksSubham Sekhar Sahoo, Anselm Paulus, Marin Vlastelica, Vít Musil, Volodymyr Kuleshov, Georg Martius. [doi]
- CUDA: Curriculum of Data Augmentation for Long-tailed RecognitionSumyeong Ahn, Jongwoo Ko, Se-Young Yun. [doi]
- Asymptotic Instance-Optimal Algorithms for Interactive Decision MakingKefan Dong, Tengyu Ma 0001. [doi]
- Anti-Symmetric DGN: a stable architecture for Deep Graph NetworksAlessio Gravina, Davide Bacciu, Claudio Gallicchio. [doi]
- Sequential Gradient Coding For Straggler MitigationMuralee Nikhil Krishnan, MohammadReza Ebrahimi, Ashish J. Khisti. [doi]
- Long Range Language Modeling via Gated State SpacesHarsh Mehta, Ankit Gupta 0001, Ashok Cutkosky, Behnam Neyshabur. [doi]
- Learning Diffusion Bridges on Constrained DomainsXingchao Liu, Lemeng Wu, Mao Ye 0006, Qiang Liu. [doi]
- Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep LearningXiaobo Xia, Jiale Liu, Jun Yu 0001, Xu Shen, Bo Han 0003, Tongliang Liu. [doi]
- One-Pixel Shortcut: On the Learning Preference of Deep Neural NetworksShutong Wu, Sizhe Chen, Cihang Xie, Xiaolin Huang. [doi]
- Spotlight: Mobile UI Understanding using Vision-Language Models with a FocusGang Li, Yang Li. [doi]
- First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen DomainsKefan Dong, Tengyu Ma 0001. [doi]
- Particle-based Variational Inference with Preconditioned Functional Gradient FlowHanze Dong, Xi Wang, Yong Lin, Tong Zhang 0001. [doi]
- EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous DataMichael Crawshaw, Yajie Bao, Mingrui Liu. [doi]
- Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuningXiangyu Peng, Chen Xing, Prafulla Kumar Choubey, Chien-Sheng Wu, Caiming Xiong. [doi]
- GReTo: Remedying dynamic graph topology-task discordance via target homophilyZhengyang Zhou, Qihe Huang, Gengyu Lin, Yang Kuo, Lei Bai 0001, Yang Wang 0015. [doi]
- TrojText: Test-time Invisible Textual Trojan InsertionQian Lou, Yepeng Liu, Bo Feng. [doi]
- Understanding Influence Functions and Datamodels via Harmonic AnalysisNikunj Saunshi, Arushi Gupta, Mark Braverman, Sanjeev Arora. [doi]
- The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge DistillationZihui Xue, Zhengqi Gao, Sucheng Ren, Hang Zhao. [doi]
- When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It?Mert Yüksekgönül, Federico Bianchi 0001, Pratyusha Kalluri, Dan Jurafsky, James Zou 0001. [doi]
- Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningPan Lu, Liang Qiu 0001, Kai-Wei Chang, Ying Nian Wu, Song Chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan. [doi]
- MIMT: Masked Image Modeling Transformer for Video CompressionJinxi Xiang, Kuan Tian, Jun Zhang. [doi]
- How to Train your HIPPO: State Space Models with Generalized Orthogonal Basis ProjectionsAlbert Gu, Isys Johnson, Aman Timalsina, Atri Rudra, Christopher Ré. [doi]
- Neural DAG Scheduling via One-Shot Priority SamplingWonseok Jeon, Mukul Gagrani, Burak Bartan, Weiliang Will Zeng, Harris Teague, Piero Zappi, Christopher Lott. [doi]
- Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal RetrievalZhenghao Liu, Chenyan Xiong, Yuanhuiyi Lv, Zhiyuan Liu 0001, Ge Yu 0001. [doi]
- Is Conditional Generative Modeling all you need for Decision Making?Anurag Ajay, Yilun Du, Abhi Gupta, Joshua B. Tenenbaum, Tommi S. Jaakkola, Pulkit Agrawal. [doi]
- Learning without Prejudices: Continual Unbiased Learning via Benign and Malignant ForgettingMyeongho Jeon, Hyoje Lee, Yedarm Seong, Myungjoo Kang. [doi]
- Switch-NeRF: Learning Scene Decomposition with Mixture of Experts for Large-scale Neural Radiance FieldsZhenxing Mi, Dan Xu 0002. [doi]
- MultiViz: Towards Visualizing and Understanding Multimodal ModelsPaul Pu Liang, Yiwei Lyu, Gunjan Chhablani, Nihal Jain, Zihao Deng, Xingbo Wang 0001, Louis-Philippe Morency, Ruslan Salakhutdinov. [doi]
- Multi-domain image generation and translation with identifiability guaranteesShaoan Xie, Lingjing Kong, Mingming Gong, Kun Zhang 0001. [doi]
- On the Robustness of Safe Reinforcement Learning under Observational PerturbationsZuxin Liu, Zijian Guo, Zhepeng Cen, Huan Zhang, Jie Tan, Bo Li 0026, Ding Zhao. [doi]
- Neural Optimal TransportAlexander Korotin, Daniil Selikhanovych, Evgeny Burnaev. [doi]
- DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge BasesDonghan Yu, Sheng Zhang, Patrick Ng, Henghui Zhu, Alexander Hanbo Li, Jun Wang 0122, Yiqun Hu, William Yang Wang, Zhiguo Wang, Bing Xiang. [doi]
- Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action RepresentationsLuca Franco, Paolo Mandica, Bharti Munjal, Fabio Galasso. [doi]
- Learning to Grow Pretrained Models for Efficient Transformer TrainingPeihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogério Feris, David Daniel Cox, Zhangyang Wang, Yoon Kim. [doi]
- Quantized Compressed Sensing with Score-Based Generative ModelsXiangming Meng, Yoshiyuki Kabashima. [doi]
- AANG : Automating Auxiliary LearningLucio M. Dery, Paul Michel, Mikhail Khodak, Graham Neubig, Ameet Talwalkar. [doi]
- Understanding Embodied Reference with Touch-Line TransformerYang Li 0178, Xiaoxue Chen, Hao Zhao 0002, Jiangtao Gong, Guyue Zhou, Federico Rossano, Yixin Zhu. [doi]
- SP2 : A Second Order Stochastic Polyak MethodShuang Li 0003, William J. Swartworth, Martin Takác 0001, Deanna Needell, Robert M. Gower. [doi]
- LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale RetrievalTao Shen 0001, Xiubo Geng, Chongyang Tao, Can Xu, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang. [doi]
- Consolidator: Mergable Adapter with Group Connections for Visual AdaptationTianxiang Hao, Hui Chen 0013, Yuchen Guo, Guiguang Ding. [doi]
- HiT-MDP: Learning the SMDP option framework on MDPs with Hidden Temporal EmbeddingsChang Li, Dongjin Song, Dacheng Tao. [doi]
- Improving Out-of-distribution Generalization with Indirection RepresentationsKha Pham, Hung Le, Man Ngo, Truyen Tran 0001. [doi]
- Discovering Evolution Strategies via Meta-Black-Box OptimizationRobert Tjarko Lange, Tom Schaul, Yutian Chen, Tom Zahavy, Valentin Dalibard, Chris Lu 0001, Satinder Singh 0001, Sebastian Flennerhag. [doi]
- BALTO: fast tensor program optimization with diversity-based active learningJun Bi, Xiaqing Li, Qi Guo 0001, Rui Zhang 0040, Yuanbo Wen, Xing Hu 0001, Zidong Du, Xinkai Song, Yifan Hao, Yunji Chen. [doi]
- Explaining Temporal Graph Models through an Explorer-Navigator FrameworkWenwen Xia, Mincai Lai, Caihua Shan, Yao Zhang 0009, Xinnan Dai, Xiang Li 0067, Dongsheng Li 0002. [doi]
- PV3D: A 3D Generative Model for Portrait Video GenerationEric Zhongcong Xu, Jianfeng Zhang, Jun Hao Liew, Wenqing Zhang, Song Bai, Jiashi Feng, Mike Zheng Shou. [doi]
- Energy-Inspired Self-Supervised Pretraining for Vision ModelsZe Wang, Jiang Wang, Zicheng Liu 0001, Qiang Qiu. [doi]
- Structure by Architecture: Structured Representations without RegularizationFelix Leeb, Giulia Lanzillotta, Yashas Annadani, Michel Besserve, Stefan Bauer, Bernhard Schölkopf. [doi]
- ∞-adversarial training, and its unrealized threatsRanjie Duan, Yuefeng Chen, Yao Zhu, Xiaojun Jia, Rong Zhang, Hui Xue 0001. [doi]
- Prompting GPT-3 To Be ReliableChenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan L. Boyd-Graber, Lijuan Wang. [doi]
- Empowering Networks With Scale and Rotation Equivariance Using A Similarity ConvolutionZikai Sun, Thierry Blu. [doi]
- GAIN: On the Generalization of Instructional Action UnderstandingJunlong Li, Guangyi Chen 0002, Yansong Tang, Jinan Bao, Kun Zhang, Jie Zhou 0001, Jiwen Lu. [doi]
- Can We Faithfully Represent Absence States to Compute Shapley Values on a DNN?Jie Ren 0018, Zhanpeng Zhou, Qirui Chen, Quanshi Zhang. [doi]
- Short-Term Memory ConvolutionsGrzegorz Stefanski, Krzysztof Arendt, Pawel Daniluk, Bartlomiej Jasik, Artur Szumaczuk. [doi]
- Building a Subspace of Policies for Scalable Continual LearningJean-Baptiste Gaya, Thang Doan, Lucas Caccia, Laure Soulier, Ludovic Denoyer, Roberta Raileanu. [doi]
- RLx2: Training a Sparse Deep Reinforcement Learning Model from ScratchYiqin Tan, Pihe Hu, Ling Pan, Jiatai Huang, Longbo Huang. [doi]
- Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and PlanningAnton Bakhtin, David J. Wu 0002, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H. Miller, Noam Brown. [doi]
- Lower Bounds on the Depth of Integral ReLU Neural Networks via Lattice PolytopesChristian Haase 0001, Christoph Hertrich, Georg Loho. [doi]
- Provable Robustness against Wasserstein Distribution Shifts via Input RandomizationAounon Kumar, Alexander Levine 0001, Tom Goldstein, Soheil Feizi. [doi]
- Learning Fair Graph Representations via Automated Data AugmentationsHongyi Ling, Zhimeng Jiang, Youzhi Luo, Shuiwang Ji, Na Zou. [doi]
- Breaking Correlation Shift via Conditional Invariant RegularizerMingyang Yi, Ruoyu Wang 0016, Jiacheng Sun, Zhenguo Li, Zhi-Ming Ma. [doi]
- Diffusion Posterior Sampling for General Noisy Inverse ProblemsHyungjin Chung, Jeongsol Kim, Michael Thompson McCann, Marc Louis Klasky, Jong Chul Ye. [doi]
- Phase transition for detecting a small community in a large networkJiashun Jin, Zheng Tracy Ke, Paxton Turner, Anru Zhang. [doi]
- DiffMimic: Efficient Motion Mimicking with Differentiable PhysicsJiawei Ren, Cunjun Yu, Siwei Chen, Xiao Ma 0006, Liang Pan, Ziwei Liu 0002. [doi]
- Analog Bits: Generating Discrete Data using Diffusion Models with Self-ConditioningTing Chen, Ruixiang Zhang, Geoffrey E. Hinton. [doi]
- Adversarial Imitation Learning with PreferencesAleksandar Taranovic, Andras Gabor Kupcsik, Niklas Freymuth, Gerhard Neumann. [doi]
- Bag of Tricks for Unsupervised Text-to-SpeechYi Ren 0006, Chen Zhang 0020, Shuicheng Yan. [doi]
- Hebbian and Gradient-based Plasticity Enables Robust Memory and Rapid Learning in RNNsYu Duan, Zhongfan Jia, Qian Li, Yi Zhong, Kaisheng Ma. [doi]
- SAM as an Optimal Relaxation of BayesThomas Möllenhoff, Mohammad Emtiyaz Khan. [doi]
- Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov GamesShicong Cen, Yuejie Chi, Simon Shaolei Du, Lin Xiao. [doi]
- Verifying the Union of Manifolds Hypothesis for Image DataBradley C. A. Brown, Anthony L. Caterini, Brendan Leigh Ross, Jesse C. Cresswell, Gabriel Loaiza-Ganem. [doi]
- Learning with Auxiliary Activation for Memory-Efficient TrainingSunghyeon Woo, Dongsuk Jeon. [doi]
- Rethinking Graph Lottery Tickets: Graph Sparsity MattersBo Hui, Da Yan 0001, Xiaolong Ma, Wei-Shinn Ku. [doi]
- Mitigating Dataset Bias by Using Per-Sample GradientSumyeong Ahn, Seongyoon Kim, Se-Young Yun. [doi]
- Learning Object-Language Alignments for Open-Vocabulary Object DetectionChuang Lin, Peize Sun, Yi Jiang, Ping Luo 0002, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai 0001. [doi]
- Improving Deep Regression with Ordinal EntropyShihao Zhang, Linlin Yang, Michael Bi Mi, Xiaoxu Zheng, Angela Yao. [doi]
- PatchDCT: Patch Refinement for High Quality Instance SegmentationQinrou Wen, Jirui Yang, Xue Yang 0005, Kewei Liang. [doi]
- Differentially Private Adaptive Optimization with Delayed PreconditionersTian Li 0005, Manzil Zaheer, Ken Liu, Sashank J. Reddi, Hugh Brendan McMahan, Virginia Smith. [doi]
- Benign Overfitting in Classification: Provably Counter Label Noise with Larger ModelsKaiyue Wen, Jiaye Teng, Jingzhao Zhang. [doi]
- Ollivier-Ricci Curvature for Hypergraphs: A Unified FrameworkCorinna Coupette, Sebastian Dalleiger, Bastian Rieck. [doi]
- Editing models with task arithmeticGabriel Ilharco, Marco Túlio Ribeiro, Mitchell Wortsman, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi. [doi]
- Learning rigid dynamics with face interaction graph networksKelsey R. Allen, Yulia Rubanova, Tatiana Lopez-Guevara, William Whitney 0001, Alvaro Sanchez-Gonzalez, Peter W. Battaglia, Tobias Pfaff. [doi]
- Improving Object-centric Learning with Query OptimizationBaoxiong Jia, Yu Liu, Siyuan Huang. [doi]
- Beyond calibration: estimating the grouping loss of modern neural networksAlexandre Perez-Lebel, Marine Le Morvan, Gaël Varoquaux. [doi]
- Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient AlgorithmsPan Zhou, Xingyu Xie, Shuicheng Yan. [doi]
- Are More Layers Beneficial to Graph Transformers?Haiteng Zhao, Shuming Ma, Dongdong Zhang 0001, Zhi-Hong Deng, Furu Wei. [doi]
- Backpropagation at the Infinitesimal Inference Limit of Energy-Based Models: Unifying Predictive Coding, Equilibrium Propagation, and Contrastive Hebbian LearningBeren Millidge, Yuhang Song 0001, Tommaso Salvatori, Thomas Lukasiewicz, Rafal Bogacz. [doi]
- On the complexity of nonsmooth automatic differentiationJérôme Bolte, Ryan Boustany, Edouard Pauwels, Béatrice Pesquet-Popescu. [doi]
- TILP: Differentiable Learning of Temporal Logical Rules on Knowledge GraphsSiheng Xiong, Yuan Yang, Faramarz Fekri, James Clayton Kerce. [doi]
- On Representing Linear Programs by Graph Neural NetworksZiang Chen, Jialin Liu 0003, Xinshang Wang, Wotao Yin. [doi]
- Edge Guided GANs with Contrastive Learning for Semantic Image SynthesisHao Tang 0005, Xiaojuan Qi, Guolei Sun, Dan Xu 0002, Nicu Sebe, Radu Timofte, Luc Van Gool. [doi]
- Robust Graph Dictionary LearningWeijie Liu 0006, Jiahao Xie 0001, Chao Zhang 0029, Makoto Yamada, Nenggan Zheng, Hui Qian 0001. [doi]
- CUTS: Neural Causal Discovery from Irregular Time-Series DataYuxiao Cheng, Runzhao Yang, Tingxiong Xiao, Zongren Li, Jinli Suo, Kunlun He, Qionghai Dai. [doi]
- Sparse Distributed Memory is a Continual LearnerTrenton Bricken, Xander Davies, Deepak Singh, Dmitry Krotov, Gabriel Kreiman. [doi]
- Reversible Column NetworksYuxuan Cai, Yizhuang Zhou, Qi Han, Jianjian Sun, Xiangwen Kong, Jun Li, Xiangyu Zhang. [doi]
- Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function ApproximationDan Qiao 0002, Yu-Xiang Wang 0003. [doi]
- The Surprising Effectiveness of Equivariant Models in Domains with Latent SymmetryDian Wang 0001, Jung Yeon Park, Neel Sortur, Lawson L. S. Wong, Robin Walters, Robert Platt. [doi]
- Leveraging Future Relationship Reasoning for Vehicle Trajectory PredictionDaehee Park, Hobin Ryu, Yunseo Yang, Jegyeong Cho, Jiwon Kim, Kuk-Jin Yoon. [doi]
- Self-supervised learning with rotation-invariant kernelsLéon Zheng, Gilles Puy, Elisa Riccietti, Patrick Pérez, Rémi Gribonval. [doi]
- DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable PhysicsSizhe Li, Zhiao Huang, Tao Chen 0046, Tao Du 0001, Hao Su 0001, Joshua B. Tenenbaum, Chuang Gan. [doi]
- A Self-Attention Ansatz for Ab-initio Quantum ChemistryIngrid von Glehn, James S. Spencer, David Pfau. [doi]
- HiViT: A Simpler and More Efficient Design of Hierarchical Vision TransformerXiaosong Zhang 0004, Yunjie Tian, Lingxi Xie, Wei Huang, Qi Dai, Qixiang Ye, Qi Tian 0001. [doi]
- Active Learning in Bayesian Neural Networks with Balanced Entropy Learning PrincipleJae Oh Woo. [doi]
- ImageNet-X: Understanding Model Mistakes with Factor of Variation AnnotationsBadr Youbi Idrissi, Diane Bouchacourt, Randall Balestriero, Ivan Evtimov, Caner Hazirbas, Nicolas Ballas, Pascal Vincent, Michal Drozdzal, David Lopez-Paz, Mark Ibrahim. [doi]
- Effective passive membership inference attacks in federated learning against overparameterized modelsJiacheng Li, Ninghui Li, Bruno Ribeiro 0001. [doi]
- Image as Set of PointsXu Ma 0005, YuQian Zhou, Huan Wang 0014, Can Qin, Bin Sun 0002, Chang Liu 0022, Yun Fu 0001. [doi]
- Differentially Private $L_2$-Heavy Hitters in the Sliding Window ModelJeremiah Blocki, Seunghoon Lee, Tamalika Mukherjee, Samson Zhou. [doi]
- Boosting the Cycle Counting Power of Graph Neural Networks with I$^2$-GNNsYinan Huang, Xingang Peng, Jianzhu Ma, Muhan Zhang. [doi]
- InPL: Pseudo-labeling the Inliers First for Imbalanced Semi-supervised LearningZhuoran Yu, Yin Li, Yong Jae Lee. [doi]
- Towards Interpretable Deep Reinforcement Learning with Human-Friendly PrototypesEoin M. Kenny, Mycal Tucker, Julie Shah. [doi]
- On the duality between contrastive and non-contrastive self-supervised learningQuentin Garrido, Yubei Chen, Adrien Bardes, Laurent Najman, Yann LeCun. [doi]
- Bit-Pruning: A Sparse Multiplication-Less Dot-ProductYusuke Sekikawa, Shingo Yashima. [doi]
- Learning to Solve Constraint Satisfaction Problems with Recurrent TransformerZhun Yang, Adam Ishay, Joohyung Lee 0002. [doi]
- Neural Bregman Divergences for Distance LearningFred Lu, Edward Raff, Francis Ferraro. [doi]
- Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video RecognitionJunyan Wang, Zhenhong Sun, Yichen Qian, Dong Gong, Xiuyu Sun, Ming Lin, Maurice Pagnucco, Yang Song 0001. [doi]
- ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate RegretStephen Marcus McAleer, Gabriele Farina, Marc Lanctot, Tuomas Sandholm. [doi]
- Deep Variational Implicit ProcessesLuis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato. [doi]
- Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov GameWei Xiong 0015, Han Zhong 0001, Chengshuai Shi, Cong Shen 0001, Liwei Wang 0001, Tong Zhang 0001. [doi]
- Modeling Sequential Sentence Relation to Improve Cross-lingual Dense RetrievalShunyu Zhang, Yaobo Liang, Ming Gong, Daxin Jiang, Nan Duan. [doi]
- Order Matters: Agent-by-agent Policy OptimizationXihuai Wang, Zheng Tian 0002, Ziyu Wan, Ying Wen 0001, Jun Wang 0012, Weinan Zhang 0001. [doi]
- Latent Graph Inference using Product ManifoldsHaitz Sáez de Ocáriz Borde, Anees Kazi, Federico Barbero, Pietro Liò. [doi]
- Solving stochastic weak Minty variational inequalities without increasing batch sizeThomas Pethick, Olivier Fercoq, Puya Latafat, Panagiotis Patrinos, Volkan Cevher. [doi]
- Towards Robust Object Detection Invariant to Real-World Domain ShiftsQi Fan, Mattia Segù, Yu-Wing Tai, Fisher Yu, Chi-Keung Tang, Bernt Schiele, Dengxin Dai. [doi]
- Combinatorial-Probabilistic Trade-Off: P-Values of Community Properties Test in the Stochastic Block ModelsShuting Shen, Junwei Lu. [doi]
- PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System IdentificationXuan Li, Yi-Ling Qiao, Peter Yichen Chen, Krishna Murthy Jatavallabhula, Ming Lin, Chenfanfu Jiang, Chuang Gan. [doi]
- Globally Optimal Training of Neural Networks with Threshold Activation FunctionsTolga Ergen, Halil Ibrahim Gulluk, Jonathan Lacotte, Mert Pilanci. [doi]
- Tailoring Language Generation Models under Total Variation DistanceHaozhe Ji, Pei Ke, Zhipeng Hu, Rongsheng Zhang, Minlie Huang. [doi]
- Partial Label Unsupervised Domain Adaptation with Class-Prototype AlignmentYan Yan, Yuhong Guo. [doi]
- Dichotomy of Control: Separating What You Can Control from What You CannotSherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum. [doi]
- Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent ApproachHeshan Devaka Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen. [doi]
- Causality Compensated Attention for Contextual Biased Visual RecognitionRuyang Liu, Jingjia Huang, Thomas H. Li, Ge Li 0002. [doi]
- Stochastic Differentially Private and Fair LearningAndrew Lowy, Devansh Gupta, Meisam Razaviyayn. [doi]
- Solving Continuous Control via Q-learningTim Seyde, Peter Werner, Wilko Schwarting, Igor Gilitschenski, Martin A. Riedmiller, Daniela Rus, Markus Wulfmeier. [doi]
- The Implicit Bias of Minima Stability in Multivariate Shallow ReLU NetworksMor Shpigel Nacson, Rotem Mulayoff, Greg Ongie, Tomer Michaeli, Daniel Soudry. [doi]
- Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural DynamicsSirui Zheng, Lingxiao Wang 0003, Shuang Qiu, Zuyue Fu, Zhuoran Yang, Csaba Szepesvári, Zhaoran Wang. [doi]
- Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptionsSitan Chen, Sinho Chewi, Jerry Li 0001, Yuanzhi Li, Adil Salim, Anru Zhang. [doi]
- A Learning Based Hypothesis Test for Harmful Covariate ShiftTom Ginsberg, Zhongyuan Liang, Rahul G. Krishnan. [doi]
- A Theoretical Framework for Inference and Learning in Predictive Coding NetworksBeren Millidge, Yuhang Song 0001, Tommaso Salvatori, Thomas Lukasiewicz, Rafal Bogacz. [doi]
- Offline Reinforcement Learning with Differentiable Function Approximation is Provably EfficientMing Yin, Mengdi Wang, Yu-Xiang Wang. [doi]
- Neural-based classification rule learning for sequential dataMarine Collery, Philippe Bonnard, François Fages, Remy Kusters. [doi]
- SYNC: Safety-Aware Neural Control for Stabilizing Stochastic Delay-Differential EquationsJingdong Zhang, Qunxi Zhu, Wei Yang, Wei Lin 0003. [doi]
- This Looks Like It Rather Than That: ProtoKNN For Similarity-Based ClassifiersYuki Ukai, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi. [doi]
- AIM: Adapting Image Models for Efficient Video Action RecognitionTaojiannan Yang, Yi Zhu, Yusheng Xie, Aston Zhang, Chen Chen, Mu Li 0003. [doi]
- Asynchronous Distributed Bilevel OptimizationYang Jiao, Kai Yang 0001, Tiancheng Wu, Dongjin Song, Chengtao Jian. [doi]
- Revisiting Populations in multi-agent CommunicationPaul Michel, Mathieu Rita, Kory Wallace Mathewson, Olivier Tieleman, Angeliki Lazaridou. [doi]
- Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer ArithmeticYulhwa Kim, Jaeyong Jang, Jehun Lee, JiHoon Park, Jeonghoon Kim, Byeongwook Kim, Baeseong Park, Se Jung Kwon, Dongsoo Lee, Jae-Joon Kim. [doi]
- Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-EncodersHuangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou. [doi]
- Generating Diverse Cooperative Agents by Learning Incompatible PoliciesRujikorn Charakorn, Poramate Manoonpong, Nat Dilokthanakul. [doi]
- Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form SolutionChao Chen, Haoyu Geng, Gang Zeng, Zhaobing Han, Hua Chai, Xiaokang Yang, Junchi Yan. [doi]
- The Power of Regularization in Solving Extensive-Form GamesMingyang Liu, Asuman E. Ozdaglar, Tiancheng Yu, Kaiqing Zhang. [doi]
- Specformer: Spectral Graph Neural Networks Meet TransformersDeyu Bo, Chuan Shi, Lele Wang, Renjie Liao. [doi]
- DiffEdit: Diffusion-based semantic image editing with mask guidanceGuillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu Cord. [doi]
- MA-BERT: Towards Matrix Arithmetic-only BERT Inference by Eliminating Complex Non-Linear FunctionsNeo Wei Ming, Zhehui Wang, Cheng Liu 0008, Rick Siow Mong Goh, Tao Luo 0014. [doi]
- Semi-supervised Community Detection via Structural Similarity MetricsYicong Jiang, Tracy Ke. [doi]
- What learning algorithm is in-context learning? Investigations with linear modelsEkin Akyürek, Dale Schuurmans, Jacob Andreas, Tengyu Ma 0001, Denny Zhou. [doi]
- Artificial Neuronal Ensembles with Learned Context Dependent GatingMatthew J. Tilley,