Abstract is missing.
- Poisoning and Backdooring Contrastive LearningNicholas Carlini, Andreas Terzis. [doi]
- ADAVI: Automatic Dual Amortized Variational Inference Applied To Pyramidal Bayesian ModelsLouis Rouillard, Demian Wassermann. [doi]
- Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and HowYuning You, Yue Cao, Tianlong Chen, Zhangyang Wang, Yang Shen. [doi]
- L0-Sparse Canonical Correlation AnalysisOfir Lindenbaum, Moshe Salhov, Amir Averbuch, Yuval Kluger. [doi]
- Understanding and Preventing Capacity Loss in Reinforcement LearningClare Lyle, Mark Rowland, Will Dabney. [doi]
- Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious CorrelationsAahlad Manas Puli, Lily H. Zhang, Eric Karl Oermann, Rajesh Ranganath. [doi]
- Bundle Networks: Fiber Bundles, Local Trivializations, and a Generative Approach to Exploring Many-to-one MapsNico Courts, Henry Kvinge. [doi]
- Scene Transformer: A unified architecture for predicting future trajectories of multiple agentsJiquan Ngiam, Vijay Vasudevan, Benjamin Caine, Zhengdong Zhang, Hao-Tien Lewis Chiang, Jeffrey Ling, Rebecca Roelofs, Alex Bewley, Chenxi Liu, Ashish Venugopal, David J. Weiss, Ben Sapp, Zhifeng Chen, Jonathon Shlens. [doi]
- ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of MindYuanfei Wang, Fangwei Zhong, Jing Xu, Yizhou Wang 0001. [doi]
- How Much Can CLIP Benefit Vision-and-Language Tasks?Sheng Shen, Liunian Harold Li, Hao Tan, Mohit Bansal, Anna Rohrbach, Kai-Wei Chang, Zhewei Yao, Kurt Keutzer. [doi]
- Topological Experience ReplayZhang-Wei Hong, Tao Chen 0046, Yen-Chen Lin, Joni Pajarinen, Pulkit Agrawal. [doi]
- On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural NetworksMaximilian Seitzer, Arash Tavakoli, Dimitrije Antic, Georg Martius. [doi]
- One After Another: Learning Incremental Skills for a Changing WorldNur Muhammad (Mahi) Shafiullah, Lerrel Pinto. [doi]
- Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANsJaewoong Choi, Junho Lee, Changyeon Yoon, Jung-Ho Park, Geonho Hwang, Myungjoo Kang. [doi]
- Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic FunctionsJuncheng Dong, Simiao Ren, Yang Deng, Omar Khatib, Jordan M. Malof, Mohammadreza Soltani, Willie Padilla, Vahid Tarokh. [doi]
- Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic ForecastingHyunwook Lee, Seungmin Jin, Hyeshin Chu, Hongkyu Lim, Sungahn Ko. [doi]
- On the approximation properties of recurrent encoder-decoder architecturesZhong Li, Haotian Jiang, Qianxiao Li. [doi]
- Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized TeamingSachin G. Konan, Esmaeil Seraj, Matthew C. Gombolay. [doi]
- Should We Be Pre-training? An Argument for End-task Aware Training as an AlternativeLucio M. Dery, Paul Michel, Ameet Talwalkar, Graham Neubig. [doi]
- Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability AuthorizationLixu Wang, Shichao Xu, Ruiqi Xu, Xiao Wang, Qi Zhu 0002. [doi]
- On the Learning and Learnability of QuasimetricsTongzhou Wang 0001, Phillip Isola. [doi]
- On the Importance of Difficulty Calibration in Membership Inference AttacksLauren Watson, Chuan Guo, Graham Cormode, Alexandre Sablayrolles. [doi]
- Object Dynamics Distillation for Scene Decomposition and RepresentationQu Tang, Xiangyu Zhu, Zhen Lei 0001, Zhaoxiang Zhang. [doi]
- Possibility Before Utility: Learning And Using Hierarchical AffordancesRobby Costales, Shariq Iqbal, Fei Sha. [doi]
- Generative Models as a Data Source for Multiview Representation LearningAli Jahanian 0002, Xavier Puig, Yonglong Tian, Phillip Isola. [doi]
- Model Zoo: A Growing Brain That Learns ContinuallyRahul Ramesh, Pratik Chaudhari. [doi]
- R5: Rule Discovery with Reinforced and Recurrent Relational ReasoningShengyao Lu, Bang Liu, Keith G Mills, Shangling Jui, Di Niu. [doi]
- OntoProtein: Protein Pretraining With Gene Ontology EmbeddingNingyu Zhang, Zhen Bi, Xiaozhuan Liang, Siyuan Cheng 0008, Haosen Hong, Shumin Deng, Qiang Zhang, Jiazhang Lian, Huajun Chen. [doi]
- Communication-Efficient Actor-Critic Methods for Homogeneous Markov GamesDingyang Chen, Yile Li, Qi Zhang. [doi]
- How many degrees of freedom do we need to train deep networks: a loss landscape perspectiveBrett W. Larsen, Stanislav Fort, Nic Becker, Surya Ganguli. [doi]
- Effective Model Sparsification by Scheduled Grow-and-Prune MethodsXiaolong Ma, Minghai Qin, Fei Sun, Zejiang Hou, Kun Yuan, Yi Xu, Yanzhi Wang, Yen-Kuang Chen, Rong Jin 0001, Yuan Xie 0008. [doi]
- Triangle and Four Cycle Counting with Predictions in Graph StreamsJustin Y. Chen, Talya Eden, Piotr Indyk, Honghao Lin, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner, David Woodruff, Michael Zhang. [doi]
- Neural Solvers for Fast and Accurate Numerical Optimal ControlFederico Berto, Stefano Massaroli, Michael Poli, Jinkyoo Park. [doi]
- Learning to Guide and to be Guided in the Architect-Builder ProblemPaul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer. [doi]
- How Well Does Self-Supervised Pre-Training Perform with Streaming Data?Dapeng Hu, Shipeng Yan, Qizhengqiu Lu, Lanqing Hong, Hailin Hu, Yifan Zhang, Zhenguo Li, Xinchao Wang, Jiashi Feng. [doi]
- CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series ForecastingGerald Woo, Chenghao Liu, Doyen Sahoo, Akshat Kumar, Steven C. H. Hoi. [doi]
- Differentially Private Fine-tuning of Language ModelsDa Yu, Saurabh Naik, Arturs Backurs, Sivakanth Gopi, Huseyin A. Inan, Gautam Kamath 0001, Janardhan Kulkarni, Yin Tat Lee, Andre Manoel, Lukas Wutschitz, Sergey Yekhanin, Huishuai Zhang. [doi]
- Provable Adaptation across Multiway Domains via Representation LearningZhili Feng, Shaobo Han, Simon Shaolei Du. [doi]
- Trigger Hunting with a Topological Prior for Trojan DetectionXiaoling Hu 0002, Xiao Lin, Michael Cogswell, Yi Yao, Susmit Jha, Chao Chen 0012. [doi]
- Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical SolutionsLeslie O'Bray, Max Horn, Bastian Rieck, Karsten M. Borgwardt. [doi]
- Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit DifferentiationRoss M. Clarke, Elre Talea Oldewage, José Miguel Hernández-Lobato. [doi]
- Automatic Loss Function Search for Predict-Then-Optimize Problems with Strong Ranking PropertyBoshi Wang, Jialin Yi, Hang Dong, Bo Qiao, Chuan Luo, Qingwei Lin. [doi]
- Scarf: Self-Supervised Contrastive Learning using Random Feature CorruptionDara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler. [doi]
- NASI: Label- and Data-agnostic Neural Architecture Search at InitializationYao Shu, Shaofeng Cai, Zhongxiang Dai, Beng Chin Ooi, Bryan Kian Hsiang Low. [doi]
- Generalized rectifier wavelet covariance models for texture synthesisAntoine Brochard, Sixin Zhang, Stéphane Mallat. [doi]
- Solving Inverse Problems in Medical Imaging with Score-Based Generative ModelsYang Song 0011, Liyue Shen, Lei Xing 0001, Stefano Ermon. [doi]
- Revisit Kernel Pruning with Lottery Regulated Grouped ConvolutionsShaochen Zhong, Guanqun Zhang, Ningjia Huang, Shuai Xu. [doi]
- StyleAlign: Analysis and Applications of Aligned StyleGAN ModelsZongze Wu, Yotam Nitzan, Eli Shechtman, Dani Lischinski. [doi]
- Understanding and Improving Graph Injection Attack by Promoting UnnoticeabilityYongqiang Chen 0002, Han Yang 0002, Yonggang Zhang, Kaili Ma 0001, Tongliang Liu, Bo Han 0003, James Cheng. [doi]
- Sample Efficient Deep Reinforcement Learning via Uncertainty EstimationVincent Mai, Kaustubh Mani, Liam Paull. [doi]
- Multi-Stage Episodic Control for Strategic Exploration in Text GamesJens Tuyls, Shunyu Yao, Sham M. Kakade, Karthik Narasimhan. [doi]
- X-model: Improving Data Efficiency in Deep Learning with A Minimax ModelXimei Wang, Xinyang Chen, Jianmin Wang, Mingsheng Long. [doi]
- Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family DistributionsBertrand Charpentier, Oliver Borchert, Daniel Zügner, Simon Geisler, Stephan Günnemann. [doi]
- Trivial or Impossible --- dichotomous data difficulty masks model differences (on ImageNet and beyond)Kristof Meding, Luca M. Schulze Buschoff, Robert Geirhos, Felix A. Wichmann. [doi]
- Neural Models for Output-Space Invariance in Combinatorial ProblemsYatin Nandwani, Vidit Jain, Mausam, Parag Singla. [doi]
- Amortized Implicit Differentiation for Stochastic Bilevel OptimizationMichael Arbel, Julien Mairal. [doi]
- Hyperparameter Tuning with Renyi Differential PrivacyNicolas Papernot, Thomas Steinke 0002. [doi]
- Is Homophily a Necessity for Graph Neural Networks?Yao Ma 0001, Xiaorui Liu, Neil Shah, Jiliang Tang. [doi]
- Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to PracticePeihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang. [doi]
- Deep ReLU Networks Preserve Expected LengthBoris Hanin, Ryan S. Jeong, David Rolnick. [doi]
- On-Policy Model Errors in Reinforcement LearningLukas P. Fröhlich, Maksym Lefarov, Melanie N. Zeilinger, Felix Berkenkamp. [doi]
- ZeroFL: Efficient On-Device Training for Federated Learning with Local SparsityXinchi Qiu, Javier Fernández-Marqués, Pedro P. B. Gusmao, Yan Gao, Titouan Parcollet, Nicholas Donald Lane. [doi]
- Decoupled Adaptation for Cross-Domain Object DetectionJunguang Jiang, Baixu Chen, Jianmin Wang, Mingsheng Long. [doi]
- Distributionally Robust Fair Principal Components via Geodesic DescentsHieu Vu, Toan Tran, Man-Chung Yue, Viet Anh Nguyen. [doi]
- Information Prioritization through Empowerment in Visual Model-based RLHomanga Bharadhwaj, Mohammad Babaeizadeh, Dumitru Erhan, Sergey Levine. [doi]
- Efficient Self-supervised Vision Transformers for Representation LearningChunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao. [doi]
- Generative Pseudo-Inverse MemoryKha Pham, Hung Le, Man Ngo, Truyen Tran 0001, Bao Ho, Svetha Venkatesh. [doi]
- Pretrained Language Model in Continual Learning: A Comparative StudyTongtong Wu, Massimo Caccia, Zhuang Li, Yuan-Fang Li, Guilin Qi, Gholamreza Haffari. [doi]
- Retriever: Learning Content-Style Representation as a Token-Level Bipartite GraphDacheng Yin, Xuanchi Ren, Chong Luo, Yuwang Wang, Zhiwei Xiong, Wenjun Zeng. [doi]
- Wiring Up Vision: Minimizing Supervised Synaptic Updates Needed to Produce a Primate Ventral StreamFranziska Geiger, Martin Schrimpf, Tiago Marques, James J. DiCarlo. [doi]
- Step-unrolled Denoising Autoencoders for Text GenerationNikolay Savinov, Junyoung Chung, Mikolaj Binkowski, Erich Elsen, Aäron Van Den Oord. [doi]
- A Loss Curvature Perspective on Training Instabilities of Deep Learning ModelsJustin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George Edward Dahl, Zachary Nado, Orhan Firat. [doi]
- Learning Strides in Convolutional Neural NetworksRachid Riad, Olivier Teboul, David Grangier, Neil Zeghidour. [doi]
- COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationJongmin Lee 0004, Cosmin Paduraru, Daniel J. Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez. [doi]
- Stein Latent Optimization for Generative Adversarial NetworksUiwon Hwang, Heeseung Kim, Dahuin Jung, Hyemi Jang, Hyungyu Lee, Sungroh Yoon. [doi]
- Learning Super-Features for Image RetrievalPhilippe Weinzaepfel, Thomas Lucas, Diane Larlus, Yannis Kalantidis. [doi]
- Neural Contextual Bandits with Deep Representation and Shallow ExplorationPan Xu 0002, Zheng Wen, Handong Zhao, Quanquan Gu. [doi]
- Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and UnpredictabilityShangmin Guo, Yi Ren, Kory Wallace Mathewson, Simon Kirby, Stefano V. Albrecht, Kenny Smith. [doi]
- How Did the Model Change? Efficiently Assessing Machine Learning API ShiftsLingjiao Chen, Matei Zaharia, James Zou 0001. [doi]
- Weighted Training for Cross-Task LearningShuxiao Chen, Koby Crammer, Hangfeng He, Dan Roth, Weijie J. Su. [doi]
- Optimization and Adaptive Generalization of Three layer Neural NetworksKhashayar Gatmiry, Stefanie Jegelka, Jonathan A. Kelner. [doi]
- Minimax Optimality (Probably) Doesn't Imply Distribution Learning for GANsSitan Chen, Jerry Li 0001, Yuanzhi Li, Raghu Meka. [doi]
- Planning in Stochastic Environments with a Learned ModelIoannis Antonoglou, Julian Schrittwieser, Sherjil Ozair, Thomas K. Hubert, David Silver. [doi]
- Discrete Representations Strengthen Vision Transformer RobustnessChengzhi Mao, Lu Jiang, Mostafa Dehghani 0001, Carl Vondrick, Rahul Sukthankar, Irfan Essa. [doi]
- Equivariant and Stable Positional Encoding for More Powerful Graph Neural NetworksHaorui Wang, Haoteng Yin, Muhan Zhang, Pan Li 0005. [doi]
- DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG SignalsCédric Allain, Alexandre Gramfort, Thomas Moreau. [doi]
- Context-Aware Sparse Deep Coordination GraphsTonghan Wang 0001, Liang Zeng 0002, Weijun Dong, Qianlan Yang, Yang Yu 0001, Chongjie Zhang. [doi]
- Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak ModelsChaoyue Liu 0001, Libin Zhu, Misha Belkin. [doi]
- Autoregressive Diffusion ModelsEmiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans. [doi]
- Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with MomentumKirby Banman, Liam Peet-Pare, Nidhi Hegde 0001, Alona Fyshe, Martha White. [doi]
- Efficient Token Mixing for Transformers via Adaptive Fourier Neural OperatorsJohn Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro. [doi]
- Training Structured Neural Networks Through Manifold Identification and Variance ReductionZih-Syuan Huang, Ching-Pei Lee. [doi]
- Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit RegularizationTolga Ergen, Arda Sahiner, Batu Ozturkler, John M. Pauly, Morteza Mardani, Mert Pilanci. [doi]
- Bayesian Framework for Gradient LeakageMislav Balunovic, Dimitar Iliev Dimitrov, Robin Staab, Martin T. Vechev. [doi]
- Sequential Reptile: Inter-Task Gradient Alignment for Multilingual LearningSeanie Lee, Haebeom Lee, Juho Lee 0001, Sung Ju Hwang. [doi]
- Neural Stochastic Dual Dynamic ProgrammingHanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai. [doi]
- $\beta$-Intact-VAE: Identifying and Estimating Causal Effects under Limited OverlapPengzhou Abel Wu, Kenji Fukumizu. [doi]
- Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure AnalysisSiyi Tang, Jared Dunnmon, Khaled Kamal Saab, Xuan Zhang, Qianying Huang, Florian Dubost, Daniel Rubin, Christopher Lee-Messer. [doi]
- Evaluating Disentanglement of Structured RepresentationsRaphaël Dang-Nhu. [doi]
- Discrepancy-Based Active Learning for Domain AdaptationAntoine de Mathelin, François Deheeger, Mathilde Mougeot, Nicolas Vayatis. [doi]
- Continuous-Time Meta-Learning with Forward Mode DifferentiationTristan Deleu, David Kanaa, Leo Feng, Giancarlo Kerg, Yoshua Bengio, Guillaume Lajoie, Pierre-Luc Bacon. [doi]
- Vision-Based Manipulators Need to Also See from Their HandsKyle Hsu, Moo Jin Kim, Rafael Rafailov, Jiajun Wu 0001, Chelsea Finn. [doi]
- A Class of Short-term Recurrence Anderson Mixing Methods and Their ApplicationsFuchao Wei, Chenglong Bao, Yang Liu 0005. [doi]
- A fast and accurate splitting method for optimal transport: analysis and implementationVien V. Mai, Jacob Lindbäck, Mikael Johansson 0001. [doi]
- Frame Averaging for Invariant and Equivariant Network DesignOmri Puny, Matan Atzmon, Edward J. Smith, Ishan Misra, Aditya Grover, Heli Ben Hamu, Yaron Lipman. [doi]
- Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs TheoryTianrong Chen, Guan-Horng Liu, Evangelos A. Theodorou. [doi]
- PF-GNN: Differentiable particle filtering based approximation of universal graph representationsMohammed Haroon Dupty, Yanfei Dong, Wee Sun Lee. [doi]
- Pessimistic Model-based Offline Reinforcement Learning under Partial CoverageMasatoshi Uehara, Wen Sun 0002. [doi]
- Learning Object-Oriented Dynamics for Planning from TextGuiliang Liu, Ashutosh Adhikari, Amir Massoud Farahmand, Pascal Poupart. [doi]
- Revisiting Over-smoothing in BERT from the Perspective of GraphHan Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok. [doi]
- RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter EstimationPingchuan Ma 0002, Tao Du 0001, Joshua B. Tenenbaum, Wojciech Matusik, Chuang Gan. [doi]
- Group-based Interleaved Pipeline Parallelism for Large-scale DNN TrainingPengcheng Yang, Xiaoming Zhang, Wenpeng Zhang, Ming Yang, Hong Wei. [doi]
- Few-shot Learning via Dirichlet Tessellation EnsembleChunwei Ma, Ziyun Huang, Mingchen Gao, Jinhui Xu 0001. [doi]
- Explaining Point Processes by Learning Interpretable Temporal Logic RulesShuang Li, Mingquan Feng, Lu Wang, Abdelmajid Essofi, Yufeng Cao, Junchi Yan, Le Song. [doi]
- Doubly Adaptive Scaled Algorithm for Machine Learning Using Second-Order InformationMajid Jahani, Sergey Rusakov 0001, Zheng Shi, Peter Richtárik, Michael W. Mahoney, Martin Takác. [doi]
- Equivariant Transformers for Neural Network based Molecular PotentialsPhilipp Thölke, Gianni De Fabritiis. [doi]
- A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement LearningJiaxian Guo, Mingming Gong, Dacheng Tao. [doi]
- Towards General Function Approximation in Zero-Sum Markov GamesBaihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang. [doi]
- Generalized Demographic Parity for Group FairnessZhimeng Jiang, Xiaotian Han, Chao Fan, Fan Yang, Ali Mostafavi, Xia Hu. [doi]
- Spike-inspired rank coding for fast and accurate recurrent neural networksAlan Jeffares, Qinghai Guo, Pontus Stenetorp, Timoleon Moraitis. [doi]
- W-CTC: a Connectionist Temporal Classification Loss with Wild CardsXingyu Cai, Jiahong Yuan, Yuchen Bian, Guangxu Xun, Jiaji Huang, Kenneth Church 0001. [doi]
- Improving Non-Autoregressive Translation Models Without DistillationXiao Shi Huang, Felipe Pérez, Maksims Volkovs. [doi]
- Distribution Compression in Near-Linear TimeAbhishek Shetty, Raaz Dwivedi, Lester Mackey. [doi]
- Attention-based Interpretability with Concept TransformersMattia Rigotti, Christoph Miksovic, Ioana Giurgiu, Thomas Gschwind, Paolo Scotton. [doi]
- Normalization of Language Embeddings for Cross-Lingual AlignmentPrince Osei Aboagye, Yan Zheng, Chin-Chia Michael Yeh, JunPeng Wang, Wei Zhang, Liang Wang, Hao Yang, Jeff M. Phillips. [doi]
- Huber Additive Models for Non-stationary Time Series AnalysisYingjie Wang, Xianrui Zhong, Fengxiang He, Hong Chen, Dacheng Tao. [doi]
- Unified Visual Transformer CompressionShixing Yu, Tianlong Chen, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu 0002, Zhangyang Wang. [doi]
- A New Perspective on "How Graph Neural Networks Go Beyond Weisfeiler-Lehman?"Asiri Wijesinghe, Qing Wang 0002. [doi]
- Diverse Client Selection for Federated Learning via Submodular MaximizationRavikumar Balakrishnan, Tian Li 0005, Tianyi Zhou, Nageen Himayat, Virginia Smith, Jeff A. Bilmes. [doi]
- Provably Filtering Exogenous Distractors using Multistep Inverse DynamicsYonathan Efroni, Dipendra Misra, Akshay Krishnamurthy, Alekh Agarwal, John Langford 0001. [doi]
- Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP FrameworkXu Ma, Can Qin, Haoxuan You, Haoxi Ran, Yun Fu 0001. [doi]
- A generalization of the randomized singular value decompositionNicolas Boullé, Alex Townsend. [doi]
- Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement LearningDenis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto. [doi]
- Representation Learning for Online and Offline RL in Low-rank MDPsMasatoshi Uehara, Xuezhou Zhang, Wen Sun. [doi]
- FILIP: Fine-grained Interactive Language-Image Pre-TrainingLewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing Xu. [doi]
- Learning with Noisy Labels Revisited: A Study Using Real-World Human AnnotationsJiaheng Wei, Zhaowei Zhu, Hao Cheng 0014, Tongliang Liu, Gang Niu 0001, Yang Liu 0018. [doi]
- Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modelingGianluigi Silvestri, Emily Fertig, Dave Moore, Luca Ambrogioni. [doi]
- Enhancing Cross-lingual Transfer by Manifold MixupHuiyun Yang, Huadong Chen, Hao Zhou 0012, Lei Li 0005. [doi]
- Long Expressive Memory for Sequence ModelingT. Konstantin Rusch, Siddhartha Mishra, N. Benjamin Erichson, Michael W. Mahoney. [doi]
- Fast Generic Interaction Detection for Model Interpretability and CompressionTianjian Zhang, Feng Yin, Zhi-Quan Luo. [doi]
- Symbolic Learning to Optimize: Towards Interpretability and ScalabilityWenqing Zheng, Tianlong Chen, Ting-Kuei Hu, Zhangyang Wang. [doi]
- Skill-based Meta-Reinforcement LearningTaewook Nam, Shao-Hua Sun, Karl Pertsch, Sung Ju Hwang, Joseph J. Lim. [doi]
- The Three Stages of Learning Dynamics in High-dimensional Kernel MethodsNikhil Ghosh, Song Mei, Bin Yu. [doi]
- Energy-Inspired Molecular Conformation OptimizationJiaqi Guan, Wesley Wei Qian, Qiang Liu 0001, Wei-Ying Ma, Jianzhu Ma, Jian Peng 0001. [doi]
- Back2Future: Leveraging Backfill Dynamics for Improving Real-time Predictions in FutureHarshavardhan Kamarthi, Alexander Rodríguez, B. Aditya Prakash. [doi]
- FedBABU: Toward Enhanced Representation for Federated Image ClassificationJaehoon Oh, Sangmook Kim, Se-Young Yun. [doi]
- Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement LearningYutong Wang, Ke Xue, Chao Qian 0001. [doi]
- Adaptive Wavelet Transformer Network for 3D Shape Representation LearningHao Huang, Yi Fang 0006. [doi]
- Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node featuresJiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Soji Adeshina, Yangkun Wang, Tom Goldstein, David Wipf. [doi]
- PAC-Bayes Information BottleneckZifeng Wang 0008, Shao-Lun Huang, Ercan Engin Kuruoglu, Jimeng Sun, Xi Chen, Yefeng Zheng 0001. [doi]
- On Improving Adversarial Transferability of Vision TransformersMuzammal Naseer, Kanchana Ranasinghe, Salman Khan 0001, Fahad Shahbaz Khan, Fatih Porikli. [doi]
- Equivariant Subgraph Aggregation NetworksBeatrice Bevilacqua, Fabrizio Frasca, Derek Lim, Balasubramaniam Srinivasan, Chen Cai, Gopinath Balamurugan, Michael M. Bronstein, Haggai Maron. [doi]
- FP-DETR: Detection Transformer Advanced by Fully Pre-trainingWen Wang, Yang Cao, Jing Zhang, Dacheng Tao. [doi]
- Handling Distribution Shifts on Graphs: An Invariance PerspectiveQitian Wu, Hengrui Zhang, Junchi Yan, David Wipf. [doi]
- Taming Sparsely Activated Transformer with Stochastic ExpertsSimiao Zuo, Xiaodong Liu 0003, Jian Jiao 0007, Young-Jin Kim, Hany Hassan, Ruofei Zhang, Jianfeng Gao, Tuo Zhao. [doi]
- Generalization Through the Lens of Leave-One-Out ErrorGregor Bachmann, Thomas Hofmann, Aurélien Lucchi. [doi]
- StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image SynthesisJiatao Gu, Lingjie Liu, Peng Wang 0099, Christian Theobalt. [doi]
- Illiterate DALL-E Learns to ComposeGautam Singh, Fei Deng, Sungjin Ahn. [doi]
- NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge GraphsMikhail Galkin 0001, Etienne G. Denis, Jiapeng Wu, William L. Hamilton. [doi]
- Energy-Based Learning for Cooperative Games, with Applications to Valuation Problems in Machine LearningYatao Bian, Yu Rong, Tingyang Xu, Jiaxiang Wu, Andreas Krause 0001, JunZhou Huang. [doi]
- SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit modelsZaccharie Ramzi, Florian Mannel, Shaojie Bai, Jean-Luc Starck, Philippe Ciuciu, Thomas Moreau. [doi]
- FILM: Following Instructions in Language with Modular MethodsSo Yeon Min, Devendra Singh Chaplot, Pradeep Kumar Ravikumar, Yonatan Bisk, Ruslan Salakhutdinov. [doi]
- A Reduction-Based Framework for Conservative Bandits and Reinforcement LearningYunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, Simon Shaolei Du. [doi]
- Environment Predictive Coding for Visual NavigationSanthosh Kumar Ramakrishnan, Tushar Nagarajan, Ziad Al-Halah, Kristen Grauman. [doi]
- Using Graph Representation Learning with Schema Encoders to Measure the Severity of Depressive SymptomsSimin Hong, Anthony G. Cohn, David Crossland Hogg. [doi]
- Differentiable Prompt Makes Pre-trained Language Models Better Few-shot LearnersNingyu Zhang, Luoqiu Li, Xiang Chen, Shumin Deng, Zhen Bi, Chuanqi Tan, Fei Huang, Huajun Chen. [doi]
- Controlling Directions Orthogonal to a ClassifierYilun Xu, Hao He, Tianxiao Shen, Tommi S. Jaakkola. [doi]
- ViDT: An Efficient and Effective Fully Transformer-based Object DetectorHwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang 0001. [doi]
- Online Ad Hoc Teamwork under Partial ObservabilityPengjie Gu, Mengchen Zhao, Jianye Hao, Bo An 0001. [doi]
- Reinforcement Learning in Presence of Discrete Markovian Context EvolutionHang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar. [doi]
- FairCal: Fairness Calibration for Face VerificationTiago Salvador, Stephanie Cairns, Vikram Voleti, Noah Marshall, Adam M. Oberman. [doi]
- Equivariant Self-Supervised Learning: Encouraging Equivariance in RepresentationsRumen Dangovski, Li Jing, Charlotte Loh, Seungwook Han, Akash Srivastava, Brian Cheung, Pulkit Agrawal, Marin Soljacic. [doi]
- Training invariances and the low-rank phenomenon: beyond linear networksThien Le, Stefanie Jegelka. [doi]
- Multi-Task ProcessesDonggyun Kim, Seongwoong Cho, Wonkwang Lee, Seunghoon Hong. [doi]
- Contact Points Discovery for Soft-Body Manipulations with Differentiable PhysicsSizhe Li, Zhiao Huang, Tao Du 0001, Hao Su 0001, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Differentiable Gradient Sampling for Learning Implicit 3D Scene Reconstructions from a Single ImageShizhan Zhu, Sayna Ebrahimi, Angjoo Kanazawa, Trevor Darrell. [doi]
- Score-Based Generative Modeling with Critically-Damped Langevin DiffusionTim Dockhorn, Arash Vahdat, Karsten Kreis. [doi]
- Unrolling PALM for Sparse Semi-Blind Source SeparationMohammad Fahes, Christophe Kervazo, Jérôme Bobin, Florence Tupin. [doi]
- Mapping conditional distributions for domain adaptation under generalized target shiftMatthieu Kirchmeyer, Alain Rakotomamonjy, Emmanuel de Bézenac, Patrick Gallinari. [doi]
- DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with ToolsXingyu Lin, Zhiao Huang, Yunzhu Li, Joshua B. Tenenbaum, David Held, Chuang Gan. [doi]
- Cross-Lingual Transfer with Class-Weighted Language-Invariant RepresentationsRuicheng Xian, Heng Ji, Han Zhao 0002. [doi]
- LoRA: Low-Rank Adaptation of Large Language ModelsEdward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen. [doi]
- Fair Normalizing FlowsMislav Balunovic, Anian Ruoss, Martin T. Vechev. [doi]
- The Rich Get Richer: Disparate Impact of Semi-Supervised LearningZhaowei Zhu, Tianyi Luo, Yang Liu. [doi]
- Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and TransferableShaojin Ding, Tianlong Chen, Zhangyang Wang. [doi]
- Map Induction: Compositional spatial submap learning for efficient exploration in novel environmentsSugandha Sharma, Aidan Curtis, Marta Kryven, Joshua B. Tenenbaum, Ila R. Fiete. [doi]
- D-CODE: Discovering Closed-form ODEs from Observed TrajectoriesZhaozhi Qian, Krzysztof Kacprzyk, Mihaela van der Schaar. [doi]
- What Happens after SGD Reaches Zero Loss? --A Mathematical FrameworkZhiyuan Li 0005, Tianhao Wang 0017, Sanjeev Arora. [doi]
- Hierarchical Few-Shot Imitation with Skill Transition ModelsKourosh Hakhamaneshi, Ruihan Zhao 0001, Albert Zhan, Pieter Abbeel, Michael Laskin. [doi]
- Anisotropic Random Feature Regression in High DimensionsGabriel Mel, Jeffrey Pennington. [doi]
- Optimal Representations for Covariate ShiftYangjun Ruan, Yann Dubois, Chris J. Maddison. [doi]
- Discovering Nonlinear PDEs from Scarce Data with Physics-encoded LearningChengping Rao, Pu Ren, Yang Liu, Hao Sun. [doi]
- VC dimension of partially quantized neural networks in the overparametrized regimeYutong Wang, Clayton Scott. [doi]
- The Spectral Bias of Polynomial Neural NetworksMoulik Choraria, Leello Tadesse Dadi, Grigorios Chrysos 0002, Julien Mairal, Volkan Cevher. [doi]
- Looking Back on Learned Experiences For Class/task Incremental LearningMozhgan PourKeshavarz, Guoying Zhao, Mohammad Sabokrou. [doi]
- Multi-objective Optimization by Learning Space PartitionYiyang Zhao, Linnan Wang, Kevin Yang, Tianjun Zhang, Tian Guo 0001, Yuandong Tian. [doi]
- Transfer RL across Observation Feature Spaces via Model-Based RegularizationYanchao Sun, Ruijie Zheng, Xiyao Wang, Andrew E. Cohen, Furong Huang. [doi]
- Phenomenology of Double Descent in Finite-Width Neural NetworksSidak Pal Singh, Aurélien Lucchi, Thomas Hofmann, Bernhard Schölkopf. [doi]
- Superclass-Conditional Gaussian Mixture Model For Learning Fine-Grained EmbeddingsJingchao Ni, Wei Cheng 0002, Zhengzhang Chen, Takayoshi Asakura, Tomoya Soma, Sho Kato, Haifeng Chen. [doi]
- Proof Artifact Co-Training for Theorem Proving with Language ModelsJesse Michael Han, Jason Rute, Yuhuai Wu, Edward W. Ayers, Stanislas Polu. [doi]
- Zero Pixel Directional Boundary by Vector TransformEdoardo Mello Rella, Ajad Chhatkuli, Yun Liu, Ender Konukoglu, Luc Van Gool. [doi]
- MonoDistill: Learning Spatial Features for Monocular 3D Object DetectionZhiyu Chong, Xinzhu Ma, Hong Zhang, Yuxin Yue, Haojie Li, Zhihui Wang, Wanli Ouyang. [doi]
- Procedural generalization by planning with self-supervised world modelsAnkesh Anand, Jacob C. Walker, Yazhe Li, Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Theophane Weber, Jessica B. Hamrick. [doi]
- The Role of Pretrained Representations for the OOD Generalization of RL AgentsFrederik Träuble, Andrea Dittadi, Manuel Wuthrich, Felix Widmaier, Peter Vincent Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer. [doi]
- Conditional Contrastive Learning with KernelYao-Hung Hubert Tsai, Tianqin Li, Martin Q. Ma, Han Zhao 0002, Kun Zhang 0001, Louis-Philippe Morency, Ruslan Salakhutdinov. [doi]
- ClimateGAN: Raising Climate Change Awareness by Generating Images of FloodsVictor Schmidt, Alexandra Luccioni, Mélisande Teng, Tianyu Zhang, Alexia Reynaud, Sunand Raghupathi, Gautier Cosne, Adrien Juraver, Vahe Vardanyan, Alex Hernández-García, Yoshua Bengio. [doi]
- Pretraining Text Encoders with Adversarial Mixture of Training Signal GeneratorsYu Meng 0001, Chenyan Xiong, Payal Bajaj, Saurabh Tiwary, Paul N. Bennett, Jiawei Han 0001, Xia Song. [doi]
- Sample Efficient Stochastic Policy Extragradient Algorithm for Zero-Sum Markov GameZiyi Chen 0002, Shaocong Ma, Yi Zhou 0017. [doi]
- Generalizing Few-Shot NAS with Gradient MatchingShoukang Hu, Ruochen Wang, Lanqing Hong, Zhenguo Li, Cho-Jui Hsieh, Jiashi Feng. [doi]
- Proving the Lottery Ticket Hypothesis for Convolutional Neural NetworksArthur da Cunha, Emanuele Natale, Laurent Viennot. [doi]
- AS-MLP: An Axial Shifted MLP Architecture for VisionDongze Lian, Zehao Yu, Xing Sun, Shenghua Gao. [doi]
- Geometry-Consistent Neural Shape Representation with Implicit Displacement FieldsYifan Wang, Lukas Rahmann, Olga Sorkine-Hornung. [doi]
- Understanding Dimensional Collapse in Contrastive Self-supervised LearningLi Jing, Pascal Vincent, Yann LeCun, Yuandong Tian. [doi]
- Shallow and Deep Networks are Near-Optimal Approximators of Korobov FunctionsMoïse Blanchard, Mohammed Amine Bennouna. [doi]
- Explanations of Black-Box Models based on Directional Feature InteractionsAria Masoomi, Davin Hill, Zhonghui Xu, Craig P. Hersh, Edwin K. Silverman, Peter J. Castaldi, Stratis Ioannidis, Jennifer G. Dy. [doi]
- Associated Learning: an Alternative to End-to-End Backpropagation that Works on CNN, RNN, and TransformerDennis Y. H. Wu, Dinan Lin, Vincent Chen, Hung-Hsuan Chen. [doi]
- Bandit Learning with Joint Effect of Incentivized Sampling, Delayed Sampling Feedback, and Self-Reinforcing User PreferencesTianchen Zhou, Jia Liu, Chaosheng Dong, Yi Sun. [doi]
- EViT: Expediting Vision Transformers via Token ReorganizationsYouwei Liang, Chongjian Ge, Zhan Tong, Yibing Song, Jue Wang 0001, Pengtao Xie. [doi]
- BAM: Bayes with Adaptive MemoryJosue Nassar, Jennifer Rogers Brennan, Ben Evans, Kendall Lowrey. [doi]
- GradSign: Model Performance Inference with Theoretical InsightsZhihao Zhang, Zhihao Jia. [doi]
- A Fine-Grained Analysis on Distribution ShiftOlivia Wiles, Sven Gowal, Florian Stimberg, Sylvestre-Alvise Rebuffi, Ira Ktena, Krishnamurthy Dvijotham, Ali Taylan Cemgil. [doi]
- POETREE: Interpretable Policy Learning with Adaptive Decision TreesAlizée Pace, Alex Chan, Mihaela van der Schaar. [doi]
- How Attentive are Graph Attention Networks?Shaked Brody, Uri Alon 0002, Eran Yahav. [doi]
- Is Importance Weighting Incompatible with Interpolating Classifiers?Ke Alexander Wang, Niladri Shekhar Chatterji, Saminul Haque, Tatsunori Hashimoto. [doi]
- A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud CompletionZhaoyang Lyu, Zhifeng Kong, Xudong Xu, Liang Pan, Dahua Lin. [doi]
- Convergent and Efficient Deep Q Learning AlgorithmZhikang T. Wang, Masahito Ueda. [doi]
- Constrained Physical-Statistics Models for Dynamical System Identification and PredictionJérémie Donà, Marie Déchelle, Patrick Gallinari, Marina Levy. [doi]
- GeoDiff: A Geometric Diffusion Model for Molecular Conformation GenerationMinkai Xu, Lantao Yu, Yang Song 0011, Chence Shi, Stefano Ermon, Jian Tang 0005. [doi]
- Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionBowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed. [doi]
- Clean Images are Hard to Reblur: Exploiting the Ill-Posed Inverse Task for Dynamic Scene DeblurringSeungjun Nah, Sanghyun Son, Jaerin Lee, Kyoung Mu Lee. [doi]
- Distributionally Robust Models with Parametric Likelihood RatiosPaul Michel, Tatsunori Hashimoto, Graham Neubig. [doi]
- Differentially Private Fractional Frequency Moments Estimation with Polylogarithmic SpaceLun Wang, Iosif Pinelis, Dawn Song. [doi]
- Learning Guarantees for Graph Convolutional Networks on the Stochastic Block ModelWei Lu. [doi]
- Reducing Excessive Margin to Achieve a Better Accuracy vs. Robustness Trade-offRahul Rade, Seyed-Mohsen Moosavi-Dezfooli. [doi]
- Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack ScenariosZhen Xiang, David J. Miller 0001, George Kesidis. [doi]
- GRAND++: Graph Neural Diffusion with A Source TermMatthew Thorpe, Tan Minh Nguyen, Hedi Xia, Thomas Strohmer, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang. [doi]
- Hidden Parameter Recurrent State Space Models For Changing Dynamics ScenariosVaisakh Shaj, Dieter Büchler, Rohit Sonker, Philipp Becker, Gerhard Neumann. [doi]
- Non-Linear Operator Approximations for Initial Value ProblemsGaurav Gupta, Xiongye Xiao, Radu Balan, Paul Bogdan. [doi]
- Learning Temporally Causal Latent Processes from General Temporal DataWeiran Yao, Yuewen Sun, Alex Ho, Changyin Sun, Kun Zhang 0001. [doi]
- Language modeling via stochastic processesRose E. Wang, Esin Durmus, Noah D. Goodman, Tatsunori Hashimoto. [doi]
- Salient ImageNet: How to discover spurious features in Deep Learning?Sahil Singla 0002, Soheil Feizi. [doi]
- Responsible Disclosure of Generative Models Using Scalable FingerprintingNing Yu, Vladislav Skripniuk, Dingfan Chen, Larry S. Davis, Mario Fritz. [doi]
- A First-Occupancy Representation for Reinforcement LearningTed Moskovitz, Spencer R. Wilson, Maneesh Sahani. [doi]
- FlexConv: Continuous Kernel Convolutions With Differentiable Kernel SizesDavid W. Romero, Robert-Jan Bruintjes, Jakub Mikolaj Tomczak, Erik J. Bekkers, Mark Hoogendoorn, Jan van Gemert. [doi]
- Expressiveness and Approximation Properties of Graph Neural NetworksFloris Geerts, Juan L. Reutter. [doi]
- Finding Biological Plausibility for Adversarially Robust Features via Metameric TasksAnne Harrington, Arturo Deza. [doi]
- Deep Learning without Shortcuts: Shaping the Kernel with Tailored RectifiersGuodong Zhang, Aleksandar Botev, James Martens. [doi]
- Neural Processes with Stochastic Attention: Paying more attention to the context datasetMingyu Kim, Kyeongryeol Go, Se-Young Yun. [doi]
- FastSHAP: Real-Time Shapley Value EstimationNeil Jethani, Mukund Sudarshan, Ian Connick Covert, Su-In Lee, Rajesh Ranganath. [doi]
- Evidential Turing ProcessesMelih Kandemir, Abdullah Akgül, Manuel Haußmann, Gozde Unal. [doi]
- GDA-AM: On the Effectiveness of Solving Min-Imax Optimization via Anderson MixingHuan He, Shifan Zhao, Yuanzhe Xi, Joyce C. Ho, Yousef Saad. [doi]
- Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningHaichao Zhang, Wei Xu, Haonan Yu. [doi]
- On the Existence of Universal Lottery TicketsRebekka Burkholz, Nilanjana Laha, Rajarshi Mukherjee, Alkis Gotovos. [doi]
- An Explanation of In-context Learning as Implicit Bayesian InferenceSang Michael Xie, Aditi Raghunathan, Percy Liang, Tengyu Ma 0001. [doi]
- Topological Graph Neural NetworksMax Horn, Edward De Brouwer, Michael Moor, Yves Moreau, Bastian Rieck, Karsten M. Borgwardt. [doi]
- $\mathrm{SO}(2)$-Equivariant Reinforcement LearningDian Wang, Robin Walters, Robert Platt. [doi]
- Bootstrapped Meta-LearningSebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh 0001. [doi]
- Anomaly Transformer: Time Series Anomaly Detection with Association DiscrepancyJiehui Xu, Haixu Wu, Jianmin Wang, Mingsheng Long. [doi]
- Implicit Bias of Adversarial Training for Deep Neural NetworksBochen Lv, Zhanxing Zhu. [doi]
- PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature CommunicationCheng Wan, Youjie Li, Cameron R. Wolfe, Anastasios Kyrillidis, Nam Sung Kim, Yingyan Lin. [doi]
- Hybrid Local SGD for Federated Learning with Heterogeneous CommunicationsYuanxiong Guo, Ying Sun, Rui Hu 0005, Yanmin Gong 0001. [doi]
- Fast Differentiable Matrix Square RootYue Song, Nicu Sebe, Wei Wang 0108. [doi]
- Anti-Concentrated Confidence Bonuses For Scalable ExplorationJordan T. Ash, Cyril Zhang, Surbhi Goel, Akshay Krishnamurthy, Sham M. Kakade. [doi]
- Asymmetry Learning for Counterfactually-invariant Classification in OOD TasksS Chandra Mouli, Bruno Ribeiro. [doi]
- Hindsight Foresight Relabeling for Meta-Reinforcement LearningMichael Wan, Jian Peng 0001, Tanmay Gangwani. [doi]
- Top-N: Equivariant Set and Graph Generation without ExchangeabilityClément Vignac, Pascal Frossard. [doi]
- Learning to Annotate Part Segmentation with Gradient MatchingYu Yang, Xiaotian Cheng, Hakan Bilen, Xiangyang Ji. [doi]
- Learning Curves for SGD on Structured FeaturesBlake Bordelon, Cengiz Pehlevan. [doi]
- FedPara: Low-rank Hadamard Product for Communication-Efficient Federated LearningNam Hyeon-Woo, Moon Ye-Bin, Tae Hyun Oh. [doi]
- It Takes Two to Tango: Mixup for Deep Metric LearningShashanka Venkataramanan, Bill Psomas, Ewa Kijak, Laurent Amsaleg, Konstantinos Karantzalos, Yannis Avrithis. [doi]
- Sparse Communication via Mixed DistributionsAntónio Farinhas, Wilker Aziz, Vlad Niculae, André F. T. Martins. [doi]
- DISSECT: Disentangled Simultaneous Explanations via Concept TraversalsAsma Ghandeharioun, Been Kim, Chun-Liang Li, Brendan Jou, Brian Eoff, Rosalind W. Picard. [doi]
- Practical Integration via Separable Bijective NetworksChristopher M. Bender, Patrick Emmanuel, Michael K. Reiter, Junier Oliva. [doi]
- Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial LearningShaopeng Fu, Fengxiang He, Yang Liu, Li Shen, Dacheng Tao. [doi]
- FedChain: Chained Algorithms for Near-optimal Communication Cost in Federated LearningCharlie Hou, Kiran Koshy Thekumparampil, Giulia Fanti, Sewoong Oh. [doi]
- When should agents explore?Miruna Pislar, David Szepesvari, Georg Ostrovski, Diana L. Borsa, Tom Schaul. [doi]
- Differentiable Scaffolding Tree for Molecule OptimizationTianfan Fu, Wenhao Gao, Cao Xiao, Jacob Yasonik, Connor W. Coley, Jimeng Sun. [doi]
- Reverse Engineering of Imperceptible Adversarial Image PerturbationsYifan Gong 0004, Yuguang Yao, Yize Li, Yimeng Zhang, Xiaoming Liu, Xue Lin, Sijia Liu 0001. [doi]
- Image BERT Pre-training with Online TokenizerJinghao Zhou, Chen Wei 0005, Huiyu Wang, Wei Shen 0002, Cihang Xie, Alan L. Yuille, Tao Kong. [doi]
- PSA-GAN: Progressive Self Attention GANs for Synthetic Time SeriesPaul Jeha, Michael Bohlke-Schneider, Pedro Mercado, Shubham Kapoor, Rajbir-Singh Nirwan, Valentin Flunkert, Jan Gasthaus, Tim Januschowski. [doi]
- Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural NetworksAndrea Cini, Ivan Marisca, Cesare Alippi. [doi]
- EigenGame Unloaded: When playing games is better than optimizingIan M. Gemp, Brian McWilliams, Claire Vernade, Thore Graepel. [doi]
- Fast Model Editing at ScaleEric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D. Manning. [doi]
- Meta-Learning with Fewer Tasks through Task InterpolationHuaxiu Yao, Linjun Zhang, Chelsea Finn. [doi]
- CDTrans: Cross-domain Transformer for Unsupervised Domain AdaptationTongkun Xu, Weihua Chen, Pichao Wang, Fan Wang, Hao Li, Rong Jin 0001. [doi]
- Efficient Learning of Safe Driving Policy via Human-AI Copilot OptimizationQuanyi Li, Zhenghao Peng, Bolei Zhou. [doi]
- Hindsight is 20/20: Leveraging Past Traversals to Aid 3D PerceptionYurong You, Katie Z. Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger. [doi]
- Phase Collapse in Neural NetworksFlorentin Guth, John Zarka, Stéphane Mallat. [doi]
- Gradient Information Matters in Policy Optimization by Back-propagating through ModelChongchong Li, Yue Wang 0017, Wei Chen, Yuting Liu, Zhi-Ming Ma, Tie-Yan Liu. [doi]
- Connectome-constrained Latent Variable Model of Whole-Brain Neural ActivityLu Mi, Richard Xu, Sridhama Prakhya, Albert Lin, Nir Shavit, Aravinthan D. T. Samuel, Srinivas C. Turaga. [doi]
- GNN is a Counter? Revisiting GNN for Question AnsweringKuan Wang, Yuyu Zhang, Diyi Yang, Le Song, Tao Qin. [doi]
- Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability PerspectiveQi Lyu, Xiao Fu 0001, Weiran Wang, Songtao Lu. [doi]
- Eliminating Sharp Minima from SGD with Truncated Heavy-tailed NoiseXingyu Wang, Sewoong Oh, Chang-han Rhee. [doi]
- No One Representation to Rule Them All: Overlapping Features of Training MethodsRaphael Gontijo Lopes, Yann Dauphin, Ekin Dogus Cubuk. [doi]
- Rethinking Class-Prior Estimation for Positive-Unlabeled LearningYu Yao, Tongliang Liu, Bo Han 0003, Mingming Gong, Gang Niu 0001, Masashi Sugiyama, Dacheng Tao. [doi]
- Crystal Diffusion Variational Autoencoder for Periodic Material GenerationTian Xie, Xiang Fu, Octavian-Eugen Ganea, Regina Barzilay, Tommi S. Jaakkola. [doi]
- Spanning Tree-based Graph Generation for MoleculesSungsoo Ahn, Binghong Chen, Tianzhe Wang, Le Song. [doi]
- Deconstructing the Inductive Biases of Hamiltonian Neural NetworksNate Gruver, Marc Anton Finzi, Samuel Don Stanton, Andrew Gordon Wilson. [doi]
- Memorizing TransformersYuhuai Wu, Markus Norman Rabe, DeLesley Hutchins, Christian Szegedy. [doi]
- Task Affinity with Maximum Bipartite Matching in Few-Shot LearningCat Phuoc Le, Juncheng Dong, Mohammadreza Soltani, Vahid Tarokh. [doi]
- CrowdPlay: Crowdsourcing Human Demonstrations for Offline LearningMatthias Gerstgrasser, Rakshit Trivedi, David C. Parkes. [doi]
- Language-driven Semantic SegmentationBoyi Li, Kilian Q. Weinberger, Serge J. Belongie, Vladlen Koltun, René Ranftl. [doi]
- A Non-Parametric Regression Viewpoint : Generalization of Overparametrized Deep RELU Network Under Noisy ObservationsNamjoon Suh, Hyunouk Ko, Xiaoming Huo. [doi]
- Neural Structured Prediction for Inductive Node ClassificationMeng Qu, Huiyu Cai, Jian Tang 0005. [doi]
- Distributional Reinforcement Learning with Monotonic SplinesYudong Luo, Guiliang Liu, Haonan Duan, Oliver Schulte, Pascal Poupart. [doi]
- Online Coreset Selection for Rehearsal-based Continual LearningJaehong Yoon, Divyam Madaan, Eunho Yang, Sung Ju Hwang. [doi]
- Imbedding Deep Neural NetworksAndrew Corbett, Dmitry Kangin. [doi]
- Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning SystemElahe Arani, Fahad Sarfraz, Bahram Zonooz. [doi]
- Sound and Complete Neural Network Repair with Minimality and Locality GuaranteesFeisi Fu, Wenchao Li. [doi]
- Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average RewardHairi, Jia Liu 0002, Songtao Lu. [doi]
- Self-Supervision Enhanced Feature Selection with Correlated GatesChangHee Lee, Fergus Imrie, Mihaela van der Schaar. [doi]
- How Do Vision Transformers Work?Namuk Park, Songkuk Kim. [doi]
- Tackling the Generative Learning Trilemma with Denoising Diffusion GANsZhisheng Xiao, Karsten Kreis, Arash Vahdat. [doi]
- Information Bottleneck: Exact Analysis of (Quantized) Neural NetworksStephan Sloth Lorenzen, Christian Igel, Mads Nielsen. [doi]
- Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon ReasoningDhruv Shah, Peng Xu, Yao Lu, Ted Xiao, Alexander Toshev, Sergey Levine, Brian Ichter. [doi]
- A Theory of Tournament RepresentationsArun Rajkumar, Vishnu Veerathu, Abdul Bakey Mir. [doi]
- Maximizing Ensemble Diversity in Deep Reinforcement LearningHassam Sheikh, Mariano Phielipp, Ladislau Bölöni. [doi]
- Surreal-GAN: Semi-Supervised Representation Learning via GAN for uncovering heterogeneous disease-related imaging patternsZhijian Yang, Junhao Wen, Christos Davatzikos. [doi]
- Maximum Entropy RL (Provably) Solves Some Robust RL ProblemsBenjamin Eysenbach, Sergey Levine. [doi]
- iFlood: A Stable and Effective RegularizerYuexiang Xie, Zhen Wang, Yaliang Li, Ce Zhang, Jingren Zhou, Bolin Ding. [doi]
- RotoGrad: Gradient Homogenization in Multitask LearningAdrián Javaloy, Isabel Valera. [doi]
- Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug DiscoveryYulun Wu, Nicholas Choma, Andrew Deru Chen, Mikaela Cashman, Érica Teixeira Prates, Verónica G. Melesse Vergara, Manesh Shah, Austin Clyde, Thomas S. Brettin, Wibe Albert de Jong, Neeraj Kumar, Martha S. Head, Rick L. Stevens, Peter Nugent, Daniel A. Jacobson, James B. Brown. [doi]
- Conditioning Sequence-to-sequence Networks with Learned ActivationsAlberto Gil Couto Pimentel Ramos, Abhinav Mehrotra, Nicholas Donald Lane, Sourav Bhattacharya. [doi]
- Inverse Online Learning: Understanding Non-Stationary and Reactionary PoliciesAlex J. Chan, Alicia Curth, Mihaela van der Schaar. [doi]
- Safe Neurosymbolic Learning with Differentiable Symbolic ExecutionChenxi Yang, Swarat Chaudhuri. [doi]
- Learning Value Functions from Undirected State-only ExperienceMatthew Chang, Arjun Gupta, Saurabh Gupta 0001. [doi]
- Fooling Explanations in Text ClassifiersAdam Ivankay, Ivan Girardi, Chiara Marchiori, Pascal Frossard. [doi]
- Who Is Your Right Mixup Partner in Positive and Unlabeled LearningChangchun Li, XiMing Li, Lei Feng, Jihong OuYang. [doi]
- An Autoregressive Flow Model for 3D Molecular Geometry Generation from ScratchYouzhi Luo, Shuiwang Ji. [doi]
- Reward Uncertainty for Exploration in Preference-based Reinforcement LearningXinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel. [doi]
- Spherical Message Passing for 3D Molecular GraphsYi Liu 0059, Limei Wang, Meng Liu, Yuchao Lin, Xuan Zhang, Bora Oztekin, Shuiwang Ji. [doi]
- CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale AttentionWenxiao Wang 0001, Lu Yao, Long Chen 0016, Binbin Lin, Deng Cai 0001, Xiaofei He 0001, Wei Liu. [doi]
- Unsupervised Disentanglement with Tensor Product Representations on the TorusMichael Rotman, Amit Dekel, Shir Gur, Yaron Oz, Lior Wolf. [doi]
- MT3: Multi-Task Multitrack Music TranscriptionJosh Gardner, Ian Simon, Ethan Manilow, Curtis Hawthorne, Jesse H. Engel. [doi]
- When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?Ziang Song, Song Mei, Yu Bai. [doi]
- A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial TrainingYifei Wang 0001, Yisen Wang 0001, Jiansheng Yang, Zhouchen Lin. [doi]
- Autonomous Learning of Object-Centric Abstractions for High-Level PlanningSteven James, Benjamin Rosman, George Konidaris 0001. [doi]
- Approximation and Learning with Deep Convolutional Models: a Kernel PerspectiveAlberto Bietti. [doi]
- What Makes Better Augmentation Strategies? Augment Difficult but Not too DifferentJaehyung Kim, Dongyeop Kang, Sungsoo Ahn, Jinwoo Shin. [doi]
- CROP: Certifying Robust Policies for Reinforcement Learning through Functional SmoothingFan Wu, Linyi Li, Zijian Huang, Yevgeniy Vorobeychik, Ding Zhao, Bo Li 0026. [doi]
- A Comparison of Hamming Errors of Representative Variable Selection MethodsTracy Ke, Longlin Wang. [doi]
- GradMax: Growing Neural Networks using Gradient InformationUtku Evci, Bart van Merrienboer, Thomas Unterthiner, Fabian Pedregosa, Max Vladymyrov. [doi]
- Generating Videos with Dynamics-aware Implicit Generative Adversarial NetworksSihyun Yu, Jihoon Tack, Sangwoo Mo, Hyunsu Kim, Junho Kim, Jung-Woo Ha 0001, Jinwoo Shin. [doi]
- A Biologically Interpretable Graph Convolutional Network to Link Genetic Risk Pathways and Imaging Phenotypes of DiseaseSayan Ghosal, Qiang Chen, Giulio Pergola, Aaron L. Goldman, William Ulrich, Daniel R. Weinberger, Archana Venkataraman. [doi]
- Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RLYanchao Sun, Ruijie Zheng, Yongyuan Liang, Furong Huang. [doi]
- Dropout Q-Functions for Doubly Efficient Reinforcement LearningTakuya Hiraoka, Takahisa Imagawa, Taisei Hashimoto, Takashi Onishi, Yoshimasa Tsuruoka. [doi]
- Learning Prototype-oriented Set Representations for Meta-LearningDandan Guo, Long Tian, Minghe Zhang, Mingyuan Zhou, Hongyuan Zha. [doi]
- Permutation Compressors for Provably Faster Distributed Nonconvex OptimizationRafal Szlendak, Alexander Tyurin, Peter Richtárik. [doi]
- Learning State Representations via Retracing in Reinforcement LearningChangmin Yu, Dong Li, Jianye Hao, Jun Wang, Neil Burgess. [doi]
- Provable Learning-based Algorithm For Sparse RecoveryXinshi Chen, Haoran Sun, Le Song. [doi]
- BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation ModelsKangjie Chen, Yuxian Meng, Xiaofei Sun, Shangwei Guo, Tianwei Zhang 0004, Jiwei Li, Chun Fan. [doi]
- Latent Image Animator: Learning to Animate Images via Latent Space NavigationYaohui Wang, Di Yang, François Brémond, Antitza Dantcheva. [doi]
- Assessing Generalization of SGD via DisagreementYiding Jiang, Vaishnavh Nagarajan, Christina Baek, J. Zico Kolter. [doi]
- An Agnostic Approach to Federated Learning with Class ImbalanceZebang Shen, Juan Cerviño, Hamed Hassani, Alejandro Ribeiro. [doi]
- Group equivariant neural posterior estimationMaximilian Dax, Stephen R. Green, Jonathan Gair, Michael Deistler, Bernhard Schölkopf, Jakob H. Macke. [doi]
- CoMPS: Continual Meta Policy SearchGlen Berseth, Zhiwei Zhang, Grace Zhang, Chelsea Finn, Sergey Levine. [doi]
- Critical Points in Quantum Generative ModelsEric Ricardo Anschütz. [doi]
- Multimeasurement Generative ModelsSaeed Saremi, Rupesh Kumar Srivastava. [doi]
- Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form SolutionsArda Sahiner, Tolga Ergen, Batu Ozturkler, Burak Bartan, John M. Pauly, Morteza Mardani, Mert Pilanci. [doi]
- Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?Sheikh Shams Azam, Seyyedali Hosseinalipour, Qiang Qiu, Christopher G. Brinton. [doi]
- Self-Joint Supervised LearningNavid Kardan, Mubarak Shah, Mitch Hill. [doi]
- Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network ModelsBeidi Chen, Tri Dao, Kaizhao Liang, Jiaming Yang, Zhao Song 0002, Atri Rudra, Christopher Ré. [doi]
- Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and BeyondChulhee Yun, Shashank Rajput, Suvrit Sra. [doi]
- NASPY: Automated Extraction of Automated Machine Learning ModelsXiaoxuan Lou, Shangwei Guo, Jiwei Li, Yaoxin Wu, Tianwei Zhang 0004. [doi]
- Task Relatedness-Based Generalization Bounds for Meta LearningJiechao Guan, Zhiwu Lu 0001. [doi]
- Gradient Step Denoiser for convergent Plug-and-PlaySamuel Hurault, Arthur Leclaire, Nicolas Papadakis. [doi]
- ProtoRes: Proto-Residual Network for Pose Authoring via Learned Inverse KinematicsBoris N. Oreshkin, Florent Bocquelet, Félix G. Harvey, Bay Raitt, Dominic Laflamme. [doi]
- Learning Efficient Image Super-Resolution Networks via Structure-Regularized PruningYulun Zhang, Huan Wang, Can Qin, Yun Fu. [doi]
- Learning Distributionally Robust Models at Scale via Composite OptimizationFarzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Amin Karbasi. [doi]
- VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised LearningAdrien Bardes, Jean Ponce, Yann LeCun. [doi]
- QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training QuantizationXiuying Wei, Ruihao Gong, Yuhang Li, Xianglong Liu, Fengwei Yu. [doi]
- Scale Mixtures of Neural Network Gaussian ProcessesHyungi Lee, Eunggu Yun, Hongseok Yang, Juho Lee 0001. [doi]
- KL Guided Domain AdaptationA. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atilim Gunes Baydin. [doi]
- Tuformer: Data-driven Design of Transformers for Improved Generalization or EfficiencyXiaoyu Liu, Jiahao Su, Furong Huang. [doi]
- Can an Image Classifier Suffice For Action Recognition?Quanfu Fan, Chun-Fu Chen 0001, Rameswar Panda. [doi]
- Coordination Among Neural Modules Through a Shared Global WorkspaceAnirudh Goyal, Aniket Rajiv Didolkar, Alex Lamb, Kartikeya Badola, Nan Rosemary Ke, Nasim Rahaman, Jonathan Binas, Charles Blundell, Michael Curtis Mozer, Yoshua Bengio. [doi]
- Neural Methods for Logical Reasoning over Knowledge GraphsAlfonso Amayuelas, Shuai Zhang, Susie Xi Rao, Ce Zhang 0001. [doi]
- How to Train Your MAML to Excel in Few-Shot ClassificationHan-Jia Ye, Wei-Lun Chao. [doi]
- Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal TransformersRuihan Yang, Minghao Zhang, Nicklas Hansen, Huazhe Xu, Xiaolong Wang. [doi]
- Uncertainty Modeling for Out-of-Distribution GeneralizationXiaotong Li, Yongxing Dai, Yixiao Ge, Jun Liu, Ying Shan, Lingyu Duan. [doi]
- Continual Normalization: Rethinking Batch Normalization for Online Continual LearningQuang Pham, Chenghao Liu, Steven C. H. Hoi. [doi]
- PI3NN: Out-of-distribution-aware Prediction Intervals from Three Neural NetworksSiyan Liu, Pei Zhang, Dan Lu 0001, Guannan Zhang. [doi]
- Towards Deployment-Efficient Reinforcement Learning: Lower Bound and OptimalityJiawei Huang, Jinglin Chen, Li Zhao, Tao Qin, Nan Jiang, Tie-Yan Liu. [doi]
- Open-Set Recognition: A Good Closed-Set Classifier is All You NeedSagar Vaze, Kai Han 0001, Andrea Vedaldi, Andrew Zisserman. [doi]
- Exploiting Class Activation Value for Partial-Label LearningFei Zhang, Lei Feng, Bo Han 0003, Tongliang Liu, Gang Niu 0001, Tao Qin, Masashi Sugiyama. [doi]
- On Bridging Generic and Personalized Federated Learning for Image ClassificationHong-You Chen, Wei-Lun Chao. [doi]
- ExT5: Towards Extreme Multi-Task Scaling for Transfer LearningVamsi Aribandi, Yi Tay, Tal Schuster, Jinfeng Rao, Huaixiu Steven Zheng, Sanket Vaibhav Mehta, Honglei Zhuang, Vinh Q. Tran 0002, Dara Bahri, Jianmo Ni, Jai Prakash Gupta, Kai Hui 0001, Sebastian Ruder, Donald Metzler. [doi]
- Discriminative Similarity for Data ClusteringYingzhen Yang, Ping Li. [doi]
- How Low Can We Go: Trading Memory for Error in Low-Precision TrainingChengrun Yang, Ziyang Wu, Jerry Chee, Christopher De Sa, Madeleine Udell. [doi]
- Self-supervised Learning is More Robust to Dataset ImbalanceHong Liu, Jeff Z. HaoChen, Adrien Gaidon, Tengyu Ma 0001. [doi]
- MoReL: Multi-omics Relational LearningArman Hasanzadeh, Ehsan Hajiramezanali, Nick Duffield, Xiaoning Qian. [doi]
- Measuring the Interpretability of Unsupervised Representations via Quantized Reversed ProbingIro Laina, Yuki M. Asano, Andrea Vedaldi. [doi]
- Neural Variational Dropout ProcessesInsu Jeon, Youngjin Park, Gunhee Kim. [doi]
- Understanding Intrinsic Robustness Using Label UncertaintyXiao Zhang, David Evans. [doi]
- Declarative nets that are equilibrium modelsRussell Tsuchida, Suk Yee Yong, Mohammad Ali Armin, Lars Petersson, Cheng Soon Ong. [doi]
- Sqrt(d) Dimension Dependence of Langevin Monte CarloRuilin Li, Hongyuan Zha, Molei Tao. [doi]
- On Non-Random Missing Labels in Semi-Supervised LearningXinting Hu, Yulei Niu, Chunyan Miao, Xian-Sheng Hua 0001, Hanwang Zhang. [doi]
- On Redundancy and Diversity in Cell-based Neural Architecture SearchXingchen Wan, Binxin Ru, Pedro M. Esperança, Zhenguo Li. [doi]
- Learning more skills through optimistic explorationDJ Strouse, Kate Baumli, David Warde-Farley, Volodymyr Mnih, Steven Stenberg Hansen. [doi]
- Learning to Dequantise with Truncated FlowsShawn Tan, Chin-Wei Huang, Alessandro Sordoni, Aaron C. Courville. [doi]
- On the Convergence of Certified Robust Training with Interval Bound PropagationYihan Wang, Zhouxing Shi, Quanquan Gu, Cho-Jui Hsieh. [doi]
- Leveraging Automated Unit Tests for Unsupervised Code TranslationBaptiste Rozière, Jie Zhang, François Charton, Mark Harman, Gabriel Synnaeve, Guillaume Lample. [doi]
- Active Hierarchical Exploration with Stable Subgoal Representation LearningSiyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang. [doi]
- Relational Learning with Variational BayesKuang-Hung Liu. [doi]
- cosFormer: Rethinking Softmax In AttentionZhen Qin, Weixuan Sun, Hui Deng, Dongxu Li, Yunshen Wei, Baohong Lv, Junjie Yan, Lingpeng Kong, Yiran Zhong. [doi]
- Cross-Trajectory Representation Learning for Zero-Shot Generalization in RLBogdan Mazoure, Ahmed M Ahmed, R. Devon Hjelm, Andrey Kolobov, Patrick MacAlpine. [doi]
- LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential EquationsJaehoon Lee 0002, Jinsung Jeon, Sheo Yon Jhin, Jihyeon Hyeong, Jayoung Kim 0002, Minju Jo, Kook Seungji, Noseong Park. [doi]
- Language-biased image classification: evaluation based on semantic representationsYoann Lemesle, Masataka Sawayama, Guillermo Valle Pérez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves Oudeyer. [doi]
- The Role of Permutation Invariance in Linear Mode Connectivity of Neural NetworksRahim Entezari, Hanie Sedghi, Olga Saukh, Behnam Neyshabur. [doi]
- Properties from mechanisms: an equivariance perspective on identifiable representation learningKartik Ahuja, Jason Hartford, Yoshua Bengio. [doi]
- Synchromesh: Reliable Code Generation from Pre-trained Language ModelsGabriel Poesia, Alex Polozov, Vu Le 0002, Ashish Tiwari 0001, Gustavo Soares, Christopher Meek, Sumit Gulwani. [doi]
- Feature Kernel DistillationBobby He, Mete Ozay. [doi]
- Rethinking Adversarial Transferability from a Data Distribution PerspectiveYao Zhu, Jiacheng Sun, Zhenguo Li. [doi]
- Robbing the Fed: Directly Obtaining Private Data in Federated Learning with Modified ModelsLiam H. Fowl, Jonas Geiping, Wojciech Czaja, Micah Goldblum, Tom Goldstein. [doi]
- R4D: Utilizing Reference Objects for Long-Range Distance EstimationYingwei Li, Tiffany Chen, Maya Kabkab, Ruichi Yu, Longlong Jing, Yurong You, Hang Zhao. [doi]
- IntSGD: Adaptive Floatless Compression of Stochastic GradientsKonstantin Mishchenko, Bokun Wang, Dmitry Kovalev, Peter Richtárik. [doi]
- Neural graphical modelling in continuous-time: consistency guarantees and algorithmsAlexis Bellot, Kim Branson, Mihaela van der Schaar. [doi]
- Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulationTodor Davchev, Oleg Olegovich Sushkov, Jean-Baptiste Regli, Stefan Schaal, Yusuf Aytar, Markus Wulfmeier, Jon Scholz. [doi]
- Cross-Domain Imitation Learning via Optimal TransportArnaud Fickinger, Samuel Cohen, Stuart Russell 0001, Brandon Amos. [doi]
- Graph-Relational Domain AdaptationZihao Xu, Hao He 0011, Guang-He Lee, Bernie Wang, Hao Wang. [doi]
- End-to-End Learning of Probabilistic Hierarchies on GraphsDaniel Zügner, Bertrand Charpentier, Morgane Ayle, Sascha Geringer, Stephan Günnemann. [doi]
- Progressive Distillation for Fast Sampling of Diffusion ModelsTim Salimans, Jonathan Ho. [doi]
- Transferable Adversarial Attack based on Integrated GradientsYi Huang, Adams Wai-Kin Kong. [doi]
- Generalized Natural Gradient Flows in Hidden Convex-Concave Games and GANsAndjela Mladenovic, Iosif Sakos, Gauthier Gidel, Georgios Piliouras. [doi]
- Regularized Autoencoders for Isometric Representation LearningYonghyeon Lee, Sangwoong Yoon, MinJun Son, Frank Chongwoo Park. [doi]
- Givens Coordinate Descent Methods for Rotation Matrix Learning in Trainable Embedding IndexesYunjiang Jiang, Han Zhang, Yiming Qiu, Yun Xiao, Bo Long, Wen-Yun Yang. [doi]
- Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural NetworksTong Bu, Wei Fang, Jianhao Ding, Penglin Dai, Zhaofei Yu, Tiejun Huang 0001. [doi]
- Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--HastingsKartik Goyal, Chris Dyer, Taylor Berg-Kirkpatrick. [doi]
- Understanding approximate and unrolled dictionary learning for pattern recoveryBenoît Malézieux, Thomas Moreau, Matthieu Kowalski. [doi]
- Coherence-based Label Propagation over Time Series for Accelerated Active LearningYooju Shin, Susik Yoon, Sundong Kim, Hwanjun Song, Jae-Gil Lee 0001, Byung Suk Lee. [doi]
- LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5Chengwei Qin, Shafiq Joty. [doi]
- Learning Towards The Largest MarginsXiong Zhou, Xianming Liu, Deming Zhai, Junjun Jiang, Xin Gao, Xiangyang Ji. [doi]
- AdaAug: Learning Class- and Instance-adaptive Data Augmentation PoliciesTsz-Him Cheung, Dit-Yan Yeung. [doi]
- A Theoretical Analysis on Feature Learning in Neural Networks: Emergence from Inputs and Advantage over Fixed FeaturesZhenmei Shi, Junyi Wei, Yingyu Liang. [doi]
- Practical Conditional Neural Process Via Tractable Dependent PredictionsStratis Markou, James Requeima, Wessel P. Bruinsma, Anna Vaughan, Richard E. Turner. [doi]
- Large-Scale Representation Learning on Graphs via BootstrappingShantanu Thakoor, Corentin Tallec, Mohammad Gheshlaghi Azar, Mehdi Azabou, Eva L. Dyer, Rémi Munos, Petar Velickovic, Michal Valko. [doi]
- The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human ModelsCassidy Laidlaw, Anca D. Dragan. [doi]
- GATSBI: Generative Adversarial Training for Simulation-Based InferencePoornima Ramesh, Jan-Matthis Lueckmann, Jan Boelts, Álvaro Tejero-Cantero, David S. Greenberg, Pedro J. Gonçalves, Jakob H. Macke. [doi]
- How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive LearningChaoning Zhang, Kang Zhang, Chenshuang Zhang, Trung X. Pham, Chang D. Yoo, In-So Kweon. [doi]
- Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural NetworksMorteza Ramezani, Weilin Cong, Mehrdad Mahdavi, Mahmut T. Kandemir, Anand Sivasubramaniam. [doi]
- SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement LearningJongjin Park, Younggyo Seo, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee. [doi]
- Label-Efficient Semantic Segmentation with Diffusion ModelsDmitry Baranchuk, Andrey Voynov, Ivan Rubachev, Valentin Khrulkov, Artem Babenko. [doi]
- Differentiable DAG SamplingBertrand Charpentier, Simon Kibler, Stephan Günnemann. [doi]
- Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with PessimismMing Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang 0003. [doi]
- Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown CodimensionParis Giampouras, Benjamin David Haeffele, René Vidal. [doi]
- Towards Understanding the Robustness Against Evasion Attack on Categorical DataHongyan Bao, Yufei Han, Yujun Zhou, Yun Shen, Xiangliang Zhang 0001. [doi]
- Charformer: Fast Character Transformers via Gradient-based Subword TokenizationYi Tay, Vinh Q. Tran 0002, Sebastian Ruder, Jai Prakash Gupta, Hyung Won Chung, Dara Bahri, Zhen Qin 0001, Simon Baumgartner, Cong Yu 0001, Donald Metzler. [doi]
- Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model CompressionBaeseong Park, Se Jung Kwon, Daehwan Oh, Byeongwook Kim, Dongsoo Lee. [doi]
- Increasing the Cost of Model Extraction with Calibrated Proof of WorkAdam Dziedzic, Muhammad Ahmad Kaleem, Yu Shen Lu, Nicolas Papernot. [doi]
- Contrastive Fine-grained Class Clustering via Generative Adversarial NetworksYunji Kim, Jung-Woo Ha. [doi]
- Continuously Discovering Novel Strategies via Reward-Switching Policy OptimizationZihan Zhou 0002, Wei Fu, Bingliang Zhang, Yi Wu. [doi]
- Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax OptimalityYiping Lu 0001, Haoxuan Chen, Jianfeng Lu 0001, Lexing Ying, Jose H. Blanchet. [doi]
- Multiset-Equivariant Set Prediction with Approximate Implicit DifferentiationYan Zhang, David W. Zhang, Simon Lacoste-Julien, Gertjan J. Burghouts, Cees G. M. Snoek. [doi]
- Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel SpaceSteeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg Mori, Christian Wolf 0001. [doi]
- Focus on the Common Good: Group Distributional Robustness FollowsVihari Piratla, Praneeth Netrapalli, Sunita Sarawagi. [doi]
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training ParadigmYangguang Li, Feng Liang, Lichen Zhao, Yufeng Cui, Wanli Ouyang, Jing Shao, Fengwei Yu, Junjie Yan. [doi]
- Imitation Learning by Reinforcement LearningKamil Ciosek. [doi]
- Do We Need Anisotropic Graph Neural Networks?Shyam A. Tailor, Felix L. Opolka, Pietro Liò, Nicholas Donald Lane. [doi]
- Benchmarking the Spectrum of Agent CapabilitiesDanijar Hafner. [doi]
- Toward Efficient Low-Precision Training: Data Format Optimization and Hysteresis QuantizationSunWoo Lee, Jeongwoo Park, Dongsuk Jeon. [doi]
- Hybrid Random FeaturesKrzysztof Marcin Choromanski, Han Lin, Haoxian Chen, Arijit Sehanobish, Yuanzhe Ma, Deepali Jain, Jake Varley, Andy Zeng, Michael S. Ryoo, Valerii Likhosherstov, Dmitry Kalashnikov, Vikas Sindhwani, Adrian Weller. [doi]
- Towards a Unified View of Parameter-Efficient Transfer LearningJunxian He, Chunting Zhou, Xuezhe Ma, Taylor Berg-Kirkpatrick, Graham Neubig. [doi]
- MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingYusong Wu, Ethan Manilow, Yi Deng, Rigel Swavely, Kyle Kastner, Tim Cooijmans, Aaron C. Courville, Cheng-Zhi Anna Huang, Jesse H. Engel. [doi]
- Anomaly Detection for Tabular Data with Internal Contrastive LearningTom Shenkar, Lior Wolf. [doi]
- Dynamic Token Normalization improves Vision TransformersWenqi Shao, Yixiao Ge, Zhaoyang Zhang, Xuyuan Xu, Xiaogang Wang, Ying Shan, Ping Luo. [doi]
- Extending the WILDS Benchmark for Unsupervised AdaptationShiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang. [doi]
- Diurnal or Nocturnal? Federated Learning of Multi-branch Networks from Periodically Shifting DistributionsChen Zhu, Zheng Xu 0002, Mingqing Chen, Jakub Konecný, Andrew Hard, Tom Goldstein. [doi]
- Case-based reasoning for better generalization in textual reinforcement learningMattia Atzeni, Shehzaad Zuzar Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan. [doi]
- UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation LearningKunchang Li, Yali Wang 0001, Peng Gao 0007, Guanglu Song, Yu Liu 0015, Hongsheng Li 0001, Yu Qiao 0001. [doi]
- Learning Graphon Mean Field Games and Approximate Nash EquilibriaKai Cui 0001, Heinz Koeppl. [doi]
- Know Thyself: Transferable Visual Control Policies Through Robot-AwarenessEdward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman. [doi]
- How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean DataZhiyuan Zhang, Lingjuan Lyu, Weiqiang Wang, Lichao Sun, Xu Sun 0001. [doi]
- Linking Emergent and Natural Languages via Corpus TransferShunyu Yao, Mo Yu, Yang Zhang, Karthik R. Narasimhan, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling SchemeVadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Sergeevich Kudinov, Jiansheng Wei. [doi]
- Neural Program Synthesis with QueryDi Huang, Rui Zhang 0040, Xing Hu 0001, Xishan Zhang, Pengwei Jin, Nan Li, Zidong Du, Qi Guo, Yunji Chen. [doi]
- Domain Adversarial Training: A Game PerspectiveDavid Acuna, Marc T. Law, Guojun Zhang, Sanja Fidler. [doi]
- Selective Ensembles for Consistent PredictionsEmily Black, Klas Leino, Matt Fredrikson. [doi]
- The Close Relationship Between Contrastive Learning and Meta-LearningRenkun Ni, Manli Shu, Hossein Souri, Micah Goldblum, Tom Goldstein. [doi]
- Omni-Dimensional Dynamic ConvolutionChao Li, Aojun Zhou, Anbang Yao. [doi]
- Sparse DETR: Efficient End-to-End Object Detection with Learnable SparsityByungseok Roh, Jaewoong Shin, Wuhyun Shin, Saehoon Kim. [doi]
- Continual Learning with Filter Atom SwappingZichen Miao, Ze Wang, Wei Chen, Qiang Qiu. [doi]
- Variational Predictive Routing with Nested Subjective TimescalesAlexey Zakharov, Qinghai Guo, Zafeirios Fountas. [doi]
- Resolving Training Biases via Influence-based Data RelabelingShuming Kong, Yanyan Shen, Linpeng Huang. [doi]
- Constructing Orthogonal Convolutions in an Explicit MannerTan Yu, Jun Li, Yunfeng Cai, Ping Li. [doi]
- Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport DistillationBichen Wu, Ruizhe Cheng, Peizhao Zhang, Tianren Gao, Joseph E. Gonzalez, Peter Vajda. [doi]
- On the role of population heterogeneity in emergent communicationMathieu Rita, Florian Strub, Jean-Bastien grill, Olivier Pietquin, Emmanuel Dupoux. [doi]
- THOMAS: Trajectory Heatmap Output with learned Multi-Agent SamplingThomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde. [doi]
- Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure SpaceYaohua Wang, Yaobin Zhang, Fangyi Zhang, Senzhang Wang, Ming Lin, Yuqi Zhang, Xiuyu Sun. [doi]
- Churn Reduction via DistillationHeinrich Jiang, Harikrishna Narasimhan, Dara Bahri, Andrew Cotter, Afshin Rostamizadeh. [doi]
- Byzantine-Robust Learning on Heterogeneous Datasets via BucketingSai Praneeth Karimireddy, Lie He, Martin Jaggi. [doi]
- Measuring CLEVRness: Black-box Testing of Visual Reasoning ModelsSpyridon Mouselinos, Henryk Michalewski, Mateusz Malinowski. [doi]
- ComPhy: Compositional Physical Reasoning of Objects and Events from VideosZhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba 0001, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Delaunay Component Analysis for Evaluation of Data RepresentationsPetra Poklukar, Vladislav Polianskii, Anastasiia Varava, Florian T. Pokorny, Danica Kragic Jensfelt. [doi]
- Self-ensemble Adversarial Training for Improved RobustnessHongjun Wang, Yisen Wang 0001. [doi]
- Adversarially Robust Conformal PredictionAsaf Gendler, Tsui-Wei Weng, Luca Daniel, Yaniv Romano. [doi]
- Transformer-based Transform CodingYinhao Zhu, Yang Yang 0010, Taco Cohen. [doi]
- DEPTS: Deep Expansion Learning for Periodic Time Series ForecastingWei Fan 0010, Shun Zheng, Xiaohan Yi, Wei Cao, Yanjie Fu, Jiang Bian 0002, Tie-Yan Liu. [doi]
- On Lottery Tickets and Minimal Task Representations in Deep Reinforcement LearningMarc Aurel Vischer, Robert Tjarko Lange, Henning Sprekeler. [doi]
- Nonlinear ICA Using Volume-Preserving TransformationsXiaojiang Yang, Yi Wang, Jiacheng Sun, Xing Zhang, Shifeng Zhang, Zhenguo Li, Junchi Yan. [doi]
- Learning transferable motor skills with hierarchical latent mixture policiesDushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell. [doi]
- Controlling the Complexity and Lipschitz Constant improves Polynomial NetsZhenyu Zhu, Fabian Latorre, Grigorios Chrysos 0002, Volkan Cevher. [doi]
- CoordX: Accelerating Implicit Neural Representation with a Split MLP ArchitectureRuofan Liang, Hongyi Sun, Nandita Vijaykumar. [doi]
- Is Fairness Only Metric Deep? Evaluating and Addressing Subgroup Gaps in Deep Metric LearningNatalie Dullerud, Karsten Roth, Kimia Hamidieh, Nicolas Papernot, Marzyeh Ghassemi. [doi]
- Optimal Transport for Causal DiscoveryRuibo Tu, Kun Zhang, Hedvig Kjellström, Cheng Zhang 0005. [doi]
- Comparing Distributions by Measuring Differences that Affect Decision MakingShengjia Zhao, Abhishek Sinha, Yutong He, Aidan Perreault, Jiaming Song, Stefano Ermon. [doi]
- Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionMichiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Fei Sha, William W. Cohen. [doi]
- Interacting Contour Stochastic Gradient Langevin DynamicsWei Deng 0002, Siqi Liang, Botao Hao, Guang Lin, Faming Liang. [doi]
- Lossless Compression with Probabilistic CircuitsAnji Liu, Stephan Mandt, Guy Van den Broeck. [doi]
- Query Embedding on Hyper-Relational Knowledge GraphsDimitrios Alivanistos, Max Berrendorf, Michael Cochez, Mikhail Galkin 0001. [doi]
- SimVLM: Simple Visual Language Model Pretraining with Weak SupervisionZirui Wang, Jiahui Yu, Adams Wei Yu, Zihang Dai, Yulia Tsvetkov, Yuan Cao 0007. [doi]
- Fairness in Representation for Multilingual NLP: Insights from Controlled Experiments on Conditional Language ModelingAda Wan. [doi]
- Missingness Bias in Model DebuggingSaachi Jain, Hadi Salman, Eric Wong, Pengchuan Zhang, Vibhav Vineet, Sai Vemprala, Aleksander Madry. [doi]
- Learning Optimal Conformal ClassifiersDavid Stutz, Krishnamurthy Dvijotham, Ali Taylan Cemgil, Arnaud Doucet. [doi]
- Dive Deeper Into Integral Pose RegressionKerui Gu, Linlin Yang, Angela Yao. [doi]
- Variational Neural Cellular AutomataRasmus Berg Palm, Miguel González Duque, Shyam Sudhakaran, Sebastian Risi. [doi]
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisMax W. Y. Lam, Jun Wang 0091, Dan Su 0002, Dong Yu 0001. [doi]
- Towards Training Billion Parameter Graph Neural Networks for Atomic SimulationsAnuroop Sriram, Abhishek Das, Brandon M. Wood, Siddharth Goyal, C. Lawrence Zitnick. [doi]
- Towards Deepening Graph Neural Networks: A GNTK-based Optimization PerspectiveWei Huang, Yayong Li, Weitao Du, Richard Y. D. Xu, Jie Yin, Ling Chen, Miao Zhang. [doi]
- Signing the Supermask: Keep, Hide, InvertNils Koster, Oliver Grothe, Achim Rettinger. [doi]
- Learning Weakly-supervised Contrastive RepresentationsYao-Hung Hubert Tsai, Tianqin Li, Weixin Liu, Peiyuan Liao, Ruslan Salakhutdinov, Louis-Philippe Morency. [doi]
- EntQA: Entity Linking as Question AnsweringWenzheng Zhang, Wenyue Hua, Karl Stratos. [doi]
- Recursive Disentanglement NetworkYixuan Chen 0003, Yubin Shi, Dongsheng Li 0002, Yujiang Wang 0001, Mingzhi Dong, Yingying Zhao, Robert P. Dick, Qin Lv, Fan Yang, Li Shang. [doi]
- MetaMorph: Learning Universal Controllers with TransformersAgrim Gupta, Linxi Fan, Surya Ganguli, Li Fei-Fei 0001. [doi]
- Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous InterfaceTuan Anh Le 0001, Katherine M. Collins, Luke Hewitt, Kevin Ellis, Siddharth Narayanaswamy, Samuel Gershman, Joshua B. Tenenbaum. [doi]
- Path Auxiliary Proposal for MCMC in Discrete SpaceHaoran Sun, Hanjun Dai, Wei Xia, Arun Ramamurthy. [doi]
- Steerable Partial Differential Operators for Equivariant Neural NetworksErik Jenner, Maurice Weiler. [doi]
- The Hidden Convex Optimization Landscape of Regularized Two-Layer ReLU Networks: an Exact Characterization of Optimal SolutionsYifei Wang, Jonathan Lacotte, Mert Pilanci. [doi]
- COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksFan Wu, Linyi Li, Huan Zhang 0001, Bhavya Kailkhura, Krishnaram Kenthapadi, Ding Zhao, Bo Li 0026. [doi]
- Prototypical Contrastive Predictive CodingKyungmin Lee. [doi]
- A Statistical Framework for Efficient Out of Distribution Detection in Deep Neural NetworksMatan Haroush, Tzviel Frostig, Ruth Heller, Daniel Soudry. [doi]
- Closed-form Sample Probing for Learning Generative Models in Zero-shot LearningSamet Çetin, Orhun Bugra Baran, Ramazan Gokberk Cinbis. [doi]
- Learning-Augmented $k$-means ClusteringJon C. Ergun, Zhili Feng, Sandeep Silwal, David P. Woodruff, Samson Zhou. [doi]
- Conditional Object-Centric Learning from VideoThomas Kipf, Gamaleldin Fathy Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff. [doi]
- Evaluating Distributional Distortion in Neural Language ModelingBenjamin LeBrun, Alessandro Sordoni, Timothy J. O'Donnell. [doi]
- Analyzing and Improving the Optimization Landscape of Noise-Contrastive EstimationBingbin Liu, Elan Rosenfeld, Pradeep Kumar Ravikumar, Andrej Risteski. [doi]
- A Fine-Tuning Approach to Belief State ModelingSamuel Sokota, Hengyuan Hu, David J. Wu, J. Zico Kolter, Jakob Nicolaus Foerster, Noam Brown. [doi]
- Node Feature Extraction by Self-Supervised Multi-scale Neighborhood PredictionEli Chien, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Jiong Zhang, Olgica Milenkovic, Inderjit S. Dhillon. [doi]
- $\pi$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian OptimizationCarl Hvarfner, Danny Stoll, Artur L. F. Souza, Marius Lindauer, Frank Hutter, Luigi Nardi. [doi]
- Knowledge Removal in Sampling-based Bayesian InferenceShaopeng Fu, Fengxiang He, Dacheng Tao. [doi]
- Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?Matthew Farrell, Blake Bordelon, Shubhendu Trivedi, Cengiz Pehlevan. [doi]
- Counterfactual Plans under Distributional AmbiguityNgoc Bui, Duy Nguyen, Viet Anh Nguyen. [doi]
- Learnability of convolutional neural networks for infinite dimensional input via mixed and anisotropic smoothnessSho Okumoto, Taiji Suzuki. [doi]
- Learning Continuous Environment Fields via Implicit FunctionsXueting Li, Shalini De Mello, Xiaolong Wang, Ming-Hsuan Yang 0001, Jan Kautz, Sifei Liu. [doi]
- Ab-Initio Potential Energy Surfaces by Pairing GNNs with Neural Wave FunctionsNicholas Gao, Stephan Günnemann. [doi]
- SDEdit: Guided Image Synthesis and Editing with Stochastic Differential EquationsChenlin Meng, Yutong He, Yang Song, Jiaming Song, Jiajun Wu 0001, Jun-Yan Zhu, Stefano Ermon. [doi]
- Few-Shot Backdoor Attacks on Visual Object TrackingYiming Li 0004, Haoxiang Zhong, Xingjun Ma, Yong Jiang, Shu-Tao Xia. [doi]
- Post hoc Explanations may be Ineffective for Detecting Unknown Spurious CorrelationJulius Adebayo, Michael Muelly, Harold Abelson, Been Kim. [doi]
- EXACT: Scalable Graph Neural Networks Training via Extreme Activation CompressionZirui Liu, Kaixiong Zhou, Fan Yang, Li Li, Rui Chen, Xia Hu. [doi]
- Vitruvion: A Generative Model of Parametric CAD SketchesAri Seff, Wenda Zhou, Nick Richardson, Ryan P. Adams. [doi]
- TRGP: Trust Region Gradient Projection for Continual LearningSen Lin, Li Yang, Deliang Fan, Junshan Zhang. [doi]
- On the Limitations of Multimodal VAEsImant Daunhawer, Thomas M. Sutter, Kieran Chin-Cheong, Emanuele Palumbo, Julia E. Vogt. [doi]
- Objects in Semantic TopologyShuo Yang, Peize Sun, Yi Jiang, Xiaobo Xia, Ruiheng Zhang, Zehuan Yuan, Changhu Wang, Ping Luo, Min Xu. [doi]
- miniF2F: a cross-system benchmark for formal Olympiad-level mathematicsKunhao Zheng, Jesse Michael Han, Stanislas Polu. [doi]
- Sparse Attention with Learning to HashZhiqing Sun, Yiming Yang, Shinjae Yoo. [doi]
- Particle Stochastic Dual Coordinate Ascent: Exponential convergent algorithm for mean field neural network optimizationKazusato Oko, Taiji Suzuki, Atsushi Nitanda, Denny Wu. [doi]
- Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning ViewXuanchi Ren, Tao Yang, Yuwang Wang, Wenjun Zeng. [doi]
- Should I Run Offline Reinforcement Learning or Behavioral Cloning?Aviral Kumar, Joey Hong, Anikait Singh, Sergey Levine. [doi]
- Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning ModelsViet Quoc Vo, Ehsan Abbasnejad, Damith Ranasinghe. [doi]
- Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement LearningSunghoon Hong, Deunsol Yoon, Kee-Eung Kim. [doi]
- Self-Supervised Inference in State-Space ModelsDavid Ruhe, Patrick Forré. [doi]
- DIVA: Dataset Derivative of a Learning TaskYonatan Dukler, Alessandro Achille, Giovanni Paolini, Avinash Ravichandran, Marzia Polito, Stefano Soatto. [doi]
- Divisive Feature Normalization Improves Image Recognition Performance in AlexNetMichelle Miller, SueYeon Chung, Kenneth D. Miller. [doi]
- On the Pitfalls of Analyzing Individual Neurons in Language ModelsOmer Antverg, Yonatan Belinkov. [doi]
- DemoDICE: Offline Imitation Learning with Supplementary Imperfect DemonstrationsGeon-hyeong Kim, Seokin Seo, Jongmin Lee 0004, Wonseok Jeon, HyeongJoo Hwang, Hongseok Yang, Kee-Eung Kim. [doi]
- A Tale of Two Flows: Cooperative Learning of Langevin Flow and Normalizing Flow Toward Energy-Based ModelJianwen Xie, Yaxuan Zhu, Jun Li, Ping Li. [doi]
- Adversarial Support AlignmentShangyuan Tong, Timur Garipov, Yang Zhang, Shiyu Chang, Tommi S. Jaakkola. [doi]
- Quantitative Performance Assessment of CNN Units via Topological Entropy CalculationYang Zhao, Hao Zhang. [doi]
- Learning a subspace of policies for online adaptation in Reinforcement LearningJean-Baptiste Gaya, Laure Soulier, Ludovic Denoyer. [doi]
- Complete Verification via Multi-Neuron Relaxation Guided Branch-and-BoundClaudio Ferrari, Mark Niklas Müller, Nikola Jovanovic, Martin T. Vechev. [doi]
- On the Connection between Local Attention and Dynamic Depth-wise ConvolutionQi Han, Zejia Fan, Qi Dai, Lei Sun, Ming-Ming Cheng, Jiaying Liu 0001, Jingdong Wang 0001. [doi]
- Privacy Implications of ShufflingCasey Meehan, Amrita Roy Chowdhury 0001, Kamalika Chaudhuri, Somesh Jha. [doi]
- How to Robustify Black-Box ML Models? A Zeroth-Order Optimization PerspectiveYimeng Zhang, Yuguang Yao, Jinghan Jia, Jinfeng Yi, Mingyi Hong, Shiyu Chang, Sijia Liu 0001. [doi]
- Unsupervised Semantic Segmentation by Distilling Feature CorrespondencesMark Hamilton, Zhoutong Zhang, Bharath Hariharan, Noah Snavely, William T. Freeman. [doi]
- Sample and Computation Redistribution for Efficient Face DetectionJia Guo, Jiankang deng, Alexandros Lattas, Stefanos Zafeiriou. [doi]
- Learning Scenario Representation for Solving Two-stage Stochastic Integer ProgramsYaoxin Wu, Wen Song, Zhiguang Cao, Jie Zhang. [doi]
- AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain AdaptationDavid Berthelot, Rebecca Roelofs, Kihyuk Sohn, Nicholas Carlini, Alexey Kurakin. [doi]
- Adversarial Robustness Through the Lens of CausalityYonggang Zhang, Mingming Gong, Tongliang Liu, Gang Niu 0001, Xinmei Tian 0001, Bo Han 0003, Bernhard Schölkopf, Kun Zhang 0001. [doi]
- Offline Reinforcement Learning with Implicit Q-LearningIlya Kostrikov, Ashvin Nair, Sergey Levine. [doi]
- RegionViT: Regional-to-Local Attention for Vision TransformersChun-Fu Chen 0001, Rameswar Panda, Quanfu Fan. [doi]
- High Probability Generalization Bounds with Fast Rates for Minimax ProblemsShaojie Li, Yong Liu 0018. [doi]
- What Do We Mean by Generalization in Federated Learning?Honglin Yuan 0002, Warren Richard Morningstar, Lin Ning, Karan Singhal. [doi]
- Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-designWengong Jin, Jeremy Wohlwend, Regina Barzilay, Tommi S. Jaakkola. [doi]
- NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet TrainingChengYue Gong, Dilin Wang, Meng Li, Xinlei Chen, Zhicheng Yan, Yuandong Tian, Qiang Liu 0001, Vikas Chandra. [doi]
- Unifying Likelihood-free Inference with Black-box Optimization and BeyondDinghuai Zhang, Jie Fu, Yoshua Bengio, Aaron C. Courville. [doi]
- Sparsity Winning Twice: Better Robust Generalization from More Efficient TrainingTianlong Chen, Zhenyu Zhang, Pengjun Wang, Santosh Balachandra, Haoyu Ma, Zehao Wang, Zhangyang Wang. [doi]
- TAMP-S2GCNets: Coupling Time-Aware Multipersistence Knowledge Representation with Spatio-Supra Graph Convolutional Networks for Time-Series ForecastingYuzhou Chen, Ignacio Segovia-Dominguez, Baris Coskunuzer, Yulia R. Gel. [doi]
- Towards Understanding the Data Dependency of Mixup-style TrainingMuthu Chidambaram, Xiang Wang 0011, Yuzheng Hu, Chenwei Wu 0002, Rong Ge 0001. [doi]
- Language model compression with weighted low-rank factorizationYen-Chang Hsu, Ting Hua, Sungen Chang, Qian Lou, Yilin Shen, Hongxia Jin. [doi]
- A Neural Tangent Kernel Perspective of Infinite Tree EnsemblesRyuichi Kanoh, Mahito Sugiyama. [doi]
- Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic SparsityShiwei Liu, Tianlong Chen, Zahra Atashgahi, Xiaohan Chen, Ghada Sokar, Elena Mocanu, Mykola Pechenizkiy, Zhangyang Wang, Decebal Constantin Mocanu. [doi]
- A Zest of LIME: Towards Architecture-Independent Model DistancesHengrui Jia, Hongyu Chen, Jonas Guan, Ali Shahin Shamsabadi, Nicolas Papernot. [doi]
- Relational Surrogate Loss LearningTao Huang 0020, Zekang Li, Hua Lu 0017, Yong Shan, Shusheng Yang, Yang Feng, Fei Wang 0032, Shan You, Chang Xu 0002. [doi]
- Reliable Adversarial Distillation with Unreliable TeachersJianing Zhu, Jiangchao Yao, Bo Han 0003, Jingfeng Zhang, Tongliang Liu, Gang Niu 0001, Jingren Zhou, Jianliang Xu, Hongxia Yang. [doi]
- HyperDQN: A Randomized Exploration Method for Deep Reinforcement LearningZiniu Li, Yingru Li, Yushun Zhang, Tong Zhang, Zhi-Quan Luo. [doi]
- Message Passing Neural PDE SolversJohannes Brandstetter, Daniel E. Worrall, Max Welling. [doi]
- Learning meta-features for AutoMLHerilalaina Rakotoarison, Louisot Milijaona, Andry Rasoanaivo, Michèle Sebag, Marc Schoenauer. [doi]
- CoBERL: Contrastive BERT for Reinforcement LearningAndrea Banino, Adrià Puigdomènech Badia, Jacob C. Walker, Tim Scholtes, Jovana Mitrovic, Charles Blundell. [doi]
- On the Convergence of mSGD and AdaGrad for Stochastic OptimizationRuinan Jin, Yu Xing, Xingkang He. [doi]
- Towards Better Understanding and Better Generalization of Low-shot Classification in Histology Images with Contrastive LearningJiawei Yang, Hanbo Chen, Jiangpeng Yan, Xiaoyu Chen, Jianhua Yao. [doi]
- Igeood: An Information Geometry Approach to Out-of-Distribution DetectionEduardo Dadalto Câmara Gomes, Florence Alberge, Pierre Duhamel, Pablo Piantanida. [doi]
- Neural Markov Controlled SDE: Stochastic Optimization for Continuous-Time DataSung-Woo Park, Kyungjae Lee, Junseok Kwon. [doi]
- Learning Long-Term Reward Redistribution via Randomized Return DecompositionZhizhou Ren, Ruihan Guo, Yuan Zhou 0007, Jian Peng 0001. [doi]
- Sampling with Mirrored Stein OperatorsJiaxin Shi, Chang Liu, Lester Mackey. [doi]
- A General Analysis of Example-Selection for Stochastic Gradient DescentYucheng Lu, Si Yi Meng, Christopher De Sa. [doi]
- Towards Understanding Generalization via Decomposing Excess Risk DynamicsJiaye Teng, Jianhao Ma, Yang Yuan. [doi]
- Pareto Policy AdaptationPanagiotis Kyriakis, Jyotirmoy Deshmukh, Paul Bogdan. [doi]
- Dual Lottery Ticket HypothesisYue Bai, Huan Wang 0014, Zhiqiang Tao, Kunpeng Li, Yun Fu 0001. [doi]
- Revisiting flow generative models for Out-of-distribution detectionDihong Jiang, Sun Sun, Yaoliang Yu. [doi]
- Almost Tight L0-norm Certified Robustness of Top-k Predictions against Adversarial PerturbationsJinyuan Jia, Binghui Wang, Xiaoyu Cao, Hongbin Liu, Neil Zhenqiang Gong. [doi]
- Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence ClassificationBing Su 0001, Ji-Rong Wen. [doi]
- Multi-Mode Deep Matrix and Tensor FactorizationJicong Fan. [doi]
- Interpretable Unsupervised Diversity Denoising and Artefact RemovalMangal Prakash, Mauricio Delbracio, Peyman Milanfar, Florian Jug. [doi]
- Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited DataYaxing Wang, Joost van de Weijer 0001, Lu Yu 0004, Shangling Jui. [doi]
- Generative Modeling with Optimal Transport MapsLitu Rout, Alexander Korotin, Evgeny Burnaev. [doi]
- ConFeSS: A Framework for Single Source Cross-Domain Few-Shot LearningDebasmit Das, Sungrack Yun, Fatih Porikli. [doi]
- Towards Empirical Sandwich Bounds on the Rate-Distortion FunctionYibo Yang, Stephan Mandt. [doi]
- Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent DesignYe Yuan 0007, Yuda Song 0001, Zhengyi Luo 0002, Wen Sun 0002, Kris M. Kitani. [doi]
- Strength of Minibatch Noise in SGDZiyin Liu, Kangqiao Liu, Takashi Mori, Masahito Ueda. [doi]
- High Probability Bounds for a Class of Nonconvex Algorithms with AdaGrad StepsizeAli Kavis, Kfir Yehuda Levy, Volkan Cevher. [doi]
- Scaling Laws for Neural Machine TranslationBehrooz Ghorbani, Orhan Firat, Markus Freitag, Ankur Bapna, Maxim Krikun, Xavier Garcia, Ciprian Chelba, Colin Cherry. [doi]
- On the Certified Robustness for Ensemble Models and BeyondZhuolin Yang, Linyi Li, Xiaojun Xu, Bhavya Kailkhura, Tao Xie 0001, Bo Li. [doi]
- Efficient and Differentiable Conformal Prediction with General Function ClassesYu Bai, Song Mei, Huan Wang, Yingbo Zhou, Caiming Xiong. [doi]
- Universalizing Weak SupervisionChangho Shin, Winfred Li, Harit Vishwakarma, Nicholas Carl Roberts, Frederic Sala. [doi]
- Bregman Gradient Policy OptimizationFeihu Huang, Shangqian Gao, Heng Huang. [doi]
- Graph-based Nearest Neighbor Search in Hyperbolic SpacesLiudmila Prokhorenkova, Dmitry Baranchuk, Nikolay Bogachev, Yury Demidovich, Alexander Kolpakov. [doi]
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement LearningChenjia Bai, Lingxiao Wang 0003, Zhuoran Yang, Zhi-Hong Deng, Animesh Garg, Peng Liu 0008, Zhaoran Wang. [doi]
- Bootstrapping Semantic Segmentation with Regional ContrastShikun Liu, Shuaifeng Zhi, Edward Johns, Andrew J. Davison. [doi]
- Modular Lifelong Reinforcement Learning via Neural CompositionJorge A. Mendez, Harm van Seijen, Eric Eaton. [doi]
- Object Pursuit: Building a Space of Objects via Discriminative Weight GenerationChuanyu Pan, Yanchao Yang, Kaichun Mo, Yueqi Duan, Leonidas J. Guibas. [doi]
- Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient DescentOliver Bryniarski, Nabeel Hingun, Pedro Pachuca, Vincent Wang, Nicholas Carlini. [doi]
- Variational autoencoders in the presence of low-dimensional data: landscape and implicit biasFrederic Koehler, Viraj Mehta, Chenghui Zhou, Andrej Risteski. [doi]
- Train Short, Test Long: Attention with Linear Biases Enables Input Length ExtrapolationOfir Press, Noah Smith, Mike Lewis. [doi]
- Explainable GNN-Based Models over Knowledge GraphsDavid Jaime Tena Cucala, Bernardo Cuenca Grau, Egor V. Kostylev, Boris Motik. [doi]
- Joint Shapley values: a measure of joint feature importanceChris Harris, Richard Pymar, Colin Rowat. [doi]
- Curriculum learning as a tool to uncover learning principles in the brainDaniel R. Kepple, Rainer Engelken, Kanaka Rajan. [doi]
- Axiomatic Explanations for Visual Search, Retrieval, and Similarity LearningMark Hamilton, Scott M. Lundberg, Stephanie Fu, Lei Zhang, William T. Freeman. [doi]
- C-Planning: An Automatic Curriculum for Learning Goal-Reaching TasksTianjun Zhang, Benjamin Eysenbach, Ruslan Salakhutdinov, Sergey Levine, Joseph E. Gonzalez. [doi]
- Learning to Complete Code with SketchesDaya Guo, Alexey Svyatkovskiy, Jian Yin 0001, Nan Duan, Marc Brockschmidt, Miltiadis Allamanis. [doi]
- TRAIL: Near-Optimal Imitation Learning with Suboptimal DataMengjiao Yang, Sergey Levine, Ofir Nachum. [doi]
- Visual Representation Learning Does Not Generalize Strongly Within the Same DomainLukas Schott, Julius von Kügelgen, Frederik Träuble, Peter Vincent Gehler, Chris Russell 0001, Matthias Bethge, Bernhard Schölkopf, Francesco Locatello, Wieland Brendel. [doi]
- You are AllSet: A Multiset Function Framework for Hypergraph Neural NetworksEli Chien, Chao Pan, Jianhao Peng, Olgica Milenkovic. [doi]
- Equivariant Graph Mechanics Networks with ConstraintsWenbing Huang 0001, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun 0001, JunZhou Huang. [doi]
- GLASS: GNN with Labeling Tricks for Subgraph Representation LearningXiyuan Wang, Muhan Zhang. [doi]
- Generative Principal Component AnalysisZhaoqiang Liu, Jiulong Liu, Subhroshekhar Ghosh, Jun Han, Jonathan Scarlett. [doi]
- Topologically Regularized Data EmbeddingsRobin Vandaele, Bo Kang, Jefrey Lijffijt, Tijl De Bie, Yvan Saeys. [doi]
- DKM: Differentiable k-Means Clustering Layer for Neural Network CompressionMinsik Cho, Keivan Alizadeh-Vahid, Saurabh Adya, Mohammad Rastegari. [doi]
- Large Language Models Can Be Strong Differentially Private LearnersXuechen Li, Florian Tramèr, Percy Liang, Tatsunori Hashimoto. [doi]
- Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learningShaofeng Zhang, Feng Zhu, Junchi Yan, Rui Zhao, Xiaokang Yang. [doi]
- Creating Training Sets via Weak Indirect SupervisionJieyu Zhang, Bohan Wang, Xiangchen Song, Yujing Wang, Yaming Yang 0001, Jing Bai 0010, Alexander Ratner. [doi]
- Demystifying Limited Adversarial Transferability in Automatic Speech Recognition SystemsHadi Abdullah, Aditya Karlekar, Vincent Bindschaedler, Patrick Traynor. [doi]
- Discovering Invariant Rationales for Graph Neural NetworksYingxin Wu, Xiang Wang, An Zhang, Xiangnan He 0001, Tat-Seng Chua. [doi]
- Continual Learning with Recursive Gradient OptimizationHao Liu, Huaping Liu. [doi]
- Visual Correspondence HallucinationHugo Germain, Vincent Lepetit, Guillaume Bourmaud. [doi]
- Parallel Training of GRU Networks with a Multi-Grid Solver for Long SequencesEuhyun Moon, Eric C. Cyr. [doi]
- Transformer Embeddings of Irregularly Spaced Events and Their ParticipantsHongyuan Mei, Chenghao Yang, Jason Eisner. [doi]
- DEGREE: Decomposition Based Explanation for Graph Neural NetworksQizhang Feng, Ninghao Liu, Fan Yang, Ruixiang Tang, Mengnan Du, Xia Hu. [doi]
- Boosting Randomized Smoothing with Variance Reduced ClassifiersMiklós Z. Horváth, Mark Niklas Müller, Marc Fischer 0002, Martin T. Vechev. [doi]
- Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness?Vikash Sehwag, Saeed Mahloujifar, Tinashe Handina, Sihui Dai, Chong Xiang 0001, Mung Chiang, Prateek Mittal. [doi]
- Learned Simulators for TurbulenceKimberly Stachenfeld, Drummond Buschman Fielding, Dmitrii Kochkov, Miles D. Cranmer, Tobias Pfaff, Jonathan Godwin, Can Cui, Shirley Ho, Peter W. Battaglia, Alvaro Sanchez-Gonzalez. [doi]
- Learning Hierarchical Structures with Differentiable Nondeterministic StacksBrian DuSell, David Chiang 0001. [doi]
- Amortized Tree Generation for Bottom-up Synthesis Planning and Synthesizable Molecular DesignWenhao Gao, Rocío Mercado, Connor W. Coley. [doi]
- Sample Selection with Uncertainty of Losses for Learning with Noisy LabelsXiaobo Xia, Tongliang Liu, Bo Han 0003, Mingming Gong, Jun Yu, Gang Niu 0001, Masashi Sugiyama. [doi]
- Curvature-Guided Dynamic Scale Networks for Multi-View StereoKhang Truong Giang, Soohwan Song, Sungho Jo. [doi]
- MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training ConflictsWeixin Liang, James Zou 0001. [doi]
- Low-Budget Active Learning via Wasserstein Distance: An Integer Programming ApproachRafid Mahmood, Sanja Fidler, Marc T. Law. [doi]
- Enabling Arbitrary Translation Objectives with Adaptive Tree SearchWang Ling, Wojciech Stokowiec, Domenic Donato, Chris Dyer, Lei Yu 0008, Laurent Sartran, Austin Matthews. [doi]
- Constraining Linear-chain CRFs to Regular LanguagesSean Papay, Roman Klinger, Sebastian Padó. [doi]
- Label Encoding for Regression NetworksDeval Shah, Zi Yu Xue, Tor M. Aamodt. [doi]
- CrossBeam: Learning to Search in Bottom-Up Program SynthesisKensen Shi, Hanjun Dai, Kevin Ellis, Charles Sutton. [doi]
- The Geometry of Memoryless Stochastic Policy Optimization in Infinite-Horizon POMDPsJohannes Müller, Guido Montúfar. [doi]
- Learnability Lock: Authorized Learnability Control Through Adversarial Invertible TransformationsWeiqi Peng, Jinghui Chen. [doi]
- Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime InferenceHyunseo Koh, DahYun Kim, Jung-Woo Ha 0001, Jonghyun Choi. [doi]
- Autoregressive Quantile Flows for Predictive Uncertainty EstimationPhillip Si, Allan Bishop, Volodymyr Kuleshov. [doi]
- In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal SpecificationsBorja G. León, Murray Shanahan, Francesco Belardinelli. [doi]
- Dynamics-Aware Comparison of Learned Reward FunctionsBlake Wulfe, Logan Michael Ellis, Jean Mercat, Rowan Thomas McAllister, Adrien Gaidon. [doi]
- On Robust Prefix-Tuning for Text ClassificationZonghan Yang, Yang Liu. [doi]
- The Uncanny Similarity of Recurrence and DepthAvi Schwarzschild, Arjun Gupta, Amin Ghiasi, Micah Goldblum, Tom Goldstein. [doi]
- The Information Geometry of Unsupervised Reinforcement LearningBenjamin Eysenbach, Ruslan Salakhutdinov, Sergey Levine. [doi]
- On feature learning in neural networks with global convergence guaranteesZhengdao Chen, Eric Vanden-Eijnden, Joan Bruna. [doi]
- DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETRShilong Liu, Feng Li, Hao Zhang 0097, Xiao Yang, Xianbiao Qi, Hang Su, Jun Zhu, Lei Zhang. [doi]
- Bayesian Neural Network Priors RevisitedVincent Fortuin, Adrià Garriga-Alonso, Sebastian W. Ober, Florian Wenzel, Gunnar Rätsch, Richard E. Turner, Mark van der Wilk, Laurence Aitchison. [doi]
- Discovering and Explaining the Representation Bottleneck of DNNSHuiqi Deng, Qihan Ren, Hao Zhang 0063, Quanshi Zhang. [doi]
- Pseudo Numerical Methods for Diffusion Models on ManifoldsLuping Liu, Yi Ren 0006, Zhijie Lin, Zhou Zhao. [doi]
- Frequency-aware SGD for Efficient Embedding Learning with Provable BenefitsYan Li, Dhruv Choudhary, Xiaohan Wei, Baichuan Yuan, Bhargav Bhushanam, Tuo Zhao, Guanghui Lan. [doi]
- Causal Contextual Bandits with Targeted InterventionsChandrasekar Subramanian, Balaraman Ravindran. [doi]
- Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a LoopPeng Jin, Xitong Zhang, Yinpeng Chen, Sharon Xiaolei Huang, Zicheng Liu 0001, Youzuo Lin. [doi]
- Target-Side Input Augmentation for Sequence to Sequence GenerationShufang Xie 0003, Ang Lv, Yingce Xia, Lijun Wu, Tao Qin, Tie-Yan Liu, Rui Yan 0001. [doi]
- Consistent Counterfactuals for Deep ModelsEmily Black, Zifan Wang, Matt Fredrikson. [doi]
- Adversarial Unlearning of Backdoors via Implicit HypergradientYi Zeng, Si Chen, won Park, Zhuoqing Mao, Ming Jin 0002, Ruoxi Jia. [doi]
- Universal Approximation Under Constraints is Possible with TransformersAnastasis Kratsios, Behnoosh Zamanlooy, Tianlin Liu, Ivan Dokmanic. [doi]
- ViTGAN: Training GANs with Vision TransformersKwonjoon Lee, Huiwen Chang, Lu Jiang, Han Zhang, Zhuowen Tu, Ce Liu. [doi]
- Half-Inverse Gradients for Physical Deep LearningPatrick Schnell, Philipp Holl, Nils Thuerey. [doi]
- Policy Gradients Incorporating the FutureDavid Venuto, Elaine Lau, Doina Precup, Ofir Nachum. [doi]
- Learning Transferable Reward for Query Object Localization with Policy AdaptationTingfeng Li, Shaobo Han, Martin Renqiang Min, Dimitris N. Metaxas. [doi]
- Graph Neural Network Guided Local Search for the Traveling Salesperson ProblemBenjamin Hudson, Qingbiao Li, Matthew Malencia, Amanda Prorok. [doi]
- Learning to Generalize across Domains on Single Test SamplesZehao Xiao, Xiantong Zhen, Ling Shao 0001, Cees G. M. Snoek. [doi]
- Omni-Scale CNNs: a simple and effective kernel size configuration for time series classificationWensi Tang, Guodong Long, Lu Liu, Tianyi Zhou, Michael Blumenstein, Jing Jiang 0002. [doi]
- On the relation between statistical learning and perceptual distancesAlexander Hepburn, Valero Laparra, Raúl Santos-Rodríguez, Johannes Ballé, Jesus Malo. [doi]
- Scale Efficiently: Insights from Pretraining and Finetuning TransformersYi Tay, Mostafa Dehghani 0001, Jinfeng Rao, William Fedus, Samira Abnar, Hyung Won Chung, Sharan Narang, Dani Yogatama, Ashish Vaswani, Donald Metzler. [doi]
- Variational Inference for Discriminative Learning with Generative Modeling of Feature IncompletionKohei Miyaguchi, Takayuki Katsuki, Akira Koseki, Toshiya Iwamori. [doi]
- Non-Parallel Text Style Transfer with Self-Parallel SupervisionRuibo Liu, Chongyang Gao, Chenyan Jia, Guangxuan Xu, Soroush Vosoughi. [doi]
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer ModelsChen Liang, Haoming Jiang, Simiao Zuo, Pengcheng He, Xiaodong Liu 0003, Jianfeng Gao, Weizhu Chen, Tuo Zhao. [doi]
- Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic ModelsFan Bao, Chongxuan Li, Jun Zhu, Bo Zhang. [doi]
- Discovering Latent Concepts Learned in BERTFahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu 0004, Hassan Sajjad. [doi]
- Efficient Active Search for Combinatorial Optimization ProblemsAndré Hottung, Yeong-Dae Kwon, Kevin Tierney. [doi]
- Lipschitz-constrained Unsupervised Skill DiscoverySeohong Park, Jongwook Choi, Jaekyeom Kim, Honglak Lee, Gunhee Kim. [doi]
- Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It EfficientlyXiaohan Chen, Jason Zhang, Zhangyang Wang. [doi]
- Federated Learning from Only Unlabeled Data with Class-conditional-sharing ClientsNan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu 0001, Qi Dou, Masashi Sugiyama. [doi]
- Effect of scale on catastrophic forgetting in neural networksVinay Venkatesh Ramasesh, Aitor Lewkowycz, Ethan Dyer. [doi]
- Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information GameHaobo Fu, Weiming Liu, Shuang Wu, Yijia Wang, Tao Yang, Kai Li, Junliang Xing, Bin Li, Bo Ma, Qiang Fu, Wei Yang 0032. [doi]
- Understanding the Variance Collapse of SVGD in High DimensionsJimmy Ba, Murat A. Erdogdu, Marzyeh Ghassemi, Shengyang Sun, Taiji Suzuki, Denny Wu, Tianzong Zhang. [doi]
- Wisdom of Committees: An Overlooked Approach To Faster and More Accurate ModelsXiaofang Wang, Dan Kondratyuk, Eric Christiansen, Kris M. Kitani, Yair Movshovitz-Attias, Elad Eban. [doi]
- VOS: Learning What You Don't Know by Virtual Outlier SynthesisXuefeng Du, Zhaoning Wang, Mu Cai, Yixuan Li. [doi]
- On the Importance of Firth Bias Reduction in Few-Shot ClassificationSaba Ghaffari, Ehsan Saleh, David A. Forsyth, Yu-Xiong Wang. [doi]
- Entroformer: A Transformer-based Entropy Model for Learned Image CompressionYichen Qian, Xiuyu Sun, Ming Lin, Zhiyu Tan, Rong Jin 0001. [doi]
- Real-Time Neural Voice CamouflageMia Chiquier, Chengzhi Mao, Carl Vondrick. [doi]
- When, Why, and Which Pretrained GANs Are Useful?Timofey Grigoryev, Andrey Voynov, Artem Babenko. [doi]
- Learning Versatile Neural Architectures by Propagating Network CodesMingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu 0001, Jingdong Wang 0001, Ping Luo. [doi]
- Fortuitous Forgetting in Connectionist NetworksHattie Zhou, Ankit Vani, Hugo Larochelle, Aaron C. Courville. [doi]
- Conditional Image Generation by Conditioning Variational Auto-EncodersWilliam Harvey 0002, Saeid Naderiparizi, Frank Wood. [doi]
- Learning Curves for Gaussian Process Regression with Power-Law Priors and TargetsHui Jin, Pradeep Kr. Banerjee, Guido Montúfar. [doi]
- Permutation-Based SGD: Is Random Optimal?Shashank Rajput, Kangwook Lee 0001, Dimitris S. Papailiopoulos. [doi]
- AlphaZero-based Proof Cost Network to Aid Game SolvingTi-Rong Wu, Chung-Chin Shih, Ting-Han Wei, Meng-Yu Tsai, Wei-Yuan Hsu, I-Chen Wu. [doi]
- Prospect Pruning: Finding Trainable Weights at Initialization using Meta-GradientsMilad Alizadeh, Shyam A. Tailor, Luisa M. Zintgraf, Joost van Amersfoort, Sebastian Farquhar, Nicholas Donald Lane, Yarin Gal. [doi]
- Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable RepresentationsSarath Sreedharan, Utkarsh Soni, Mudit Verma, Siddharth Srivastava 0001, Subbarao Kambhampati. [doi]
- Graph Auto-Encoder via Neighborhood Wasserstein ReconstructionMingyue Tang, Pan Li 0005, Carl Yang. [doi]
- The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic GeneralizationRóbert Csordás, Kazuki Irie, Jürgen Schmidhuber. [doi]
- Geometric and Physical Quantities improve E(3) Equivariant Message PassingJohannes Brandstetter, Rob Hesselink, Elise van der Pol, Erik J. Bekkers, Max Welling. [doi]
- Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations?Yonggan Fu, Shunyao Zhang, Shang Wu, Cheng Wan, Yingyan Lin. [doi]
- Learning by Directional Gradient DescentDavid Silver, Anirudh Goyal, Ivo Danihelka, Matteo Hessel, Hado van Hasselt. [doi]
- Improving Federated Learning Face Recognition via Privacy-Agnostic ClustersQiang Meng, Feng Zhou 0002, Hainan Ren, Tianshu Feng, Guochao Liu, Yuanqing Lin. [doi]
- Overcoming The Spectral Bias of Neural Value ApproximationGe Yang, Anurag Ajay, Pulkit Agrawal. [doi]
- PAC Prediction Sets Under Covariate ShiftSangdon Park, Edgar Dobriban, Insup Lee, Osbert Bastani. [doi]
- GreaseLM: Graph REASoning Enhanced Language ModelsXikun Zhang 0001, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D. Manning, Jure Leskovec. [doi]
- InfinityGAN: Towards Infinite-Pixel Image SynthesisChieh Hubert Lin, Hsin-Ying Lee, Yen-Chi Cheng, Sergey Tulyakov, Ming-Hsuan Yang 0001. [doi]
- Unsupervised Vision-Language Grammar Induction with Shared Structure ModelingBo Wan, Wenjuan Han, Zilong Zheng, Tinne Tuytelaars. [doi]
- Independent SE(3)-Equivariant Models for End-to-End Rigid Protein DockingOctavian-Eugen Ganea, Xinyuan Huang, Charlotte Bunne, Yatao Bian, Regina Barzilay, Tommi S. Jaakkola, Andreas Krause 0001. [doi]
- Tracking the risk of a deployed model and detecting harmful distribution shiftsAleksandr Podkopaev, Aaditya Ramdas. [doi]
- Training Data Generating Networks: Shape Reconstruction via Bi-level OptimizationBiao Zhang, Peter Wonka. [doi]
- Towards Evaluating the Robustness of Neural Networks Learned by TransductionJiefeng Chen 0001, Xi Wu 0001, Yang Guo, Yingyu Liang, Somesh Jha. [doi]
- Inductive Relation Prediction Using Analogy Subgraph EmbeddingsJiarui Jin, Yangkun Wang, Kounianhua Du, Weinan Zhang 0001, Zheng Zhang, David Wipf, Yong Yu 0001, Quan Gan. [doi]
- FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual RelationsLingjie Mei, Jiayuan Mao, Ziqi Wang, Chuang Gan, Joshua B. Tenenbaum. [doi]
- Certified Robustness for Deep Equilibrium Models via Interval Bound PropagationColin Wei, J. Zico Kolter. [doi]
- Acceleration of Federated Learning with Alleviated Forgetting in Local TrainingChencheng Xu, Zhiwei Hong, Minlie Huang, Tao Jiang. [doi]
- Why Propagate Alone? Parallel Use of Labels and Features on GraphsYangkun Wang, Jiarui Jin, Weinan Zhang 0001, Yongyi Yang, Jiuhai Chen, Quan Gan, Yong Yu 0001, Zheng Zhang, Zengfeng Huang, David Wipf. [doi]
- DeSKO: Stability-Assured Robust Control with a Deep Stochastic Koopman OperatorMinghao Han, Jacob Euler-Rolle, Robert K. Katzschmann. [doi]
- Temporal Efficient Training of Spiking Neural Network via Gradient Re-weightingShikuang Deng, Yuhang Li, Shanghang Zhang, Shi Gu. [doi]
- Minimax Optimization with Smooth Algorithmic AdversariesTanner Fiez, Chi Jin, Praneeth Netrapalli, Lillian J. Ratliff. [doi]
- Compositional Attention: Disentangling Search and RetrievalSarthak Mittal, Sharath Chandra Raparthy, Irina Rish, Yoshua Bengio, Guillaume Lajoie. [doi]
- Mapping Language Models to Grounded Conceptual SpacesRoma Patel, Ellie Pavlick. [doi]
- Learning Neural Contextual Bandits through Perturbed RewardsYiling Jia, Weitong Zhang, Dongruo Zhou, Quanquan Gu, Hongning Wang. [doi]
- Semi-relaxed Gromov-Wasserstein divergence and applications on graphsCédric Vincent-Cuaz, Rémi Flamary, Marco Corneli, Titouan Vayer, Nicolas Courty. [doi]
- Provably convergent quasistatic dynamics for mean-field two-player zero-sum gamesChao Ma 0012, Lexing Ying. [doi]
- Incremental False Negative Detection for Contrastive LearningTsai-Shien Chen, Wei-Chih Hung, Hung-Yu Tseng, Shao-Yi Chien, Ming-Hsuan Yang 0001. [doi]
- Learning to Map for Active Semantic Goal NavigationGeorgios Georgakis, Bernadette Bucher, Karl Schmeckpeper, Siddharth Singh, Kostas Daniilidis. [doi]
- An Operator Theoretic View On Pruning Deep Neural NetworksWilliam T. Redman, Maria Fonoberova, Ryan Mohr, Yannis G. Kevrekidis, Igor Mezic. [doi]
- On Incorporating Inductive Biases into VAEsNing Miao, Emile Mathieu, Siddharth N, Yee Whye Teh, Tom Rainforth. [doi]
- Lossy Compression with Distribution Shift as Entropy Constrained Optimal TransportHuan Liu, George Zhang, Jun Chen, Ashish J. Khisti. [doi]
- Fine-grained Differentiable Physics: A Yarn-level Model for FabricsDeshan Gong, Zhanxing Zhu, Andrew J. Bulpitt, He Wang 0002. [doi]
- Scalable Sampling for Nonsymmetric Determinantal Point ProcessesInsu Han, Mike Gartrell, Jennifer Gillenwater, Elvis Dohmatob, Amin Karbasi. [doi]
- Value Gradient weighted Model-Based Reinforcement LearningClaas Voelcker, Victor Liao, Animesh Garg, Amir Massoud Farahmand. [doi]
- PoNet: Pooling Network for Efficient Token Mixing in Long SequencesChao-Hong Tan, Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Zhen-Hua Ling. [doi]
- Do deep networks transfer invariances across classes?Allan Zhou, Fahim Tajwar, Alexander Robey, Tom Knowles, George J. Pappas, Hamed Hassani, Chelsea Finn. [doi]
- Learning Discrete Structured Variational Auto-Encoder using Natural Evolution StrategiesAlon Berliner, Guy Rotman, Yossi Adi, Roi Reichart, Tamir Hazan. [doi]
- Differentiable Expectation-Maximization for Set Representation LearningMinyoung Kim. [doi]
- Chemical-Reaction-Aware Molecule Representation LearningHongwei Wang, Weijiang Li, Xiaomeng Jin, KyungHyun Cho, Heng Ji, Jiawei Han 0001, Martin D. Burke. [doi]
- Neural Link Prediction with Walk PoolingLiming Pan, Cheng Shi, Ivan Dokmanic. [doi]
- Memory Replay with Data Compression for Continual LearningLiyuan Wang, Xingxing Zhang, Kuo Yang, Longhui Yu, Chongxuan Li, Lanqing Hong, Shifeng Zhang, Zhenguo Li, Yi Zhong, Jun Zhu. [doi]
- Visual hyperacuity with moving sensor and recurrent neural computationsAlexander Rivkind, Or Ram, Eldad Assa, Michael Kreiserman, Ehud Ahissar. [doi]
- switch-GLAT: Multilingual Parallel Machine Translation Via Code-Switch DecoderZhenqiao Song, Hao Zhou, Lihua Qian, Jingjing Xu, Shanbo Cheng, Mingxuan Wang, Lei Li. [doi]
- Rethinking Supervised Pre-Training for Better Downstream TransferringYutong Feng, Jianwen Jiang, Mingqian Tang, Rong Jin 0001, Yue Gao 0002. [doi]
- Toward Faithful Case-based Reasoning through Learning Prototypes in a Nearest Neighbor-friendly SpaceSeyed Omid Davoudi, Majid Komeili. [doi]
- Bi-linear Value Networks for Multi-goal Reinforcement LearningZhang-Wei Hong, Ge Yang, Pulkit Agrawal. [doi]
- Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in SolverXiaoyu Chen, Jiachen Hu, Lin Yang 0011, Liwei Wang. [doi]
- Open-vocabulary Object Detection via Vision and Language Knowledge DistillationXiuye Gu, Tsung-Yi Lin, Weicheng Kuo, Yin Cui. [doi]
- IGLU: Efficient GCN Training via Lazy UpdatesS. Deepak Narayanan, Aditya Sinha, Prateek Jain 0002, Purushottam Kar, Sundararajan Sellamanickam. [doi]
- Stiffness-aware neural network for learning Hamiltonian systemsSenwei Liang, Zhongzhan Huang, Hong Zhang. [doi]
- Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral SimilaritiesJianda Chen, Sinno Jialin Pan. [doi]
- Reinforcement Learning under a Multi-agent Predictive State Representation Model: Method and TheoryZhi Zhang, Zhuoran Yang, Han Liu, Pratap Tokekar, Furong Huang. [doi]
- GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue SystemsYoungsoo Jang, Jongmin Lee 0004, Kee-Eung Kim. [doi]
- Source-Free Adaptation to Measurement Shift via Bottom-Up Feature RestorationCian Eastwood, Ian Mason, Christopher K. I. Williams, Bernhard Schölkopf. [doi]
- Generalization of Neural Combinatorial Solvers Through the Lens of Adversarial RobustnessSimon Geisler, Johanna Sommer, Jan Schuchardt, Aleksandar Bojchevski, Stephan Günnemann. [doi]
- Bag of Instances Aggregation Boosts Self-supervised DistillationHaohang Xu, Jiemin Fang, Xiaopeng Zhang 0008, Lingxi Xie, Xinggang Wang, Wenrui Dai, Hongkai Xiong, Qi Tian 0001. [doi]
- Path Integral Sampler: A Stochastic Control Approach For SamplingQinsheng Zhang, Yongxin Chen. [doi]
- Fast Regression for Structured InputsRaphael A. Meyer, Cameron Musco, Christopher Musco, David P. Woodruff, Samson Zhou. [doi]
- Stochastic Training is Not Necessary for GeneralizationJonas Geiping, Micah Goldblum, Phillip Pope, Michael Moeller 0001, Tom Goldstein. [doi]
- Meta Discovery: Learning to Discover Novel Classes given Very Limited DataHaoang Chi, Feng Liu, Wenjing Yang, Long Lan, Tongliang Liu, Bo Han 0003, Gang Niu 0001, Mingyuan Zhou, Masashi Sugiyama. [doi]
- Shuffle Private Stochastic Convex OptimizationAlbert Cheu, Matthew Joseph, Jieming Mao, Binghui Peng. [doi]
- When Vision Transformers Outperform ResNets without Pre-training or Strong Data AugmentationsXiangning Chen, Cho-Jui Hsieh, Boqing Gong. [doi]
- Neural Spectral Marked Point ProcessesShixiang Zhu, Haoyun Wang, Zheng Dong, Xiuyuan Cheng, Yao Xie 0002. [doi]
- Model-augmented Prioritized Experience ReplayYoungmin Oh, Jinwoo Shin, Eunho Yang, Sung Ju Hwang. [doi]
- PolyLoss: A Polynomial Expansion Perspective of Classification Loss FunctionsZhaoqi Leng, Mingxing Tan, Chenxi Liu, Ekin Dogus Cubuk, Jay Shi, Shuyang Cheng, Dragomir Anguelov. [doi]
- Zero-Shot Self-Supervised Learning for MRI ReconstructionBurhaneddin Yaman, Seyed Amir Hossein Hosseini, Mehmet Akçakaya. [doi]
- Attacking deep networks with surrogate-based adversarial black-box methods is easyNicholas A. Lord, Romain Müller, Luca Bertinetto. [doi]
- The Effects of Invertibility on the Representational Complexity of Encoders in Variational AutoencodersDivyansh Pareek, Andrej Risteski. [doi]
- Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central PathX. Y. Han, Vardan Papyan, David L. Donoho. [doi]
- Monotonic Differentiable Sorting NetworksFelix Petersen, Christian Borgelt, Hilde Kuehne, Oliver Deussen. [doi]
- Network Insensitivity to Parameter Noise via Parameter Attack During TrainingJulian Büchel, Fynn Firouz Faber, Dylan Richard Muir. [doi]
- Online Hyperparameter Meta-Learning with Hypergradient DistillationHaebeom Lee, Hayeon Lee, Jaewoong Shin, Eunho Yang, Timothy M. Hospedales, Sung Ju Hwang. [doi]
- Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100Sahil Singla 0002, Surbhi Singla, Soheil Feizi. [doi]
- Noisy Feature MixupSoon Hoe Lim, N. Benjamin Erichson, Francisco Utrera, Winnie Xu, Michael W. Mahoney. [doi]
- Compositional Training for End-to-End Deep AUC MaximizationZhuoning Yuan, Zhishuai Guo, Nitesh Chawla, Tianbao Yang. [doi]
- Pareto Set Learning for Neural Multi-Objective Combinatorial OptimizationXi Lin 0001, Zhiyuan Yang, Qingfu Zhang 0001. [doi]
- Simple GNN Regularisation for 3D Molecular Property Prediction and BeyondJonathan Godwin, Michael Schaarschmidt, Alexander L. Gaunt, Alvaro Sanchez-Gonzalez, Yulia Rubanova, Petar Velickovic, James Kirkpatrick, Peter W. Battaglia. [doi]
- Denoising Likelihood Score Matching for Conditional Score-based Data GenerationChen-Hao Chao, Wei-Fang Sun, Bo-Wun Cheng, Yi-Chen Lo, Chia-Che Chang, Yu-Lun Liu, Yu-Lin Chang, Chia-Ping Chen, Chun-Yi Lee. [doi]
- Do Users Benefit From Interpretable Vision? A User Study, Baseline, And DatasetLeon Sixt, Martin Schuessler, Oana-Iuliana Popescu, Philipp Weiß, Tim Landgraf. [doi]
- Online Adversarial AttacksAndjela Mladenovic, Avishek Joey Bose, Hugo Berard, William L. Hamilton, Simon Lacoste-Julien, Pascal Vincent, Gauthier Gidel. [doi]
- Finding an Unsupervised Image Segmenter in each of your Deep Generative ModelsLuke Melas-Kyriazi, Christian Rupprecht 0001, Iro Laina, Andrea Vedaldi. [doi]
- Imitation Learning from Observations under Transition Model DisparityTanmay Gangwani, Yuan Zhou 0007, Jian Peng 0001. [doi]
- BiBERT: Accurate Fully Binarized BERTHaotong Qin, Yifu Ding, Mingyuan Zhang, Qinghua Yan, Aishan Liu, Qingqing Dang, Ziwei Liu, Xianglong Liu. [doi]
- Evaluating Model-Based Planning and Planner Amortization for Continuous ControlArunkumar Byravan, Leonard Hasenclever, Piotr Trochim, Mehdi Mirza, Alessandro Davide Ialongo, Yuval Tassa, Jost Tobias Springenberg, Abbas Abdolmaleki, Nicolas Heess, Josh Merel, Martin A. Riedmiller. [doi]
- Contextualized Scene Imagination for Generative Commonsense ReasoningPeiFeng Wang, Jonathan Zamora, Junfeng Liu, Filip Ilievski, Muhao Chen, Xiang Ren 0001. [doi]
- Fixed Neural Network Steganography: Train the images, not the networkVarsha Kishore, Xiangyu Chen, Yan Wang, Boyi Li, Kilian Q. Weinberger. [doi]
- Bridging Recommendation and Marketing via Recurrent Intensity ModelingYifei Ma, Ge Liu, Anoop Deoras. [doi]
- HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action RepresentationBoyan Li, Hongyao Tang, Yan Zheng, Jianye Hao, Pengyi Li, Zhen Wang, Zhaopeng Meng, Li Wang. [doi]
- Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute EstimationJun Hyun Nam, Jaehyung Kim, Jaeho Lee, Jinwoo Shin. [doi]
- Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation ApproachPrashant Khanduri, Haibo Yang, Mingyi Hong, Jia Liu, Hoi-To Wai, Sijia Liu 0001. [doi]
- MCMC Should Mix: Learning Energy-Based Model with Neural Transport Latent Space MCMCErik Nijkamp, RuiQi Gao, Pavel Sountsov, Srinivas Vasudevan, Bo Pang, Song Chun Zhu, Ying Nian Wu. [doi]
- Automated Self-Supervised Learning for GraphsWei Jin, Xiaorui Liu, Xiangyu Zhao, Yao Ma 0001, Neil Shah, Jiliang Tang. [doi]
- Pareto Policy Pool for Model-based Offline Reinforcement LearningYijun Yang, Jing Jiang, Tianyi Zhou, Jie Ma, Yuhui Shi. [doi]
- 8-bit Optimizers via Block-wise QuantizationTim Dettmers, Mike Lewis, Sam Shleifer, Luke Zettlemoyer. [doi]
- Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised TranslationXuan-Phi Nguyen, Hongyu Gong, Yun Tang, Changhan Wang, Philipp Koehn, Shafiq R. Joty. [doi]
- Improving the Accuracy of Learning Example Weights for Imbalance ClassificationYuqi Liu, Bin Cao 0004, Jing Fan. [doi]
- A Unified Wasserstein Distributional Robustness Framework for Adversarial TrainingAnh Tuan Bui, Trung Le, Quan Hung Tran, He Zhao 0001, Dinh Q. Phung. [doi]
- SketchODE: Learning neural sketch representation in continuous timeAyan Das 0003, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song. [doi]
- Label Leakage and Protection in Two-party Split LearningOscar Li, Jiankai Sun, Xin Yang 0017, Weihao Gao, Hongyi Zhang, Junyuan Xie, Virginia Smith, Chong Wang. [doi]
- Learning 3D Representations of Molecular Chirality with Invariance to Bond RotationsKeir Adams, Lagnajit Pattanaik, Connor W. Coley. [doi]
- MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without RetrainingAhmed Imtiaz Humayun, Randall Balestriero, Richard G. Baraniuk. [doi]
- Concurrent Adversarial Learning for Large-Batch TrainingYong Liu, Xiangning Chen, Minhao Cheng, Cho-Jui Hsieh, Yang You. [doi]
- The MultiBERTs: BERT Reproductions for Robustness AnalysisThibault Sellam, Steve Yadlowsky, Ian Tenney, Jason Wei, Naomi Saphra, Alexander D'Amour, Tal Linzen, Jasmijn Bastings, Iulia Raluca Turc, Jacob Eisenstein, Dipanjan Das 0001, Ellie Pavlick. [doi]
- Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RLRui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang. [doi]
- Implicit Bias of MSE Gradient Optimization in Underparameterized Neural NetworksBenjamin Bowman, Guido Montúfar. [doi]
- Autonomous Reinforcement Learning: Formalism and BenchmarkingArchit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta 0004, Karol Hausman, Sergey Levine, Chelsea Finn. [doi]
- Network Augmentation for Tiny Deep LearningHan Cai, Chuang Gan, Ji Lin 0002, Song Han 0003. [doi]
- Auto-Transfer: Learning to Route Transferable RepresentationsKeerthiram Murugesan, Vijay Sadashivaiah, Ronny Luss, Karthikeyan Shanmugam, Pin-Yu Chen, Amit Dhurandhar. [doi]
- Perceiver IO: A General Architecture for Structured Inputs & OutputsAndrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier J. Hénaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, João Carreira. [doi]
- The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex ProgramYifei Wang, Mert Pilanci. [doi]
- CrossMatch: Cross-Classifier Consistency Regularization for Open-Set Single Domain GeneralizationRonghang Zhu, Sheng Li 0001. [doi]
- Programmatic Reinforcement Learning without OraclesWenjie Qiu, He Zhu. [doi]
- Einops: Clear and Reliable Tensor Manipulations with Einstein-like NotationAlex Rogozhnikov. [doi]
- Fine-Tuning can Distort Pretrained Features and Underperform Out-of-DistributionAnanya Kumar, Aditi Raghunathan, Robbie Matthew Jones, Tengyu Ma 0001, Percy Liang. [doi]
- Learning Efficient Online 3D Bin Packing on Packing Configuration TreesHang Zhao, Yang Yu, Kai Xu 0004. [doi]
- Deep AutoAugmentYu Zheng, Zhi Zhang, Shen Yan, Mi Zhang. [doi]
- Model-Based Offline Meta-Reinforcement Learning with RegularizationSen Lin, Jialin Wan, Tengyu Xu, Yingbin Liang, Junshan Zhang. [doi]
- Variational oracle guiding for reinforcement learningDongqi Han, Tadashi Kozuno, Xufang Luo, Zhao-Yun Chen, Kenji Doya, YuQing Yang, Dongsheng Li. [doi]
- PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning MethodZiwei Guan, Tengyu Xu, Yingbin Liang. [doi]
- Sound Adversarial Audio-Visual NavigationYinfeng Yu, Wenbing Huang 0001, Fuchun Sun 0001, Changan Chen, Yikai Wang 0001, Xiaohong Liu. [doi]
- Granger causal inference on DAGs identifies genomic loci regulating transcriptionAlexander P. Wu, Rohit Singh 0001, Bonnie Berger. [doi]
- Generalized Kernel ThinningRaaz Dwivedi, Lester Mackey. [doi]
- Representation-Agnostic Shape FieldsXiaoyang Huang, Jiancheng Yang, Yanjun Wang, Ziyu Chen, Linguo Li, Teng Li 0001, Bingbing Ni, Wenjun Zhang 0001. [doi]
- Fast AdvPropJieru Mei, Yucheng Han, Yutong Bai, Yixiao Zhang 0001, Yingwei Li, Xianhang Li, Alan L. Yuille, Cihang Xie. [doi]
- DR3: Value-Based Deep Reinforcement Learning Requires Explicit RegularizationAviral Kumar, Rishabh Agarwal, Tengyu Ma 0001, Aaron C. Courville, George Tucker, Sergey Levine. [doi]
- Augmented Sliced Wasserstein DistancesXiongjie Chen, Yongxin Yang, Yunpeng Li. [doi]
- Constrained Policy Optimization via Bayesian World ModelsYarden As, Ilnura Usmanova, Sebastian Curi, Andreas Krause 0001. [doi]
- SUMNAS: Supernet with Unbiased Meta-Features for Neural Architecture SearchHyeonmin Ha, Ji-Hoon Kim, Semin Park, Byung-Gon Chun. [doi]
- TAda! Temporally-Adaptive Convolutions for Video UnderstandingZiyuan Huang, Shiwei Zhang, Liang Pan, Zhiwu Qing, Mingqian Tang, Ziwei Liu 0002, Marcelo H. Ang Jr.. [doi]
- GraphENS: Neighbor-Aware Ego Network Synthesis for Class-Imbalanced Node ClassificationJoonhyung Park, Jaeyun Song, Eunho Yang. [doi]
- Hierarchical Variational Memory for Few-shot Learning Across DomainsYing-jun Du, Xiantong Zhen, Ling Shao 0001, Cees G. M. Snoek. [doi]
- Predicting Physics in Mesh-reduced Space with Temporal AttentionXu Han, Han Gao, Tobias Pfaff, Jian Xun Wang, Liping Liu. [doi]
- Global Convergence of Multi-Agent Policy Gradient in Markov Potential GamesStefanos Leonardos, Will Overman, Ioannis Panageas, Georgios Piliouras. [doi]
- iLQR-VAE : control-based learning of input-driven dynamics with applications to neural dataMarine Schimel, Ta-Chu Kao, Kristopher T. Jensen, Guillaume Hennequin. [doi]
- Fast topological clustering with Wasserstein distanceTananun Songdechakraiwut, Bryan M. Krause, Matthew I. Banks, Kirill V. Nourski, Barry D. Van Veen. [doi]
- The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse TrainingShiwei Liu, Tianlong Chen, Xiaohan Chen, Li Shen 0008, Decebal Constantin Mocanu, Zhangyang Wang, Mykola Pechenizkiy. [doi]
- Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian SpectrumsRui Pan, Haishan Ye, Tong Zhang 0001. [doi]
- Efficient Split-Mix Federated Learning for On-Demand and In-Situ CustomizationJunyuan Hong, Haotao Wang, Zhangyang Wang, Jiayu Zhou. [doi]
- Unsupervised Discovery of Object Radiance FieldsHong-Xing Yu, Leonidas J. Guibas, Jiajun Wu 0001. [doi]
- Fairness Guarantees under Demographic ShiftStephen Giguere 0001, Blossom Metevier, Bruno Castro da Silva, Yuriy Brun, Philip S. Thomas, Scott Niekum. [doi]
- AdaRL: What, Where, and How to Adapt in Transfer Reinforcement LearningBiwei Huang, Fan Feng, Chaochao Lu, Sara Magliacane, Kun Zhang 0001. [doi]
- Boosted Curriculum Reinforcement LearningPascal Klink, Carlo D'Eramo, Jan Peters 0001, Joni Pajarinen. [doi]
- Optimization inspired Multi-Branch Equilibrium ModelsMingjie Li, Yisen Wang 0001, Xingyu Xie, Zhouchen Lin. [doi]
- Offline Neural Contextual Bandits: Pessimism, Optimization and GeneralizationThanh Nguyen-Tang, Sunil Gupta 0001, A. Tuan Nguyen, Svetha Venkatesh. [doi]
- SGD Can Converge to Local MaximaZiyin Liu, Botao Li, James B Simon, Masahito Ueda. [doi]
- Local Feature Swapping for Generalization in Reinforcement LearningDavid Bertoin, Emmanuel Rachelson. [doi]
- Graph Condensation for Graph Neural NetworksWei Jin, Lingxiao Zhao, Shichang Zhang, Yozen Liu, Jiliang Tang, Neil Shah. [doi]
- Transformers Can Do Bayesian InferenceSamuel Müller 0005, Noah Hollmann, Sebastian Pineda-Arango, Josif Grabocka, Frank Hutter. [doi]
- Gradient Matching for Domain GeneralizationYuge Shi, Jeffrey Seely, Philip H. S. Torr, Siddharth Narayanaswamy, Awni Y. Hannun, Nicolas Usunier, Gabriel Synnaeve. [doi]
- SOSP: Efficiently Capturing Global Correlations by Second-Order Structured PruningManuel Nonnenmacher, Thomas Pfeil, Ingo Steinwart, David Reeb. [doi]
- Graph Neural Networks with Learnable Structural and Positional RepresentationsVijay Prakash Dwivedi, Anh Tuan Luu, Thomas Laurent 0001, Yoshua Bengio, Xavier Bresson. [doi]
- How to deal with missing data in supervised deep learning?Niels Bruun Ipsen, Pierre-Alexandre Mattei, Jes Frellsen. [doi]
- Relating transformers to models and neural representations of the hippocampal formationJames C. R. Whittington, Joseph Warren, Tim E. J. Behrens. [doi]
- Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint LocalizationCan Wang, Sheng Jin 0007, Yingda Guan, Wentao Liu 0002, Chen Qian 0006, Ping Luo 0002, Wanli Ouyang. [doi]
- Anytime Dense Prediction with Confidence AdaptivityZhuang Liu 0003, Zhiqiu Xu, Hung-Ju Wang, Trevor Darrell, Evan Shelhamer. [doi]
- Latent Variable Sequential Set Transformers for Joint Multi-Agent Motion PredictionRoger Girgis, Florian Golemo, Felipe Codevilla, Martin Weiss, Jim Aldon D'Souza, Samira Ebrahimi Kahou, Felix Heide, Christopher Pal. [doi]
- Dealing with Non-Stationarity in MARL via Trust-Region DecompositionWenhao Li, Xiangfeng Wang, Bo Jin 0003, Junjie Sheng, Hongyuan Zha. [doi]
- Learning Multimodal VAEs through Mutual SupervisionTom Joy, Yuge Shi, Philip H. S. Torr, Tom Rainforth, Sebastian M. Schmon, Siddharth Narayanaswamy. [doi]
- The Inductive Bias of In-Context Learning: Rethinking Pretraining Example DesignYoav Levine, Noam Wies, Daniel Jannai, Dan Navon, Yedid Hoshen, Amnon Shashua. [doi]
- VAE Approximation Error: ELBO and Exponential FamiliesAlexander Shekhovtsov, Dmitrij Schlesinger, Boris Flach. [doi]
- Hindsight: Posterior-guided training of retrievers for improved open-ended generationAshwin Paranjape, Omar Khattab, Christopher Potts, Matei Zaharia, Christopher D. Manning. [doi]
- Efficient Sharpness-aware Minimization for Improved Training of Neural NetworksJiawei Du, Hanshu Yan, Jiashi Feng, Joey Tianyi Zhou, Liangli Zhen, Rick Siow Mong Goh, Vincent Y. F. Tan. [doi]
- Data Poisoning Won't Save You From Facial RecognitionEvani Radiya-Dixit, Sanghyun Hong 0001, Nicholas Carlini, Florian Tramèr. [doi]
- BEiT: BERT Pre-Training of Image TransformersHangbo Bao, Li Dong 0004, Songhao Piao, Furu Wei. [doi]
- How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysisShuai Zhang 0015, Meng Wang 0003, Sijia Liu, Pin-Yu Chen, Jinjun Xiong. [doi]
- Data-Driven Offline Optimization for Architecting Hardware AcceleratorsAviral Kumar, Amir Yazdanbakhsh, Milad Hashemi, Kevin Swersky, Sergey Levine. [doi]
- Disentanglement Analysis with Partial Information DecompositionSeiya Tokui, Issei Sato. [doi]
- New Insights on Reducing Abrupt Representation Change in Online Continual LearningLucas Caccia, Rahaf Aljundi, Nader Asadi, Tinne Tuytelaars, Joelle Pineau, Eugene Belilovsky. [doi]
- Capturing Structural Locality in Non-parametric Language ModelsFrank F. Xu, Junxian He, Graham Neubig, Vincent Josua Hellendoorn. [doi]
- Maximum n-times Coverage for Vaccine DesignGe Liu, Alexander Dimitrakakis, Brandon Carter 0001, David K. Gifford. [doi]
- Offline Reinforcement Learning with Value-based Episodic MemoryXiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang 0028, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu. [doi]
- Learning Representation from Neural Fisher Kernel with Low-rank ApproximationRuixiang Zhang, Shuangfei Zhai, Etai Littwin, Joshua M. Susskind. [doi]
- On Evaluation Metrics for Graph Generative ModelsRylee Thompson, Boris Knyazev 0001, Elahe Ghalebi, Jungtaek Kim 0001, Graham W. Taylor. [doi]
- IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor ScenesQi Li, Kaichun Mo, Yanchao Yang, Hang Zhao, Leonidas J. Guibas. [doi]
- Geometric Transformers for Protein Interface Contact PredictionAlex Morehead, Chen Chen, Jianlin Cheng. [doi]
- Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time SeriesEnyan Dai, Jie Chen. [doi]
- Model Agnostic Interpretability for Multiple Instance LearningJoseph Early, Christine Evers, Sarvapali Ramchurn. [doi]
- GiraffeDet: A Heavy-Neck Paradigm for Object DetectionYiqi Jiang, Zhiyu Tan, Junyan Wang, Xiuyu Sun, Ming Lin, Hao Li 0030. [doi]
- Optimizer AmalgamationTianshu Huang, Tianlong Chen, Sijia Liu 0001, Shiyu Chang, Lisa Amini, Zhangyang Wang. [doi]
- Backdoor Defense via Decoupling the Training ProcessKunzhe Huang, Yiming Li 0004, Baoyuan Wu, Zhan Qin, Kui Ren 0001. [doi]
- Tighter Sparse Approximation Bounds for ReLU Neural NetworksCarles Domingo-Enrich, Youssef Mroueh. [doi]
- Learning Altruistic Behaviours in Reinforcement Learning without External RewardsTim Franzmeyer, Mateusz Malinowski, João F. Henriques. [doi]
- On Distributed Adaptive Optimization with Gradient CompressionXiaoyun Li, Belhal Karimi, Ping Li. [doi]
- MAML is a Noisy Contrastive Learner in ClassificationChia-Hsiang Kao, Wei-chen Chiu, Pin-Yu Chen. [doi]
- Understanding the Role of Self Attention for Efficient Speech RecognitionKyuhong Shim, Jungwook Choi, Wonyong Sung. [doi]
- Surrogate Gap Minimization Improves Sharpness-Aware TrainingJuntang Zhuang, Boqing Gong, Liangzhe Yuan, Yin Cui, Hartwig Adam, Nicha C. Dvornek, Sekhar Tatikonda, James S. Duncan, Ting Liu 0005. [doi]
- NeuPL: Neural Population LearningSiqi Liu, Luke Marris, Daniel Hennes, Josh Merel, Nicolas Heess, Thore Graepel. [doi]
- CLEVA-Compass: A Continual Learning Evaluation Assessment Compass to Promote Research Transparency and ComparabilityMartin Mundt, Steven Lang, Quentin Delfosse, Kristian Kersting. [doi]
- Optimizing Neural Networks with Gradient Lexicase SelectionLi Ding, Lee Spector. [doi]
- Visual Representation Learning over Latent DomainsLucas Deecke, Timothy M. Hospedales, Hakan Bilen. [doi]
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillationsFangyu Liu 0001, Yunlong Jiao, Jordan Massiah, Emine Yilmaz, Serhii Havrylov. [doi]
- CycleMLP: A MLP-like Architecture for Dense PredictionShoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo. [doi]
- Large Learning Rate Tames Homogeneity: Convergence and Balancing EffectYuqing Wang, Minshuo Chen, Tuo Zhao, Molei Tao. [doi]
- Neural Networks as Kernel Learners: The Silent Alignment EffectAlexander Atanasov, Blake Bordelon, Cengiz Pehlevan. [doi]
- From Intervention to Domain Transportation: A Novel Perspective to Optimize RecommendationDa Xu, Yuting Ye, Chuanwei Ruan, Evren Körpeoglu, Sushant Kumar, Kannan Achan. [doi]
- Learning to Extend Molecular Scaffolds with Structural MotifsKrzysztof Maziarz, Henry Richard Jackson-Flux, Pashmina Cameron, Finton Sirockin, Nadine Schneider, Nikolaus Stiefl, Marwin H. S. Segler, Marc Brockschmidt. [doi]
- Variational methods for simulation-based inferenceManuel Glöckler, Michael Deistler, Jakob H. Macke. [doi]
- F8Net: Fixed-Point 8-bit Only Multiplication for Network QuantizationQing Jin, Jian Ren, Richard Zhuang, Sumant Hanumante, Zhengang Li, ZhiYu Chen, Yanzhi Wang, Kaiyuan Yang 0001, Sergey Tulyakov. [doi]
- Generalisation in Lifelong Reinforcement Learning through Logical CompositionGeraud Nangue Tasse, Steven James, Benjamin Rosman. [doi]
- Scattering Networks on the Sphere for Scalable and Rotationally Equivariant Spherical CNNsJason D. McEwen, Christopher G. R. Wallis, Augustine N. Mavor-Parker. [doi]
- Vector-quantized Image Modeling with Improved VQGANJiahui Yu, Xin Li, Jing Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, Yonghui Wu. [doi]
- Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation OverlapYifei Wang 0001, Qi Zhang, Yisen Wang 0001, Jiansheng Yang, Zhouchen Lin. [doi]
- Auto-scaling Vision Transformers without TrainingWuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wang, Denny Zhou. [doi]
- DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement LearningJinxin Liu, Hongyin Zhang, Donglin Wang. [doi]
- A Program to Build E(N)-Equivariant Steerable CNNsGabriele Cesa, Leon Lang, Maurice Weiler. [doi]
- Surrogate NAS Benchmarks: Going Beyond the Limited Search Spaces of Tabular NAS BenchmarksArber Zela, Julien Niklas Siems, Lucas Zimmer, Jovita Lukasik, Margret Keuper, Frank Hutter. [doi]
- Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial NetworksPeihao Zhu, Rameen Abdal, John Femiani 0001, Peter Wonka. [doi]
- Pix2seq: A Language Modeling Framework for Object DetectionTing Chen, Saurabh Saxena, Lala Li, David J. Fleet, Geoffrey E. Hinton. [doi]
- Information-theoretic Online Memory Selection for Continual LearningShengyang Sun, Daniele Calandriello, Huiyi Hu, Ang Li, Michalis K. Titsias. [doi]
- NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly EasyYash Mehta, Colin White, Arber Zela, Arjun Krishnakumar, Guri Zabergja, Shakiba Moradian, Mahmoud Safari, Kaicheng Yu, Frank Hutter. [doi]
- AEVA: Black-box Backdoor Detection Using Adversarial Extreme Value AnalysisJunfeng Guo, Ang Li, Cong Liu. [doi]
- Data-Efficient Graph Grammar Learning for Molecular GenerationMinghao Guo, Veronika Thost, Beichen Li, Payel Das, Jie Chen 0007, Wojciech Matusik. [doi]
- Revisiting Design Choices in Offline Model Based Reinforcement LearningCong Lu, Philip J. Ball, Jack Parker-Holder, Michael A. Osborne, Stephen J. Roberts. [doi]
- On Predicting Generalization using GANsYi Zhang 0074, Arushi Gupta, Nikunj Saunshi, Sanjeev Arora. [doi]
- Neural Relational Inference with Node-Specific InformationErshad Banijamali. [doi]
- NODE-GAM: Neural Generalized Additive Model for Interpretable Deep LearningChun-Hao Chang, Rich Caruana 0001, Anna Goldenberg. [doi]
- Robust and Scalable SDE Learning: A Functional PerspectiveScott Alexander Cameron, Tyron Luke Cameron, Arnu Pretorius, Stephen J. Roberts. [doi]
- Exploring Memorization in Adversarial TrainingYinpeng Dong, Ke Xu, Xiao Yang, Tianyu Pang, Zhijie Deng, Hang Su, Jun Zhu. [doi]
- Optimal Transport for Long-Tailed Recognition with Learnable Cost MatrixHanyu Peng, Mingming Sun, Ping Li. [doi]
- Mirror Descent Policy OptimizationManan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh. [doi]
- Adversarial Retriever-Ranker for Dense Text RetrievalHang Zhang, Yeyun Gong, Yelong Shen, Jiancheng Lv 0001, Nan Duan, Weizhu Chen. [doi]
- Learning Features with Parameter-Free LayersDongyoon Han, Young Joon Yoo, Beomyoung Kim, Byeongho Heo. [doi]
- VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated ObjectsRuihai Wu, Yan Zhao, Kaichun Mo, Zizheng Guo, Yian Wang, Tianhao Wu, Qingnan Fan, Xuelin Chen, Leonidas J. Guibas, Hao Dong 0003. [doi]
- Relational Multi-Task Learning: Modeling Relations between Data and TasksKaidi Cao, Jiaxuan You, Jure Leskovec. [doi]
- Learning Causal Models from Conditional Moment Restrictions by Importance WeightingMasahiro Kato, Masaaki Imaizumi, Kenichiro McAlinn, Shota Yasui, Haruo Kakehi. [doi]
- A global convergence theory for deep ReLU implicit networks via over-parameterizationTianxiang Gao, Hailiang Liu, Jia Liu, Hridesh Rajan, Hongyang Gao. [doi]
- Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No RetrainingLu Miao, Xiaolong Luo, Tianlong Chen, Wuyang Chen, Dong Liu, Zhangyang Wang. [doi]
- Gradient Importance Learning for Incomplete ObservationsQitong Gao, Dong Wang 0037, Joshua David Amason, Siyang Yuan, Chenyang Tao, Ricardo Henao, Majda Hadziahmetovic, Lawrence Carin, Miroslav Pajic. [doi]
- ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit SimilarityGinger Delmas, Rafael Sampaio de Rezende, Gabriela Csurka, Diane Larlus. [doi]
- Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problemsThomas Pethick, Puya Latafat, Panos Patrinos, Olivier Fercoq, Volkan Cevher. [doi]
- GeneDisco: A Benchmark for Experimental Design in Drug DiscoveryArash Mehrjou, Ashkan Soleymani, Andrew Jesson, Pascal Notin, Yarin Gal, Stefan Bauer, Patrick Schwab. [doi]
- Modeling Label Space Interactions in Multi-label Classification using Box EmbeddingsDhruvesh Patel, Pavitra Dangati, Jay Yoon Lee, Michael Boratko, Andrew McCallum. [doi]
- Space-Time Graph Neural NetworksSamar Hadou, Charilaos I. Kanatsoulis, Alejandro Ribeiro. [doi]
- A Johnson-Lindenstrauss Framework for Randomly Initialized CNNsIdo Nachum, Jan Hazla, Michael Gastpar, Anatoly Khina. [doi]
- Multi-Critic Actor Learning: Teaching RL Policies to Act with StyleSiddharth Mysore, George Cheng, Yunqi Zhao, Kate Saenko, Meng Wu. [doi]
- Multi-Agent MDP Homomorphic NetworksElise van der Pol, Herke van Hoof, Frans A. Oliehoek, Max Welling. [doi]
- Know Your Action Set: Learning Action Relations for Reinforcement LearningAyush Jain, Norio Kosaka, Kyung Min Kim, Joseph J. Lim. [doi]
- An Unconstrained Layer-Peeled Perspective on Neural CollapseWenlong Ji, Yiping Lu, Yiliang Zhang, Zhun Deng, Weijie J. Su. [doi]
- Efficiently Modeling Long Sequences with Structured State SpacesAlbert Gu, Karan Goel, Christopher Ré. [doi]
- Accelerated Policy Learning with Parallel Differentiable SimulationJie Xu 0028, Viktor Makoviychuk, Yashraj S. Narang, Fabio Ramos, Wojciech Matusik, Animesh Garg, Miles Macklin. [doi]
- EE-Net: Exploitation-Exploration Neural Networks in Contextual BanditsYikun Ban, Yuchen Yan, Arindam Banerjee, Jingrui He. [doi]
- Policy Smoothing for Provably Robust Reinforcement LearningAounon Kumar, Alexander Levine 0001, Soheil Feizi. [doi]
- T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal AnalysisMinhao Liu, Ailing Zeng, Qiuxia Lai, Ruiyuan Gao 0001, Min Li, Jing Qin 0001, Qiang Xu 0001. [doi]
- Neural Network Approximation based on Hausdorff distance of Tropical ZonotopesPanagiotis Misiakos, Georgios Smyrnis, George Retsinas, Petros Maragos. [doi]
- Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space PerspectiveLuca Scimeca, Seong Joon Oh, Sanghyuk Chun, Michael Poli, Sangdoo Yun. [doi]
- Quadtree Attention for Vision TransformersShitao Tang, Jiahui Zhang, Siyu Zhu, Ping Tan. [doi]
- SphereFace2: Binary Classification is All You Need for Deep Face RecognitionYanDong Wen, Weiyang Liu, Adrian Weller, Bhiksha Raj, Rita Singh. [doi]
- PiCO: Contrastive Label Disambiguation for Partial Label LearningHaobo Wang, Ruixuan Xiao, Yixuan Li, Lei Feng, Gang Niu 0001, Gang Chen, Junbo Zhao. [doi]
- Knowledge Infused DecodingRuibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah. [doi]
- A Generalized Weighted Optimization Method for Computational Learning and InversionKui Ren 0002, Yunan Yang, Björn Engquist. [doi]
- PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction LearningSeng Pei Liew, Tsubasa Takahashi 0001, Michihiko Ueno. [doi]
- SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-TrainingWenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang 0002, Qun Liu 0001. [doi]
- CodeTrek: Flexible Modeling of Code using an Extensible Relational RepresentationPardis Pashakhanloo, Aaditya Naik, Yuepeng Wang 0001, Hanjun Dai, Petros Maniatis, Mayur Naik. [doi]
- An Experimental Design Perspective on Model-Based Reinforcement LearningViraj Mehta, Biswajit Paria, Jeff Schneider, Stefano Ermon, Willie Neiswanger. [doi]
- Towards Model Agnostic Federated Learning Using Knowledge DistillationAndrei Afonin, Sai Praneeth Karimireddy. [doi]
- Unraveling Model-Agnostic Meta-Learning via The Adaptation Learning RateYingtian Zou, Fusheng Liu, Qianxiao Li. [doi]
- Provably Robust Adversarial ExamplesDimitar Iliev Dimitrov, Gagandeep Singh 0001, Timon Gehr, Martin T. Vechev. [doi]
- Exploring the Limits of Large Scale Pre-trainingSamira Abnar, Mostafa Dehghani 0001, Behnam Neyshabur, Hanie Sedghi. [doi]
- Stability Regularization for Discrete Representation LearningAdeel Pervez, Efstratios Gavves. [doi]
- Understanding Domain Randomization for Sim-to-real TransferXiaoyu Chen, Jiachen Hu, Chi Jin, Lihong Li, Liwei Wang. [doi]
- Efficient Computation of Deep Nonlinear Infinite-Width Neural Networks that Learn FeaturesGreg Yang, Michael Santacroce, Edward J. Hu. [doi]
- Representing Mixtures of Word Embeddings with Mixtures of Topic EmbeddingsDongsheng Wang 0003, Dandan Guo, He Zhao 0001, Huangjie Zheng, Korawat Tanwisuth, Bo Chen, Mingyuan Zhou. [doi]
- The Evolution of Uncertainty of Learning in GamesYun Kuen Cheung, Georgios Piliouras, Yixin Tao. [doi]
- On the Optimal Memorization Power of ReLU Neural NetworksGal Vardi, Gilad Yehudai, Ohad Shamir. [doi]
- Learning Fast Samplers for Diffusion Models by Differentiating Through Sample QualityDaniel Watson, William Chan, Jonathan Ho, Mohammad Norouzi 0002. [doi]
- Towards Building A Group-based Unsupervised Representation Disentanglement FrameworkTao Yang, Xuanchi Ren, Yuwang Wang, Wenjun Zeng, Nanning Zheng 0001. [doi]
- Online Facility Location with PredictionsShaofeng H.-C. Jiang, Erzhi Liu, You Lyu, Zhihao Gavin Tang, Yubo Zhang. [doi]
- Boosting the Certified Robustness of L-infinity Distance NetsBohang Zhang, Du Jiang, Di He, Liwei Wang 0001. [doi]
- On the Uncomputability of Partition Functions in Energy-Based Sequence ModelsChu-Cheng Lin, Arya D. McCarthy. [doi]
- Learning Synthetic Environments and Reward Networks for Reinforcement LearningFabio Ferreira, Thomas Nierhoff, Andreas Sälinger, Frank Hutter. [doi]
- Convergent Graph SolversJunyoung Park, Jinhyun Choo, Jinkyoo Park. [doi]
- Graph-less Neural Networks: Teaching Old MLPs New Tricks Via DistillationShichang Zhang, Yozen Liu, Yizhou Sun, Neil Shah. [doi]
- Partial Wasserstein Adversarial Network for Non-rigid Point Set RegistrationZiming Wang, Nan Xue 0001, Ling Lei, Gui-Song Xia. [doi]
- Learning to Schedule Learning rate with Graph Neural NetworksYuanhao Xiong, Li-Cheng Lan, Xiangning Chen, Ruochen Wang, Cho-Jui Hsieh. [doi]
- Training Transition Policies via Distribution Matching for Complex TasksJu-Seung Byun, Andrew Perrault. [doi]
- Memory Augmented Optimizers for Deep LearningPaul-Aymeric Martin McRae, Prasanna Parthasarathi, Mido Assran, Sarath Chandar. [doi]
- Gaussian Mixture Convolution NetworksAdam Celarek, Pedro Hermosilla, Bernhard Kerbl, Timo Ropinski, Michael Wimmer 0001. [doi]
- Meta Learning Low Rank Covariance Factors for Energy Based Deterministic UncertaintyJeffrey Ryan Willette, Hae Beom Lee, Juho Lee 0001, Sung Ju Hwang. [doi]
- The Effects of Reward Misspecification: Mapping and Mitigating Misaligned ModelsAlexander Pan, Kush Bhatia, Jacob Steinhardt. [doi]
- Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element NetworksMarten Lienen, Stephan Günnemann. [doi]
- RvS: What is Essential for Offline RL via Supervised Learning?Scott Emmons, Benjamin Eysenbach, Ilya Kostrikov, Sergey Levine. [doi]
- Meta-Imitation Learning by Watching Video DemonstrationsJiayi Li, Tao Lu, Xiaoge Cao, Yinghao Cai, Shuo Wang. [doi]
- Leveraging unlabeled data to predict out-of-distribution performanceSaurabh Garg, Sivaraman Balakrishnan, Zachary Chase Lipton, Behnam Neyshabur, Hanie Sedghi. [doi]
- Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal ReachingPierre-Alexandre Kamienny, Jean Tarbouriech, Sylvain Lamprier, Alessandro Lazaric, Ludovic Denoyer. [doi]
- Switch to Generalize: Domain-Switch Learning for Cross-Domain Few-Shot ClassificationZhengdong Hu, Yifan Sun, Yi Yang. [doi]
- Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency DetectionWei Ji, Jingjing Li, Qi Bi, Chuan Guo, Jie Liu, Li Cheng. [doi]
- CKConv: Continuous Kernel Convolution For Sequential DataDavid W. Romero, Anna Kuzina, Erik J. Bekkers, Jakub Mikolaj Tomczak, Mark Hoogendoorn. [doi]
- Natural Language Descriptions of Deep Visual FeaturesEvan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba 0001, Jacob Andreas. [doi]
- Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box DomainsQilong Zhang, Xiaodan Li, Yuefeng Chen, Jingkuan Song, Lianli Gao, Yuan He, Hui Xue'. [doi]
- Defending Against Image Corruptions Through Adversarial AugmentationsDan Andrei Calian, Florian Stimberg, Olivia Wiles, Sylvestre-Alvise Rebuffi, András György 0001, Timothy A. Mann, Sven Gowal. [doi]
- Sequence Approximation using Feedforward Spiking Neural Network for Spatiotemporal Learning: Theory and Optimization MethodsXueyuan She, Saurabh Dash, Saibal Mukhopadhyay. [doi]
- GNN-LM: Language Modeling based on Global Contexts via GNNYuxian Meng, Shi Zong, Xiaoya Li, Xiaofei Sun, Tianwei Zhang 0004, Fei Wu, Jiwei Li. [doi]
- What's Wrong with Deep Learning in Tree Search for Combinatorial OptimizationMaximilian Böther, Otto Kißig, Martin Taraz, Sarel Cohen, Karen Seidel 0001, Tobias Friedrich 0001. [doi]
- Pre-training Molecular Graph Representation with 3D GeometryShengchao Liu, Hanchen Wang, Weiyang Liu, Joan Lasenby, Hongyu Guo, Jian Tang. [doi]
- Learning to Downsample for Segmentation of Ultra-High Resolution ImagesChen Jin, Ryutaro Tanno, Thomy Mertzanidou, Eleftheria Panagiotaki, Daniel C. Alexander. [doi]
- Better Supervisory Signals by Observing Learning PathsYi Ren, Shangmin Guo, Danica J. Sutherland. [doi]
- Graphon based Clustering and Testing of Networks: Algorithms and TheoryMahalakshmi Sabanayagam, Leena Chennuru Vankadara, Debarghya Ghoshdastidar. [doi]
- Efficient Neural Causal Discovery without Acyclicity ConstraintsPhillip Lippe, Taco Cohen, Efstratios Gavves. [doi]
- Invariant Causal Representation Learning for Out-of-Distribution GeneralizationChaochao Lu, Yuhuai Wu, José Miguel Hernández-Lobato, Bernhard Schölkopf. [doi]
- Top-label calibration and multiclass-to-binary reductionsChirag Gupta, Aaditya Ramdas. [doi]
- The Efficiency MisnomerMostafa Dehghani 0001, Yi Tay, Anurag Arnab, Lucas Beyer, Ashish Vaswani. [doi]
- Collapse by Conditioning: Training Class-conditional GANs with Limited DataMohamad Shahbazi, Martin Danelljan, Danda Pani Paudel, Luc Van Gool. [doi]
- Actor-critic is implicitly biased towards high entropy optimal policiesYuzheng Hu, Ziwei Ji, Matus Telgarsky. [doi]
- It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum GenerationYuqing Du, Pieter Abbeel, Aditya Grover. [doi]
- On the Role of Neural Collapse in Transfer LearningTomer Galanti, András György 0001, Marcus Hutter. [doi]
- Is High Variance Unavoidable in RL? A Case Study in Continuous ControlJohan Bjorck, Carla P. Gomes, Kilian Q. Weinberger. [doi]
- Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution ShiftTaesung Kim, Jinhee Kim, Yunwon Tae, Cheonbok Park, Jang-Ho Choi, Jaegul Choo. [doi]
- Plant 'n' Seek: Can You Find the Winning Ticket?Jonas Fischer, Rebekka Burkholz. [doi]
- TAPEX: Table Pre-training via Learning a Neural SQL ExecutorQian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou. [doi]
- Task-Induced Representation LearningJun Yamada, Karl Pertsch, Anisha Gunjal, Joseph J. Lim. [doi]
- Neural Deep Equilibrium SolversShaojie Bai, Vladlen Koltun, J. Zico Kolter. [doi]
- PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive PriorSang Gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan 0003, Chang Liu, Qi Meng, Tao Qin, Wei Chen 0034, Sungroh Yoon, Tie-Yan Liu. [doi]
- A Deep Variational Approach to Clustering Survival DataLaura Manduchi, Ricards Marcinkevics, Michela Carlotta Massi, Thomas J. Weikert, Alexander Sauter, Verena Gotta, Timothy Müller, Flavio Vasella, Marian C. Neidert, Marc Pfister, Bram Stieltjes, Julia E. Vogt. [doi]
- Divergence-aware Federated Self-Supervised LearningWeiming Zhuang, Yonggang Wen 0001, Shuai Zhang. [doi]
- On the benefits of maximum likelihood estimation for Regression and ForecastingPranjal Awasthi, Abhimanyu Das, Rajat Sen, Ananda Theertha Suresh. [doi]
- RelViT: Concept-guided Vision Transformer for Visual Relational ReasoningXiaojian Ma, Weili Nie, Zhiding Yu, Huaizu Jiang, Chaowei Xiao, Yuke Zhu, Song Chun Zhu, Anima Anandkumar. [doi]
- MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision TransformerSachin Mehta, Mohammad Rastegari. [doi]
- Understanding over-squashing and bottlenecks on graphs via curvatureJake Topping, Francesco Di Giovanni, Benjamin Paul Chamberlain, Xiaowen Dong 0001, Michael M. Bronstein. [doi]
- Towards Continual Knowledge Learning of Language ModelsJoel Jang, Seonghyeon Ye, Sohee Yang, Joongbo Shin, Janghoon Han, Gyeonghun Kim, Stanley Jungkyu Choi, Minjoon Seo. [doi]
- On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and ImplicationsZiqiao Wang, Yongyi Mao. [doi]
- Finetuned Language Models are Zero-Shot LearnersJason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le. [doi]
- Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and ForecastingShizhan Liu, Hang Yu, Cong Liao, Jianguo Li, Weiyao Lin, Alex X. Liu, Schahram Dustdar. [doi]
- Understanding and Leveraging Overparameterization in Recursive Value EstimationChenjun Xiao, Bo Dai, Jincheng Mei, Oscar A Ramirez, Ramki Gummadi, Chris Harris, Dale Schuurmans. [doi]
- Graph-Guided Network for Irregularly Sampled Multivariate Time SeriesXiang Zhang, Marko Zeman, Theodoros Tsiligkaridis, Marinka Zitnik. [doi]
- LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent LearningDavid Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Yaodong Yang 0001, Jun Wang. [doi]
- You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory PredictionOsama Makansi, Julius von Kügelgen, Francesco Locatello, Peter Vincent Gehler, Dominik Janzing, Thomas Brox, Bernhard Schölkopf. [doi]
- Learning curves for continual learning in neural networks: Self-knowledge transfer and forgettingRyo Karakida, Shotaro Akaho. [doi]
- Constructing a Good Behavior Basis for Transfer using Generalized Policy UpdatesSafa Alver, Doina Precup. [doi]
- Emergent Communication at ScaleRahma Chaabouni, Florian Strub, Florent Altché, Eugene Tarassov, Corentin Tallec, Elnaz Davoodi, Kory Wallace Mathewson, Olivier Tieleman, Angeliki Lazaridou, Bilal Piot. [doi]
- An Information Fusion Approach to Learning with Instance-Dependent Label NoiseZhimeng Jiang, Kaixiong Zhou, Zirui Liu, Li Li, Rui Chen, Soo Hyun Choi, Xia Hu. [doi]
- Exploring extreme parameter compression for pre-trained language modelsBenyou Wang, Yuxin Ren, Lifeng Shang, Xin Jiang 0002, Qun Liu 0001. [doi]
- Cold Brew: Distilling Graph Node Representations with Incomplete or Missing NeighborhoodsWenqing Zheng, Edward W. Huang, Nikhil Rao 0001, Sumeet Katariya, Zhangyang Wang, Karthik Subbian. [doi]
- Prototype memory and attention mechanisms for few shot image generationTianqin Li, Zijie Li, Andrew Luo, Harold Rockwell, Amir Barati Farimani, Tai Sing Lee. [doi]
- Ancestral protein sequence reconstruction using a tree-structured Ornstein-Uhlenbeck variational autoencoderLys Sanz Moreta, Ola Rønning, Ahmad Salim Al-Sibahi, Jotun Hein, Douglas L. Theobald, Thomas Hamelryck. [doi]
- Open-World Semi-Supervised LearningKaidi Cao, Maria Brbic, Jure Leskovec. [doi]
- Generalized Decision Transformer for Offline Hindsight Information MatchingHiroki Furuta, Yutaka Matsuo, Shixiang Shane Gu. [doi]
- Heteroscedastic Temporal Variational Autoencoder For Irregularly Sampled Time SeriesSatya Narayan Shukla, Benjamin M. Marlin. [doi]
- DictFormer: Tiny Transformer with Shared DictionaryQian Lou, Ting Hua, Yen-Chang Hsu, Yilin Shen, Hongxia Jin. [doi]
- Hot-Refresh Model Upgrades with Regression-Free Compatible Training in Image RetrievalBinjie Zhang, Yixiao Ge, Yantao Shen 0003, Yu Li, Chun Yuan, Xuyuan Xu, Yexin Wang, Ying Shan. [doi]
- On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement LearningChe Wang, Shuhan Yuan, Kai Shao, Keith W. Ross. [doi]
- Policy improvement by planning with GumbelIvo Danihelka, Arthur Guez, Julian Schrittwieser, David Silver. [doi]
- CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG SignalsCédric Rommel, Thomas Moreau, Joseph Paillard, Alexandre Gramfort. [doi]
- From Stars to Subgraphs: Uplifting Any GNN with Local Structure AwarenessLingxiao Zhao, Wei Jin, Leman Akoglu, Neil Shah. [doi]
- SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian ApproximationCong Guo 0003, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu 0001, Minyi Guo. [doi]
- Domino: Discovering Systematic Errors with Cross-Modal EmbeddingsSabri Eyuboglu, Maya Varma, Khaled Kamal Saab, Jean-Benoit Delbrouck, Christopher Lee-Messer, Jared Dunnmon, James Zou 0001, Christopher Ré. [doi]
- HTLM: Hyper-Text Pre-Training and Prompting of Language ModelsArmen Aghajanyan, Dmytro Okhonko, Mike Lewis, Mandar Joshi, Hu Xu 0001, Gargi Ghosh, Luke Zettlemoyer. [doi]
- Reinforcement Learning with Sparse Rewards using Guidance from Offline DemonstrationDesik Rengarajan, Gargi Vaidya, Akshay Sarvesh, Dileep M. Kalathil, Srinivas Shakkottai. [doi]
- Information Gain Propagation: a New Way to Graph Active Learning with Soft LabelsWentao Zhang, Yexin Wang, Zhenbang You, Meng Cao, Ping Huang, Jiulong Shan, Zhi Yang 0001, Bin Cui 0001. [doi]
- TPU-GAN: Learning temporal coherence from dynamic point cloud sequencesZijie Li, Tianqin Li, Amir Barati Farimani. [doi]
- RelaxLoss: Defending Membership Inference Attacks without Losing UtilityDingfan Chen, Ning Yu, Mario Fritz. [doi]
- On Covariate Shift of Latent Confounders in Imitation and Reinforcement LearningGuy Tennenholtz, Assaf Hallak, Gal Dalal, Shie Mannor, Gal Chechik, Uri Shalit. [doi]
- Deep Point Cloud ReconstructionJaesung Choe, Byeongin Joung, François Rameau, Jaesik Park, In-So Kweon. [doi]
- Neural Parameter Allocation SearchBryan A. Plummer, Nikoli Dryden, Julius Frost, Torsten Hoefler, Kate Saenko. [doi]
- P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse PromptsBenjamin Newman, Prafulla Kumar Choubey, Nazneen Rajani. [doi]
- Goal-Directed Planning via Hindsight Experience ReplayLorenzo Moro, Amarildo Likmeta, Enrico Prati, Marcello Restelli. [doi]
- Orchestrated Value Mapping for Reinforcement LearningMehdi Fatemi, Arash Tavakoli. [doi]
- Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPsNaman Agarwal, Syomantak Chaudhuri, Prateek Jain 0002, Dheeraj Mysore Nagaraj, Praneeth Netrapalli. [doi]
- Representational Continuity for Unsupervised Continual LearningDivyam Madaan, Jaehong Yoon, Yuanchun Li, Yunxin Liu, Sung Ju Hwang. [doi]
- Probabilistic Implicit Scene CompletionDongsu Zhang, Changwoon Choi, Inbum Park, Young Min Kim 0001. [doi]
- Multitask Prompted Training Enables Zero-Shot Task GeneralizationVictor Sanh, Albert Webson, Colin Raffel, Stephen Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim 0002, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Teven Le Scao, Stella Biderman, Leo Gao, Thomas Wolf 0008, Alexander M. Rush. [doi]
- Improving Mutual Information Estimation with Annealed and Energy-Based BoundsRob Brekelmans, Sicong Huang, Marzyeh Ghassemi, Greg Ver Steeg, Roger Baker Grosse, Alireza Makhzani. [doi]
- Trust Region Policy Optimisation in Multi-Agent Reinforcement LearningJakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen 0001, Fanglei Sun, Jun Wang 0012, Yaodong Yang 0001. [doi]
- Chunked Autoregressive GAN for Conditional Waveform SynthesisMax Morrison, Rithesh Kumar, Kundan Kumar, Prem Seetharaman, Aaron C. Courville, Yoshua Bengio. [doi]
- Deep Attentive Variational InferenceIfigeneia Apostolopoulou, Ian Char, Elan Rosenfeld, Artur Dubrawski. [doi]
- Random matrices in service of ML footprint: ternary random features with no performance lossHafiz Tiomoko Ali, Zhenyu Liao 0001, Romain Couillet. [doi]
- WeakM3D: Towards Weakly Supervised Monocular 3D Object DetectionLiang Peng, Senbo Yan, Boxi Wu, Zheng Yang, Xiaofei He 0001, Deng Cai 0001. [doi]
- Subspace Regularizers for Few-Shot Class Incremental LearningAfra Feyza Akyürek, Ekin Akyürek, Derry Wijaya, Jacob Andreas. [doi]