Abstract is missing.
- Poisoning and Backdooring Contrastive LearningNicholas Carlini, Andreas Terzis. [doi]
- ADAVI: Automatic Dual Amortized Variational Inference Applied To Pyramidal Bayesian ModelsLouis Rouillard, Demian Wassermann. [doi]
- Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and HowYuning You, Yue Cao, Tianlong Chen, Zhangyang Wang, Yang Shen. [doi]
- L0-Sparse Canonical Correlation AnalysisOfir Lindenbaum, Moshe Salhov, Amir Averbuch, Yuval Kluger. [doi]
- Understanding and Preventing Capacity Loss in Reinforcement LearningClare Lyle, Mark Rowland, Will Dabney. [doi]
- Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious CorrelationsAahlad Manas Puli, Lily H. Zhang, Eric Karl Oermann, Rajesh Ranganath. [doi]
- Bundle Networks: Fiber Bundles, Local Trivializations, and a Generative Approach to Exploring Many-to-one MapsNico Courts, Henry Kvinge. [doi]
- Scene Transformer: A unified architecture for predicting future trajectories of multiple agentsJiquan Ngiam, Vijay Vasudevan, Benjamin Caine, Zhengdong Zhang, Hao-Tien Lewis Chiang, Jeffrey Ling, Rebecca Roelofs, Alex Bewley, Chenxi Liu, Ashish Venugopal, David J. Weiss, Ben Sapp, Zhifeng Chen, Jonathon Shlens. [doi]
- ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of MindYuanfei Wang, Fangwei Zhong, Jing Xu, Yizhou Wang 0001. [doi]
- How Much Can CLIP Benefit Vision-and-Language Tasks?Sheng Shen, Liunian Harold Li, Hao Tan, Mohit Bansal, Anna Rohrbach, Kai-Wei Chang, Zhewei Yao, Kurt Keutzer. [doi]
- Topological Experience ReplayZhang-Wei Hong, Tao Chen 0046, Yen-Chen Lin, Joni Pajarinen, Pulkit Agrawal. [doi]
- On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural NetworksMaximilian Seitzer, Arash Tavakoli, Dimitrije Antic, Georg Martius. [doi]
- One After Another: Learning Incremental Skills for a Changing WorldNur Muhammad (Mahi) Shafiullah, Lerrel Pinto. [doi]
- Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANsJaewoong Choi, Junho Lee, Changyeon Yoon, Jung-Ho Park, Geonho Hwang, Myungjoo Kang. [doi]
- Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic FunctionsJuncheng Dong, Simiao Ren, Yang Deng, Omar Khatib, Jordan M. Malof, Mohammadreza Soltani, Willie Padilla, Vahid Tarokh. [doi]
- Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic ForecastingHyunwook Lee, Seungmin Jin, Hyeshin Chu, Hongkyu Lim, Sungahn Ko. [doi]
- On the approximation properties of recurrent encoder-decoder architecturesZhong Li, Haotian Jiang, Qianxiao Li. [doi]
- Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized TeamingSachin G. Konan, Esmaeil Seraj, Matthew C. Gombolay. [doi]
- Should We Be Pre-training? An Argument for End-task Aware Training as an AlternativeLucio M. Dery, Paul Michel, Ameet Talwalkar, Graham Neubig. [doi]
- Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability AuthorizationLixu Wang, Shichao Xu, Ruiqi Xu, Xiao Wang, Qi Zhu 0002. [doi]
- On the Learning and Learnability of QuasimetricsTongzhou Wang 0001, Phillip Isola. [doi]
- On the Importance of Difficulty Calibration in Membership Inference AttacksLauren Watson, Chuan Guo, Graham Cormode, Alexandre Sablayrolles. [doi]
- Object Dynamics Distillation for Scene Decomposition and RepresentationQu Tang, Xiangyu Zhu, Zhen Lei 0001, Zhaoxiang Zhang. [doi]
- Possibility Before Utility: Learning And Using Hierarchical AffordancesRobby Costales, Shariq Iqbal, Fei Sha. [doi]
- Generative Models as a Data Source for Multiview Representation LearningAli Jahanian 0002, Xavier Puig, Yonglong Tian, Phillip Isola. [doi]
- Model Zoo: A Growing Brain That Learns ContinuallyRahul Ramesh, Pratik Chaudhari. [doi]
- R5: Rule Discovery with Reinforced and Recurrent Relational ReasoningShengyao Lu, Bang Liu, Keith G Mills, Shangling Jui, Di Niu. [doi]
- OntoProtein: Protein Pretraining With Gene Ontology EmbeddingNingyu Zhang, Zhen Bi, Xiaozhuan Liang, Siyuan Cheng 0008, Haosen Hong, Shumin Deng, Qiang Zhang, Jiazhang Lian, Huajun Chen. [doi]
- Communication-Efficient Actor-Critic Methods for Homogeneous Markov GamesDingyang Chen, Yile Li, Qi Zhang. [doi]
- How many degrees of freedom do we need to train deep networks: a loss landscape perspectiveBrett W. Larsen, Stanislav Fort, Nic Becker, Surya Ganguli. [doi]
- Effective Model Sparsification by Scheduled Grow-and-Prune MethodsXiaolong Ma, Minghai Qin, Fei Sun, Zejiang Hou, Kun Yuan, Yi Xu, Yanzhi Wang, Yen-Kuang Chen, Rong Jin 0001, Yuan Xie 0008. [doi]
- Triangle and Four Cycle Counting with Predictions in Graph StreamsJustin Y. Chen, Talya Eden, Piotr Indyk, Honghao Lin, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner, David Woodruff, Michael Zhang. [doi]
- Neural Solvers for Fast and Accurate Numerical Optimal ControlFederico Berto, Stefano Massaroli, Michael Poli, Jinkyoo Park. [doi]
- Learning to Guide and to be Guided in the Architect-Builder ProblemPaul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer. [doi]
- How Well Does Self-Supervised Pre-Training Perform with Streaming Data?Dapeng Hu, Shipeng Yan, Qizhengqiu Lu, Lanqing Hong, Hailin Hu, Yifan Zhang, Zhenguo Li, Xinchao Wang, Jiashi Feng. [doi]
- CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series ForecastingGerald Woo, Chenghao Liu, Doyen Sahoo, Akshat Kumar, Steven C. H. Hoi. [doi]
- Differentially Private Fine-tuning of Language ModelsDa Yu, Saurabh Naik, Arturs Backurs, Sivakanth Gopi, Huseyin A. Inan, Gautam Kamath 0001, Janardhan Kulkarni, Yin Tat Lee, Andre Manoel, Lukas Wutschitz, Sergey Yekhanin, Huishuai Zhang. [doi]
- Provable Adaptation across Multiway Domains via Representation LearningZhili Feng, Shaobo Han, Simon Shaolei Du. [doi]
- Trigger Hunting with a Topological Prior for Trojan DetectionXiaoling Hu 0002, Xiao Lin, Michael Cogswell, Yi Yao, Susmit Jha, Chao Chen 0012. [doi]
- Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical SolutionsLeslie O'Bray, Max Horn, Bastian Rieck, Karsten M. Borgwardt. [doi]
- Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit DifferentiationRoss M. Clarke, Elre Talea Oldewage, José Miguel Hernández-Lobato. [doi]
- Automatic Loss Function Search for Predict-Then-Optimize Problems with Strong Ranking PropertyBoshi Wang, Jialin Yi, Hang Dong, Bo Qiao, Chuan Luo, Qingwei Lin. [doi]
- Scarf: Self-Supervised Contrastive Learning using Random Feature CorruptionDara Bahri, Heinrich Jiang, Yi Tay, Donald Metzler. [doi]
- NASI: Label- and Data-agnostic Neural Architecture Search at InitializationYao Shu, Shaofeng Cai, Zhongxiang Dai, Beng Chin Ooi, Bryan Kian Hsiang Low. [doi]
- Generalized rectifier wavelet covariance models for texture synthesisAntoine Brochard, Sixin Zhang, Stéphane Mallat. [doi]
- Solving Inverse Problems in Medical Imaging with Score-Based Generative ModelsYang Song 0011, Liyue Shen, Lei Xing 0001, Stefano Ermon. [doi]
- Revisit Kernel Pruning with Lottery Regulated Grouped ConvolutionsShaochen Zhong, Guanqun Zhang, Ningjia Huang, Shuai Xu. [doi]
- StyleAlign: Analysis and Applications of Aligned StyleGAN ModelsZongze Wu, Yotam Nitzan, Eli Shechtman, Dani Lischinski. [doi]
- Understanding and Improving Graph Injection Attack by Promoting UnnoticeabilityYongqiang Chen 0002, Han Yang 0002, Yonggang Zhang, Kaili Ma 0001, Tongliang Liu, Bo Han 0003, James Cheng. [doi]
- Sample Efficient Deep Reinforcement Learning via Uncertainty EstimationVincent Mai, Kaustubh Mani, Liam Paull. [doi]
- Multi-Stage Episodic Control for Strategic Exploration in Text GamesJens Tuyls, Shunyu Yao, Sham M. Kakade, Karthik Narasimhan. [doi]
- X-model: Improving Data Efficiency in Deep Learning with A Minimax ModelXimei Wang, Xinyang Chen, Jianmin Wang, Mingsheng Long. [doi]
- Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family DistributionsBertrand Charpentier, Oliver Borchert, Daniel Zügner, Simon Geisler, Stephan Günnemann. [doi]
- Trivial or Impossible --- dichotomous data difficulty masks model differences (on ImageNet and beyond)Kristof Meding, Luca M. Schulze Buschoff, Robert Geirhos, Felix A. Wichmann. [doi]
- Neural Models for Output-Space Invariance in Combinatorial ProblemsYatin Nandwani, Vidit Jain, Mausam, Parag Singla. [doi]
- Amortized Implicit Differentiation for Stochastic Bilevel OptimizationMichael Arbel, Julien Mairal. [doi]
- Hyperparameter Tuning with Renyi Differential PrivacyNicolas Papernot, Thomas Steinke 0002. [doi]
- Is Homophily a Necessity for Graph Neural Networks?Yao Ma 0001, Xiaorui Liu, Neil Shah, Jiliang Tang. [doi]
- Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to PracticePeihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang. [doi]
- Deep ReLU Networks Preserve Expected LengthBoris Hanin, Ryan S. Jeong, David Rolnick. [doi]
- On-Policy Model Errors in Reinforcement LearningLukas P. Fröhlich, Maksym Lefarov, Melanie N. Zeilinger, Felix Berkenkamp. [doi]
- ZeroFL: Efficient On-Device Training for Federated Learning with Local SparsityXinchi Qiu, Javier Fernández-Marqués, Pedro P. B. Gusmao, Yan Gao, Titouan Parcollet, Nicholas Donald Lane. [doi]
- Decoupled Adaptation for Cross-Domain Object DetectionJunguang Jiang, Baixu Chen, Jianmin Wang, Mingsheng Long. [doi]
- Distributionally Robust Fair Principal Components via Geodesic DescentsHieu Vu, Toan Tran, Man-Chung Yue, Viet Anh Nguyen. [doi]
- Information Prioritization through Empowerment in Visual Model-based RLHomanga Bharadhwaj, Mohammad Babaeizadeh, Dumitru Erhan, Sergey Levine. [doi]
- Efficient Self-supervised Vision Transformers for Representation LearningChunyuan Li, Jianwei Yang, Pengchuan Zhang, Mei Gao, Bin Xiao, Xiyang Dai, Lu Yuan, Jianfeng Gao. [doi]
- Generative Pseudo-Inverse MemoryKha Pham, Hung Le, Man Ngo, Truyen Tran 0001, Bao Ho, Svetha Venkatesh. [doi]
- Pretrained Language Model in Continual Learning: A Comparative StudyTongtong Wu, Massimo Caccia, Zhuang Li, Yuan-Fang Li, Guilin Qi, Gholamreza Haffari. [doi]
- Retriever: Learning Content-Style Representation as a Token-Level Bipartite GraphDacheng Yin, Xuanchi Ren, Chong Luo, Yuwang Wang, Zhiwei Xiong, Wenjun Zeng. [doi]
- Wiring Up Vision: Minimizing Supervised Synaptic Updates Needed to Produce a Primate Ventral StreamFranziska Geiger, Martin Schrimpf, Tiago Marques, James J. DiCarlo. [doi]
- Step-unrolled Denoising Autoencoders for Text GenerationNikolay Savinov, Junyoung Chung, Mikolaj Binkowski, Erich Elsen, Aäron Van Den Oord. [doi]
- A Loss Curvature Perspective on Training Instabilities of Deep Learning ModelsJustin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George Edward Dahl, Zachary Nado, Orhan Firat. [doi]
- Learning Strides in Convolutional Neural NetworksRachid Riad, Olivier Teboul, David Grangier, Neil Zeghidour. [doi]
- COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationJongmin Lee 0004, Cosmin Paduraru, Daniel J. Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez. [doi]
- Stein Latent Optimization for Generative Adversarial NetworksUiwon Hwang, Heeseung Kim, Dahuin Jung, Hyemi Jang, Hyungyu Lee, Sungroh Yoon. [doi]
- Learning Super-Features for Image RetrievalPhilippe Weinzaepfel, Thomas Lucas, Diane Larlus, Yannis Kalantidis. [doi]
- Neural Contextual Bandits with Deep Representation and Shallow ExplorationPan Xu 0002, Zheng Wen, Handong Zhao, Quanquan Gu. [doi]
- Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and UnpredictabilityShangmin Guo, Yi Ren, Kory Wallace Mathewson, Simon Kirby, Stefano V. Albrecht, Kenny Smith. [doi]
- How Did the Model Change? Efficiently Assessing Machine Learning API ShiftsLingjiao Chen, Matei Zaharia, James Zou 0001. [doi]
- Weighted Training for Cross-Task LearningShuxiao Chen, Koby Crammer, Hangfeng He, Dan Roth, Weijie J. Su. [doi]
- Optimization and Adaptive Generalization of Three layer Neural NetworksKhashayar Gatmiry, Stefanie Jegelka, Jonathan A. Kelner. [doi]
- Minimax Optimality (Probably) Doesn't Imply Distribution Learning for GANsSitan Chen, Jerry Li 0001, Yuanzhi Li, Raghu Meka. [doi]
- Planning in Stochastic Environments with a Learned ModelIoannis Antonoglou, Julian Schrittwieser, Sherjil Ozair, Thomas K. Hubert, David Silver. [doi]
- Discrete Representations Strengthen Vision Transformer RobustnessChengzhi Mao, Lu Jiang, Mostafa Dehghani 0001, Carl Vondrick, Rahul Sukthankar, Irfan Essa. [doi]
- Equivariant and Stable Positional Encoding for More Powerful Graph Neural NetworksHaorui Wang, Haoteng Yin, Muhan Zhang, Pan Li 0005. [doi]
- DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG SignalsCédric Allain, Alexandre Gramfort, Thomas Moreau. [doi]
- Context-Aware Sparse Deep Coordination GraphsTonghan Wang 0001, Liang Zeng 0002, Weijun Dong, Qianlan Yang, Yang Yu 0001, Chongjie Zhang. [doi]
- Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak ModelsChaoyue Liu 0001, Libin Zhu, Misha Belkin. [doi]
- Autoregressive Diffusion ModelsEmiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans. [doi]
- Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with MomentumKirby Banman, Liam Peet-Pare, Nidhi Hegde 0001, Alona Fyshe, Martha White. [doi]
- Efficient Token Mixing for Transformers via Adaptive Fourier Neural OperatorsJohn Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro. [doi]
- Training Structured Neural Networks Through Manifold Identification and Variance ReductionZih-Syuan Huang, Ching-Pei Lee. [doi]
- Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit RegularizationTolga Ergen, Arda Sahiner, Batu Ozturkler, John M. Pauly, Morteza Mardani, Mert Pilanci. [doi]
- Bayesian Framework for Gradient LeakageMislav Balunovic, Dimitar Iliev Dimitrov, Robin Staab, Martin T. Vechev. [doi]
- Sequential Reptile: Inter-Task Gradient Alignment for Multilingual LearningSeanie Lee, Haebeom Lee, Juho Lee 0001, Sung Ju Hwang. [doi]
- Neural Stochastic Dual Dynamic ProgrammingHanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai. [doi]
- $\beta$-Intact-VAE: Identifying and Estimating Causal Effects under Limited OverlapPengzhou Abel Wu, Kenji Fukumizu. [doi]
- Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure AnalysisSiyi Tang, Jared Dunnmon, Khaled Kamal Saab, Xuan Zhang, Qianying Huang, Florian Dubost, Daniel Rubin, Christopher Lee-Messer. [doi]
- Evaluating Disentanglement of Structured RepresentationsRaphaël Dang-Nhu. [doi]
- Discrepancy-Based Active Learning for Domain AdaptationAntoine de Mathelin, François Deheeger, Mathilde Mougeot, Nicolas Vayatis. [doi]
- Continuous-Time Meta-Learning with Forward Mode DifferentiationTristan Deleu, David Kanaa, Leo Feng, Giancarlo Kerg, Yoshua Bengio, Guillaume Lajoie, Pierre-Luc Bacon. [doi]
- Vision-Based Manipulators Need to Also See from Their HandsKyle Hsu, Moo Jin Kim, Rafael Rafailov, Jiajun Wu 0001, Chelsea Finn. [doi]
- A Class of Short-term Recurrence Anderson Mixing Methods and Their ApplicationsFuchao Wei, Chenglong Bao, Yang Liu 0005. [doi]
- A fast and accurate splitting method for optimal transport: analysis and implementationVien V. Mai, Jacob Lindbäck, Mikael Johansson 0001. [doi]
- Frame Averaging for Invariant and Equivariant Network DesignOmri Puny, Matan Atzmon, Edward J. Smith, Ishan Misra, Aditya Grover, Heli Ben Hamu, Yaron Lipman. [doi]
- Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs TheoryTianrong Chen, Guan-Horng Liu, Evangelos A. Theodorou. [doi]
- PF-GNN: Differentiable particle filtering based approximation of universal graph representationsMohammed Haroon Dupty, Yanfei Dong, Wee Sun Lee. [doi]
- Pessimistic Model-based Offline Reinforcement Learning under Partial CoverageMasatoshi Uehara, Wen Sun 0002. [doi]
- Learning Object-Oriented Dynamics for Planning from TextGuiliang Liu, Ashutosh Adhikari, Amir Massoud Farahmand, Pascal Poupart. [doi]
- Revisiting Over-smoothing in BERT from the Perspective of GraphHan Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok. [doi]
- RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter EstimationPingchuan Ma 0002, Tao Du 0001, Joshua B. Tenenbaum, Wojciech Matusik, Chuang Gan. [doi]
- Group-based Interleaved Pipeline Parallelism for Large-scale DNN TrainingPengcheng Yang, Xiaoming Zhang, Wenpeng Zhang, Ming Yang, Hong Wei. [doi]
- Few-shot Learning via Dirichlet Tessellation EnsembleChunwei Ma, Ziyun Huang, Mingchen Gao, Jinhui Xu 0001. [doi]
- Explaining Point Processes by Learning Interpretable Temporal Logic RulesShuang Li, Mingquan Feng, Lu Wang, Abdelmajid Essofi, Yufeng Cao, Junchi Yan, Le Song. [doi]
- Doubly Adaptive Scaled Algorithm for Machine Learning Using Second-Order InformationMajid Jahani, Sergey Rusakov 0001, Zheng Shi, Peter Richtárik, Michael W. Mahoney, Martin Takác. [doi]
- Equivariant Transformers for Neural Network based Molecular PotentialsPhilipp Thölke, Gianni De Fabritiis. [doi]
- A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement LearningJiaxian Guo, Mingming Gong, Dacheng Tao. [doi]
- Towards General Function Approximation in Zero-Sum Markov GamesBaihe Huang, Jason D. Lee, Zhaoran Wang, Zhuoran Yang. [doi]
- Generalized Demographic Parity for Group FairnessZhimeng Jiang, Xiaotian Han, Chao Fan, Fan Yang, Ali Mostafavi, Xia Hu. [doi]
- Spike-inspired rank coding for fast and accurate recurrent neural networksAlan Jeffares, Qinghai Guo, Pontus Stenetorp, Timoleon Moraitis. [doi]
- W-CTC: a Connectionist Temporal Classification Loss with Wild CardsXingyu Cai, Jiahong Yuan, Yuchen Bian, Guangxu Xun, Jiaji Huang, Kenneth Church 0001. [doi]
- Improving Non-Autoregressive Translation Models Without DistillationXiao Shi Huang, Felipe Pérez, Maksims Volkovs. [doi]
- Distribution Compression in Near-Linear TimeAbhishek Shetty, Raaz Dwivedi, Lester Mackey. [doi]
- Attention-based Interpretability with Concept TransformersMattia Rigotti, Christoph Miksovic, Ioana Giurgiu, Thomas Gschwind, Paolo Scotton. [doi]
- Normalization of Language Embeddings for Cross-Lingual AlignmentPrince Osei Aboagye, Yan Zheng, Chin-Chia Michael Yeh, JunPeng Wang, Wei Zhang, Liang Wang, Hao Yang, Jeff M. Phillips. [doi]
- Huber Additive Models for Non-stationary Time Series AnalysisYingjie Wang, Xianrui Zhong, Fengxiang He, Hong Chen, Dacheng Tao. [doi]
- Unified Visual Transformer CompressionShixing Yu, Tianlong Chen, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu 0002, Zhangyang Wang. [doi]
- A New Perspective on "How Graph Neural Networks Go Beyond Weisfeiler-Lehman?"Asiri Wijesinghe, Qing Wang 0002. [doi]
- Diverse Client Selection for Federated Learning via Submodular MaximizationRavikumar Balakrishnan, Tian Li 0005, Tianyi Zhou, Nageen Himayat, Virginia Smith, Jeff A. Bilmes. [doi]
- Provably Filtering Exogenous Distractors using Multistep Inverse DynamicsYonathan Efroni, Dipendra Misra, Akshay Krishnamurthy, Alekh Agarwal, John Langford 0001. [doi]
- Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP FrameworkXu Ma, Can Qin, Haoxuan You, Haoxi Ran, Yun Fu 0001. [doi]
- A generalization of the randomized singular value decompositionNicolas Boullé, Alex Townsend. [doi]
- Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement LearningDenis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto. [doi]
- Representation Learning for Online and Offline RL in Low-rank MDPsMasatoshi Uehara, Xuezhou Zhang, Wen Sun. [doi]
- FILIP: Fine-grained Interactive Language-Image Pre-TrainingLewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing Xu. [doi]
- Learning with Noisy Labels Revisited: A Study Using Real-World Human AnnotationsJiaheng Wei, Zhaowei Zhu, Hao Cheng 0014, Tongliang Liu, Gang Niu 0001, Yang Liu 0018. [doi]
- Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modelingGianluigi Silvestri, Emily Fertig, Dave Moore, Luca Ambrogioni. [doi]
- Enhancing Cross-lingual Transfer by Manifold MixupHuiyun Yang, Huadong Chen, Hao Zhou 0012, Lei Li 0005. [doi]
- Long Expressive Memory for Sequence ModelingT. Konstantin Rusch, Siddhartha Mishra, N. Benjamin Erichson, Michael W. Mahoney. [doi]
- Fast Generic Interaction Detection for Model Interpretability and CompressionTianjian Zhang, Feng Yin, Zhi-Quan Luo. [doi]
- Symbolic Learning to Optimize: Towards Interpretability and ScalabilityWenqing Zheng, Tianlong Chen, Ting-Kuei Hu, Zhangyang Wang. [doi]
- Skill-based Meta-Reinforcement LearningTaewook Nam, Shao-Hua Sun, Karl Pertsch, Sung Ju Hwang, Joseph J. Lim. [doi]
- The Three Stages of Learning Dynamics in High-dimensional Kernel MethodsNikhil Ghosh, Song Mei, Bin Yu. [doi]
- Energy-Inspired Molecular Conformation OptimizationJiaqi Guan, Wesley Wei Qian, Qiang Liu 0001, Wei-Ying Ma, Jianzhu Ma, Jian Peng 0001. [doi]
- Back2Future: Leveraging Backfill Dynamics for Improving Real-time Predictions in FutureHarshavardhan Kamarthi, Alexander Rodríguez, B. Aditya Prakash. [doi]
- FedBABU: Toward Enhanced Representation for Federated Image ClassificationJaehoon Oh, Sangmook Kim, Se-Young Yun. [doi]
- Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement LearningYutong Wang, Ke Xue, Chao Qian 0001. [doi]
- Adaptive Wavelet Transformer Network for 3D Shape Representation LearningHao Huang, Yi Fang 0006. [doi]
- Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node featuresJiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Soji Adeshina, Yangkun Wang, Tom Goldstein, David Wipf. [doi]
- PAC-Bayes Information BottleneckZifeng Wang 0008, Shao-Lun Huang, Ercan Engin Kuruoglu, Jimeng Sun, Xi Chen, Yefeng Zheng 0001. [doi]
- On Improving Adversarial Transferability of Vision TransformersMuzammal Naseer, Kanchana Ranasinghe, Salman Khan 0001, Fahad Shahbaz Khan, Fatih Porikli. [doi]
- Equivariant Subgraph Aggregation NetworksBeatrice Bevilacqua, Fabrizio Frasca, Derek Lim, Balasubramaniam Srinivasan, Chen Cai, Gopinath Balamurugan, Michael M. Bronstein, Haggai Maron. [doi]
- FP-DETR: Detection Transformer Advanced by Fully Pre-trainingWen Wang, Yang Cao, Jing Zhang, Dacheng Tao. [doi]
- Handling Distribution Shifts on Graphs: An Invariance PerspectiveQitian Wu, Hengrui Zhang, Junchi Yan, David Wipf. [doi]
- Taming Sparsely Activated Transformer with Stochastic ExpertsSimiao Zuo, Xiaodong Liu 0003, Jian Jiao 0007, Young-Jin Kim, Hany Hassan, Ruofei Zhang, Jianfeng Gao, Tuo Zhao. [doi]
- Generalization Through the Lens of Leave-One-Out ErrorGregor Bachmann, Thomas Hofmann, Aurélien Lucchi. [doi]
- StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image SynthesisJiatao Gu, Lingjie Liu, Peng Wang 0099, Christian Theobalt. [doi]
- Illiterate DALL-E Learns to ComposeGautam Singh, Fei Deng, Sungjin Ahn. [doi]
- NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge GraphsMikhail Galkin 0001, Etienne G. Denis, Jiapeng Wu, William L. Hamilton. [doi]
- Energy-Based Learning for Cooperative Games, with Applications to Valuation Problems in Machine LearningYatao Bian, Yu Rong, Tingyang Xu, Jiaxiang Wu, Andreas Krause 0001, JunZhou Huang. [doi]
- SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit modelsZaccharie Ramzi, Florian Mannel, Shaojie Bai, Jean-Luc Starck, Philippe Ciuciu, Thomas Moreau. [doi]
- FILM: Following Instructions in Language with Modular MethodsSo Yeon Min, Devendra Singh Chaplot, Pradeep Kumar Ravikumar, Yonatan Bisk, Ruslan Salakhutdinov. [doi]
- A Reduction-Based Framework for Conservative Bandits and Reinforcement LearningYunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, Simon Shaolei Du. [doi]
- Environment Predictive Coding for Visual NavigationSanthosh Kumar Ramakrishnan, Tushar Nagarajan, Ziad Al-Halah, Kristen Grauman. [doi]
- Using Graph Representation Learning with Schema Encoders to Measure the Severity of Depressive SymptomsSimin Hong, Anthony G. Cohn, David Crossland Hogg. [doi]
- Differentiable Prompt Makes Pre-trained Language Models Better Few-shot LearnersNingyu Zhang, Luoqiu Li, Xiang Chen, Shumin Deng, Zhen Bi, Chuanqi Tan, Fei Huang, Huajun Chen. [doi]
- Controlling Directions Orthogonal to a ClassifierYilun Xu, Hao He, Tianxiao Shen, Tommi S. Jaakkola. [doi]
- ViDT: An Efficient and Effective Fully Transformer-based Object DetectorHwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang 0001. [doi]
- Online Ad Hoc Teamwork under Partial ObservabilityPengjie Gu, Mengchen Zhao, Jianye Hao, Bo An 0001. [doi]
- Reinforcement Learning in Presence of Discrete Markovian Context EvolutionHang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar. [doi]
- FairCal: Fairness Calibration for Face VerificationTiago Salvador, Stephanie Cairns, Vikram Voleti, Noah Marshall, Adam M. Oberman. [doi]
- Equivariant Self-Supervised Learning: Encouraging Equivariance in RepresentationsRumen Dangovski, Li Jing, Charlotte Loh, Seungwook Han, Akash Srivastava, Brian Cheung, Pulkit Agrawal, Marin Soljacic. [doi]
- Training invariances and the low-rank phenomenon: beyond linear networksThien Le, Stefanie Jegelka. [doi]
- Multi-Task ProcessesDonggyun Kim, Seongwoong Cho, Wonkwang Lee, Seunghoon Hong. [doi]
- Contact Points Discovery for Soft-Body Manipulations with Differentiable PhysicsSizhe Li, Zhiao Huang, Tao Du 0001, Hao Su 0001, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Differentiable Gradient Sampling for Learning Implicit 3D Scene Reconstructions from a Single ImageShizhan Zhu, Sayna Ebrahimi, Angjoo Kanazawa, Trevor Darrell. [doi]
- Score-Based Generative Modeling with Critically-Damped Langevin DiffusionTim Dockhorn, Arash Vahdat, Karsten Kreis. [doi]
- Unrolling PALM for Sparse Semi-Blind Source SeparationMohammad Fahes, Christophe Kervazo, Jérôme Bobin, Florence Tupin. [doi]
- Mapping conditional distributions for domain adaptation under generalized target shiftMatthieu Kirchmeyer, Alain Rakotomamonjy, Emmanuel de Bézenac, Patrick Gallinari. [doi]
- DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with ToolsXingyu Lin, Zhiao Huang, Yunzhu Li, Joshua B. Tenenbaum, David Held, Chuang Gan. [doi]
- Cross-Lingual Transfer with Class-Weighted Language-Invariant RepresentationsRuicheng Xian, Heng Ji, Han Zhao 0002. [doi]
- LoRA: Low-Rank Adaptation of Large Language ModelsEdward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen. [doi]
- Fair Normalizing FlowsMislav Balunovic, Anian Ruoss, Martin T. Vechev. [doi]
- The Rich Get Richer: Disparate Impact of Semi-Supervised LearningZhaowei Zhu, Tianyi Luo, Yang Liu. [doi]
- Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and TransferableShaojin Ding, Tianlong Chen, Zhangyang Wang. [doi]
- Map Induction: Compositional spatial submap learning for efficient exploration in novel environmentsSugandha Sharma, Aidan Curtis, Marta Kryven, Joshua B. Tenenbaum, Ila R. Fiete. [doi]
- D-CODE: Discovering Closed-form ODEs from Observed TrajectoriesZhaozhi Qian, Krzysztof Kacprzyk, Mihaela van der Schaar. [doi]
- What Happens after SGD Reaches Zero Loss? --A Mathematical FrameworkZhiyuan Li 0005, Tianhao Wang 0017, Sanjeev Arora. [doi]
- Hierarchical Few-Shot Imitation with Skill Transition ModelsKourosh Hakhamaneshi, Ruihan Zhao 0001, Albert Zhan, Pieter Abbeel, Michael Laskin. [doi]
- Anisotropic Random Feature Regression in High DimensionsGabriel Mel, Jeffrey Pennington. [doi]
- Optimal Representations for Covariate ShiftYangjun Ruan, Yann Dubois, Chris J. Maddison. [doi]
- Discovering Nonlinear PDEs from Scarce Data with Physics-encoded LearningChengping Rao, Pu Ren, Yang Liu, Hao Sun. [doi]
- VC dimension of partially quantized neural networks in the overparametrized regimeYutong Wang, Clayton Scott. [doi]
- The Spectral Bias of Polynomial Neural NetworksMoulik Choraria, Leello Tadesse Dadi, Grigorios Chrysos 0002, Julien Mairal, Volkan Cevher. [doi]
- Looking Back on Learned Experiences For Class/task Incremental LearningMozhgan PourKeshavarz, Guoying Zhao, Mohammad Sabokrou. [doi]
- Multi-objective Optimization by Learning Space PartitionYiyang Zhao, Linnan Wang, Kevin Yang, Tianjun Zhang, Tian Guo 0001, Yuandong Tian. [doi]
- Transfer RL across Observation Feature Spaces via Model-Based RegularizationYanchao Sun, Ruijie Zheng, Xiyao Wang, Andrew E. Cohen, Furong Huang. [doi]
- Phenomenology of Double Descent in Finite-Width Neural NetworksSidak Pal Singh, Aurélien Lucchi, Thomas Hofmann, Bernhard Schölkopf. [doi]
- Superclass-Conditional Gaussian Mixture Model For Learning Fine-Grained EmbeddingsJingchao Ni, Wei Cheng 0002, Zhengzhang Chen, Takayoshi Asakura, Tomoya Soma, Sho Kato, Haifeng Chen. [doi]
- Proof Artifact Co-Training for Theorem Proving with Language ModelsJesse Michael Han, Jason Rute, Yuhuai Wu, Edward W. Ayers, Stanislas Polu. [doi]
- Zero Pixel Directional Boundary by Vector TransformEdoardo Mello Rella, Ajad Chhatkuli, Yun Liu, Ender Konukoglu, Luc Van Gool. [doi]
- MonoDistill: Learning Spatial Features for Monocular 3D Object DetectionZhiyu Chong, Xinzhu Ma, Hong Zhang, Yuxin Yue, Haojie Li, Zhihui Wang, Wanli Ouyang. [doi]
- Procedural generalization by planning with self-supervised world modelsAnkesh Anand, Jacob C. Walker, Yazhe Li, Eszter Vértes, Julian Schrittwieser, Sherjil Ozair, Theophane Weber, Jessica B. Hamrick. [doi]
- The Role of Pretrained Representations for the OOD Generalization of RL AgentsFrederik Träuble, Andrea Dittadi, Manuel Wuthrich, Felix Widmaier, Peter Vincent Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer. [doi]
- Conditional Contrastive Learning with KernelYao-Hung Hubert Tsai, Tianqin Li, Martin Q. Ma, Han Zhao 0002, Kun Zhang 0001, Louis-Philippe Morency, Ruslan Salakhutdinov. [doi]
- ClimateGAN: Raising Climate Change Awareness by Generating Images of FloodsVictor Schmidt, Alexandra Luccioni, Mélisande Teng, Tianyu Zhang, Alexia Reynaud, Sunand Raghupathi, Gautier Cosne, Adrien Juraver, Vahe Vardanyan, Alex Hernández-García, Yoshua Bengio. [doi]
- Pretraining Text Encoders with Adversarial Mixture of Training Signal GeneratorsYu Meng 0001, Chenyan Xiong, Payal Bajaj, Saurabh Tiwary, Paul N. Bennett, Jiawei Han 0001, Xia Song. [doi]
- Sample Efficient Stochastic Policy Extragradient Algorithm for Zero-Sum Markov GameZiyi Chen 0002, Shaocong Ma, Yi Zhou 0017. [doi]
- Generalizing Few-Shot NAS with Gradient MatchingShoukang Hu, Ruochen Wang, Lanqing Hong, Zhenguo Li, Cho-Jui Hsieh, Jiashi Feng. [doi]
- Proving the Lottery Ticket Hypothesis for Convolutional Neural NetworksArthur da Cunha, Emanuele Natale, Laurent Viennot. [doi]
- AS-MLP: An Axial Shifted MLP Architecture for VisionDongze Lian, Zehao Yu, Xing Sun, Shenghua Gao. [doi]
- Geometry-Consistent Neural Shape Representation with Implicit Displacement FieldsYifan Wang, Lukas Rahmann, Olga Sorkine-Hornung. [doi]
- Understanding Dimensional Collapse in Contrastive Self-supervised LearningLi Jing, Pascal Vincent, Yann LeCun, Yuandong Tian. [doi]
- Shallow and Deep Networks are Near-Optimal Approximators of Korobov FunctionsMoïse Blanchard, Mohammed Amine Bennouna. [doi]
- Explanations of Black-Box Models based on Directional Feature InteractionsAria Masoomi, Davin Hill, Zhonghui Xu, Craig P. Hersh, Edwin K. Silverman, Peter J. Castaldi, Stratis Ioannidis, Jennifer G. Dy. [doi]
- Associated Learning: an Alternative to End-to-End Backpropagation that Works on CNN, RNN, and TransformerDennis Y. H. Wu, Dinan Lin, Vincent Chen, Hung-Hsuan Chen. [doi]
- Bandit Learning with Joint Effect of Incentivized Sampling, Delayed Sampling Feedback, and Self-Reinforcing User PreferencesTianchen Zhou, Jia Liu, Chaosheng Dong, Yi Sun. [doi]
- EViT: Expediting Vision Transformers via Token ReorganizationsYouwei Liang, Chongjian Ge, Zhan Tong, Yibing Song, Jue Wang 0001, Pengtao Xie. [doi]
- BAM: Bayes with Adaptive MemoryJosue Nassar, Jennifer Rogers Brennan, Ben Evans, Kendall Lowrey. [doi]
- GradSign: Model Performance Inference with Theoretical InsightsZhihao Zhang, Zhihao Jia. [doi]
- A Fine-Grained Analysis on Distribution ShiftOlivia Wiles, Sven Gowal, Florian Stimberg, Sylvestre-Alvise Rebuffi, Ira Ktena, Krishnamurthy Dvijotham, Ali Taylan Cemgil. [doi]
- POETREE: Interpretable Policy Learning with Adaptive Decision TreesAlizée Pace, Alex Chan, Mihaela van der Schaar. [doi]
- How Attentive are Graph Attention Networks?Shaked Brody, Uri Alon 0002, Eran Yahav. [doi]
- Is Importance Weighting Incompatible with Interpolating Classifiers?Ke Alexander Wang, Niladri Shekhar Chatterji, Saminul Haque, Tatsunori Hashimoto. [doi]
- A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud CompletionZhaoyang Lyu, Zhifeng Kong, Xudong Xu, Liang Pan, Dahua Lin. [doi]
- Convergent and Efficient Deep Q Learning AlgorithmZhikang T. Wang, Masahito Ueda. [doi]
- Constrained Physical-Statistics Models for Dynamical System Identification and PredictionJérémie Donà, Marie Déchelle, Patrick Gallinari, Marina Levy. [doi]
- GeoDiff: A Geometric Diffusion Model for Molecular Conformation GenerationMinkai Xu, Lantao Yu, Yang Song 0011, Chence Shi, Stefano Ermon, Jian Tang 0005. [doi]
- Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionBowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed. [doi]
- Clean Images are Hard to Reblur: Exploiting the Ill-Posed Inverse Task for Dynamic Scene DeblurringSeungjun Nah, Sanghyun Son, Jaerin Lee, Kyoung Mu Lee. [doi]
- Distributionally Robust Models with Parametric Likelihood RatiosPaul Michel, Tatsunori Hashimoto, Graham Neubig. [doi]
- Differentially Private Fractional Frequency Moments Estimation with Polylogarithmic SpaceLun Wang, Iosif Pinelis, Dawn Song. [doi]
- Learning Guarantees for Graph Convolutional Networks on the Stochastic Block ModelWei Lu. [doi]
- Reducing Excessive Margin to Achieve a Better Accuracy vs. Robustness Trade-offRahul Rade, Seyed-Mohsen Moosavi-Dezfooli. [doi]
- Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack ScenariosZhen Xiang, David J. Miller 0001, George Kesidis. [doi]
- GRAND++: Graph Neural Diffusion with A Source TermMatthew Thorpe, Tan Minh Nguyen, Hedi Xia, Thomas Strohmer, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang. [doi]
- Hidden Parameter Recurrent State Space Models For Changing Dynamics ScenariosVaisakh Shaj, Dieter Büchler, Rohit Sonker, Philipp Becker, Gerhard Neumann. [doi]
- Non-Linear Operator Approximations for Initial Value ProblemsGaurav Gupta, Xiongye Xiao, Radu Balan, Paul Bogdan. [doi]
- Learning Temporally Causal Latent Processes from General Temporal DataWeiran Yao, Yuewen Sun, Alex Ho, Changyin Sun, Kun Zhang 0001. [doi]
- Language modeling via stochastic processesRose E. Wang, Esin Durmus, Noah D. Goodman, Tatsunori Hashimoto. [doi]
- Salient ImageNet: How to discover spurious features in Deep Learning?Sahil Singla 0002, Soheil Feizi. [doi]
- Responsible Disclosure of Generative Models Using Scalable FingerprintingNing Yu, Vladislav Skripniuk, Dingfan Chen, Larry S. Davis, Mario Fritz. [doi]
- A First-Occupancy Representation for Reinforcement LearningTed Moskovitz, Spencer R. Wilson, Maneesh Sahani. [doi]
- FlexConv: Continuous Kernel Convolutions With Differentiable Kernel SizesDavid W. Romero, Robert-Jan Bruintjes, Jakub Mikolaj Tomczak, Erik J. Bekkers, Mark Hoogendoorn, Jan van Gemert. [doi]
- Expressiveness and Approximation Properties of Graph Neural NetworksFloris Geerts, Juan L. Reutter. [doi]
- Finding Biological Plausibility for Adversarially Robust Features via Metameric TasksAnne Harrington, Arturo Deza. [doi]
- Deep Learning without Shortcuts: Shaping the Kernel with Tailored RectifiersGuodong Zhang, Aleksandar Botev, James Martens. [doi]
- Neural Processes with Stochastic Attention: Paying more attention to the context datasetMingyu Kim, Kyeongryeol Go, Se-Young Yun. [doi]
- FastSHAP: Real-Time Shapley Value EstimationNeil Jethani, Mukund Sudarshan, Ian Connick Covert, Su-In Lee, Rajesh Ranganath. [doi]
- Evidential Turing ProcessesMelih Kandemir, Abdullah Akgül, Manuel Haußmann, Gozde Unal. [doi]
- GDA-AM: On the Effectiveness of Solving Min-Imax Optimization via Anderson MixingHuan He, Shifan Zhao, Yuanzhe Xi, Joyce C. Ho, Yousef Saad. [doi]
- Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningHaichao Zhang, Wei Xu, Haonan Yu. [doi]
- On the Existence of Universal Lottery TicketsRebekka Burkholz, Nilanjana Laha, Rajarshi Mukherjee, Alkis Gotovos. [doi]
- An Explanation of In-context Learning as Implicit Bayesian InferenceSang Michael Xie, Aditi Raghunathan, Percy Liang, Tengyu Ma 0001. [doi]
- Topological Graph Neural NetworksMax Horn, Edward De Brouwer, Michael Moor, Yves Moreau, Bastian Rieck, Karsten M. Borgwardt. [doi]
- $\mathrm{SO}(2)$-Equivariant Reinforcement LearningDian Wang, Robin Walters, Robert Platt. [doi]
- Bootstrapped Meta-LearningSebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh 0001. [doi]
- Anomaly Transformer: Time Series Anomaly Detection with Association DiscrepancyJiehui Xu, Haixu Wu, Jianmin Wang, Mingsheng Long. [doi]
- Implicit Bias of Adversarial Training for Deep Neural NetworksBochen Lv, Zhanxing Zhu. [doi]
- PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature CommunicationCheng Wan, Youjie Li, Cameron R. Wolfe, Anastasios Kyrillidis, Nam Sung Kim, Yingyan Lin. [doi]
- Hybrid Local SGD for Federated Learning with Heterogeneous CommunicationsYuanxiong Guo, Ying Sun, Rui Hu 0005, Yanmin Gong 0001. [doi]
- Fast Differentiable Matrix Square RootYue Song, Nicu Sebe, Wei Wang 0108. [doi]
- Anti-Concentrated Confidence Bonuses For Scalable ExplorationJordan T. Ash, Cyril Zhang, Surbhi Goel, Akshay Krishnamurthy, Sham M. Kakade. [doi]
- Asymmetry Learning for Counterfactually-invariant Classification in OOD TasksS Chandra Mouli, Bruno Ribeiro. [doi]
- Hindsight Foresight Relabeling for Meta-Reinforcement LearningMichael Wan, Jian Peng 0001, Tanmay Gangwani. [doi]
- Top-N: Equivariant Set and Graph Generation without ExchangeabilityClément Vignac, Pascal Frossard. [doi]
- Learning to Annotate Part Segmentation with Gradient MatchingYu Yang, Xiaotian Cheng, Hakan Bilen, Xiangyang Ji. [doi]
- Learning Curves for SGD on Structured FeaturesBlake Bordelon, Cengiz Pehlevan. [doi]
- FedPara: Low-rank Hadamard Product for Communication-Efficient Federated LearningNam Hyeon-Woo, Moon Ye-Bin, Tae Hyun Oh. [doi]
- It Takes Two to Tango: Mixup for Deep Metric LearningShashanka Venkataramanan, Bill Psomas, Ewa Kijak, Laurent Amsaleg, Konstantinos Karantzalos, Yannis Avrithis. [doi]
- Sparse Communication via Mixed DistributionsAntónio Farinhas, Wilker Aziz, Vlad Niculae, André F. T. Martins. [doi]
- DISSECT: Disentangled Simultaneous Explanations via Concept TraversalsAsma Ghandeharioun, Been Kim, Chun-Liang Li, Brendan Jou, Brian Eoff, Rosalind W. Picard. [doi]
- Practical Integration via Separable Bijective NetworksChristopher M. Bender, Patrick Emmanuel, Michael K. Reiter, Junier Oliva. [doi]
- Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial LearningShaopeng Fu, Fengxiang He, Yang Liu, Li Shen, Dacheng Tao. [doi]
- FedChain: Chained Algorithms for Near-optimal Communication Cost in Federated LearningCharlie Hou, Kiran Koshy Thekumparampil, Giulia Fanti, Sewoong Oh. [doi]
- When should agents explore?Miruna Pislar, David Szepesvari, Georg Ostrovski, Diana L. Borsa, Tom Schaul. [doi]
- Differentiable Scaffolding Tree for Molecule OptimizationTianfan Fu, Wenhao Gao, Cao Xiao, Jacob Yasonik, Connor W. Coley, Jimeng Sun. [doi]
- Reverse Engineering of Imperceptible Adversarial Image PerturbationsYifan Gong 0004, Yuguang Yao, Yize Li, Yimeng Zhang, Xiaoming Liu, Xue Lin, Sijia Liu 0001. [doi]
- Image BERT Pre-training with Online TokenizerJinghao Zhou, Chen Wei 0005, Huiyu Wang, Wei Shen 0002, Cihang Xie, Alan L. Yuille, Tao Kong. [doi]
- PSA-GAN: Progressive Self Attention GANs for Synthetic Time SeriesPaul Jeha, Michael Bohlke-Schneider, Pedro Mercado, Shubham Kapoor, Rajbir-Singh Nirwan, Valentin Flunkert, Jan Gasthaus, Tim Januschowski. [doi]
- Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural NetworksAndrea Cini, Ivan Marisca, Cesare Alippi. [doi]
- EigenGame Unloaded: When playing games is better than optimizingIan M. Gemp, Brian McWilliams, Claire Vernade, Thore Graepel. [doi]
- Fast Model Editing at ScaleEric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D. Manning. [doi]
- Meta-Learning with Fewer Tasks through Task InterpolationHuaxiu Yao, Linjun Zhang, Chelsea Finn. [doi]
- CDTrans: Cross-domain Transformer for Unsupervised Domain AdaptationTongkun Xu, Weihua Chen, Pichao Wang, Fan Wang, Hao Li, Rong Jin 0001. [doi]
- Efficient Learning of Safe Driving Policy via Human-AI Copilot OptimizationQuanyi Li, Zhenghao Peng, Bolei Zhou. [doi]
- Hindsight is 20/20: Leveraging Past Traversals to Aid 3D PerceptionYurong You, Katie Z. Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark E. Campbell, Kilian Q. Weinberger. [doi]
- Phase Collapse in Neural NetworksFlorentin Guth, John Zarka, Stéphane Mallat. [doi]
- Gradient Information Matters in Policy Optimization by Back-propagating through ModelChongchong Li, Yue Wang 0017, Wei Chen, Yuting Liu, Zhi-Ming Ma, Tie-Yan Liu. [doi]
- Connectome-constrained Latent Variable Model of Whole-Brain Neural ActivityLu Mi, Richard Xu, Sridhama Prakhya, Albert Lin, Nir Shavit, Aravinthan D. T. Samuel, Srinivas C. Turaga. [doi]
- GNN is a Counter? Revisiting GNN for Question AnsweringKuan Wang, Yuyu Zhang, Diyi Yang, Le Song, Tao Qin. [doi]
- Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability PerspectiveQi Lyu, Xiao Fu 0001, Weiran Wang, Songtao Lu. [doi]
- Eliminating Sharp Minima from SGD with Truncated Heavy-tailed NoiseXingyu Wang, Sewoong Oh, Chang-han Rhee. [doi]
- No One Representation to Rule Them All: Overlapping Features of Training MethodsRaphael Gontijo Lopes, Yann Dauphin, Ekin Dogus Cubuk. [doi]
- Rethinking Class-Prior Estimation for Positive-Unlabeled LearningYu Yao, Tongliang Liu, Bo Han 0003, Mingming Gong, Gang Niu 0001, Masashi Sugiyama, Dacheng Tao. [doi]
- Crystal Diffusion Variational Autoencoder for Periodic Material GenerationTian Xie, Xiang Fu, Octavian-Eugen Ganea, Regina Barzilay, Tommi S. Jaakkola. [doi]
- Spanning Tree-based Graph Generation for MoleculesSungsoo Ahn, Binghong Chen, Tianzhe Wang, Le Song. [doi]
- Deconstructing the Inductive Biases of Hamiltonian Neural NetworksNate Gruver, Marc Anton Finzi, Samuel Don Stanton, Andrew Gordon Wilson. [doi]
- Memorizing TransformersYuhuai Wu, Markus Norman Rabe, DeLesley Hutchins, Christian Szegedy. [doi]
- Task Affinity with Maximum Bipartite Matching in Few-Shot LearningCat Phuoc Le, Juncheng Dong, Mohammadreza Soltani, Vahid Tarokh. [doi]
- CrowdPlay: Crowdsourcing Human Demonstrations for Offline LearningMatthias Gerstgrasser, Rakshit Trivedi, David C. Parkes. [doi]
- Language-driven Semantic SegmentationBoyi Li, Kilian Q. Weinberger, Serge J. Belongie, Vladlen Koltun, René Ranftl. [doi]
- A Non-Parametric Regression Viewpoint : Generalization of Overparametrized Deep RELU Network Under Noisy ObservationsNamjoon Suh, Hyunouk Ko, Xiaoming Huo. [doi]
- Neural Structured Prediction for Inductive Node ClassificationMeng Qu, Huiyu Cai, Jian Tang 0005. [doi]
- Distributional Reinforcement Learning with Monotonic SplinesYudong Luo, Guiliang Liu, Haonan Duan, Oliver Schulte, Pascal Poupart. [doi]
- Online Coreset Selection for Rehearsal-based Continual LearningJaehong Yoon, Divyam Madaan, Eunho Yang, Sung Ju Hwang. [doi]
- Imbedding Deep Neural NetworksAndrew Corbett, Dmitry Kangin. [doi]
- Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning SystemElahe Arani, Fahad Sarfraz, Bahram Zonooz. [doi]
- Sound and Complete Neural Network Repair with Minimality and Locality GuaranteesFeisi Fu, Wenchao Li. [doi]
- Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average RewardHairi, Jia Liu 0002, Songtao Lu. [doi]
- Self-Supervision Enhanced Feature Selection with Correlated GatesChangHee Lee, Fergus Imrie, Mihaela van der Schaar. [doi]
- How Do Vision Transformers Work?Namuk Park, Songkuk Kim. [doi]
- Tackling the Generative Learning Trilemma with Denoising Diffusion GANsZhisheng Xiao, Karsten Kreis, Arash Vahdat. [doi]
- Information Bottleneck: Exact Analysis of (Quantized) Neural NetworksStephan Sloth Lorenzen, Christian Igel, Mads Nielsen. [doi]
- Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon ReasoningDhruv Shah, Peng Xu, Yao Lu, Ted Xiao, Alexander Toshev, Sergey Levine, Brian Ichter. [doi]
- A Theory of Tournament RepresentationsArun Rajkumar, Vishnu Veerathu, Abdul Bakey Mir. [doi]
- Maximizing Ensemble Diversity in Deep Reinforcement LearningHassam Sheikh, Mariano Phielipp, Ladislau Bölöni. [doi]
- Surreal-GAN: Semi-Supervised Representation Learning via GAN for uncovering heterogeneous disease-related imaging patternsZhijian Yang, Junhao Wen, Christos Davatzikos. [doi]
- Maximum Entropy RL (Provably) Solves Some Robust RL ProblemsBenjamin Eysenbach, Sergey Levine. [doi]
- iFlood: A Stable and Effective RegularizerYuexiang Xie, Zhen Wang, Yaliang Li, Ce Zhang, Jingren Zhou, Bolin Ding. [doi]
- RotoGrad: Gradient Homogenization in Multitask LearningAdrián Javaloy, Isabel Valera. [doi]
- Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug DiscoveryYulun Wu, Nicholas Choma, Andrew Deru Chen, Mikaela Cashman, Érica Teixeira Prates, Verónica G. Melesse Vergara, Manesh Shah, Austin Clyde, Thomas S. Brettin, Wibe Albert de Jong, Neeraj Kumar, Martha S. Head, Rick L. Stevens, Peter Nugent, Daniel A. Jacobson, James B. Brown. [doi]
- Conditioning Sequence-to-sequence Networks with Learned ActivationsAlberto Gil Couto Pimentel Ramos, Abhinav Mehrotra, Nicholas Donald Lane, Sourav Bhattacharya. [doi]
- Inverse Online Learning: Understanding Non-Stationary and Reactionary PoliciesAlex J. Chan, Alicia Curth, Mihaela van der Schaar. [doi]
- Safe Neurosymbolic Learning with Differentiable Symbolic ExecutionChenxi Yang, Swarat Chaudhuri. [doi]
- Learning Value Functions from Undirected State-only ExperienceMatthew Chang, Arjun Gupta, Saurabh Gupta 0001. [doi]
- Fooling Explanations in Text ClassifiersAdam Ivankay, Ivan Girardi, Chiara Marchiori, Pascal Frossard. [doi]
- Who Is Your Right Mixup Partner in Positive and Unlabeled LearningChangchun Li, XiMing Li, Lei Feng, Jihong OuYang. [doi]
- An Autoregressive Flow Model for 3D Molecular Geometry Generation from ScratchYouzhi Luo, Shuiwang Ji. [doi]
- Reward Uncertainty for Exploration in Preference-based Reinforcement LearningXinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel. [doi]
- Spherical Message Passing for 3D Molecular GraphsYi Liu 0059, Limei Wang, Meng Liu, Yuchao Lin, Xuan Zhang, Bora Oztekin, Shuiwang Ji. [doi]
- CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale AttentionWenxiao Wang 0001, Lu Yao, Long Chen 0016, Binbin Lin, Deng Cai 0001, Xiaofei He 0001, Wei Liu. [doi]
- Unsupervised Disentanglement with Tensor Product Representations on the TorusMichael Rotman, Amit Dekel, Shir Gur, Yaron Oz, Lior Wolf. [doi]
- MT3: Multi-Task Multitrack Music TranscriptionJosh Gardner, Ian Simon, Ethan Manilow, Curtis Hawthorne, Jesse H. Engel. [doi]
- When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?Ziang Song, Song Mei, Yu Bai. [doi]
- A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial TrainingYifei Wang 0001, Yisen Wang 0001, Jiansheng Yang, Zhouchen Lin. [doi]
- Autonomous Learning of Object-Centric Abstractions for High-Level PlanningSteven James, Benjamin Rosman, George Konidaris 0001. [doi]
- Approximation and Learning with Deep Convolutional Models: a Kernel PerspectiveAlberto Bietti. [doi]
- What Makes Better Augmentation Strategies? Augment Difficult but Not too DifferentJaehyung Kim, Dongyeop Kang, Sungsoo Ahn, Jinwoo Shin. [doi]
- CROP: Certifying Robust Policies for Reinforcement Learning through Functional SmoothingFan Wu, Linyi Li, Zijian Huang, Yevgeniy Vorobeychik, Ding Zhao, Bo Li 0026. [doi]
- A Comparison of Hamming Errors of Representative Variable Selection MethodsTracy Ke, Longlin Wang. [doi]
- GradMax: Growing Neural Networks using Gradient InformationUtku Evci, Bart van Merrienboer, Thomas Unterthiner, Fabian Pedregosa, Max Vladymyrov. [doi]
- Generating Videos with Dynamics-aware Implicit Generative Adversarial NetworksSihyun Yu, Jihoon Tack, Sangwoo Mo, Hyunsu Kim, Junho Kim, Jung-Woo Ha 0001, Jinwoo Shin. [doi]
- A Biologically Interpretable Graph Convolutional Network to Link Genetic Risk Pathways and Imaging Phenotypes of DiseaseSayan Ghosal, Qiang Chen, Giulio Pergola, Aaron L. Goldman, William Ulrich, Daniel R. Weinberger, Archana Venkataraman. [doi]
- Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RLYanchao Sun, Ruijie Zheng, Yongyuan Liang, Furong Huang. [doi]
- Dropout Q-Functions for Doubly Efficient Reinforcement LearningTakuya Hiraoka, Takahisa Imagawa, Taisei Hashimoto, Takashi Onishi, Yoshimasa Tsuruoka. [doi]
- Learning Prototype-oriented Set Representations for Meta-LearningDandan Guo, Long Tian, Minghe Zhang, Mingyuan Zhou, Hongyuan Zha. [doi]
- Permutation Compressors for Provably Faster Distributed Nonconvex OptimizationRafal Szlendak, Alexander Tyurin, Peter Richtárik. [doi]
- Learning State Representations via Retracing in Reinforcement LearningChangmin Yu, Dong Li, Jianye Hao, Jun Wang, Neil Burgess. [doi]
- Provable Learning-based Algorithm For Sparse RecoveryXinshi Chen, Haoran Sun, Le Song. [doi]
- BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation ModelsKangjie Chen, Yuxian Meng, Xiaofei Sun, Shangwei Guo, Tianwei Zhang 0004, Jiwei Li, Chun Fan. [doi]
- Latent Image Animator: Learning to Animate Images via Latent Space NavigationYaohui Wang, Di Yang, François Brémond, Antitza Dantcheva. [doi]
- Assessing Generalization of SGD via DisagreementYiding Jiang, Vaishnavh Nagarajan, Christina Baek, J. Zico Kolter. [doi]
- An Agnostic Approach to Federated Learning with Class ImbalanceZebang Shen, Juan Cerviño, Hamed Hassani, Alejandro Ribeiro. [doi]
- Group equivariant neural posterior estimationMaximilian Dax, Stephen R. Green, Jonathan Gair, Michael Deistler, Bernhard Schölkopf, Jakob H. Macke. [doi]
- CoMPS: Continual Meta Policy SearchGlen Berseth, Zhiwei Zhang, Grace Zhang, Chelsea Finn, Sergey Levine. [doi]
- Critical Points in Quantum Generative ModelsEric Ricardo Anschütz. [doi]
- Multimeasurement Generative ModelsSaeed Saremi, Rupesh Kumar Srivastava. [doi]
- Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form SolutionsArda Sahiner, Tolga Ergen, Batu Ozturkler, Burak Bartan, John M. Pauly, Morteza Mardani, Mert Pilanci. [doi]
- Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?Sheikh Shams Azam, Seyyedali Hosseinalipour, Qiang Qiu, Christopher G. Brinton. [doi]
- Self-Joint Supervised LearningNavid Kardan, Mubarak Shah, Mitch Hill. [doi]
- Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network ModelsBeidi Chen, Tri Dao, Kaizhao Liang, Jiaming Yang, Zhao Song 0002, Atri Rudra, Christopher Ré. [doi]
- Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and BeyondChulhee Yun, Shashank Rajput, Suvrit Sra. [doi]
- NASPY: Automated Extraction of Automated Machine Learning ModelsXiaoxuan Lou, Shangwei Guo, Jiwei Li, Yaoxin Wu, Tianwei Zhang 0004. [doi]
- Task Relatedness-Based Generalization Bounds for Meta LearningJiechao Guan, Zhiwu Lu 0001. [doi]
- Gradient Step Denoiser for convergent Plug-and-PlaySamuel Hurault, Arthur Leclaire, Nicolas Papadakis. [doi]
- ProtoRes: Proto-Residual Network for Pose Authoring via Learned Inverse KinematicsBoris N. Oreshkin, Florent Bocquelet, Félix G. Harvey, Bay Raitt, Dominic Laflamme. [doi]
- Learning Efficient Image Super-Resolution Networks via Structure-Regularized PruningYulun Zhang, Huan Wang, Can Qin, Yun Fu. [doi]
- Learning Distributionally Robust Models at Scale via Composite OptimizationFarzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Amin Karbasi. [doi]
- VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised LearningAdrien Bardes, Jean Ponce, Yann LeCun. [doi]
- QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training QuantizationXiuying Wei, Ruihao Gong, Yuhang Li, Xianglong Liu, Fengwei Yu. [doi]
- Scale Mixtures of Neural Network Gaussian ProcessesHyungi Lee, Eunggu Yun, Hongseok Yang, Juho Lee 0001. [doi]
- KL Guided Domain AdaptationA. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atilim Gunes Baydin. [doi]
- Tuformer: Data-driven Design of Transformers for Improved Generalization or EfficiencyXiaoyu Liu, Jiahao Su, Furong Huang. [doi]
- Can an Image Classifier Suffice For Action Recognition?Quanfu Fan, Chun-Fu Chen 0001, Rameswar Panda. [doi]
- Coordination Among Neural Modules Through a Shared Global WorkspaceAnirudh Goyal, Aniket Rajiv Didolkar, Alex Lamb, Kartikeya Badola, Nan Rosemary Ke, Nasim Rahaman, Jonathan Binas, Charles Blundell, Michael Curtis Mozer, Yoshua Bengio. [doi]
- Neural Methods for Logical Reasoning over Knowledge GraphsAlfonso Amayuelas, Shuai Zhang, Susie Xi Rao, Ce Zhang 0001. [doi]
- How to Train Your MAML to Excel in Few-Shot ClassificationHan-Jia Ye, Wei-Lun Chao. [doi]
- Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal TransformersRuihan Yang, Minghao Zhang, Nicklas Hansen, Huazhe Xu, Xiaolong Wang. [doi]
- Uncertainty Modeling for Out-of-Distribution GeneralizationXiaotong Li, Yongxing Dai, Yixiao Ge, Jun Liu, Ying Shan, Lingyu Duan. [doi]
- Continual Normalization: Rethinking Batch Normalization for Online Continual LearningQuang Pham, Chenghao Liu, Steven C. H. Hoi. [doi]
- PI3NN: Out-of-distribution-aware Prediction Intervals from Three Neural NetworksSiyan Liu, Pei Zhang, Dan Lu 0001, Guannan Zhang. [doi]
- Towards Deployment-Efficient Reinforcement Learning: Lower Bound and OptimalityJiawei Huang, Jinglin Chen, Li Zhao, Tao Qin, Nan Jiang, Tie-Yan Liu. [doi]
- Open-Set Recognition: A Good Closed-Set Classifier is All You NeedSagar Vaze, Kai Han 0001, Andrea Vedaldi, Andrew Zisserman. [doi]
- Exploiting Class Activation Value for Partial-Label LearningFei Zhang, Lei Feng, Bo Han 0003, Tongliang Liu, Gang Niu 0001, Tao Qin, Masashi Sugiyama. [doi]
- On Bridging Generic and Personalized Federated Learning for Image ClassificationHong-You Chen, Wei-Lun Chao. [doi]
- ExT5: Towards Extreme Multi-Task Scaling for Transfer LearningVamsi Aribandi, Yi Tay, Tal Schuster, Jinfeng Rao, Huaixiu Steven Zheng, Sanket Vaibhav Mehta, Honglei Zhuang, Vinh Q. Tran 0002, Dara Bahri, Jianmo Ni, Jai Prakash Gupta, Kai Hui 0001, Sebastian Ruder, Donald Metzler. [doi]
- Discriminative Similarity for Data ClusteringYingzhen Yang, Ping Li. [doi]
- How Low Can We Go: Trading Memory for Error in Low-Precision TrainingChengrun Yang, Ziyang Wu, Jerry Chee, Christopher De Sa, Madeleine Udell. [doi]
- Self-supervised Learning is More Robust to Dataset ImbalanceHong Liu, Jeff Z. HaoChen, Adrien Gaidon, Tengyu Ma 0001. [doi]
- MoReL: Multi-omics Relational LearningArman Hasanzadeh, Ehsan Hajiramezanali, Nick Duffield, Xiaoning Qian. [doi]
- Measuring the Interpretability of Unsupervised Representations via Quantized Reversed ProbingIro Laina, Yuki M. Asano, Andrea Vedaldi. [doi]
- Neural Variational Dropout ProcessesInsu Jeon, Youngjin Park, Gunhee Kim. [doi]
- Understanding Intrinsic Robustness Using Label UncertaintyXiao Zhang, David Evans. [doi]
- Declarative nets that are equilibrium modelsRussell Tsuchida, Suk Yee Yong, Mohammad Ali Armin, Lars Petersson, Cheng Soon Ong. [doi]
- Sqrt(d) Dimension Dependence of Langevin Monte CarloRuilin Li, Hongyuan Zha, Molei Tao. [doi]
- On Non-Random Missing Labels in Semi-Supervised LearningXinting Hu, Yulei Niu, Chunyan Miao, Xian-Sheng Hua 0001, Hanwang Zhang. [doi]
- On Redundancy and Diversity in Cell-based Neural Architecture SearchXingchen Wan, Binxin Ru, Pedro M. Esperança, Zhenguo Li. [doi]
- Learning more skills through optimistic explorationDJ Strouse, Kate Baumli, David Warde-Farley, Volodymyr Mnih, Steven Stenberg Hansen. [doi]
- Learning to Dequantise with Truncated FlowsShawn Tan, Chin-Wei Huang, Alessandro Sordoni, Aaron C. Courville. [doi]
- On the Convergence of Certified Robust Training with Interval Bound PropagationYihan Wang, Zhouxing Shi, Quanquan Gu, Cho-Jui Hsieh. [doi]
- Leveraging Automated Unit Tests for Unsupervised Code TranslationBaptiste Rozière, Jie Zhang, François Charton, Mark Harman, Gabriel Synnaeve, Guillaume Lample. [doi]
- Active Hierarchical Exploration with Stable Subgoal Representation LearningSiyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang. [doi]
- Relational Learning with Variational BayesKuang-Hung Liu. [doi]
- cosFormer: Rethinking Softmax In AttentionZhen Qin, Weixuan Sun, Hui Deng, Dongxu Li, Yunshen Wei, Baohong Lv, Junjie Yan, Lingpeng Kong, Yiran Zhong. [doi]
- Cross-Trajectory Representation Learning for Zero-Shot Generalization in RLBogdan Mazoure, Ahmed M Ahmed, R. Devon Hjelm, Andrey Kolobov, Patrick MacAlpine. [doi]
- LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential EquationsJaehoon Lee 0002, Jinsung Jeon, Sheo Yon Jhin, Jihyeon Hyeong, Jayoung Kim 0002, Minju Jo, Kook Seungji, Noseong Park. [doi]
- Language-biased image classification: evaluation based on semantic representationsYoann Lemesle, Masataka Sawayama, Guillermo Valle Pérez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves Oudeyer. [doi]
- The Role of Permutation Invariance in Linear Mode Connectivity of Neural NetworksRahim Entezari, Hanie Sedghi, Olga Saukh, Behnam Neyshabur. [doi]
- Properties from mechanisms: an equivariance perspective on identifiable representation learningKartik Ahuja, Jason Hartford, Yoshua Bengio. [doi]
- Synchromesh: Reliable Code Generation from Pre-trained Language ModelsGabriel Poesia, Alex Polozov, Vu Le 0002, Ashish Tiwari 0001, Gustavo Soares, Christopher Meek, Sumit Gulwani. [doi]
- Feature Kernel DistillationBobby He, Mete Ozay. [doi]
- Rethinking Adversarial Transferability from a Data Distribution PerspectiveYao Zhu, Jiacheng Sun, Zhenguo Li. [doi]
- Robbing the Fed: Directly Obtaining Private Data in Federated Learning with Modified ModelsLiam H. Fowl, Jonas Geiping, Wojciech Czaja, Micah Goldblum, Tom Goldstein. [doi]
- R4D: Utilizing Reference Objects for Long-Range Distance EstimationYingwei Li, Tiffany Chen, Maya Kabkab, Ruichi Yu, Longlong Jing, Yurong You, Hang Zhao. [doi]
- IntSGD: Adaptive Floatless Compression of Stochastic GradientsKonstantin Mishchenko, Bokun Wang, Dmitry Kovalev, Peter Richtárik. [doi]
- Neural graphical modelling in continuous-time: consistency guarantees and algorithmsAlexis Bellot, Kim Branson, Mihaela van der Schaar. [doi]
- Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulationTodor Davchev, Oleg Olegovich Sushkov, Jean-Baptiste Regli, Stefan Schaal, Yusuf Aytar, Markus Wulfmeier, Jon Scholz. [doi]
- Cross-Domain Imitation Learning via Optimal TransportArnaud Fickinger, Samuel Cohen, Stuart Russell 0001, Brandon Amos. [doi]
- Graph-Relational Domain AdaptationZihao Xu, Hao He 0011, Guang-He Lee, Bernie Wang, Hao Wang. [doi]
- End-to-End Learning of Probabilistic Hierarchies on GraphsDaniel Zügner, Bertrand Charpentier, Morgane Ayle, Sascha Geringer, Stephan Günnemann. [doi]
- Progressive Distillation for Fast Sampling of Diffusion ModelsTim Salimans, Jonathan Ho. [doi]
- Transferable Adversarial Attack based on Integrated GradientsYi Huang, Adams Wai-Kin Kong. [doi]
- Generalized Natural Gradient Flows in Hidden Convex-Concave Games and GANsAndjela Mladenovic, Iosif Sakos, Gauthier Gidel, Georgios Piliouras. [doi]
- Regularized Autoencoders for Isometric Representation LearningYonghyeon Lee, Sangwoong Yoon, MinJun Son, Frank Chongwoo Park. [doi]
- Givens Coordinate Descent Methods for Rotation Matrix Learning in Trainable Embedding IndexesYunjiang Jiang, Han Zhang, Yiming Qiu, Yun Xiao, Bo Long, Wen-Yun Yang. [doi]
- Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural NetworksTong Bu, Wei Fang, Jianhao Ding, Penglin Dai, Zhaofei Yu, Tiejun Huang 0001. [doi]
- Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--HastingsKartik Goyal, Chris Dyer, Taylor Berg-Kirkpatrick. [doi]
- Understanding approximate and unrolled dictionary learning for pattern recoveryBenoît Malézieux, Thomas Moreau, Matthieu Kowalski. [doi]
- Coherence-based Label Propagation over Time Series for Accelerated Active LearningYooju Shin, Susik Yoon, Sundong Kim, Hwanjun Song, Jae-Gil Lee 0001, Byung Suk Lee. [doi]
- LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5Chengwei Qin, Shafiq Joty. [doi]
- Learning Towards The Largest MarginsXiong Zhou, Xianming Liu, Deming Zhai, Junjun Jiang, Xin Gao, Xiangyang Ji. [doi]
- AdaAug: Learning Class- and Instance-adaptive Data Augmentation PoliciesTsz-Him Cheung, Dit-Yan Yeung. [doi]
- A Theoretical Analysis on Feature Learning in Neural Networks: Emergence from Inputs and Advantage over Fixed FeaturesZhenmei Shi, Junyi Wei, Yingyu Liang. [doi]
- Practical Conditional Neural Process Via Tractable Dependent PredictionsStratis Markou, James Requeima, Wessel P. Bruinsma, Anna Vaughan, Richard E. Turner. [doi]
- Large-Scale Representation Learning on Graphs via BootstrappingShantanu Thakoor, Corentin Tallec, Mohammad Gheshlaghi Azar, Mehdi Azabou, Eva L. Dyer, Rémi Munos, Petar Velickovic, Michal Valko. [doi]
- The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human ModelsCassidy Laidlaw, Anca D. Dragan. [doi]
- GATSBI: Generative Adversarial Training for Simulation-Based InferencePoornima Ramesh, Jan-Matthis Lueckmann, Jan Boelts, Álvaro Tejero-Cantero, David S. Greenberg, Pedro J. Gonçalves, Jakob H. Macke. [doi]
- How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive LearningChaoning Zhang, Kang Zhang, Chenshuang Zhang, Trung X. Pham, Chang D. Yoo, In-So Kweon. [doi]
- Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural NetworksMorteza Ramezani, Weilin Cong, Mehrdad Mahdavi, Mahmut T. Kandemir, Anand Sivasubramaniam. [doi]
- SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement LearningJongjin Park, Younggyo Seo, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee. [doi]
- Label-Efficient Semantic Segmentation with Diffusion ModelsDmitry Baranchuk, Andrey Voynov, Ivan Rubachev, Valentin Khrulkov, Artem Babenko. [doi]
- Differentiable DAG SamplingBertrand Charpentier, Simon Kibler, Stephan Günnemann. [doi]
- Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with PessimismMing Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang 0003. [doi]
- Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown CodimensionParis Giampouras, Benjamin David Haeffele, René Vidal. [doi]
- Towards Understanding the Robustness Against Evasion Attack on Categorical DataHongyan Bao, Yufei Han, Yujun Zhou, Yun Shen, Xiangliang Zhang 0001. [doi]
- Charformer: Fast Character Transformers via Gradient-based Subword TokenizationYi Tay, Vinh Q. Tran 0002, Sebastian Ruder, Jai Prakash Gupta, Hyung Won Chung, Dara Bahri, Zhen Qin 0001, Simon Baumgartner, Cong Yu 0001, Donald Metzler. [doi]
- Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model CompressionBaeseong Park, Se Jung Kwon, Daehwan Oh, Byeongwook Kim, Dongsoo Lee. [doi]
- Increasing the Cost of Model Extraction with Calibrated Proof of WorkAdam Dziedzic, Muhammad Ahmad Kaleem, Yu Shen Lu, Nicolas Papernot. [doi]
- Contrastive Fine-grained Class Clustering via Generative Adversarial NetworksYunji Kim, Jung-Woo Ha. [doi]
- Continuously Discovering Novel Strategies via Reward-Switching Policy OptimizationZihan Zhou 0002, Wei Fu, Bingliang Zhang, Yi Wu. [doi]
- Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax OptimalityYiping Lu 0001, Haoxuan Chen, Jianfeng Lu 0001, Lexing Ying, Jose H. Blanchet. [doi]
- Multiset-Equivariant Set Prediction with Approximate Implicit DifferentiationYan Zhang, David W. Zhang, Simon Lacoste-Julien, Gertjan J. Burghouts, Cees G. M. Snoek. [doi]
- Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel SpaceSteeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg Mori, Christian Wolf 0001. [doi]
- Focus on the Common Good: Group Distributional Robustness FollowsVihari Piratla, Praneeth Netrapalli, Sunita Sarawagi. [doi]
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training ParadigmYangguang Li, Feng Liang, Lichen Zhao, Yufeng Cui, Wanli Ouyang, Jing Shao, Fengwei Yu, Junjie Yan. [doi]
- Imitation Learning by Reinforcement LearningKamil Ciosek. [doi]
- Do We Need Anisotropic Graph Neural Networks?Shyam A. Tailor, Felix L. Opolka, Pietro Liò, Nicholas Donald Lane. [doi]
- Benchmarking the Spectrum of Agent CapabilitiesDanijar Hafner. [doi]
- Toward Efficient Low-Precision Training: Data Format Optimization and Hysteresis QuantizationSunWoo Lee, Jeongwoo Park, Dongsuk Jeon. [doi]
- Hybrid Random FeaturesKrzysztof Marcin Choromanski, Han Lin, Haoxian Chen, Arijit Sehanobish, Yuanzhe Ma, Deepali Jain, Jake Varley, Andy Zeng, Michael S. Ryoo, Valerii Likhosherstov, Dmitry Kalashnikov, Vikas Sindhwani, Adrian Weller. [doi]
- Towards a Unified View of Parameter-Efficient Transfer LearningJunxian He, Chunting Zhou, Xuezhe Ma, Taylor Berg-Kirkpatrick, Graham Neubig. [doi]
- MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical ModelingYusong Wu, Ethan Manilow, Yi Deng, Rigel Swavely, Kyle Kastner, Tim Cooijmans, Aaron C. Courville, Cheng-Zhi Anna Huang, Jesse H. Engel. [doi]
- Anomaly Detection for Tabular Data with Internal Contrastive LearningTom Shenkar, Lior Wolf. [doi]
- Dynamic Token Normalization improves Vision TransformersWenqi Shao, Yixiao Ge, Zhaoyang Zhang, Xuyuan Xu, Xiaogang Wang, Ying Shan, Ping Luo. [doi]
- Extending the WILDS Benchmark for Unsupervised AdaptationShiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang. [doi]
- Diurnal or Nocturnal? Federated Learning of Multi-branch Networks from Periodically Shifting DistributionsChen Zhu, Zheng Xu 0002, Mingqing Chen, Jakub Konecný, Andrew Hard, Tom Goldstein. [doi]
- Case-based reasoning for better generalization in textual reinforcement learningMattia Atzeni, Shehzaad Zuzar Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan. [doi]
- UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation LearningKunchang Li, Yali Wang 0001, Peng Gao 0007, Guanglu Song, Yu Liu 0015, Hongsheng Li 0001, Yu Qiao 0001. [doi]
- Learning Graphon Mean Field Games and Approximate Nash EquilibriaKai Cui 0001, Heinz Koeppl. [doi]
- Know Thyself: Transferable Visual Control Policies Through Robot-AwarenessEdward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman. [doi]
- How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean DataZhiyuan Zhang, Lingjuan Lyu, Weiqiang Wang, Lichao Sun, Xu Sun 0001. [doi]
- Linking Emergent and Natural Languages via Corpus TransferShunyu Yao, Mo Yu, Yang Zhang, Karthik R. Narasimhan, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling SchemeVadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Sergeevich Kudinov, Jiansheng Wei. [doi]
- Neural Program Synthesis with QueryDi Huang, Rui Zhang 0040, Xing Hu 0001, Xishan Zhang, Pengwei Jin, Nan Li, Zidong Du, Qi Guo, Yunji Chen. [doi]
- Domain Adversarial Training: A Game PerspectiveDavid Acuna, Marc T. Law, Guojun Zhang, Sanja Fidler. [doi]
- Selective Ensembles for Consistent PredictionsEmily Black, Klas Leino, Matt Fredrikson. [doi]
- The Close Relationship Between Contrastive Learning and Meta-LearningRenkun Ni, Manli Shu, Hossein Souri, Micah Goldblum, Tom Goldstein. [doi]
- Omni-Dimensional Dynamic ConvolutionChao Li, Aojun Zhou, Anbang Yao. [doi]
- Sparse DETR: Efficient End-to-End Object Detection with Learnable SparsityByungseok Roh, Jaewoong Shin, Wuhyun Shin, Saehoon Kim. [doi]
- Continual Learning with Filter Atom SwappingZichen Miao, Ze Wang, Wei Chen, Qiang Qiu. [doi]
- Variational Predictive Routing with Nested Subjective TimescalesAlexey Zakharov, Qinghai Guo, Zafeirios Fountas. [doi]
- Resolving Training Biases via Influence-based Data RelabelingShuming Kong, Yanyan Shen, Linpeng Huang. [doi]
- Constructing Orthogonal Convolutions in an Explicit MannerTan Yu, Jun Li, Yunfeng Cai, Ping Li. [doi]
- Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport DistillationBichen Wu, Ruizhe Cheng, Peizhao Zhang, Tianren Gao, Joseph E. Gonzalez, Peter Vajda. [doi]
- On the role of population heterogeneity in emergent communicationMathieu Rita, Florian Strub, Jean-Bastien grill, Olivier Pietquin, Emmanuel Dupoux. [doi]
- THOMAS: Trajectory Heatmap Output with learned Multi-Agent SamplingThomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde. [doi]
- Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure SpaceYaohua Wang, Yaobin Zhang, Fangyi Zhang, Senzhang Wang, Ming Lin, Yuqi Zhang, Xiuyu Sun. [doi]
- Churn Reduction via DistillationHeinrich Jiang, Harikrishna Narasimhan, Dara Bahri, Andrew Cotter, Afshin Rostamizadeh. [doi]
- Byzantine-Robust Learning on Heterogeneous Datasets via BucketingSai Praneeth Karimireddy, Lie He, Martin Jaggi. [doi]
- Measuring CLEVRness: Black-box Testing of Visual Reasoning ModelsSpyridon Mouselinos, Henryk Michalewski, Mateusz Malinowski. [doi]
- ComPhy: Compositional Physical Reasoning of Objects and Events from VideosZhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba 0001, Joshua B. Tenenbaum, Chuang Gan. [doi]
- Delaunay Component Analysis for Evaluation of Data RepresentationsPetra Poklukar, Vladislav Polianskii, Anastasiia Varava, Florian T. Pokorny, Danica Kragic Jensfelt. [doi]
- Self-ensemble Adversarial Training for Improved RobustnessHongjun Wang, Yisen Wang 0001. [doi]
- Adversarially Robust Conformal PredictionAsaf Gendler, Tsui-Wei Weng, Luca Daniel, Yaniv Romano. [doi]
- Transformer-based Transform CodingYinhao Zhu, Yang Yang 0010, Taco Cohen. [doi]
- DEPTS: Deep Expansion Learning for Periodic Time Series ForecastingWei Fan 0010, Shun Zheng, Xiaohan Yi, Wei Cao, Yanjie Fu, Jiang Bian 0002, Tie-Yan Liu. [doi]
- On Lottery Tickets and Minimal Task Representations in Deep Reinforcement LearningMarc Aurel Vischer, Robert Tjarko Lange, Henning Sprekeler. [doi]
- Nonlinear ICA Using Volume-Preserving TransformationsXiaojiang Yang, Yi Wang, Jiacheng Sun, Xing Zhang, Shifeng Zhang, Zhenguo Li, Junchi Yan. [doi]
- Learning transferable motor skills with hierarchical latent mixture policiesDushyant Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala, Yusuf Aytar, Josh Merel, Nicolas Heess, Raia Hadsell. [doi]
- Controlling the Complexity and Lipschitz Constant improves Polynomial NetsZhenyu Zhu, Fabian Latorre, Grigorios Chrysos 0002, Volkan Cevher. [doi]
- CoordX: Accelerating Implicit Neural Representation with a Split MLP ArchitectureRuofan Liang, Hongyi Sun, Nandita Vijaykumar. [doi]
- Is Fairness Only Metric Deep? Evaluating and Addressing Subgroup Gaps in Deep Metric LearningNatalie Dullerud, Karsten Roth, Kimia Hamidieh, Nicolas Papernot, Marzyeh Ghassemi. [doi]
- Optimal Transport for Causal DiscoveryRuibo Tu, Kun Zhang, Hedvig Kjellström, Cheng Zhang 0005. [doi]
- Comparing Distributions by Measuring Differences that Affect Decision MakingShengjia Zhao, Abhishek Sinha, Yutong He, Aidan Perreault, Jiaming Song, Stefano Ermon. [doi]
- Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionMichiel de Jong, Yury Zemlyanskiy, Nicholas FitzGerald, Fei Sha, William W. Cohen. [doi]
- Interacting Contour Stochastic Gradient Langevin DynamicsWei Deng 0002, Siqi Liang, Botao Hao, Guang Lin, Faming Liang. [doi]
- Lossless Compression with Probabilistic CircuitsAnji Liu, Stephan Mandt, Guy Van den Broeck. [doi]
- Query Embedding on Hyper-Relational Knowledge GraphsDimitrios Alivanistos, Max Berrendorf, Michael Cochez, Mikhail Galkin 0001. [doi]
- SimVLM: Simple Visual Language Model Pretraining with Weak SupervisionZirui Wang, Jiahui Yu, Adams Wei Yu, Zihang Dai, Yulia Tsvetkov, Yuan Cao 0007. [doi]
- Fairness in Representation for Multilingual NLP: Insights from Controlled Experiments on Conditional Language ModelingAda Wan. [doi]
- Missingness Bias in Model DebuggingSaachi Jain, Hadi Salman, Eric Wong, Pengchuan Zhang, Vibhav Vineet, Sai Vemprala, Aleksander Madry. [doi]
- Learning Optimal Conformal ClassifiersDavid Stutz, Krishnamurthy Dvijotham, Ali Taylan Cemgil, Arnaud Doucet. [doi]
- Dive Deeper Into Integral Pose RegressionKerui Gu, Linlin Yang, Angela Yao. [doi]
- Variational Neural Cellular AutomataRasmus Berg Palm, Miguel González Duque, Shyam Sudhakaran, Sebastian Risi. [doi]
- BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisMax W. Y. Lam, Jun Wang 0091, Dan Su 0002, Dong Yu 0001. [doi]
- Towards Training Billion Parameter Graph Neural Networks for Atomic SimulationsAnuroop Sriram, Abhishek Das, Brandon M. Wood, Siddharth Goyal, C. Lawrence Zitnick. [doi]
- Towards Deepening Graph Neural Networks: A GNTK-based Optimization PerspectiveWei Huang, Yayong Li, Weitao Du, Richard Y. D. Xu, Jie Yin, Ling Chen, Miao Zhang. [doi]
- Signing the Supermask: Keep, Hide, InvertNils Koster, Oliver Grothe, Achim Rettinger. [doi]
- Learning Weakly-supervised Contrastive RepresentationsYao-Hung Hubert Tsai, Tianqin Li, Weixin Liu, Peiyuan Liao, Ruslan Salakhutdinov, Louis-Philippe Morency. [doi]
- EntQA: Entity Linking as Question AnsweringWenzheng Zhang, Wenyue Hua, Karl Stratos. [doi]
- Recursive Disentanglement NetworkYixuan Chen 0003, Yubin Shi, Dongsheng Li 0002, Yujiang Wang 0001, Mingzhi Dong, Yingying Zhao, Robert P. Dick, Qin Lv, Fan Yang, Li Shang. [doi]
- MetaMorph: Learning Universal Controllers with TransformersAgrim Gupta, Linxi Fan, Surya Ganguli, Li Fei-Fei 0001. [doi]
- Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous InterfaceTuan Anh Le 0001, Katherine M. Collins, Luke Hewitt, Kevin Ellis, Siddharth Narayanaswamy, Samuel Gershman, Joshua B. Tenenbaum. [doi]
- Path Auxiliary Proposal for MCMC in Discrete SpaceHaoran Sun, Hanjun Dai, Wei Xia, Arun Ramamurthy. [doi]
- Steerable Partial Differential Operators for Equivariant Neural NetworksErik Jenner, Maurice Weiler. [doi]
- The Hidden Convex Optimization Landscape of Regularized Two-Layer ReLU Networks: an Exact Characterization of Optimal SolutionsYifei Wang, Jonathan Lacotte, Mert Pilanci. [doi]
- COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksFan Wu, Linyi Li, Huan Zhang 0001, Bhavya Kailkhura, Krishnaram Kenthapadi, Ding Zhao, Bo Li 0026. [doi]
- Prototypical Contrastive Predictive CodingKyungmin Lee. [doi]
- A Statistical Framework for Efficient Out of Distribution Detection in Deep Neural NetworksMatan Haroush, Tzviel Frostig, Ruth Heller, Daniel Soudry. [doi]
- Closed-form Sample Probing for Learning Generative Models in Zero-shot LearningSamet Çetin, Orhun Bugra Baran, Ramazan Gokberk Cinbis. [doi]
- Learning-Augmented $k$-means ClusteringJon C. Ergun, Zhili Feng, Sandeep Silwal, David P. Woodruff, Samson Zhou. [doi]
- Conditional Object-Centric Learning from VideoThomas Kipf, Gamaleldin Fathy Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff. [doi]
- Evaluating Distributional Distortion in Neural Language ModelingBenjamin LeBrun, Alessandro Sordoni, Timothy J. O'Donnell. [doi]
- Analyzing and Improving the Optimization Landscape of Noise-Contrastive EstimationBingbin Liu, Elan Rosenfeld, Pradeep Kumar Ravikumar, Andrej Risteski. [doi]
- A Fine-Tuning Approach to Belief State ModelingSamuel Sokota, Hengyuan Hu, David J. Wu, J. Zico Kolter, Jakob Nicolaus Foerster, Noam Brown. [doi]
- Node Feature Extraction by Self-Supervised Multi-scale Neighborhood PredictionEli Chien, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Jiong Zhang, Olgica Milenkovic, Inderjit S. Dhillon. [doi]
- $\pi$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian OptimizationCarl Hvarfner, Danny Stoll, Artur L. F. Souza, Marius Lindauer, Frank Hutter, Luigi Nardi. [doi]
- Knowledge Removal in Sampling-based Bayesian InferenceShaopeng Fu, Fengxiang He, Dacheng Tao. [doi]
- Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?Matthew Farrell, Blake Bordelon, Shubhendu Trivedi, Cengiz Pehlevan. [doi]
- Counterfactual Plans under Distributional AmbiguityNgoc Bui, Duy Nguyen,