Abstract is missing.
- Responsible AI and Autonomous Agents: Governance, Ethics, and Sustainable InnovationVirginia Dignum. 1-2 [doi]
- The Agent Paradox: Can Multi-Agent Systems Replicate the Complexity of Human Cognition and Social Behavior?Rada Mihalcea. 3 [doi]
- Multiagent Systems, and the Search for Appropriate Foundations: A Personal Journey and RetrospectiveJeffrey S. Rosenschein. 4 [doi]
- Enhancing Offline Reinforcement Learning with Curriculum Learning-Based Trajectory ValuationAmir Abolfazli, Zekun Song, Avishek Anand, Wolfgang Nejdl. 5-13 [doi]
- Who Reviews The Reviewers? A Multi-Level Jury ProblemBen Abramowitz, Omer Lev, Nicholas Mattei. 14-22 [doi]
- An Improved Mechanism for Pricing Ride-Hailing FaresMarek Adamczyk, Maurycy Borkowski, Michal Pawlowski. 23-31 [doi]
- EFX Allocations and Orientations on Bipartite Multi-graphs: A Complete PictureMahyar Afshinmehr, Alireza Danaei, Mehrafarin Kazemi, Kurt Mehlhorn, Nidhi Rathi. 32-40 [doi]
- Harmonious Balanced Partitioning of a Network of AgentsPulkit Agarwal, Harshvardhan Agarwal, Vaibhav Raj, Swaprava Nath. 41-49 [doi]
- SCMRAG: Self-Corrective Multihop Retrieval Augmented Generation System for LLM AgentsRishabh Agrawal, Murtaza Asrani, Hadi Youssef, Apurva Narayan. 50-58 [doi]
- Investigating the Perspective of Non-Native Speakers on Foreigner-Directed Speech using Virtual Agents: The Role of Racial Ingroup Affiliation and Language Proficiency on Perception and ComprehensionOhenewa Bediako Akuffo, Birgit Lugrin. 59-68 [doi]
- Impact Measures for Gradual Argumentation SemanticsCaren Al Anaissy, Jérôme Delobelle, Srdjan Vesic, Bruno Yun. 69-77 [doi]
- Approximation Ratio for Preference Aggregation Using Tree CP-NetsAbu Mohammad Hammad Ali, Daniel Ogundare, Boting Yang, Sandra Zilles. 78-86 [doi]
- Geometric Freeze-Tag ProblemSharareh Alipour, Kajal Baghestani, Mahdis Mirzaei, Soroush Sahraei. 87-95 [doi]
- Robin Hood Reachability Bidding GamesShaull Almagor, Guy Avni, Neta Dafni. 96-104 [doi]
- A Hypothesis-Driven Approach to Explainable Goal RecognitionAbeer Alshehri, Hissah Alotaibi, Tim Miller 0001, Mor Vered. 105-114 [doi]
- Algorithmically Fair Maximization of Multiple Submodular Objective FunctionsGeorgios Amanatidis, Georgios Birmpas, Philip Lazos, Stefano Leonardi 0001, Rebecca Reiffenhäuser. 115-123 [doi]
- Truthful and Welfare-maximizing Resource Scheduling with Application to Electric VehiclesRamsundar Anandanarayanan, Swaprava Nath, Prasant Misra. 124-132 [doi]
- Model and Mechanisms of Consent for Responsible AutonomyAnastasia Sophia Apeiron, Davide Dell'Anna, Pradeep K. Murukannaiah, Pinar Yolum. 133-141 [doi]
- FORM: Learning Expressive and Transferable First-Order Logic Reward MachinesLeo Ardon, Daniel Furelos-Blanco, Roko Parac, Alessandra Russo. 142-151 [doi]
- Probably Correct Optimal Stable Matching for Two-Sided Market Under UncertaintyAndreas Athanasopoulos, Anne-Marie George, Christos Dimitrakakis. 152-160 [doi]
- Bidding Games on Markov Decision Processes with Quantitative Reachability ObjectivesGuy Avni, Martin Kurecka, Kaushik Mallik, Petr Novotný 0001, Suman Sadhukhan. 161-169 [doi]
- Fair Allocation of Divisible Goods under Non-Linear ValuationsHaris Aziz 0001, Zixu He, Xinhang Lu, Kaiyang Zhou. 170-178 [doi]
- Condorcet Winners and Anscombe's Paradox Under Weighted Binary VotingCarmel Baharav, Andrei Constantinescu 0001, Roger Wattenhofer. 179-187 [doi]
- Local Topological Information as a Powerful Enhancer for Generalizable Neural Method in Travelling Salesman ProblemXiaoxin Bai, JunYang Yang, Shengchao Yuan, Yinghao Zhang, Hanqian Wu. 188-196 [doi]
- On the Gale-Shapley Algorithm for Stable Matchings with a Partial Honesty Nash RefinementJames P. Bailey, Craig A. Tovey. 197-204 [doi]
- The Price of Anarchy in Spatial Social ChoiceJames P. Bailey, Craig A. Tovey. 205-213 [doi]
- Alternating-time Temporal Logic with Stochastic AbilitiesGabriel Ballot, Vadim Malvone, Jean Leneutre, Jingxuan Ma, Mourad Leslous. 214-222 [doi]
- An AI-Driven Card Playing Robot: An Empirical Study on Communicative Style and Embodiment with Elderly AdultsMichael Banck, Elisabeth Ganal, Hanna-Finja Weichert, Frank Puppe, Birgit Lugrin. 223-232 [doi]
- On the Complexity of Learning to Cooperate in Populations of Socially Rational AgentsSaptarashmi Bandyopadhyay, Mustafa Mert Çelikok, Robert Loftin. 233-241 [doi]
- Beyond Words: Integrating Personality Traits and Context-Driven Gestures in Human-Robot InteractionsTahsin Tariq Banna, Sejuti Rahman, Mohammad Tareq. 242-251 [doi]
- Neural DNF-MT: A Neuro-symbolic Approach for Learning Interpretable and Editable PoliciesKexin Gu Baugh, Luke Dickens, Alessandra Russo. 252-260 [doi]
- Sea-cret Agents: Maritime Abduction for Region Generation to Expose Dark Vessel TrajectoriesDivyagna Bavikadi, Nathaniel Lee, Paulo Shakarian, Chad Parvis. 261-270 [doi]
- Opinion Dynamics with Median AggregationPetra Berenbrink, Martin Hoefer 0001, Dominik Kaaser, Marten Maack, Malin Rau, Lisa Wilhelmi. 271-279 [doi]
- Speed vs Accuracy in Goal Recognition for Time-Sensitive Applications: A Game-Theoretic ApproachSara Bernardini, Fabio Fagnani, Santiago Franco. 280-288 [doi]
- To Spend or to Gain: Online Learning in Repeated Karma AuctionsDamien Berriaud, Ezzat Elokda, Devansh Jalota, Emilio Frazzoli, Marco Pavone 0001, Florian Dörfler. 289-297 [doi]
- Towards Envy-Freeness Relaxations for General Nonmonotone ValuationsUmang Bhaskar, Gunjan Kumar, Yeshwant Pandit, Rakshitha. 298-306 [doi]
- Maximizing Value in Challenge the Champ TournamentsUmang Bhaskar, Juhi Chaudhary, Palash Dey. 307-315 [doi]
- Agent-based Modeling and Simulation of Ambiguity in Catastrophe Insurance MarketsYu Bi, Lingxiao Zhao, Jinyun Tong, Zhe Feng, Carmine Ventre. 316-324 [doi]
- Equilibrium Analysis in Markets with Asymmetric Utility FunctionsMartin Bichler, Markus Ewert, Axel Ockenfels. 325-333 [doi]
- Temporal Network Creation Games: The Impact of Non-Locality and TerminalsDavide Bilò, Sarel Cohen, Tobias Friedrich 0001, Hans Gawendowicz, Nicolas Klodt, Pascal Lenzner, George Skretas. 334-342 [doi]
- Minimizing Rosenthal's Potential in Monotone Congestion GamesVittorio Bilò, Angelo Fanelli 0001, Laurent Gourvès, Christos Tsoufis, Cosimo Vinci. 343-351 [doi]
- Synergistic Traffic AssignmentThomas Bläsius, Adrian Feilhauer, Markus Jung, Moritz Laupichler, Peter Sanders 0001, Michael Zündorf. 352-360 [doi]
- EnEnv 1.0: Energy Grid Environment for Multi-Agent Reinforcement Learning BenchmarkingDominik Jacek Bogucki, Lukasz Lepak, Sonam Parashar, Bartlomiej Blachowski, Pawel Wawrzynski. 361-370 [doi]
- Monte Carlo Tree Search with Velocity Obstacles for Safe and Efficient Motion Planning in Dynamic EnvironmentsLorenzo Bonanni, Daniele Meli, Alberto Castellini, Alessandro Farinelli. 371-380 [doi]
- Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML MonitoringGusseppe Bravo Rocca, Peini Liu, Jordi Guitart, Rodrigo M. Carrillo-Larco, Ajay Dholakia, David Ellison. 381-389 [doi]
- Computing Efficient Envy-Free Partial Allocations of Indivisible GoodsRobert Bredereck, Andrzej Kaczmarczyk 0001, Junjie Luo 0001, Bin Sun. 390-398 [doi]
- Compositional Shielding and Reinforcement Learning for Multi-Agent SystemsAsger Horn Brorholt, Kim Guldstrand Larsen, Christian Schilling 0001. 399-407 [doi]
- Scalable Offline Reinforcement Learning for Mean Field GamesAxel Brunnbauer, Julian Lemmel, Zahra Babaiee, Sophie A. Neubauer, Radu Grosu. 408-417 [doi]
- Welfare Approximation in Additively Separable Hedonic GamesMartin Bullinger, Vaggos Chatziafratis, Parnian Shahkar. 418-426 [doi]
- Towards Fair and Efficient Public Transportation: A Bus Stop ModelMartin Bullinger, Edith Elkind, Mohamad Latifian. 427-435 [doi]
- Who Am I Dealing With? Explaining the Designer's Hidden IntentionsTurgay Caglar, Sarath Sreedharan, Mor Vered. 436-444 [doi]
- Emit As You Go: Enumerating Edges of a Spanning TreeKatrin Casel, Stefan Neubert. 445-453 [doi]
- On the Fairness of Additive Welfarist RulesKaren Frilya Celine, Warut Suksompong, Sheung Man Yuen. 454-462 [doi]
- Game-Theoretically Secure Distributed Protocols for Fair Allocation in Coalitional GamesT.-H. Hubert Chan, Qipeng Kuang, Quan Xue. 463-471 [doi]
- Fair Division in a Variable SettingHarish Chandramouleeswaran, Prajakta Nimbhorkar, Nidhi Rathi. 472-480 [doi]
- Human-Agent Coordination in Games under Incomplete Information via Multi-Step IntentShenghui Chen, Ruihan Zhao 0001, Sandeep Chinchali, Ufuk Topcu. 481-489 [doi]
- Azorus: Commitments over Protocols for BDI AgentsAmit K. Chopra, Matteo Baldoni, Samuel Christie, Munindar P. Singh. 490-499 [doi]
- On the Limits of Agency in Agent-based ModelsAyush Chopra, Shashank Kumar, Nurullah Giray Kuru, Ramesh Raskar, Arnau Quera-Bofarull. 500-509 [doi]
- Computing Efficient and Envy-Free Allocations under Dichotomous Preferences using SATAri Conati, Andreas Niskanen, Ronald de Haan, Matti Järvisalo. 510-518 [doi]
- Byzantine Game Theory: Sun Tzu's BoxesAndrei Constantinescu 0001, Roger Wattenhofer. 519-528 [doi]
- Selfish Behavior and Resource Competition in Multi-Agent SystemsCostas Courcoubetis, Antonis Dimakis. 529-537 [doi]
- Approximation Algorithms for Connected Maximum CoverageGianlorenzo D'Angelo, Esmaeil Delfaraz. 538-546 [doi]
- Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal HealthArpan Dasgupta, Gagan Jain, Arun Sai Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja. 547-555 [doi]
- Greedy ABA Learning for Case-Based ReasoningEmanuele De Angelis, Maurizio Proietti, Francesca Toni. 556-564 [doi]
- More Efficient Sybil Detection Mechanisms Leveraging Resistance of Users to Attack RequestsAli Safarpoor-Dehkordi, Ahad N. Zehmakan. 565-573 [doi]
- Composing Reinforcement Learning Policies, with Formal GuaranteesFlorent Delgrange, Guy Avni, Anna Lukina, Christian Schilling 0001, Ann Nowé, Guillermo A. Pérez 0001. 574-583 [doi]
- Parameterized Algorithms for Multiagent Pathfinding on TreesArgyrios Deligkas, Eduard Eiben, Robert Ganian, Iyad Kanj, M. S. Ramanujan 0001. 584-592 [doi]
- From Natural Language to Extensive-Form Game RepresentationsShilong Deng, Yongzhao Wang 0001, Rahul Savani. 593-601 [doi]
- Safe Pareto Improvements for Expected Utility Maximizers in Program GamesAnthony DiGiovanni, Jesse Clifton, Nicolas Macé. 602-610 [doi]
- Hitchhiker's Guide to Patrolling: Path-Finding for Energy-Sharing Drone-UGV TeamsJonathan Diller, Qi Han 0001, Robert Byers, James Dotterweich, James Humann. 611-619 [doi]
- Learning Graph Representation of Agent DiffusersYoucef Djenouri, Nassim Belmecheri, Tomasz P. Michalak, Jan Dubinski, Ahmed Nabil Belbachir, Anis Yazidi. 620-629 [doi]
- Selecting Interlacing CommitteesChris Dong, Martin Bullinger, Tomasz Was, Larry Birnbaum, Edith Elkind. 630-638 [doi]
- Simulating and Evaluating Generative Modeling and Collaborative Filtering in Complex Social NetworksWen Dong, Fairul Mohd-Zaid. 639-648 [doi]
- Fast UCB-type Algorithms for Stochastic Bandits with Heavy and Super Heavy Symmetric NoiseYuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Andrey Pudovikov. 649-657 [doi]
- Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase Transitions in the Perturbed CultureFrançois Durand. 658-666 [doi]
- Boosting Sortition via Proportional RepresentationSoroush Ebadian, Evi Micha. 667-675 [doi]
- Temporal Fair Division of Indivisible ItemsEdith Elkind, Alexander Lam, Mohamad Latifian, Tzeh Yuan Neoh, Nicholas Teh. 676-685 [doi]
- A Simple Integration of Epistemic Logic and Reinforcement LearningThorsten Engesser, Thibaut Le Marre, Emiliano Lorini, François Schwarzentruber, Bruno Zanuttini. 686-694 [doi]
- Mitigating Value Conflicts with Computational Theory of MindEmre Erdogan, Hüseyin Aydin, Frank Dignum, Rineke Verbrugge, Pinar Yolum. 695-703 [doi]
- Learning Real-Life Approval ElectionsPiotr Faliszewski, Lukasz Janeczko, Andrzej Kaczmarczyk 0001, Marcin Kurdziel, Grzegorz Pierczynski, Stanislaw Szufa. 704-712 [doi]
- FedRLHF: A Convergence-Guaranteed Federated Framework for Privacy-Preserving and Personalized RLHFFlint Xiaofeng Fan, Cheston Tan, Yew-Soon Ong, Roger Wattenhofer, Wei Tsang Ooi. 713-721 [doi]
- Automatic Verification of Linear Integer Planning Programs via Forgetting in LIAUPFLiangda Fang, Shikang Chen, Xiaoman Wang, Xiaoyou Lin, Chenyi Zhang 0001, Qingliang Chen, Quanlong Guan, Kaile Su. 722-730 [doi]
- Consistency Policy with Categorical Critic for Autonomous DrivingXing Fang, Qichao Zhang, Haoran Li 0010, Dongbin Zhao. 731-739 [doi]
- Translating Multi-Agent Modal Logics of Knowledge and Belief into Decidable First-Order FragmentsQihui Feng, Hannah Wilk, Shakil M. Khan 0001, Gerhard Lakemeyer. 740-748 [doi]
- Eliminating Majority IllusionFoivos Fioravantes, Abhiruk Lahiri, Antonio Lauerbach, Lluís Sabater, Marie Diana Sieper, Samuel Wolf. 749-757 [doi]
- On the Hardness of Fair Allocation under Ternary ValuationsZack Fitzsimmons, Vignesh Viswanathan, Yair Zick. 758-766 [doi]
- Non-obvious Manipulability in Hedonic Games with Friends Appreciation PreferencesMichele Flammini, Maria Fomenko, Giovanna Varricchio. 767-775 [doi]
- Higher-Order Belief in Incomplete Information MAIDsJack Foxabbott, Rohan Subramani, Francis Rhys Ward. 776-784 [doi]
- The Metric Distortion of Randomized Social Choice Functions: C1 Maximal Lottery Rules and SimulationsFabian Frank, Patrick Lederer. 785-793 [doi]
- Order Symmetry: A New Fairness Criterion for Assignment MechanismsRupert Freeman, Geoffrey Pritchard, Mark C. Wilson. 794-802 [doi]
- Learning Collusion in Episodic, Inventory-Constrained MarketsPaul Friedrich 0001, Barna Pásztor, Giorgia Ramponi. 803-812 [doi]
- Global Behavior of Learning Dynamics in Zero-Sum Games with Memory AsymmetryYuma Fujimoto, Kaito Ariu, Kenshi Abe. 813-819 [doi]
- Optimising Expectation with Guarantees for Window Mean Payoff in Markov Decision ProcessesPranshu Gaba, Shibashis Guha. 820-828 [doi]
- Changing the Rules of the Game: Reasoning About Dynamic Phenomena in Multi-Agent SystemsRustam Galimullin, Maksim Gladyshev, Munyque Mittelmann, Nima Motamed. 829-838 [doi]
- Fairly Allocating Goods in ParallelRohan Garg 0002, Alexandros Psomas 0001. 839-847 [doi]
- Voter Model Meets Rumour Spreading: A Study of Consensus Protocols on Graphs with Agnostic NodesMarcelo Matheus Gauy, Anna Abramishvili, Eduardo Colli, Tiago Madeira, Frederik Mallmann-Trenn, Vinícius Franco Vasconcelos, David Kohan Marzagão. 848-857 [doi]
- On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionZichang Ge, Changyu Chen, Arunesh Sinha, Pradeep Varakantham. 858-866 [doi]
- MOSMAC: A Multi-agent Reinforcement Learning Benchmark on Sequential Multi-Objective TasksMinghong Geng, Shubham Pateria, Budhitama Subagdja, Ah-Hwee Tan. 867-876 [doi]
- Certified Guidance for Planning with Deep Generative ModelsFrancesco Giacomarra, Mehran Hosseini, Nicola Paoletti, Francesca Cairoli. 877-885 [doi]
- Predictability Awareness for Efficient and Robust Multi-Agent CoordinationRoman Chiva Gil, Daniel Jarne Ornia, Khaled A. Mustafa, Javier Alonso-Mora. 886-894 [doi]
- Simplifying Imperfect Recall GamesHugo Gimbert, Soumyajit Paul, B. Srivathsan. 895-903 [doi]
- Policy Graphs and Intention: Answering 'Why' and 'How' from a Telic PerspectiveVictor Gimenez-Abalos, Sergio Álvarez-Napagao, Adrián Tormos, Ulises Cortés, Javier Vázquez-Salceda. 904-913 [doi]
- Approximating One-Sided and Two-Sided Nash Social Welfare With CapacitiesSalil Gokhale, Harshul Sagar, Rohit Vaish, Jatin Yadav. 914-922 [doi]
- Fairness and Optimality in RoutingSreenivas Gollapudi, Kostas Kollias, Alkmini Sgouritsa, Ali Kemal Sinop. 923-931 [doi]
- Extending Consensus-based Task Allocation Algorithms with Bid Intercession to Foster Mixed-InitiativeVictor Guillet, Charles Lesire, Gauthier Picard, Christophe Grand. 932-940 [doi]
- On the Power of Temporal Locality on Online Routing ProblemsSwapnil Guragain, Gokarna Sharma. 941-949 [doi]
- Coherence-Driven Multimodal Safety Dialogue with Active Learning for Embodied AgentsSabit Hassan, Hye-Young Chung, Xiang Zhi Tan, Malihe Alikhani. 950-959 [doi]
- Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination DynamicsSomnath Hazra, Pallab Dasgupta, Soumyajit Dey. 960-968 [doi]
- Learning in Games with Progressive HidingBenjamin Heymann, Marc Lanctot. 969-977 [doi]
- LTL Verification of Memoryful Neural AgentsMehran Hosseini, Alessio Lomuscio, Nicola Paoletti. 978-987 [doi]
- Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian NetworkVincent Hsiao, Mark Roberts, Laura M. Hiatt, George Dimitri Konidaris, Dana S. Nau. 988-996 [doi]
- PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningKun Hu, Muning Wen, Xihuai Wang, Shao Zhang, Yiwei Shi, Minne Li, Minglong Li, Ying Wen 0001. 997-1005 [doi]
- Truthful Mechanisms for Linear Bandit Games with Private ContextsYiting Hu, Lingjie Duan. 1006-1014 [doi]
- CAMP: Collaborative Attention Model with Profiles for Vehicle Routing ProblemsChuanbo Hua, Federico Berto, Jiwoo Son, Seunghyun Kang, Changhyun Kwon 0001, Jinkyoo Park. 1015-1024 [doi]
- Human-Aligned Skill Discovery: Balancing Behaviour Exploration and AlignmentMaxence Hussonnois, Thommen George Karimpanal, Santu Rana. 1025-1033 [doi]
- Responsible Uplift ModelingLihi Idan, Ming Li. 1034-1041 [doi]
- Taming Multi-Agent Reinforcement Learning with Estimator Variance ReductionTaher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew E. Taylor, Kun Shao, Jun Wang 0012, David Mguni. 1042-1050 [doi]
- Probabilistic Timed ATLWojciech Jamroga, Marta Kwiatkowska, Wojciech Penczek, Laure Petrucci, Teofil Sidoruk. 1051-1059 [doi]
- Tackling Sparsity in Designated Driver Dispatch with Multi-Agent Reinforcement LearningJiaxuan Jiang, Ling Pan, Lin Zhou, Longbo Huang, Zhixuan Fang. 1060-1069 [doi]
- Full Proportional Justified RepresentationYusuf Hakan Kalayci, Jiasen Liu, David Kempe 0001. 1070-1078 [doi]
- A View of the Certainty-Equivalence Method for PAC RL as an Application of the Trajectory Tree MethodShivaram Kalyanakrishnan, Sheel Shah, Santhosh Kumar Guguloth. 1079-1087 [doi]
- Game of Thoughts: Iterative Reasoning in Game-Theoretic Domains with Large Language ModelsBenjamin Kempinski, Ian Gemp, Kate Larson, Marc Lanctot, Yoram Bachrach, Tal Kachman. 1088-1097 [doi]
- Causes and Strategies in Multiagent SystemsSylvia S. Kerkhove, Natasha Alechina, Mehdi Dastani. 1098-1106 [doi]
- GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-ThoughtSungsik Kim, Janghyun Baek, Jinkyu Kim, Jaekoo Lee. 1107-1116 [doi]
- Practical Abstractions for Model Checking Continuous-Time Multi-Agent SystemsYan Kim, Wojciech Jamroga, Wojciech Penczek, Laure Petrucci. 1117-1126 [doi]
- k-ApprovalVeto: A Spectrum of Voting Rules Balancing Metric Distortion and Minority ProtectionFatih Erdem Kizilkaya, David Kempe 0001. 1127-1135 [doi]
- Robustness of Epistemic Gossip Protocols Against Data LossYoshikatsu Kobayashi, Koji Hasebe. 1136-1144 [doi]
- Ranking Joint Policies in Dynamic Games using Evolutionary DynamicsNatalia Koliou, George A. Vouros. 1145-1153 [doi]
- Uncertain Machine Ethics PlanningSimon Kolker, Louise A. Dennis, Ramon Fraga Pereira, Mengwei Xu. 1154-1162 [doi]
- Policy Abstraction and Nash Refinement in Tree-Exploiting PSROChristine Konicki, Mithun Chakraborty, Michael P. Wellman. 1163-1171 [doi]
- Free Argumentative Exchanges for Explaining Image ClassifiersAvinash Kori, Antonio Rago 0001, Francesca Toni. 1172-1180 [doi]
- Offline Multi-Agent Preference-based Reinforcement Learning with Agent-aware Direct Preference OptimizationQian Kou, Mingyang Li, Zeyang Liu 0001, Long Qian, Zhuoran Chen, Lipeng Wan 0003, Xingyu Chen, Xuguang Lan. 1181-1190 [doi]
- Game Theory with Simulation in the Presence of Unpredictable RandomisationVojtech Kovarík, Nathaniel Sauerberg, Lewis Hammond, Vincent Conitzer. 1191-1199 [doi]
- Tighter Value-Function Approximations for POMDPsMerlijn Krale, Wietze Koops, Sebastian Junges, Thiago D. Simão, Nils Jansen 0001. 1200-1208 [doi]
- The Bakers and Millers Game with Restricted LocationsSimon Krogmann, Pascal Lenzner, Alexander Skopalik. 1209-1217 [doi]
- Near-Linear Time Leader Election in Multiagent NetworksAjay D. Kshemkalyani, Manish Kumar, Anisur Rahaman Molla, Gokarna Sharma. 1218-1226 [doi]
- Dynamic Coalition Structure Detection in Natural-Language-based InteractionsAbhishek Ninad Kulkarni, Andy Liu, Jean-Raphaël Gaglione, Daniel Fried, Ufuk Topcu. 1227-1234 [doi]
- Emergence of Recursive Language through Bootstrapping and Iterated LearningVikas Kumar, Ajin George Joseph. 1235-1243 [doi]
- AdaCred: Adaptive Causal Decision Transformers with Feature CreditingHemant Kumawat, Saibal Mukhopadhyay. 1244-1252 [doi]
- Soft Condorcet Optimization for Ranking of General AgentsMarc Lanctot, Kate Larson, Michael Kaisers, Quentin Berthet, Ian Gemp, Manfred Diaz, Roberto-Rafael Maura-Rivero, Yoram Bachrach, Anna Koop, Doina Precup. 1253-1262 [doi]
- MacLight: Multi-scene Aggregation Convolutional Learning for Traffic Signal ControlSunbowen Lee, Hongqin Lyu, Yicheng Gong, Yingying Sun, Chao Deng. 1263-1271 [doi]
- Timed Obstruction Logic: A Timed Approach to Dynamic Game ReasoningJean Leneutre, Vadim Malvone, James Ortiz 0001. 1272-1281 [doi]
- Curiosity-Driven Partner Selection Accelerates Convention Emergence in Language GamesChin-wing Leung, Paolo Turrini, Ann Nowé. 1282-1290 [doi]
- Self-Supervised Multi-Agent Diversity with Nonparametric Entropy MaximizationTianxu Li, Kun Zhu 0001. 1291-1299 [doi]
- OGS-SLAM: Hybrid ORB-Gaussian Splatting SLAMXiaohan Li, Wenxiang Shen, Dong Liu, Jun Wu. 1300-1308 [doi]
- Rational Capability in Concurrent GamesYinfeng Li, Emiliano Lorini, Munyque Mittelmann. 1309-1317 [doi]
- Nucleolus Credit Assignment for Effective Coalitions in Multi-agent Reinforcement LearningYugu Li, Zehong Cao, Jianglin Qiao, Siyi Hu. 1318-1326 [doi]
- Dynamic Sight Range Selection in Multi-Agent Reinforcement LearningWei-Chen Liao, Ti-Rong Wu, I-Chen Wu. 1327-1335 [doi]
- Adaptive Bi-Level Multi-Robot Task Allocation and Learning under Uncertainty with Temporal Logic ConstraintsXiaoshan Lin, Roberto Tron. 1336-1344 [doi]
- Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term RewardsFangqi Liu, Rishav Sen, Jose Paolo Talusan, Ava Pettet, Aaron Kandel, Yoshinori Suzue, Ayan Mukhopadhyay, Abhishek Dubey. 1345-1353 [doi]
- Efficient and Optimal Policy Gradient Algorithm for Corrupted Multi-armed BanditsJiayuan Liu, Siwei Wang, Zhixuan Fang. 1354-1361 [doi]
- Teamwork Makes the Defense Work: Comprehensive Vulnerability Defense Resource AllocationSiyu Liu, Rida A. Bazzi, Fei Fang, Tiffany Bao. 1362-1370 [doi]
- Games in Public Announcement: How to Reduce System Losses in Optimistic Blockchain MechanismsSiyuan Liu, Yulong Zeng. 1371-1379 [doi]
- Data Pricing for Graph Neural Networks without Pre-purchased InspectionYiping Liu, Mengxiao Zhang, Jiamou Liu, Song Yang 0001. 1380-1388 [doi]
- Leveraging Score-based Models for Generating Penalization in Model-based Offline Reinforcement LearningZeyuan Liu, Zhirui Fang, Jiafei Lyu, Xiu Li 0001. 1389-1398 [doi]
- MAGNET: A Multi-Agent Graph Neural Network for Efficient Bipartite Task AssignmentDonald Loveland, James Usevitch, Zachary Serlin, Danai Koutra, Rajmonda Caceres. 1399-1407 [doi]
- Multi-Ship Future Interaction Trajectory Prediction via Pre-Initializer Diffusion ModelKun Ma, Qilong Han, Jingzheng Yao. 1408-1417 [doi]
- Minimizing Makespan with Conflict-Based Search for Optimal Multi-Agent Path FindingAmir Maliah, Dor Atzmon, Ariel Felner. 1418-1426 [doi]
- Beyond Goal Recognition: A Reinforcement Learning-based Approach to Inferring Agent BehaviourSheryl Mantik, Michael Dann, Minyi Li 0001, Huong Ha 0001, Julie Porteous. 1427-1435 [doi]
- Multi-agent Multi-armed Bandits with Minimum Reward Guarantee FairnessPiyushi Manupriya, Himanshu, Saketha Nath Jagarlapudi, Ganesh Ghalme. 1436-1444 [doi]
- On Stateful Value Factorization in Multi-Agent Reinforcement LearningEnrico Marchesini, Andrea Baisero, Rupali Bhati, Christopher Amato. 1445-1453 [doi]
- ApproxED: Approximate Exploitability Descent via Learned Best ResponsesCarlos Martin 0001, Tuomas Sandholm. 1454-1463 [doi]
- Improving Policy Optimization via ε-RetrainLuca Marzari, Priya L. Donti, Changliu Liu, Enrico Marchesini. 1464-1472 [doi]
- Discovery and Deployment of Emergent Robot Swarm Behaviors via Representation Learning and Real2Sim2Real TransferConnor Mattson, Varun Raveendra, Ricardo Vega, Cameron Nowzari, Daniel S. Drew, Daniel S. Brown. 1473-1482 [doi]
- Generalised BDI PlanningFelipe Meneguzzi, Ramon Fraga Pereira, Nir Oren. 1483-1491 [doi]
- Multi-agent Reinforcement Learning in the All-or-Nothing Public Goods game on NetworksBenedikt Valentin Meylahn. 1492-1500 [doi]
- Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit AssignmentKartik Nagpal, Dayi Dong, Negar Mehr. 1501-1510 [doi]
- Explaining Facial Expression RecognitionSanjeev Nahulanthran, Leimin Tian, Dana Kulic, Mor Vered. 1511-1519 [doi]
- Evaluation-Time Policy Switching for Offline Reinforcement LearningNatinael Solomon Neggatu, Jeremie Houssineau, Giovanni Montana. 1520-1528 [doi]
- Resource Task GamesJessica L. Newman, Enrico H. Gerding, Enrico Marchioni, Baharak Rastegari. 1529-1537 [doi]
- Personality-Driven Decision Making in LLM-Based Autonomous AgentsLewis Newsham, Daniel Prince. 1538-1547 [doi]
- Contrastive Explainable Clustering with Differential PrivacyDung Nguyen 0002, Ariel Vetzler, Sarit Kraus, Anil Vullikanti. 1548-1556 [doi]
- DUPRE: Data Utility Prediction for Efficient Data ValuationKieu Thao Nguyen Pham, Rachael Hwee Ling Sim, Quoc Phong Nguyen, See-Kiong Ng, Bryan Kian Hsiang Low. 1557-1565 [doi]
- Counterfactual Explanations for Model Ensembles Using Entropic Risk MeasuresErfaun Noorani, Pasan Dissanayake, Faisal Hamman, Sanghamitra Dutta. 1566-1575 [doi]
- Conformal Set-based Human-AI Complementarity with Multiple ExpertsHelbert Paat, Guohao Shen. 1576-1585 [doi]
- Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous PlaysAritra Pal, Anandsingh Chauhan, Mayank Baranwal. 1586-1594 [doi]
- Smooth Information Gathering in Two-Player Noncooperative GamesFernando Palafox, Jesse Milzman, Dong-Ho Lee, Ryan Park, David Fridovich-Keil. 1595-1603 [doi]
- Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing ProblemsYuxin Pan, Ruohong Liu, Yize Chen, Zhiguang Cao, Fangzhen Lin. 1604-1612 [doi]
- An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative TasksGeorge Papadopoulos 0006, Andreas Kontogiannis, Foteini Papadopoulou, Chaido Poulianou, Ioannis Koumentis, George A. Vouros. 1613-1622 [doi]
- Enhancing Graph-based Coordination with Evolutionary Algorithms for Episodic Multi-agent Reinforcement LearningKexing Peng, Pengyi Li 0001, Jianye Hao. 1623-1631 [doi]
- Multi-objective Reinforcement Learning with Nonlinear Preferences: Provable Approximation for Maximizing Expected Scalarized ReturnNianli Peng, Muhang Tian, Brandon Fain. 1632-1640 [doi]
- ShipNaviSim: Data-Driven Simulation for Real-World Maritime NavigationQuang Anh Pham, Janaka Chathuranga Brahmanage, Akshat Kumar. 1641-1649 [doi]
- Artificial Agents Mitigate The Punishment Dilemma Of Indirect ReciprocityAlexandre S. Pires, Fernando P. Santos. 1650-1659 [doi]
- Anytime Fairness Guarantees in Stochastic Combinatorial MABs: A Novel Learning FrameworkSubham Pokhriyal, Shweta Jain 0002, Ganesh Ghalme, Vaneet Aggarwal. 1660-1669 [doi]
- Indifferential Privacy: A New Paradigm and Its Applications to Optimal Matching in Dark Pool AuctionsAntigoni Polychroniadou, T.-H. Hubert Chan, Adya Agrawal. 1670-1678 [doi]
- EconoJax: A Fast & Scalable Economic Simulation in JAXKoen Ponse, Aske Plaat, Niki van Stein, Thomas M. Moerland. 1679-1687 [doi]
- Decentralized Planning Using Probabilistic HyperpropertiesFrancesco Pontiggia, Filip Macák, Roman Andriushchenko, Michele Chiari, Milan Ceska 0002. 1688-1697 [doi]
- Uncertainty Expression for Human-Robot Task CommunicationDavid Porfirio, Mark Roberts, Laura M. Hiatt. 1698-1707 [doi]
- Combining Planning and Reinforcement Learning for Solving Relational Multiagent DomainsNikhilesh Prabhakar, Ranveer Singh, Harsha Kokel, Sriraam Natarajan, Prasad Tadepalli. 1708-1717 [doi]
- Reinforcement Learning Based Simulated AnnealingNathan Qiu, Daniel Liang. 1718-1726 [doi]
- Planning, Scheduling, and Execution on the Moon: The CADRE Technology Demonstration MissionGregg R. Rabideau, Joseph A. Russino, Andrew Branch, Nihal Dhamani, Tiago Stegun Vaquero, Steve A. Chien, Jean-Pierre de la Croix, Federico Rossi 0001. 1727-1735 [doi]
- Reputation-Filtered Reward Reshaping: Encouraging Cooperation in High Dimensional Semi-Cooperative Multi-agent SettingsHassan Raissouni, Wissal Bekhti, Btissam El Khamlichi, Amal El Fallah-Seghrouchni. 1736-1744 [doi]
- Bottom-Up Reputation Promotes Cooperation with Multi-Agent Reinforcement LearningTianyu Ren, Xuan Yao, Yang Li, Xiao-Jun Zeng. 1745-1754 [doi]
- The Effect of Agent-based Feedback on Prosociality in Social DilemmasJennifer Renoux, Filipa Correia, Joana Campos 0001, Lucas Morillo-Mendez, Neziha Akalin, Fernando P. Santos, Ana Paiva 0001. 1755-1763 [doi]
- Real-World Testing Matters in Reinforcement Learning for EducationAnna Riedmann, Carlo D'Eramo, Birgit Lugrin. 1764-1773 [doi]
- Divide and Conquer: Provably Unveiling the Pareto Front with Multi-Objective Reinforcement LearningWillem Röpke, Mathieu Reymond, Patrick Mannion, Diederik M. Roijers, Ann Nowé, Roxana Radulescu. 1774-1783 [doi]
- On Some Fundamental Problems for Multi-Agent Systems Over Multilayer NetworksDaniel J. Rosenkrantz, Madhav V. Marathe, Zirou Qiu, S. S. Ravi, Richard Edwin Stearns. 1784-1792 [doi]
- Factorised Active Inference for Strategic Multi-Agent InteractionsJaime Ruiz-Serra, Patrick Sweeney, Michael S. Harré. 1793-1802 [doi]
- Multi-Objective Planning with Contextual Lexicographic Reward PreferencesPulkit Rustagi, Yashwanthi Anand, Sandhya Saisubramanian. 1803-1811 [doi]
- Gricean Norms as a Basis for Effective CollaborationFardin Saad, Pradeep K. Murukannaiah, Munindar P. Singh. 1812-1820 [doi]
- Surprise! Surprise! Learn and AdaptHuma Samin, Dylan J. Walton, Nelly Bencomo. 1821-1829 [doi]
- Training Language Models for Social Deduction with Multi-Agent Reinforcement LearningBidipta Sarkar, Warren Xia, C. Karen Liu, Dorsa Sadigh. 1830-1839 [doi]
- Formalising Overdetermination in a Labelled Transition SystemCamilo Sarmiento, Gauvain Bourgne, Jean-Gabriel Ganascia. 1840-1848 [doi]
- Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement LearningLukas Schäfer 0001, Oliver Slumbers, Stephen McAleer, Yali Du 0001, Stefano V. Albrecht, David Mguni. 1849-1857 [doi]
- Candidate Nomination for Condorcet-consistent Voting RulesIldikó Schlotter, Katarína Cechlárová. 1858-1866 [doi]
- The Strong Core of Housing Markets with Partial Order PreferencesIldikó Schlotter, Lydia Mirabel Mendoza Cadena. 1867-1875 [doi]
- Socratic: Enhancing Human Teamwork via AI-enabled CoachingSangwon Seo, Bing Han, Rayan Ebnali Harari, Roger D. Dias, Marco A. Zenati, Eduardo Salas, Vaibhav V. Unhelkar. 1876-1885 [doi]
- Hierarchical Imitation Learning of Team Behavior from Heterogeneous DemonstrationsSangwon Seo, Vaibhav V. Unhelkar. 1886-1894 [doi]
- Towards Efficient Online Goal Recognition through Deep LearningLorenzo Serina, Mattia Chiari, Alfonso Emilio Gerevini, Luca Putelli, Ivan Serina. 1895-1903 [doi]
- Learning Symbolic Task Decompositions for Multi-Agent TeamsAmeesh Shah, Niklas Lauffer, Thomas Chen, Nikhil Pitta, Sanjit A. Seshia. 1904-1913 [doi]
- Learning with Limited Shared Information in Multi-agent Multi-armed BanditJunning Shao, Siwei Wang, Zhixuan Fang. 1914-1922 [doi]
- Incentivizing Truth Exploration and Honest Reporting: A Contract Design ApproachYuming Shao, Zhixuan Fang. 1923-1931 [doi]
- xSRL: Safety-Aware Explainable Reinforcement Learning - Safety as a Product of ExplainabilityRisal Shahriar Shefin, Md Asifur Rahman, Thai Le, Sarra Alqahtani. 1932-1940 [doi]
- Modeling the Centaur: Human-Machine Synergy in Sequential Decision MakingDavid Shoresh, Yonatan Loewenstein. 1941-1949 [doi]
- Tackling Temporal Deontic Challenges with Equilibrium LogicDavide Soldà, Pedro Cabalar, Agata Ciabattoni, Emery A. Neufeld. 1950-1958 [doi]
- Housing Market on NetworksXinwei Song, Tianyi Yang, Dengji Zhao. 1959-1967 [doi]
- An Organizationally-Oriented Approach to Enhancing Explainability and Control in Multi-Agent Reinforcement LearningJulien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez, Paul Théron. 1968-1976 [doi]
- Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis PerspectiveChuxiong Sun, Peng He, Rui Wang 0079, Changwen Zheng. 1977-1986 [doi]
- Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement LearningJingbo Sun, Songjun Tu, Qichao Zhang, Ke Chen, Dongbin Zhao. 1987-1995 [doi]
- The Many Challenges of Human-Like Agents in Virtual Game EnvironmentsMaciej Swiechowski, Dominik Slezak. 1996-2005 [doi]
- Value Iteration for Learning Concurrently Executable Robotic Control TasksSheikh A. Tahmid, Gennaro Notomista. 2006-2014 [doi]
- Ready, Bid, Go! On-Demand Delivery Using Fleets of Drones with Unknown, Heterogeneous Energy Storage ConstraintsMohamed S. Talamali, Genki Miyauchi, Thomas Watteyne, Micael S. Couceiro, Roderich Groß. 2015-2023 [doi]
- The Degree of (Extended) Justified Representation and Its OptimizationBiaoshuai Tao, Chengkai Zhang, Houyu Zhou. 2024-2032 [doi]
- Logic of Knowledge and Cognitive AbilityJia Tao, Xinran Zhang. 2033-2041 [doi]
- EduQate: Generating Adaptive Curricula through RMABs in Education SettingsSidney Tio, Dexun Li, Pradeep Varakantham. 2042-2050 [doi]
- Large Language Models for Virtual Human Gesture SelectionParisa Ghanad Torshizi, Laura B. Hensel, Ari Shapiro, Stacy C. Marsella. 2051-2059 [doi]
- Conditional Max-Sum for Asynchronous Multiagent Decision MakingDimitrios Troullinos, Georgios Chalkiadakis, Ioannis Papamichail, Markos Papageorgiou. 2060-2068 [doi]
- Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language ModelSongjun Tu, Jingbo Sun, Qichao Zhang, Xiangyuan Lan, Dongbin Zhao. 2069-2077 [doi]
- Maximizing Truth Learning in a Social Network is NP-hardFilip Úradník, Amanda Wang, Jie Gao. 2078-2086 [doi]
- Networked Agents in the Dark: Team Value Learning under Partial ObservabilityGuilherme S. Varela, Alberto Sardinha, Francisco S. Melo. 2087-2095 [doi]
- HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement LearningKryspin Varys, Federico Cerutti 0001, Adam J. Sobey, Timothy J. Norman. 2096-2104 [doi]
- A Minimax Approach to Ad Hoc TeamworkVictor Villin, Thomas Kleine Buening, Christos Dimitrakakis. 2105-2114 [doi]
- Implicit Repair with Reinforcement Learning in Emergent CommunicationFábio Vital, Alberto Sardinha, Francisco S. Melo. 2115-2124 [doi]
- FLIGHT: Facility Location Integrating Generalized, Holistic Theory of WelfareAvyukta Manjunatha Vummintala, Shivam Gupta 0004, Shweta Jain 0002, Sujit Gujar. 2125-2133 [doi]
- InCLET: Large Language Model In-context Learning can Improve Embodied Instruction-followingPeng-Yuan Wang, Jing-Cheng Pang, Chenyang Wang, Xu-Hui Liu, Tian-Shuo Liu, Si-Hang Yang, Hong Qian, Yang Yu. 2134-2142 [doi]
- On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite FlowTonghan Wang 0001, Heng Dong 0001, Yanchen Jiang, David C. Parkes, Milind Tambe. 2143-2152 [doi]
- ReSCOM: Reward-Shaped Curriculum for Efficient Multi-Agent Communication LearningXinghai Wei, Tingting Yuan 0001, Jie Yuan 0001, Dongxiao Liu, Xiaoming Fu 0001. 2153-2161 [doi]
- Goal Recognition via Variational CausalityJiaqi Wen, Leonardo Amado. 2162-2170 [doi]
- A Scoresheet for Explainable AIMichael Winikoff, John Thangarajah, Sebastian Rodriguez. 2171-2180 [doi]
- FGLight: Learning Neighbor-level Information for Traffic Signal ControlHang Xiao, Huale Li, Shuhan Qi, Jiajia Zhang 0001, Dingzhong Cai. 2181-2189 [doi]
- ACORN: Acyclic Coordination with Reachability Network to Reduce Communication Redundancy in Multi-Agent SystemsYi Xie, Ziqing Zhou, Chun Ouyang 0002, Siao Liu, Linqiang Hu, Zhongxue Gan. 2190-2198 [doi]
- Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource AllocationGuojun Xiong, Haichuan Wang, Yuqi Pan, Saptarshi Mandal, Sanket Shah, Niclas Boehmer, Milind Tambe. 2199-2207 [doi]
- On the Effective Horizon of Inverse Reinforcement LearningYiqing Xu, Finale Doshi-Velez, David Hsu. 2208-2216 [doi]
- Uncertainty-Aware Opponent Modeling for Deep Reinforcement LearningLikun Yang, Pei Xu, Shiyue Cao, Yongjian Ren, Xiaotang Chen, Kaiqi Huang. 2217-2225 [doi]
- Dual Ensembled Multiagent Q-Learning with Hypernet RegularizerYaodong Yang 0002, Guangyong Chen, Hongyao Tang, Furui Liu, Danruo Deng, Pheng-Ann Heng. 2226-2234 [doi]
- Self-Interpretable Reinforcement Learning via Rule EnsemblesYue Yang, Fan Yang, Yu Bai, Hao Wang. 2235-2243 [doi]
- Asymptotic Existence of Class Envy-free MatchingsTomohiko Yokoyama, Ayumi Igarashi 0001. 2244-2252 [doi]
- Adaptive Episode Length Adjustment for Multi-agent Reinforcement LearningByunghyun Yoo, Younghwan Shin, Hyunwoo Kim, Euisok Chung, Jeongmin Yang. 2253-2261 [doi]
- Task-Agnostic Contrastive pre-Training for Inter-Agent CommunicationPeihong Yu, Manav Mishra, Syed Zaidi, Pratap Tokekar. 2262-2270 [doi]
- Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationXingrui Yu, Zhenglin Wan, David Mark Bossens, Yueming Lyu, Qing Guo 0005, Ivor W. Tsang. 2271-2280 [doi]
- Insights Regarding the Success of Damping in Improving Belief PropagationUriel Zaed, Omer Lev, Roie Zivan. 2281-2289 [doi]
- Enhancing Sub-Optimal Trajectory Stitching: Spatial Composition RvS for Offline RLSheng Zang, Zhiguang Cao, Bo An 0001, Senthilnath Jayavelu, Xiaoli Li 0001. 2290-2298 [doi]
- Loss of Plasticity: A New Perspective on Solving Multi-Agent Exploration for Sparse Reward TasksZehua Zang, Chuxiong Sun, Lixiang Liu, Fuchun Sun 0001, Changwen Zheng. 2299-2308 [doi]
- On the Structure of EFX Orientations on GraphsJinghan A. Zeng, Ruta Mehta. 2309-2316 [doi]
- β-DQN: Improving Deep Q-Learning By Evolving the BehaviorHongming Zhang 0003, Fengshuo Bai, Chenjun Xiao, Chao Gao, Bo Xu 0002, Martin Müller 0003. 2317-2326 [doi]
- Incentives for Early Arrival in Cost SharingJunyu Zhang, Yao Zhang, Yaoxin Ge, Dengji Zhao, Hu Fu 0001, Zhihao Gavin Tang, Pinyan Lu. 2327-2335 [doi]
- Offline Goal-Conditioned Reinforcement Learning with Elastic-Subgoal Diffused Policy LearningYaocheng Zhang, Yuanheng Zhu, Yuqian Fu, Songjun Tu, Dongbin Zhao. 2336-2344 [doi]
- Unveiling Decision Intention for Cooperative Multi-Agent Reinforcement LearningZeren Zhang, Zhiwei Xu 0005, Guangchong Zhou, Dapeng Li 0001, Bin Zhang 0052, Guoliang Fan. 2345-2354 [doi]
- Agent-Based Analysis of Green Disclosure Policies and Their Market-Wide Impact on Firm BehaviorLingxiao Zhao, Maria Polukarov, Carmine Ventre. 2355-2363 [doi]
- Mean Field Correlated Imitation LearningZhiyu Zhao, Chengdong Ma, Qirui Mi, Ning Yang 0005, Xue Yan, Mengyue Yang, Haifeng Zhang, Jun Wang 0012, Yaodong Yang 0001. 2364-2372 [doi]
- Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential ExplorationHai Zhong, Xun Wang 0013, Zhuoran Li, Longbo Huang. 2373-2381 [doi]
- Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based PlannersFengming Zhu, Fangzhen Lin. 2382-2391 [doi]
- Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature SelectionJiafan Zhuang, Gaofei Han, Zihao Xia, Che Lin, Boxi Wang, Dongliang Wang, Wenji Li, Zhifeng Hao, Ruichu Cai, Zhun Fan. 2392-2401 [doi]
- Decision-Making in Evolving Environments: A Bayesian Multi-Agent Bandit FrameworkMohammad Essa Alsomali, Leandro Soriano Marcolino, Barry Porter, Roberto Rodrigues Filho. 2402-2404 [doi]
- Combining LLMs with a Logic-Based Framework to Explain MCTSZiyan An, Xia Wang, Hendrik Baier, Zirong Chen, Abhishek Dubey, Taylor T. Johnson, Jonathan Sprinkle, Ayan Mukhopadhyay, Meiyi Ma. 2405-2407 [doi]
- Adaptive Multi-Round Influence Maximization with Limited InformationVincenzo Auletta, Francesco Carbone, Diodato Ferraioli, Cosimo Vinci. 2408-2410 [doi]
- Safe Entropic Agents under Team ConstraintsAyhan Alp Aydeniz, Enrico Marchesini, Robert Loftin, Christopher Amato, Kagan Tumer. 2411-2413 [doi]
- Group Fairness in Multi-period Mobile Facility Location ProblemsHaris Aziz 0001, Hau Chan, Xingchen Sha, Toby Walsh, Lirong Xia. 2414-2416 [doi]
- Weighted Envy-free Allocation with SubsidyHaris Aziz 0001, Xin Huang, Kei Kimura, Indrajit Saha, Zhaohong Sun 0001, Mashbat Suzuki, Makoto Yokoo. 2417-2419 [doi]
- Neighborhood Stability in Assignments on GraphsHaris Aziz 0001, Grzegorz Lisowski, Mashbat Suzuki, Jeremy Vollen. 2420-2422 [doi]
- On the Distortion of Multi-Winner Elections on the Line MetricNegar Babashah, Hasti Karimi, Masoud Seddighin, Golnoosh Shahkarami. 2423-2425 [doi]
- Interaction Protocols in an Imperative Agent-Oriented Programming Language: the case of BSPL and SARLMatteo Baldoni, Cristina Baroglio, Stéphane Galland, Roberto Micalizio, Fatma Outay, Stefano Tedeschi 0001. 2426-2427 [doi]
- Multi-Agent Pickup and Delivery with BatteriesMarcello Bavaro, Francesco Amigoni. 2428-2430 [doi]
- Efficient Multi-Agent Delegated SearchCurtis Bechtel, Shaddin Dughmi. 2431-2433 [doi]
- Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP MethodsTyler J. Becker, Zachary Sunberg. 2434-2436 [doi]
- Robust Strategies for Stochastic Multi-Agent SystemsRaphaël Berthon, Joost-Pieter Katoen, Munyque Mittelmann, Aniello Murano. 2437-2439 [doi]
- Multiplayer Games With Incomplete Information for Hyperproperty VerificationRaven Beutner, Bernd Finkbeiner. 2440-2442 [doi]
- Planning for Temporally Extended Goals based on alpha-CTLViviane Bonadia dos Santos, Leliane Nunes de Barros, Maria Viviane de Menezes, Silvio do Lago Pereira. 2443-2445 [doi]
- Formal Verification of Manipulation DialoguesAndreas Brännström, Chiaki Sakama, Juan Carlos Nieves. 2446-2448 [doi]
- (Submodular) Hedonic Games with Common Ranking PropertyBugra Çaskurlu, Ali Eser. 2449-2451 [doi]
- Agreement Games in Multi-Agent SystemsDavide Catta, Angelo Ferrando 0001, Vadim Malvone. 2452-2454 [doi]
- The Costly Bargain: Economic Impacts of Price-Seeking Behavior in Aging PopulationsFuguang Chen, Alan Tsang. 2455-2456 [doi]
- Dynamic Conservative Degree Allocation for Offline Multi-Agent Reinforcement LearningHaosheng Chen, Yun Hua, Junjie Sheng, Wenhao Li 0001, Bo Jin 0003, Xiangfeng Wang. 2457-2459 [doi]
- Hierarchical Multi-Agent Framework for Dynamic Macroeconomic Modelling Using Large Language ModelsZhixun Chen, Zijing Shi, Yaodong Yang 0001, Meng Fang, Yali Du 0001. 2460-2462 [doi]
- Traffic Anomaly Detection through Generative Modeling of Multi-Agent Interactions in Traffic FlowZhuojun Chen, Tacitus Hui, Xinghua Zhu, Dongzhe Su. 2463-2465 [doi]
- Optimal Mechanism Design for Crowdfunding of Public GoodsYukun Cheng, Xiaotie Deng, Baqiao Quan. 2466-2468 [doi]
- Fairness in Cooperative Multi-agent Multi-objective Reinforcement Learning using the Expected Scalarized ReturnFarès Chouaki, Aurélie Beynier, Nicolas Maudet, Paolo Viappiani. 2469-2471 [doi]
- Open-World Classification with Bayesian Gaussian Mixture ModelsJustin Clarke, Przemyslaw Grabowicz, David D. Jensen. 2472-2474 [doi]
- Egalitarianism in Online Coalition FormationSaar Cohen 0001, Noa Agmon. 2475-2477 [doi]
- Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPsOfer Dagan, Tyler J. Becker, Zachary N. Sunberg. 2478-2480 [doi]
- Multi-Agent Reinforcement Learning with Selective State-Space ModelsJemma Daniel, Ruan John de Kock, Louay Ben Nessir, Sasha Abramowitz, Omayma Mahjoub, Wiem Khlifi, Juan Claude Formanek, Arnu Pretorius. 2481-2483 [doi]
- Voter Participation Control in Online PollsKoustav De, Palash Dey, Swagato Sanyal. 2484-2486 [doi]
- f SynthesisGiuseppe De Giacomo, Yves Lespérance, Gianmarco Parretti, Fabio Patrizi, Renzo Schram. 2487-2489 [doi]
- Is an Exponentially Growing Action Space Really that Bad? Validating a Core Assumption for using Multi-Agent RLRuan de Kock, Arnu Pretorius, Jonathan P. Shock. 2490-2492 [doi]
- Symplex: Learning Social Norm Hierarchies by Combining Autonomous Exploration and Expert ImitationOliver Deane, Oliver Ray. 2493-2495 [doi]
- Asynchronous Cooperative Multi-Agent Reinforcement Learning with Limited CommunicationSydney Dolan, Siddharth Nayak, Jasmine Jerry Aloor, Hamsa Balakrishnan. 2496-2498 [doi]
- Parameterized Complexity of Hedonic Games with Enemy-Oriented PreferencesMartin Durand, Laurin Erlacher, Johanne Müller Vistisen, Sofia Simola. 2499-2501 [doi]
- Distributed Adaptive Macroscopic Ensemble Task Allocation of Heterogeneous Robot Teams in Dynamic EnvironmentsVictoria M. Edwards, M. Ani Hsieh. 2502-2503 [doi]
- Weighted Envy Freeness With Bounded SubsidiesNoga Klein Elmalem, Rica Gonen, Erel Segal-haLevi. 2504-2506 [doi]
- Agential AI for Integrated Continual Learning, Deliberative Behavior, and Comprehensible ModelsZeki Doruk Erden, Boi Faltings. 2507-2509 [doi]
- ADAGE: A Generic Two-layer Framework for Adaptive Agent based ModellingBenjamin Patrick Evans, Sihan Zeng, Sumitra Ganesh, Leo Ardon. 2510-2513 [doi]
- Participatory Budgeting Project Strength via Candidate ControlPiotr Faliszewski, Lukasz Janeczko, Dusan Knop, Jan Pokorný, Simon Schierreich, Mateusz Sluszniak, Krzysztof Sornat. 2514-2516 [doi]
- Quantitative Operational Monitoring for BDI AgentsMarie Farrell, Angelo Ferrando 0001, Mengwei Xu. 2517-2519 [doi]
- Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable BehaviorsLang Feng 0002, Jiahao Lin, Dong Xing, Li Zhang 0045, De Ma, Gang Pan 0001. 2520-2522 [doi]
- Action-Dependent Optimality-Preserving Reward ShapingGrant C. Forbes, Jianxun Wang 0002, Leonardo Villalobos-Arias, Arnav Jhala, David I. Roberts. 2523-2525 [doi]
- Learning Flexible Heterogeneous Coordination With Capability-Aware Shared HypernetworksKevin Fu, Pierce Howell, Shalin Jain, Harish Ravichandar. 2526-2528 [doi]
- m-Action GamesYuma Fujimoto, Kaito Ariu, Kenshi Abe. 2529-2531 [doi]
- Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial BanditsBriti Gangopadhyay, Zhao Wang 0009, Alberto Silvio Chiappa, Shingo Takamatsu. 2532-2534 [doi]
- Matching Markets with ChoresJugal Garg, Thorben Tröbst, Vijay V. Vazirani. 2535-2537 [doi]
- Learning Bayesian Game Families, with Application to Mechanism DesignMadelyn Gatchel, Michael P. Wellman. 2538-2540 [doi]
- ChatBDI: Think BDI, Talk LLMAndrea Gatti 0002, Viviana Mascardi, Angelo Ferrando 0001. 2541-2543 [doi]
- Satisfactory Budget DivisionLaurent Gourvès, Michael Lampis, Nikolaos Melissinos, Aris Pagourtzis. 2544-2546 [doi]
- Social Ranking for Feature SelectionLaurent Gourvès, Stefano Moretti 0001, Satya Tamby. 2547-2549 [doi]
- Can you see how I learn? Human Observers' Inferences about Reinforcement Learning Agents' Learning ProcessesBernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens. 2550-2552 [doi]
- Making Universal Policies UniversalNiklas Höpner, David Kuric, Herke van Hoof. 2553-2555 [doi]
- Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy GeneralizationShengchao Hu, Wanru Zhao, Weixiong Lin, Li Shen 0008, Ya Zhang 0002, Dacheng Tao. 2556-2558 [doi]
- Fair Assignment on Multi-Stage GraphsVibulan J, Swapnil Dhamal, Shweta Jain 0002, Ojassvi Kumar, Aman Kumar, Harpreet Singh. 2559-2561 [doi]
- Decoding Negotiation Dynamics: The Impact of Opponent Identity and Privacy on Strategy, Deception, and Emotional Transparency in Human-Agent InteractionNusrath Jahan, Johnathan Mell. 2562-2564 [doi]
- Predicting Team Performance from Communications in Simulated Search-and-RescueAli Jalal-Kamali, Nikolos Gurney, David V. Pynadath. 2565-2567 [doi]
- FedHPD: Heterogeneous Federated Reinforcement Learning via Policy DistillationWenzheng Jiang, Ji Wang 0002, Xiongtao Zhang, Weidong Bao 0001, Cheston Tan, Flint Xiaofeng Fan. 2568-2570 [doi]
- When to Stop Getting Tested: The Theory of Diagnostic TestsAnson Kahng, Joseph Saber. 2571-2573 [doi]
- Evaluating and Improving Graph-based Explanation Methods for Multi-Agent CoordinationSiva Kailas, Shalin Jain, Harish Ravichandar. 2574-2576 [doi]
- Resource Allocation under the Latin Square ConstraintYasushi Kawase, Bodhayan Roy, Mohammad Azharuddin Sanpui. 2577-2578 [doi]
- RallyDiffuser: A Representation-Guided Diffusion Model Framework for Strategic Planning in BadmintonBing-Zhi Ke, Kuang-Da Wang, Wen-Chih Peng. 2579-2581 [doi]
- Adaptive Microtolling in Competitive Online Congestion Games via Multiagent Reinforcement LearningBehrad Koohy, Sebastian Stein 0001, Enrico H. Gerding. 2582-2584 [doi]
- Compensating Latent Nonlinear Dynamics for Practical Consensus ControlKrzysztof Kowalczyk, Dominik Baumann, Cristian R. Rojas, Pawel Wachel. 2585-2587 [doi]
- Online Competitive Information Gathering for Partially Observable Trajectory GamesMel Krusniak, Hang Xu, Parker Palermo, Forrest Laine. 2588-2590 [doi]
- DECAF: Learning to be Fair in Multi-agent Resource AllocationAshwin Kumar, William Yeoh 0001. 2591-2593 [doi]
- Truman: A Large Language Model-based Multi-agent Simulator for Synthetic Money Laundering Data GenerationDattatray Vishnu Kute, Zihao Xu, Yuekang Li, Fethi Rabhi. 2594-2596 [doi]
- Knowledge Transfer in Model-Based Reinforcement Learning Agents for Efficient Multi-Task LearningDmytro Kuzmenko, Nadiya Shvai. 2597-2599 [doi]
- Model of the Influence of External Signals on the Trust of the Agent in Multi Agent SystemFrédérique Lalieu, Tomasz Zurek, Tom M. van Engers. 2600-2602 [doi]
- To Stand on the Shoulders of Giants: Should We Protect Initial Discoveries in Multi-Agent Exploration?Hodaya Lampert, Reshef Meir, Kinneret Teodorescu. 2603-2605 [doi]
- Equilibrium Selection via Communication PartitionWei-chen Lee, Alessandro Abate, Michael J. Wooldridge. 2606-2608 [doi]
- Observer-Aware Probabilistic Planning under Partial ObservabilitySalomé Lepers, Vincent Thomas, Olivier Buffet. 2609-2611 [doi]
- Offline Meta Reinforcement Learning with Weighted Policy Constraints and Proximal Context CollectionHaorui Li, Jiaqi Liang 0002, Linjing Li, Daniel Zeng 0001. 2612-2614 [doi]
- Group-fair Facility Location Games with ExternalitiesMinming Li, Cheng Peng, Ying Wang, Houyu Zhou. 2615-2617 [doi]
- Lite-DIO Is Actually What You Need for Efficient Inertial LocalizationYan Li, Meng Liu, Zhongchen Shi, Yanqing Hou, Liang Xie 0012, Hongbo Chen, Erwei Yin. 2618-2620 [doi]
- Diversity-seeking Swap Games in NetworksYaqiao Li, Lata Narayanan, Jaroslav Opatrny, Yi Tian Xu. 2621-2623 [doi]
- Fusing Physical and Cognitive Stimuli: An Eye Movement Emotion Recognition Framework Based on Hierarchical Attention MechanismZhilin Li, Xiaomei Tao. 2624-2626 [doi]
- What Is a Counterfactual Cause in Action Theories?Daxin Liu 0002, Vaishak Belle. 2627-2629 [doi]
- Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement LearningLunjun Liu, Weilai Jiang, Yaonan Wang 0001. 2630-2632 [doi]
- Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming ApproachXinjie Liu, Jingqi Li 0001, Filippos Fotiadis, Mustafa O. Karabag, Jesse Milzman, David Fridovich-Keil, Ufuk Topcu. 2633-2635 [doi]
- Adaptive Offline Data Replay in Offline-to-Online Reinforcement LearningXu Liu, Tong Yu 0001, Shuai Li 0010. 2636-2638 [doi]
- RainbowArena: A Multi-Agent Toolkit for Reinforcement Learning and Large Language Models in Competitive Tabletop GamesYingzhuo Liu, Shuodi Liu, Hongsong Tang, Yubing Ma, Zikang Li, Junge Zhang, Liuyu Xiang, Zhaofeng He. 2639-2641 [doi]
- CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement LearningZeyuan Liu, Kai Yang, Jiafei Lyu, Xiu Li 0001. 2642-2644 [doi]
- Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use ToolsPanagiotis Lymperopoulos, Vasanth Sarathy. 2645-2647 [doi]
- Mitigating Non-Stationarity in Deep Reinforcement Learning with Clustering Orthogonal Weight ModificationGuoqing Ma 0003, Yuhan Zhang, Yuming Dai, Guangfu Hao, Yang Chen, Shan Yu. 2648-2650 [doi]
- DyLam: A Dynamic Reward Weighting Framework for Reinforcement Learning AlgorithmsMateus Machado, Hansenclever Bassani. 2651-2653 [doi]
- IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent SystemsYihuan Mao, Yipeng Kang, Peilun Li, Ning Zhang, Wei Xu, Chongjie Zhang. 2654-2656 [doi]
- AlphaZeroES: Direct Score Maximization Outperforms Planning Loss MinimizationCarlos Martin 0001, Tuomas Sandholm. 2657-2659 [doi]
- Learning Fair and Preferable Allocations through Neural NetworkRyota Maruo, Koh Takeuchi 0001, Hisashi Kashima. 2660-2662 [doi]
- Rethinking Explainable AI: Explanations can be DeceivingPeta Masters, Daniel Gallagher, Luc Moreau 0001, Mor Vered. 2663-2665 [doi]
- Where is the Nearest EV Charging Station? Evolutionary Optimization of the Gas/charging Stations TopologyEnrique Mateos-Melero, Javier Moralejo-Piñas, Ángela Durán-Pinto, Francisco Martinez-gil, María Soriano, Fernando Fernández 0001. 2666-2668 [doi]
- Predictive Improvement through Latent Space OptimisationAlexander McCaffrey, Eduardo Alonso, Esther Mondragón. 2669-2671 [doi]
- Dynamic Option Creation in Option-Critic Reinforcement LearningMateus Begnini Melchiades, Gabriel de Oliveira Ramos, Bruno C. da Silva 0001. 2672-2674 [doi]
- Adapting Beyond the Depth Limit: Counter Strategies in Large Imperfect Information GamesDavid Milec, Vojtech Kovarík, Viliam Lisý. 2675-2677 [doi]
- Context Adaptive Memory-Efficient LLM Inference for Edge Multi-Agent SystemsHamza Mohammed, Hang Yin, Sai Chand Boyapati. 2678-2680 [doi]
- Learning Heterogeneous Agent Collaboration in Decentralized Multi-Agent Systems via Intrinsic MotivationJahir Sadik Monon, Deeparghya Dutta Barua, Md. Mosaddek Khan. 2681-2683 [doi]
- Improving the Effectiveness of Potential-based Reward Shaping in Reinforcement LearningHenrik Müller, Daniel Kudenko. 2684-2686 [doi]
- Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic SparsityCalarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor. 2687-2689 [doi]
- A Minimalist Approach to Augmentation-based Self-supervised Representation Learning for On-policy Reinforcement LearningNasik Muhammad Nafi, William H. Hsu. 2690-2692 [doi]
- Navigating Social Dilemmas with LLM-based Agents via Consideration of Future ConsequencesDung Nguyen 0001, Hung Le 0002, Kien Do, Sunil Gupta 0001, Svetha Venkatesh, Truyen Tran 0001. 2693-2695 [doi]
- k-Submodular Bandits with Full Bandit FeedbackGuanyu Nie, Vaneet Aggarwal, Christopher John Quinn. 2696-2698 [doi]
- Reasoning and Planning with Dynamic Social NormsTaylor Olson, Roberto Salas-Damian, Kenneth D. Forbus. 2699-2701 [doi]
- Multi-Objective Reinforcement Learning for Water ManagementZuzanna Osika, Roxana Radulescu, Jazmin Zatarain Salazar, Frans A. Oliehoek, Pradeep K. Murukannaiah. 2702-2704 [doi]
- Decentralized Deep Reinforcement Learning for Cooperative Multi-Agent Flight Trajectory Planning in Adverse WeatherBizhao Pang, Xinting Hu, Mingcheng Zhang, Sameer Alam, Guglielmo Lulli 0001. 2705-2707 [doi]
- Learning to Explore when Mistakes are Not AllowedCharly Pecqueux-Guézénec, Stéphane Doncieux, Nicolas Perrin-Gilbert. 2708-2710 [doi]
- Enhancing Lifelong Multi-Agent Path-finding by Using Artificial Potential FieldsArseniy Pertzovsky, Roni Stern, Ariel Felner, Roie Zivan. 2711-2713 [doi]
- Diverse Heterogeneous Graph Conditioned Diffusion for Multi-Agent TeamingLuis Pimentel, Sean Ye, James Ellis Grant Pagan, Matthew C. Gombolay. 2714-2716 [doi]
- Enhancing Robot Navigation Policies with Task-Specific Uncertainty ManagementGokul Puthumanaillam, Paulo Padrao, Jose Fuentes, Leonardo Bobadilla, Melkior Ornik. 2717-2719 [doi]
- Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial GamesPranav Rajbhandari, Prithviraj Dasgupta, Donald A. Sofge. 2720-2722 [doi]
- Shapley Value-based Approach for Distributing Revenue of Matchmaking of Private Transactions in BlockchainsRasheed, Parth Nimish Desai, Yash Chaurasia, Sujit Gujar. 2723-2725 [doi]
- Requirements-based Explainability for Multi Agent SystemsSebastian Rodriguez, John Thangarajah, Michael Winikoff. 2726-2728 [doi]
- Towards Automating the Design of Value-Aligned Clinical ProtocolsManel Rodriguez-Soto, Nardine Osman 0001, Carles Sierra, Rocio Cintas Garcia, Cristina Farriols Danes, Montserrat Garcia Retortillo, Silvia Minguez Maso, Jordi Martinez Roldan. 2729-2731 [doi]
- Liquid Welfare and Revenue Monotonicity in Adaptive Clinching AuctionsRyosuke Sato 0002. 2732-2734 [doi]
- On the Existence of EFX Allocations in MultigraphsAlkmini Sgouritsa, Minas Marios Sotiriou. 2735-2737 [doi]
- Environmental Policies within Cournot OligopolyLiang Shan 0016, Zhengyang Liu 0002, Haoqiang Huang, Zihe Wang 0001. 2738-2740 [doi]
- Negotiated Reasoning: On Provably Addressing Relative Over-GeneralizationJunjie Sheng, Wenhao Li 0001, Bo Jin 0003, Hongyuan Zha, Jun Wang 0006, Xiangfeng Wang. 2741-2743 [doi]
- Towards Fair and Efficient Policy Learning in Cooperative Multi-Agent Reinforcement LearningUmer Siddique, Peilang Li, Yongcan Cao. 2744-2746 [doi]
- Hierarchical Multi-agent Reinforcement Learning for Cyber Network DefenseAditya Vikram Singh, Ethan Rathbun, Emma Graham, Lisa Oakley, Simona Boboila, Peter Chin 0001, Alina Oprea. 2747-2749 [doi]
- PANDA: Priority-Based Collision Avoidance Framework for Heterogeneous UAVs Navigating in Dense AirspaceAgamdeep Singh, Jaskirat Singh, P. B. Sujit. 2750-2752 [doi]
- Modeling the Collaborative Edge Data Caching Problem via a Dynamic DCOPZiyang Song, Ziyu Chen, Jinhui Huang, Cheng Zhang, Jingyuan He. 2753-2755 [doi]
- Pure Nash Equilibrium and Strong Nash Equilibrium Computation in Additive Aggregate GamesJared Soundy, Mohammad T. Irfan, Hau Chan. 2756-2758 [doi]
- Coordinating Competing Electric Vehicle Fleets: An Agent-Based Charging Capacity MarketLennard Sund, Janik Muires, Ramin Ahadi, Konstantina Valogianni, Wolfgang Ketter. 2759-2761 [doi]
- Regret Guarantees for a UCB-based Algorithm for Volatile Combinatorial BanditsAbhishek Kumar, Andra Siva Sai Teja, Ganesh Ghalme, Sujit Gujar, Y. Narahari. 2762-2764 [doi]
- Practical Comparisons of Reservoir Topology Performance and Input Distribution in Digital Reservoir ComputersLewis Thelen, Vikram Ravindra. 2765-2767 [doi]
- Dynamic Reward Sharing to Enhance Learning in the Context of Multiagent TeamsKyle Tilbury, David Radke. 2768-2770 [doi]
- Cultural Evolution of Cooperation among LLM AgentsAron Vallinder, Edward Hughes 0001. 2771-2773 [doi]
- Distributed Value Decomposition Networks with Networked AgentsGuilherme S. Varela, Alberto Sardinha, Francisco S. Melo. 2774-2776 [doi]
- Shifting Power: Leveraging LLMs to Simulate Human Aversion in ABMs of Bilateral Financial Exchanges, A bond market studyAlicia Vidler, Toby Walsh. 2777-2779 [doi]
- Trading-off Accuracy and Communication Cost in Federated LearningMattia Jacopo Villani, Emanuele Natale, Frederik Mallmann-Trenn. 2780-2782 [doi]
- Leveraging Fully-Observable Solutions for Improved Partially-Observable Offline Reinforcement LearningChulabhaya Wijesundara, Andrea Baisero, Gregory D. Castañón, Alan Carlin, Robert Platt 0001, Christopher Amato. 2783-2785 [doi]
- Will Systems of LLM Agents Lead to Cooperation: An Investigation into a Social DilemmaRichard Willis, Yali Du 0001, Joel Z. Leibo. 2786-2788 [doi]
- Combining Normative Ethics Principles to Learn Prosocial BehaviourJessica Woodgate, Nirav Ajmeri. 2789-2791 [doi]
- On-Policy Reinforcement Learning From Failure via Sparse Reward DensificationMingkang Wu, Yongcan Cao. 2792-2794 [doi]
- Integrating Large Language Models with Reinforcement Learning for Generalization in Strategic Card GamesWannian Xia, Meng Fang, Zihao Guo, Yali Du 0001, Bo Xu 0002. 2795-2797 [doi]
- Heuristics-Assisted Experience Replay Strategy for Cooperative Multi-Agent Reinforcement LearningYi Xie, Ziqing Zhou, Chun Ouyang 0002, Siao Liu, Linqiang Hu, Zhongxue Gan. 2798-2800 [doi]
- Empowering Generalization for Deep Reinforcement Learning via Symbolic PlanningTianpei Yang, Srijita Das 0001, Christabel Wayllace, Matthew E. Taylor. 2801-2803 [doi]
- vGOALYi Yang 0034, Tom Holvoet. 2804-2806 [doi]
- Using Assistance Rewards Without Introducing Bias: Overcoming Sparse Rewards in Multi-Agent Reinforcement LearningYue Yang, Bernd Meyer, Frits de Nijs. 2807-2809 [doi]
- CPE: A New Paradigm for Policy Extraction in Offline Reinforcement LearningZhaohui Yang, Xiaoxuan Wang, Linjing Li. 2810-2812 [doi]
- Learning Pre-Trained Tacit Behavior for Efficient Multi-Agent Adversarial CoordinationShiqing Yao, Jiajun Chai, Haixin Yu, Yongzhe Chang, Yuanheng Zhu, Xueqian Wang. 2813-2815 [doi]
- Local Anomaly Detection with Partial Observation in Multi-agent Systems as a Data Matching GameZixin Ye, Tansu Alpcan, Christopher Leckie. 2816-2818 [doi]
- Fast Adaption by Policy Deviation Integral Meta-reinforcement Learning with Applications to High-speed Trains OperationHaotong Zhang 0004, Wanyuan Wang. 2819-2821 [doi]
- Enhancing Offline Safe Reinforcement Learning with Trajectory-Constrained Diffusion PlanningHengrui Zhang, Youfang Lin, Shuo Shen, Hanfeng Lin, Peng Cheng 0013, Sheng Han 0001, Kai Lv 0002. 2822-2824 [doi]
- SFedRec: A Federated Learning Framework for Dynamic Session-based RecommendationHexiao Zhang, Yanni Tang, Jiamou Liu, Wu Chen 0005. 2825-2828 [doi]
- Experience-replay Innovative DynamicsTuo Zhang, Leonardo Stella, Julian Barreiro-Gomez. 2829-2831 [doi]
- Efficient Training of Generalizable Visuomotor Policies via Control-Aware AugmentationYinuo Zhao, Kun Wu 0001, Tianjiao Yi, Zhiyuan Xu, Zhengping Che, Chi Harold Liu, Jian Tang 0008. 2832-2834 [doi]
- Multi-Agent Systems for Bullying InterventionLuis Zhinin-Vera, José J. González-García, Víctor López-Jaquero, Elena Navarro 0001, Pascual González. 2835-2837 [doi]
- CADP: Towards Better Centralized Learning for Decentralized Execution in MARLYihe Zhou, Shunyu Liu 0001, Yunpeng Qing, Tongya Zheng, Kaixuan Chen 0004, Jie Song 0011, Mingli Song. 2838-2840 [doi]
- Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement LearningChangxi Zhu, Mehdi Dastani, Shihan Wang 0001. 2841-2843 [doi]
- Multimodal Agentic Model Predictive ControlSaptarashmi Bandyopadhyay, John (Jack) Cole, Tom Goldstein, David Jacobs 0001. 2844-2848 [doi]
- Safe Systems with Unsafe Agents: Challenges and OpportunitiesJeremy Bellay, J. Timothy Balint, Stephen A. Boxwell, Jeffrey Geppert. 2849-2853 [doi]
- Contesting Black-Box AI DecisionsVirginia Dignum, Loizos Michael, Juan Carlos Nieves, Marija Slavkovik 0001, Julliett Suarez, Andreas Theodorou. 2854-2858 [doi]
- The Next Level of Long-Term Agent Autonomy - Proactively Acquiring Knowledge and AbilitiesHermine J. Grosinger. 2859-2864 [doi]
- Tyranny of the Minority in Social Choice: a Call to ArmsReshef Meir. 2865-2869 [doi]
- Tackling the Protocol Problem in Automated NegotiationYasser Mohammad. 2870-2874 [doi]
- Grounding Agent Reasoning in Image Schemas: A Neurosymbolic Approach to Embodied CognitionFrançois Olivier, Zied Bouraoui. 2875-2879 [doi]
- Market-based Architectures in RL and BeyondAbhimanyu Pallavi Sudhir, Long Tran-Thanh. 2880-2884 [doi]
- Empirical Hardness in Multi-Agent Pathfinding: Research Challenges and OpportunitiesJingyao Ren, Eric Ewing, T. K. Satish Kumar, Sven Koenig, Nora Ayanian. 2885-2889 [doi]
- Multi-Agent Reinforcement Learning Simulation for Environmental Policy SynthesisJames Rudd-Jones, Mirco Musolesi, María Pérez-Ortiz 0001. 2890-2895 [doi]
- Unlocking the Potential of Decentralized LLM-based MAS: Privacy Preservation and Monetization in Collective IntelligenceYingxuan Yang, Qiuying Peng, Jun Wang 0012, Ying Wen 0001, Weinan Zhang 0001. 2896-2900 [doi]
- Towards Foundation-model-based Multiagent System to Accelerate AI for Social ImpactYunfan Zhao, Niclas Boehmer, Aparna Taneja, Milind Tambe. 2901-2907 [doi]
- Responsible Autonomy for Hybrid IntelligenceAnastasia Sophia Apeiron. 2911-2913 [doi]
- Learning Diverse Multiagent BehaviorsAyhan Alp Aydeniz. 2914-2916 [doi]
- Role of State in Partially Observable Reinforcement LearningAndrea Baisero. 2917-2919 [doi]
- Balancing Fairness and Efficiency in the Allocation of Indivisible GoodsKaren Frilya Celine. 2920-2922 [doi]
- Human Influences on Decision Making in Multi-Agent SystemsDaniel E. Collins. 2923-2925 [doi]
- Collective Decision Making via Automated ReasoningAri Conati. 2926-2928 [doi]
- Game-Family Learning for Simulation-Based GamesMadelyn Gatchel. 2929-2931 [doi]
- Hierarchical Frameworks for Scaling-up Multi-agent CoordinationMinghong Geng. 2932-2934 [doi]
- Influence Based Reward Shaping in Multiagent SystemsEverardo Gonzalez. 2935-2937 [doi]
- Extending Consensus-based Task Allocation Algorithms with Bid Intercession to Foster Mixed-InitiativeVictor Guillet. 2938-2940 [doi]
- Informed Decision-Making via VotingQishen Han. 2941-2943 [doi]
- Causality in Multi-Agent SystemsSylvia S. Kerkhove. 2944-2946 [doi]
- Efficient Offline Reinforcement Learning Through Dataset Characterization and ReductionEnrique Mateos-Melero. 2950-2952 [doi]
- Environment-Centered Design of Ethical EnvironmentsArnau Mayoral-Macau. 2953-2955 [doi]
- Modeling and Optimizing Agent-Based Model of Conflict-Induced Forced MigrationZakaria Mehrab. 2956-2958 [doi]
- Safe Multi-Agent Learning via Shielding in Decentralized EnvironmentsDaniel Melcer. 2959-2961 [doi]
- Agent-Based Modeling of Smart Sustainable Mobility Services, Markets, and PolicyJanik Muires. 2965-2967 [doi]
- Humanlike Emergent Language in Multi-Agent SystemsJannik Peters 0002. 2971-2973 [doi]
- The Impact of Artificial Agents in Human Cooperation Through Indirect ReciprocityAlexandre S. Pires. 2974-2976 [doi]
- Bi-Level Reinforcement Learning for Multi-Robot SystemsArjun Prakash. 2977-2978 [doi]
- Multi-Agent Multi-Objective Planning with Contextual Lexicographic Reward PreferencesPulkit Rustagi. 2982-2984 [doi]
- Deep Learning approaches to Goal RecognitionLorenzo Serina. 2985-2987 [doi]
- Different Models for Fair and Efficient Resource AllocationBin Sun. 2988-2990 [doi]
- Ethical Decision-Making in Multi-Agent SystemsJessica Woodgate. 2991-2993 [doi]
- Learning with Less Effort: Efficient Training and Generalization in (Multi-)Robot SystemsPeihong Yu. 2994-2996 [doi]
- FindMe: A Prototype Videogame AI based on CTL with an Optimized Synthesis AlgorithmMarco Aruta, Vadim Malvone, Aniello Murano, Vincenzo Pio Palma, Salvatore Romano. 2997-2999 [doi]
- [COMP24] The Automated Negotiating Agents Competition (ANAC) 2024 Challenges and ResultsReyhan Aydogan, Tim Baarslag, Tamara C. P. Florijn, Katsuhide Fujita, Catholijn M. Jonker, Yasser Mohammad. 3000-3002 [doi]
- A JAX-Accelerated Simulation Framework for Multi-Agent Energy Management in Energy CommunitiesHicham Azmani, Andries Rosseau, Marjon Blondeel, Ann Nowé. 3003-3005 [doi]
- Orpheus: Programming Protocol-Based BDI AgentsMatteo Baldoni, Samuel Christie, Munindar P. Singh, Amit K. Chopra. 3006-3008 [doi]
- LUNAR: A Runtime Verification Tool for Anomaly Detection in Gas NetworksJulius Gasson, Francesco Belardinelli. 3009-3011 [doi]
- BitML2MCMAS: Strategic Reasoning for Bitcoin Smart ContractsLuigi Bellomarini, Marco Favorito, Giuseppe Galano. 3012-3014 [doi]
- Recommending Green Routes for Pedestrians to Reduce the Exposure to Air Pollutants in BarcelonaFilippo Bistaffa, Sergio Calo Oliveira. 3015-3017 [doi]
- Serious Games for Ethical Preference ElicitationJayati Deshmukh, Zijie Liang, Vahid Yazdanpanah, Sebastian Stein 0001, Sarvapali D. Ramchurn. 3018-3020 [doi]
- VITAMIN: VerIficaTion of A MultI ageNt systemAngelo Ferrando 0001, Vadim Malvone. 3023-3025 [doi]
- CRLLK: Constrained Reinforcement Learning for Lane Keeping in Autonomous DrivingXinwei Gao, Arambam James Singh, Gangadhar Royyuru, Michael Yuhas, Arvind Easwaran. 3026-3028 [doi]
- Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented DialoguesMaya Medjad, Hugo Imbert, Bruno Yun, Raphaël Szymocha, Frédéric Armetta. 3029-3031 [doi]
- Personalized Language Learning: A Multi-Agent System Leveraging LLMs for Teaching LuxembourgishTebourbi Hedi, Sana Nouzri, Yazan Mualla, Amro Najjar. 3032-3034 [doi]
- Eva: An LLM-based Multilingual Voice-agent Network for Restaurant OperationsZhiwei (Tony) Qin, Jianming Zhou. 3035-3037 [doi]
- Simulating Tracking Data to Advance Sports Analytics ResearchDavid Radke, Kyle Tilbury. 3038-3040 [doi]
- Chat4Elderly: A Multi-Agent System for Personalized Wellness Using Generative AI and Wearable TechnologyVítor Crista, Diogo Martinho, Goreti Marreiros. 3041-3043 [doi]
- The Game Academy: Learn while playing, and play while learning!Simon Rey, Ulle Endriss. 3044-3046 [doi]
- Simulating Blockchain Applications in Large-Value Payment Systems through Agent-Based ModelingKenneth See, Nicholas MacGregor Garcia, Xiaofan Li. 3047-3049 [doi]
- UAV Marketplace Simulation Tool for BVLOS OperationsKivanç Serefoglu, Önder Gürcan, Reyhan Aydogan. 3050-3052 [doi]
- SmartPilot: Agent-Based CoPilot for Intelligent ManufacturingChathurangi Shyalika, Renjith Prasad, Alaa T. Al Ghazo, Darssan Eswaramoorthi, Sara Shree Muthuselvam, Amit P. Sheth. 3053-3055 [doi]
- Pabuviz.org: A Visualisation Platform to Explore Participatory Budgeting ElectionsMarkus Utke, Simon Rey, Ulle Endriss. 3056-3058 [doi]
- MapBot: A Multi-Modal Agent for Geospatial AnalysisMartin Weiss, Nasim Rahaman, Chris Pal. 3059-3061 [doi]
- Intention Recognition in Real-Time Interactive Navigation MapsPeijie Zhao, Zunayed Arefin, Felipe Meneguzzi, Ramon Fraga Pereira. 3062-3064 [doi]
- When Is It Acceptable to Break the Rules? Knowledge Representation of Moral Judgements Based on Empirical Data (Extended Abstract)Edmond Awad, Sydney Levine, Andrea Loreggia, Nicholas Mattei, Iyad Rahwan, Francesca Rossi 0001, Kartik Talamadupula, Joshua B. Tenenbaum, Max Kleiman-Weiner. 3065-3067 [doi]
- Beyond the Echo Chamber: Modelling Open-Mindedness in Citizens' AssembliesJake Barrett, Kobi Gal, Loizos Michael, Dan Vilenchik. 3068-3070 [doi]
- Contest Partitioning in Binary Contests: Costly, yet BeneficialPriel Levy, Yonatan Aumann, David Sarne. 3071-3073 [doi]
- A summary of: Tackling School Segregation with Transportation Network Interventions - An Agent-Based Modelling ApproachDimitris Michailidis, Mayesha Tasnim, Sennay Ghebreab, Fernando P. Santos. 3074-3076 [doi]
- Epistemic Selection of Costly Alternatives: The Case of Participatory Budgeting (Extended Abstract)Simon Rey, Ulle Endriss. 3077-3079 [doi]
- Strategic Manipulation of Preferences in the Rank Minimization MechanismMayesha Tasnim, Youri Weesie, Sennay Ghebreab, Max Baak. 3080-3082 [doi]
- Carbon Trading Supply Chain Management Based on Constrained Deep Reinforcement LearningQinghao Wang, Yaodong Yang 0001. 3083-3086 [doi]
- Navigating in a Space of Game Views (extended abstract)Michael P. Wellman, Katherine Mayo. 3087-3088 [doi]
- Resolving Social Dilemmas with Minimal Reward Transfer - Extended AbstractRichard Willis, Yali Du 0001, Joel Z. Leibo, Michael Luck. 3089-3091 [doi]