Abstract is missing.
- AI for Advancing Scientific Discovery for a Sustainable FutureCarla P. Gomes. 1 [doi]
- Automatic Curricula in Deep Multi-Agent Reinforc ement LearningThore Graepel. 2 [doi]
- Building Cities from Slime Mould, Agents and Quantum Field TheoryAlison J. Heppenstall, Nick Malleson. 3-4 [doi]
- Unsupervised Reinforcement LearningSergey Levine. 5-6 [doi]
- Reconfigurable Interaction for MAS ModellingYehia Abd Alrahman, Giuseppe Perelli, Nir Piterman. 7-15 [doi]
- Elessar: Ethics in Norm-Aware AgentsNirav Ajmeri, Hui Guo 0002, Pradeep K. Murukannaiah, Munindar P. Singh. 16-24 [doi]
- Formal Verification of Neural Agents in Non-deterministic EnvironmentsMichael E. Akintunde, Elena Botoeva, Panagiotis Kouvaros, Alessio Lomuscio. 25-33 [doi]
- Explainable Multi Agent Path FindingShaull Almagor, Morteza Lahijanian. 34-42 [doi]
- Rational vs Byzantine Players in Consensus-based BlockchainsYackolley Amoussou-Guenou, Bruno Biais, Maria Potop-Butucaru, Sara Tucci Piergiovanni. 43-51 [doi]
- Strategic Decision-Making for Power Network Investments with Distributed Renewable GenerationMerlinda Andoni, Valentin Robu, Wolf Gerrit Früh, David Flynn. 52-60 [doi]
- A Design-Methodology for Epidemic Dynamics via Time-Varying HypergraphsAlessia Antelmi, Gennaro Cordasco, Carmine Spagnuolo, Vittorio Scarano. 61-69 [doi]
- A General Framework for Energy-Efficient Cloud Computing MechanismsAntonios Antoniadis, Andrés Cristi, Tim Oosterwijk, Alkmini Sgouritsa. 70-78 [doi]
- Improved Algorithms for Learning Equilibria in Simulation-Based GamesEnrique Areyan Viqueira, Cyrus Cousins, Amy Greenwald. 79-87 [doi]
- Learning an Interpretable Traffic Signal Control PolicyJames Ault, Josiah P. Hanna, Guni Sharon. 88-96 [doi]
- Summer Internship Matching with Funding ConstraintsHaris Aziz 0001, Anton Baychkov, Péter Biró. 97-104 [doi]
- HMMs for Anomaly Detection in Autonomous RobotsDavide Azzalini, Alberto Castellini, Matteo Luperto, Alessandro Farinelli, Francesco Amigoni. 105-113 [doi]
- Peer Reviewing in Participatory Guarantee Systems: Modelisation and Algorithmic AspectsNathanaël Barrot, Sylvaine Lemeilleur, Nicolas Paget, Abdallah Saffidine. 114-122 [doi]
- Learning to Optimize Autonomy in Competence-Aware SystemsConnor Basich, Justin Svegliato, Kyle Hollins Wray, Stefan J. Witwicki, Joydeep Biswas, Shlomo Zilberstein. 123-131 [doi]
- Manipulation of Opinion Polls to Influence Iterative ElectionsDorothea Baumeister, Ann-Kathrin Selker, Anaëlle Wilczynski. 132-140 [doi]
- Optimising Game Tactics for FootballRyan Beal, Georgios Chalkiadakis, Timothy J. Norman, Sarvapali D. Ramchurn. 141-149 [doi]
- Candidate Selections with Proportional Fairness ConstraintsXiaohui Bei, Shengxin Liu, Chung Keung Poon, Hongao Wang. 150-158 [doi]
- Multi-Agent Path Finding in Configurable EnvironmentsMatteo Bellusci, Nicola Basilico, Francesco Amigoni. 159-167 [doi]
- Automated Justification of Collective Decisions via Constraint SolvingArthur Boixel, Ulle Endriss. 168-176 [doi]
- Input Addition and Deletion in Reinforcement: Towards Learning with Structural ChangesIago Bonnici, Abdelkader Gouaïch, Fabien Michel. 177-185 [doi]
- Majority-Strategyproofness in Judgment AggregationSirin Botan, Ulle Endriss. 186-194 [doi]
- Finding and Recognizing Popular Coalition StructuresFelix Brandt 0001, Martin Bullinger. 195-203 [doi]
- Fair Allocation of Resources with Uncertain AvailabilityJan Bürmann, Enrico H. Gerding, Baharak Rastegari. 204-212 [doi]
- Pareto-Optimality in Cardinal Hedonic GamesMartin Bullinger. 213-221 [doi]
- Task Allocation Strategy for Heterogeneous Robot Teams in Offshore MissionsYaniel Carreno, Èric Pairet, Yvan Pétillot, Ronald P. A. Petrick. 222-230 [doi]
- Weighted Envy-Freeness in Indivisible Item AllocationMithun Chakraborty, Ayumi Igarashi, Warut Suksompong, Yair Zick. 231-239 [doi]
- Schelling Models with Localized Social Influence: A Game-Theoretic FrameworkHau Chan, Mohammad T. Irfan, Cuong Viet Than. 240-248 [doi]
- RMB-DPOP: Refining MB-DPOP by Reducing Redundant InferenceZiyu Chen, Wenxin Zhang, Yanchen Deng, Dingding Chen, Qiang Li. 249-257 [doi]
- Refinement for Multiagent ProtocolsSamuel H. Christie V., Amit K. Chopra, Munindar P. Singh. 258-266 [doi]
- Policy Synthesis for Factored MDPs with Graph Temporal Logic SpecificationsMurat Cubuktepe, Zhe Xu 0005, Ufuk Topcu. 267-275 [doi]
- Leader Election and Compaction for Asynchronous Silent Programmable MatterGianlorenzo D'Angelo, Mattia D'Emidio, Shantanu Das 0001, Alfredo Navarra, Giuseppe Prencipe. 276-284 [doi]
- Intention-Aware Multiagent SchedulingMichael Dann, John Thangarajah, Yuan Yao 0007, Brian Logan. 285-293 [doi]
- Goal Formation through Interaction in the Situation Calculus: A Formal Account Grounded in Behavioral ScienceGiuseppe De Giacomo, Yves Lespérance. 294-302 [doi]
- Risk-Aware Conditional Replanning for Globally Constrained Multi-Agent Sequential Decision MakingFrits de Nijs, Peter J. Stuckey. 303-311 [doi]
- Testing Axioms Against Human Reward Divisions in Cooperative GamesGreg d'Eon, Kate Larson. 312-320 [doi]
- Manipulating Node Similarity Measures in NetworksPalash Dey, Sourav Medya. 321-329 [doi]
- Gaussian Processes as Multiagent Reward ModelsGaurav Dixit, Stéphane Airiau, Kagan Tumer. 330-338 [doi]
- Alternative Function Approximation Parameterizations for Solving Games: An Analysis of ƒ-Regression Counterfactual Regret MinimizationRyan D'Orazio, Dustin Morrill, James R. Wright, Michael Bowling. 339-347 [doi]
- Dueling Bandits: From Two-dueling to Multi-duelingYihan Du, Siwei Wang, Longbo Huang. 348-356 [doi]
- Private and Byzantine-Proof Cooperative Decision-MakingAbhimanyu Dubey, Alex Pentland. 357-365 [doi]
- Algorithms for Swap and Shift Bribery in Structured ElectionsEdith Elkind, Piotr Faliszewski, Sushmita Gupta, Sanjukta Roy. 366-374 [doi]
- Adaptive Autonomy in Wireless Sensor NetworksMirgita Frasheri, José Manuel Cano-García, Eva González-Parada, Baran Çürüklü, Mikael Ekström, Alessandro Vittorio Papadopoulos, Cristina Urdiales. 375-383 [doi]
- Equitable Allocations of Indivisible ChoresRupert Freeman, Sujoy Sikdar, Rohit Vaish, Lirong Xia. 384-392 [doi]
- Threshold Task Games: Theory, Platform and ExperimentsKobi Gal, Ta Duy Nguyen, Quang Nhat Tran, Yair Zick. 393-401 [doi]
- Mechanism Design for Defense Coordination in Security GamesJiarui Gan, Edith Elkind, Sarit Kraus, Michael J. Wooldridge. 402-410 [doi]
- Multi Type Mean Field Reinforcement LearningSriram Ganapathi Subramanian, Pascal Poupart, Matthew E. Taylor, Nidhi Hegde. 411-419 [doi]
- Computing Competitive Equilibria with Mixed MannaJugal Garg, Peter McGlaughlin. 420-428 [doi]
- Toward Genuine Robot Teammates: Improving Human-Robot Team Performance Using Robot Shared Mental ModelsFelix Gervits, Dean Thurston, Ravenna Thielstrom, Terry Fong, Quinn Pham, Matthias Scheutz. 429-437 [doi]
- Improving Performance in Reinforcement Learning by Breaking Generalization in Neural NetworksSina Ghiassian, Banafsheh Rafiee, Yat Long Lo, Adam White. 438-446 [doi]
- Towards Deployment of Robust Cooperative AI Agents: An Algorithmic Framework for Learning Adaptive PoliciesAhana Ghosh, Sebastian Tschiatschek, Hamed Mahdavi, Adish Singla. 447-455 [doi]
- A Bridge between Polynomial Optimization and Games with Imperfect RecallHugo Gimbert, Soumyajit Paul, B. Srivathsan. 456-464 [doi]
- Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward EnvironmentsVinicius G. Goecks, Gregory M. Gremillion, Vernon J. Lawhern, John Valasek, Nicholas R. Waytowich. 465-473 [doi]
- Demystifying Emergent Intelligence and Its Effect on Performance In Large Robot SwarmsJohn Harwell, London Lowmanstone, Maria L. Gini. 474-482 [doi]
- Cautious Reinforcement Learning with Logical ConstraintsMohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening. 483-491 [doi]
- Neural Replicator Dynamics: Multiagent Learning via Hedging Policy GradientsDaniel Hennes, Dustin Morrill, Shayegan Omidshafiei, Rémi Munos, Julien Pérolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Paavo Parmas, Edgar A. Duéñez-Guzmán, Karl Tuyls. 492-501 [doi]
- New Algorithms for Continuous Distributed Constraint Optimization ProblemsKhoi D. Hoang, William Yeoh 0001, Makoto Yokoo, Zinovi Rabinovich. 502-510 [doi]
- The Effect of Strategic Noise in Linear RegressionSafwan Hossain, Nisarg Shah 0001. 511-519 [doi]
- Inducing Cooperation through Reward Reshaping based on Peer Evaluations in Deep Multi-Agent Reinforcement LearningDavid Earl Hostallero, Daewoo Kim, Sangwoo Moon, Kyunghwan Son, Wan Ju Kang, Yung Yi. 520-528 [doi]
- Green Security Game with Community EngagementTaoan Huang, Weiran Shen, David Zeng, Tianyu Gu, Rohit Singh, Fei Fang. 529-537 [doi]
- Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum GamesEdward Hughes, Thomas W. Anthony, Tom Eccles, Joel Z. Leibo, David Balduzzi, Yoram Bachrach. 538-547 [doi]
- CopyCAT: : Taking Control of Neural Policies with Constant AttacksLéonard Hussenot, Matthieu Geist, Olivier Pietquin. 548-556 [doi]
- Snooping Attacks on Deep Reinforcement LearningMatthew Inkawhich, Yiran Chen, Hai Helen Li. 557-565 [doi]
- It's Not Whom You Know, It's What You, or Your Friends, Can Do: Coalitional Frameworks for Network CentralitiesGabriel Istrate, Cosmin Bonchis, Claudiu Gatina. 566-574 [doi]
- Influence Maximization in Unknown Social Networks: Learning Policies for Effective Graph SamplingHarshavardhan Kamarthi, Priyesh Vijayan, Bryan Wilder, Balaraman Ravindran, Milind Tambe. 575-583 [doi]
- On Stable Matchings with Pairwise Preferences and Matroid ConstraintsNaoyuki Kamiyama. 584-592 [doi]
- Combining No-regret and Q-learningIan A. Kash, Michael Sullins, Katja Hofmann. 593-601 [doi]
- Approximately Stable Matchings with General ConstraintsYasushi Kawase. 602-610 [doi]
- Inducing Equilibria in Networked Public Goods Games through Network Structure ModificationDavid Kempe 0001, Sixie Yu, Yevgeniy Vorobeychik. 611-619 [doi]
- Learning Hierarchical Teaching Policies for Cooperative AgentsDong Ki Kim, Miao Liu 0001, Shayegan Omidshafiei, Sebastian Lopez-Cot, Matthew Riemer, Golnaz Habibi, Gerald Tesauro, Sami Mourad, Murray Campbell, Jonathan P. How. 620-628 [doi]
- Adversarial Patrolling with DronesDavid Klaska, Antonín Kucera 0001, Vojtech Rehák. 629-637 [doi]
- Incentivising Participation in Liquid Democracy with Breadth-First DelegationGrammateia Kotsialou, Luke Riley. 638-644 [doi]
- Strategic Manipulation with Incomplete Preferences: Possibilities and Impossibilities for Positional Scoring RulesJustin Kruger, Zoi Terzopoulou. 645-653 [doi]
- Increasing Evacuation during Disaster EventsChris J. Kuhlman, Achla Marathe, Anil Vullikanti, Nafisa Halim, Pallab Mozumder. 654-662 [doi]
- Convexity of Hypergraph Matching GameSoh Kumabe, Takanori Maehara. 663-671 [doi]
- Optimal Swarm Strategy for Dynamic Target Search and TrackingHian Lee Kwa, Jabez Leong Kit, Roland Bouffanais. 672-680 [doi]
- On the Model-Checking of Branching-time Temporal Logic with BDI ModalitiesSalvatore La Torre, Gennaro Parlato. 681-689 [doi]
- Hindsight PlannerYaqing Lai, Wufan Wang, Yunjie Yang, Jihong Zhu, Minchi Kuang. 690-698 [doi]
- A Deliberate BIAT Logic for Modeling ManipulationsChristopher Leturc, Grégory Bonnet. 699-707 [doi]
- Fair Resource Sharing and Dorm AssignmentBo Li, Yingkai Li. 708-716 [doi]
- Spatial-Temporal Moving Target Defense: A Markov Stackelberg Game ModelHenger Li, Wen Shen 0008, Zizhan Zheng. 717-725 [doi]
- Moving Agents in Formation in Congested EnvironmentsJiaoyang Li 0001, Kexuan Sun 0002, Hang Ma 0001, Ariel Felner, T. K. Satish Kumar, Sven Koenig. 726-734 [doi]
- On Emergent Communication in Competitive Multi-Agent TeamsPaul Pu Liang, Jeffrey Chen, Ruslan Salakhutdinov, Louis-Philippe Morency, Satwik Kottur. 735-743 [doi]
- A Story of Two Streams: Reinforcement Learning Models from Human Behavior and NeuropsychiatryBaihan Lin, Guillermo A. Cecchi, Djallel Bouneffouf 0001, Jenna Reinen, Irina Rish. 744-752 [doi]
- Off-Policy Deep Reinforcement Learning with Analogous Disentangled ExplorationAnji Liu, Yitao Liang, Guy Van den Broeck. 753-761 [doi]
- Parameterised Verification of Strategic Properties in Probabilistic Multi-Agent SystemsAlessio Lomuscio, Edoardo Pirovano. 762-770 [doi]
- Competitive Ratios for Online Multi-capacity RidesharingMeghna Lowalekar, Pradeep Varakantham, Patrick Jaillet. 771-779 [doi]
- A Budget-Limited Mechanism for Category-Aware Crowdsourcing SystemsYuan Luo, Nicholas R. Jennings. 780-788 [doi]
- Gifting in Multi-Agent Reinforcement LearningAndrei Lupu, Doina Precup. 789-797 [doi]
- Likelihood Quantile Networks for Coordinating Multi-Agent Reinforcement LearningXueguang Lyu, Christopher Amato. 798-806 [doi]
- Penalty Bidding Mechanisms for Allocating Resources and Overcoming Present-BiasHongyao Ma, Reshef Meir, David C. Parkes, Elena Wu-Yan. 807-815 [doi]
- Feudal Multi-Agent Deep Reinforcement Learning for Traffic Signal ControlJinming Ma, Feng Wu. 816-824 [doi]
- AED: An Anytime Evolutionary DCOP AlgorithmSaaduddin Mahmud, Moumita Choudhury, Md. Mosaddek Khan, Long Tran-Thanh, Nicholas R. Jennings. 825-833 [doi]
- Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy SpacesAlberto Marchesi, Francesco Trovò, Nicola Gatti 0001. 834-842 [doi]
- Optimal Temporal Plan MergingGilberto Marcon dos Santos, Julie A. Adams. 851-859 [doi]
- Policy-Gradient Algorithms Have No Guarantees of Convergence in Linear Quadratic GamesEric Mazumdar, Lillian J. Ratliff, Michael I. Jordan, S. Shankar Sastry. 860-868 [doi]
- Social Diversity and Social Preferences in Mixed-Motive Reinforcement LearningKevin R. McKee, Ian M. Gemp, Brian McWilliams, Edgar A. Duéñez-Guzmán, Edward Hughes, Joel Z. Leibo. 869-877 [doi]
- Trajectory-User Linking with Attentive Recurrent NetworkCongcong Miao, Jilong Wang, Heng Yu, Weichen Zhang, Yinyao Qi. 878-886 [doi]
- Approximate Nash Equilibria of Imitation Games: Algorithms and ComplexityAniket Murhekar, Ruta Mehta. 887-894 [doi]
- Massive Cross-Platform Simulations of Online Social NetworksGoran Muric, Alexey Tregubov, Jim Blythe, Andrés Abeliuk, Divya Choudhary, Kristina Lerman, Emilio Ferrara. 895-903 [doi]
- Duty to Warn in Strategic GamesPavel Naumov, Jia Tao. 904-912 [doi]
- Generalized Optimistic Q-Learning with Provable EfficiencyGrigory Neustroev, Mathijs Michiel de Weerdt. 913-921 [doi]
- The Complexity of Cloning Candidates in Multiwinner ElectionsMarc Neveling, Jörg Rothe. 922-930 [doi]
- DCRAC: Deep Conditioned Recurrent Actor-Critic for Multi-Objective Partially Observable EnvironmentsXiaodong Nian, Athirai Aravazhi Irissappane, Diederik M. Roijers. 931-938 [doi]
- Is the Policy Gradient a Gradient?Chris Nota, Philip S. Thomas. 939-947 [doi]
- Driving Exploration by Maximum Distribution in Gaussian Process BanditsAlessandro Nuara, Francesco Trovò, Dominic Crippa, Nicola Gatti 0001, Marcello Restelli. 948-956 [doi]
- Multiwinner Candidacy GamesSvetlana Obraztsova, Maria Polukarov, Edith Elkind, Marek Grzesiuk. 957-965 [doi]
- Towards a Computational Framework for Automating Substance Use Counseling with Virtual AgentsStefan Olafsson, Byron Wallace, Timothy W. Bickmore. 966-974 [doi]
- Analyzing Reinforcement Learning Benchmarks with Random Weight GuessingDeclan Oller, Tobias Glasmachers, Giuseppe Cuccu. 975-982 [doi]
- Non-Uniform Policies for Multi-Robot Asymmetric Perimeter Patrol in Adversarial DomainsYaniv Oshrat, Noa Agmon, Sarit Kraus. 983-991 [doi]
- Who and When to Screen: Multi-Round Active Screening for Network Recurrent Infectious Diseases Under UncertaintyHan-Ching Ou, Arunesh Sinha, Sze-Chuan Suen, Andrew Perrault, Alpan Raval, Milind Tambe. 992-1000 [doi]
- Multi-Path Policy OptimizationLing Pan, Qingpeng Cai, Longbo Huang. 1001-1009 [doi]
- Navigating the Combinatorics of Virtual Agent Design Space to Maximize PersuasionDhaval Parmar, Stefán Ólafsson, Dina Utami, Prasanth Murali, Timothy W. Bickmore. 1010-1018 [doi]
- Real-time Learning and Planning in Environments with Swarms: A Hierarchical and a Parameter-based Simulation ApproachLukasz Pelcner, Shaling Li, Matheus Aparecido do Carmo Alves, Leandro Soriano Marcolino, Alex Collins. 1019-1027 [doi]
- GAPCoD: A Generic Assembly Planner by Constrained DisassemblyFlorian Pescher, Nils Napp, Benoît Piranda, Julien Bourgeois. 1028-1036 [doi]
- Inference-Based Strategy Alignment for General-Sum Differential GamesLasse Peters, David Fridovich-Keil, Claire J. Tomlin, Zachary N. Sunberg. 1037-1045 [doi]
- On Algorithmic Decision Procedures in Emergency Response Systems in Smart and Connected CommunitiesGeoffrey Pettet, Ayan Mukhopadhyay, Mykel J. Kochenderfer, Yevgeniy Vorobeychik, Abhishek Dubey. 1046-1054 [doi]
- Learning and Testing Resilience in Cooperative Multi-Agent SystemsThomy Phan, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner N. Schmid, Jan Wieghardt, Marc Zeller, Claudia Linnhoff-Popien. 1055-1063 [doi]
- Objective Social Choice: Using Auxiliary Information to Improve Voting OutcomesSilviu Pitis, Michael R. Zhang. 1064-1071 [doi]
- Goal Recognition Using Off-The-Shelf Process Mining TechniquesArtem Polyvyanyy, Zihang Su, Nir Lipovetzky, Sebastian Sardiña. 1072-1080 [doi]
- Extending Narrative Planning Domains with Linguistic ResourcesJulie Porteous, João F. Ferreira, Alan Lindsay, Marc Cavazza. 1081-1089 [doi]
- Yesterday's Reward is Today's Punishment: Contrast Effects in Human Feedback to Reinforcement Learning AgentsDivya Ramesh, Anthony Z. Liu, Andres J. Echeverria, Jean Y. Song, Nicholas R. Waytowich, Walter S. Lasecki. 1090-1097 [doi]
- Toll-Based Learning for Minimising Congestion under Heterogeneous PreferencesGabriel de Oliveira Ramos, Roxana Radulescu, Ann Nowé, Anderson R. Tavares. 1098-1106 [doi]
- Culture-Based Explainable Human-Agent DeconflictionAlex Raymond, Hatice Gunes, Amanda Prorok. 1107-1115 [doi]
- Automated Configuration of Negotiation StrategiesBram M. Renting, Holger H. Hoos, Catholijn M. Jonker. 1116-1124 [doi]
- Capacity, Bandwidth, and Compositionality in Emergent Language LearningCinjon Resnick, Abhinav Gupta 0002, Jakob N. Foerster, Andrew M. Dai, KyungHyun Cho. 1125-1133 [doi]
- Employing Models of Human Social Motor Behavior for Artificial Agent TrainersLillian M. Rigoli, Patrick Nalepka, Hannah M. Douglas, Rachel W. Kallen, Simon Hosking, Christopher Best, Elliot Saltzman, Michael J. Richardson. 1134-1142 [doi]
- Multi-level Fitness Critics for Cooperative CoevolutionGolden Rockefeller, Shauharda Khadka, Kagan Tumer. 1143-1151 [doi]
- A Structural Solution to Sequential Moral DilemmasManel Rodriguez-Soto, Maite López-Sánchez, Juan A. Rodríguez-Aguilar. 1152-1160 [doi]
- Human-Centered Decision Support for Agenda SchedulingStephanie Rosenthal, Laura M. Hiatt. 1161-1168 [doi]
- Viral vs. Effective: Utility Based Influence MaximizationYael Sabato, Amos Azaria, Noam Hazon. 1169-1177 [doi]
- Multirobot Coverage of Modular EnvironmentsMirko Salaris, Alessandro Riva, Francesco Amigoni. 1178-1186 [doi]
- Designing Effective and Practical Interventions to Contain EpidemicsPrathyush Sambaturu, Bijaya Adhikari, B. Aditya Prakash, Srinivasan Venkatramanan, Anil Vullikanti. 1187-1195 [doi]
- MGpi: A Computational Model of Multiagent Group Perception and InteractionNavyata Sanghvi, Ryo Yonetani, Kris Kitani. 1196-1205 [doi]
- Bayesian Active Malware AnalysisRiccardo Sartea, Georgios Chalkiadakis, Alessandro Farinelli, Matteo Murari. 1206-1214 [doi]
- Maximizing Information Gain in Partially Observable Environments via Prediction RewardsYash Satsangi, Sungsu Lim, Shimon Whiteson, Frans A. Oliehoek, Martha White. 1215-1223 [doi]
- Limitations of Greed: Influence Maximization in Undirected Networks Re-visitedGrant Schoenebeck, Biaoshuai Tao, Fang-Yi Yu. 1224-1232 [doi]
- A Qualitative Approach to Composing Value-Aligned Norm SystemsMarc Serramia, Maite López-Sánchez, Juan A. Rodríguez-Aguilar. 1233-1241 [doi]
- Learning to Design Coupons in Online Advertising MarketsWeiran Shen, Pingzhong Tang, Xun Wang, Yadong Xu, Xiwang Yang. 1242-1250 [doi]
- Epistemic Plan RecognitionMaayan Shvo, Toryn Q. Klassen, Shirin Sohrabi, Sheila A. McIlraith. 1251-1259 [doi]
- Playing Games in the Dark: An Approach for Cross-Modality Transfer in Reinforcement LearningRui Silva, Miguel Vasco, Francisco S. Melo, Ana Paiva, Manuela Veloso. 1260-1268 [doi]
- Safe Policy Improvement with an Estimated Baseline PolicyThiago D. Simão, Romain Laroche, Rémi Tachet des Combes. 1269-1277 [doi]
- Hierarchical Multiagent Reinforcement Learning for Maritime Traffic ManagementArambam James Singh, Akshat Kumar, Hoong Chuin Lau. 1278-1286 [doi]
- Signed Graph Games: Coalitional Games with Friends, Enemies and AlliesOskar Skibski, Takamasa Suzuki, Tomasz Grabowski, Tomasz P. Michalak, Makoto Yokoo. 1287-1295 [doi]
- Strategyproof Reinforcement Learning for Online Resource AllocationSebastian Stein 0001, Mateusz Ochal, Ioana-Adriana Moisoiu, Enrico Gerding, Raghu K. Ganti, Ting He 0001, Tom La Porta. 1296-1304 [doi]
- Minimizing Margin of Victory for Fair Political and Educational DistrictingAna-Andreea Stoica, Abhijnan Chakraborty, Palash Dey, Krishna P. Gummadi. 1305-1313 [doi]
- Multi-Robot Planning Under Uncertainty with Congestion-Aware ModelsCharlie Street, Bruno Lacerda, Manuel Mühlig, Nick Hawes. 1314-1322 [doi]
- Games of MinersJingchang Sun, Pingzhong Tang, Yulong Zeng. 1323-1331 [doi]
- Can Agents Learn by Analogy?: An Inferable Model for PAC Reinforcement LearningYanchao Sun, Furong Huang. 1332-1340 [doi]
- Drawing a Map of Elections in the Space of Statistical CulturesStanislaw Szufa, Piotr Faliszewski, Piotr Skowron, Arkadii Slinko, Nimrod Talmon. 1341-1349 [doi]
- Capturing Oracle Guided HidersAkshat Tandon, Kamalakar Karlapalem. 1350-1358 [doi]
- Optimized Cost per Mille in Feeds AdvertisingPingzhong Tang, Xun Wang, Zihe Wang, Yadong Xu, Xiwang Yang. 1359-1367 [doi]
- Differentially Private Contextual Dynamic PricingWei Tang, Chien-Ju Ho, Yang Liu. 1368-1376 [doi]
- An Active Learning Method for the Comparison of Agent-based ModelsSwapna Thorve, Zhihao Hu, Kiran Lakkaraju, Joshua Letchford, Anil Vullikanti, Achla Marathe, Samarth Swarup. 1377-1385 [doi]
- Deployment of a Plug-In Multi-Agent System for Traffic Signal TimingBehnam Torabi, Rym Zalila-Wenkstern, Robert Saylor, Patrick Ryan. 1386-1394 [doi]
- A Novel Individually Rational Objective In Multi-Agent Multi-Armed Bandits: Algorithms and Regret BoundsAristide C. Y. Tossou, Christos Dimitrakakis, Jaroslaw Rzepecki, Katja Hofmann. 1395-1403 [doi]
- The Effects of Autonomy and Task meaning in Algorithmic Management of CrowdworkYuushi Toyoda, Gale M. Lucas, Jonathan Gratch. 1404-1412 [doi]
- Using Cognitive Models to Train Big Data Models with Small DataJ. Gregory Trafton, Laura M. Hiatt, Benjamin Brumback, J. Malcolm McCurry. 1413-1421 [doi]
- Agent Ontology Alignment Repair through Dynamic Epistemic LogicLine van den Berg, Manuel Atencia, Jérôme Euzenat. 1422-1430 [doi]
- Plannable Approximations to MDP Homomorphisms: Equivariance under ActionsElise van der Pol, Thomas Kipf, Frans A. Oliehoek, Max Welling. 1431-1439 [doi]
- Learning Context-aware Task Reasoning for Efficient Meta Reinforcement LearningHaozhe Wang, Jiale Zhou, Xuming He. 1440-1448 [doi]
- Scalable Game-Focused Learning of Adversary Models: Data-to-Decisions in Network Security GamesKai Wang, Andrew Perrault, Aditya Mate, Milind Tambe. 1449-1457 [doi]
- Bayesian Nash Equilibrium in First-Price Auction with Discrete Value DistributionsZihe Wang, Weiran Shen, Song Zuo. 1458-1466 [doi]
- The Manipulability of Centrality Measures-An Axiomatic ApproachTomasz Was, Marcin Waniek, Talal Rahwan, Tomasz P. Michalak. 1467-1475 [doi]
- Predicting Persuasive Effectiveness for Multimodal Behavior Adaptation using Bipolar Weighted Argument GraphsKlaus Weber 0001, Kathrin Janowski, Niklas Rach, Katharina Weitz, Wolfgang Minker, Stefan Ultes, Elisabeth André. 1476-1484 [doi]
- Adaptive Knowledge Transfer based on Transfer Neural Kernel NetworkPengfei Wei, Xinghua Qu, Yiping Ke, Tze-Yun Leong, Yew-Soon Ong. 1485-1493 [doi]
- Uncertainty Modelling in Multi-agent Information Fusion SystemsJiali Weng, Fuyuan Xiao, Zehong Cao. 1494-1502 [doi]
- A Performance-Based Start State Curriculum Framework for Reinforcement LearningJan Wöhlke, Felix Schmitt, Herke van Hoof. 1503-1511 [doi]
- FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human FeedbackBaicen Xiao, Qifan Lu, Bhaskar Ramasubramanian, Andrew Clark, Linda Bushnell, Radha Poovendran. 1512-1520 [doi]
- On the Complexity of Sequential Posted PricingTao Xiao, Zhengyang Liu, Wenhan Huang. 1521-1529 [doi]
- Size-Relaxed Committee Selection under the Chamberlin-Courant RuleTao Xiao, Sujoy Sikdar. 1530-1538 [doi]
- Strategyproof Mechanisms for Activity SchedulingXinping Xu, Minming Li, Lingjie Duan. 1539-1547 [doi]
- Game Theoretic Analysis for Two-Sided Matching with Resource AllocationKentaro Yahiro, Makoto Yokoo. 1548-1556 [doi]
- Optimal Control in Partially Observable Complex Social SystemsFan Yang, Bruno Lepri, Wen Dong 0001. 1557-1565 [doi]
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill DiscoveryJiachen Yang, Igor Borovikov, Hongyuan Zha. 1566-1574 [doi]
- αα-Rank: Practically Scaling α-Rank through Stochastic OptimisationYaodong Yang, Rasul Tutunov, Phu Sakulwongtana, Haitham Bou-Ammar. 1575-1583 [doi]
- On the Complexity of Destructive Bribery in Approval-Based Multi-winner VotingYongjie Yang 0001. 1584-1592 [doi]
- Report-Sensitive Spot-Checking in Peer-Grading SystemsHedayat Zarkoob, Hu Fu, Kevin Leyton-Brown. 1593-1601 [doi]
- The Power of SuggestionNicholas Zerbel, Kagan Tumer. 1602-1610 [doi]
- Deep Residual Reinforcement LearningShangtong Zhang, Wendelin Boehmer, Shimon Whiteson. 1611-1619 [doi]
- Redistribution Mechanism on NetworksWen Zhang, Dengji Zhao, Hanyu Chen. 1620-1628 [doi]
- Collaborative Data AcquisitionWen Zhang, Yao Zhang, Dengji Zhao. 1629-1637 [doi]
- SwarmTalk - Towards Benchmark Software Suites for Swarm Robotics PlatformsYihan Zhang, Lyon Zhang, Hanlin Wang, Fabián E. Bustamante, Michael Rubenstein. 1638-1646 [doi]
- META-Learning State-based Eligibility Traces for More Sample-Efficient Policy EvaluationMingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang, Doina Precup. 1647-1655 [doi]
- Competitive and Cooperative Heterogeneous Deep Reinforcement LearningHan Zheng, Jing Jiang 0002, Pengfei Wei, Guodong Long, Chengqi Zhang. 1656-1664 [doi]
- Parameterized Complexity of Shift Bribery in Iterative ElectionsAizhong Zhou, Jiong Guo. 1665-1673 [doi]
- Learning by Reusing Previous Advice in Teacher-Student ParadigmChangxi Zhu, Yi Cai, Ho-Fung Leung, Shuyue Hu. 1674-1682 [doi]
- Towards Reality: Smoothed Analysis in Computational Social ChoiceDorothea Baumeister, Tobias Hogrebe, Jörg Rothe. 1691-1695 [doi]
- A Multi-Robot Platform for the Autonomous Operation and Maintenance of Offshore Wind FarmsSara Bernardini, Ferdian Jovan, Zhengyi Jiang, Simon Watson, Andrew Weightman, Peiman Moradi, Tom Richardson 0002, Rasoul Sadeghian, Sina Sareh. 1696-1700 [doi]
- Agents are Dead. Long live Agents!Virginia Dignum, Frank Dignum. 1701-1705 [doi]
- New Foundations of Ethical Multiagent SystemsPradeep K. Murukannaiah, Nirav Ajmeri, Catholijn M. Jonker, Munindar P. Singh. 1706-1710 [doi]
- Research Challenges and Opportunities in Multi-Agent Path Finding and Multi-Agent Pickup and Delivery ProblemsOren Salzman, Roni Stern. 1711-1715 [doi]
- We Need Fairness and Explainability in Algorithmic HiringCandice Schumann, Jeffrey S. Foster, Nicholas Mattei, John P. Dickerson. 1716-1720 [doi]
- Live SimulationsSamarth Swarup, Henning S. Mortveit. 1721-1725 [doi]
- Multiagent Climate Change ResearchVahid Yazdanpanah, Sara Mehryar, Nicholas R. Jennings, Swenja Surminski, Martin J. Siegert, Jos van Hillegersberg. 1726-1731 [doi]
- Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search AuctionsKumar Abhishek 0003, Shweta Jain 0002, Sujit Gujar. 1732-1734 [doi]
- Boolean Games: Inferring Agents' Goals Using Taxation QueriesAbhijin Adiga, Sarit Kraus, S. S. Ravi. 1735-1737 [doi]
- Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement LearningDhaval Adjodah, Dan Calacci, Abhimanyu Dubey, Anirudh Goyal, P. M. Krafft, Esteban Moro, Alex Pentland. 1738-1740 [doi]
- Learning Transferable Cooperative Behavior in Multi-Agent TeamsAkshat Agarwal, Sumit Kumar, Katia P. Sycara, Michael Lewis 0001. 1741-1743 [doi]
- Evolving Meta-Level Reasoning with Reinforcement Learning and A* for Coordinated Multi-Agent Path-planningMona Alshehri, Napoleon N. Reyes, Andre L. C. Barczak. 1744-1746 [doi]
- Privacy-Preserving Dark PoolsGilad Asharov, Tucker Hybinette Balch, Antigoni Polychroniadou, Manuela Veloso. 1747-1749 [doi]
- Long-Run Multi-Robot Planning With Uncertain Task DurationsCarlos Azevedo, Bruno Lacerda, Nick Hawes, Pedro U. Lima. 1750-1752 [doi]
- The Temporary Exchange ProblemHaris Aziz 0001, Edward Lee. 1753-1755 [doi]
- Mechanism Design for School Choice with Soft Diversity ConstraintsHaris Aziz 0001, Serge Gaspers, Zhaohong Sun 0001. 1756-1758 [doi]
- Multiple Levels of Importance in Matching with Distributional Constraints: Extended AbstractHaris Aziz 0001, Serge Gaspers, Zhaohong Sun 0001, Makoto Yokoo. 1759-1761 [doi]
- Learning Complementary Representations of the Past using Auxiliary Tasks in Partially Observable Reinforcement LearningAndrea Baisero, Christopher Amato. 1762-1764 [doi]
- Autonomous Shape Formation and Morphing in a Dynamic Environment by a Swarm of Robots: Extended AbstractVaibhav Bajaj, Sachit Rao. 1765-1767 [doi]
- Reinforcement Learning Dynamics in the Infinite Memory LimitWolfram Barfuss. 1768-1770 [doi]
- Complexity of Election Evaluation and Probabilistic Robustness: Extended AbstractDorothea Baumeister, Tobias Hogrebe. 1771-1773 [doi]
- Irresolute Approval-based BudgetingDorothea Baumeister, Linus Boes, Tessa Seeger. 1774-1776 [doi]
- Hedonic Seat Arrangement ProblemsHans L. Bodlaender, Tesshu Hanaka, Lars Jaffke, Hirotaka Ono, Yota Otachi, Tom C. van der Zanden. 1777-1779 [doi]
- Stable Roommate Problem With Diversity PreferencesNiclas Boehmer, Edith Elkind. 1780-1782 [doi]
- Encapsulating Reactive Behaviour in Goal-Based Plans for Programming BDI Agents: Extended AbstractRafael H. Bordini, Rem Collier, Jomi Fred Hübner, Alessandro Ricci. 1783-1785 [doi]
- Finding Spatial Clusters Susceptible to Epidemic Outbreaks due to UndervaccinationJose Cadena, Achla Marathe, Anil Vullikanti. 1786-1788 [doi]
- Adaptive and Collaborative Agent-based Traffic Regulation using Behavior TreesArthur Casals, Assia Belbachir, Amal El Fallah-Seghrouchni. 1789-1791 [doi]
- Option-Critic in Cooperative Multi-agent SystemsJhelum Chakravorty, Patrick Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup. 1792-1794 [doi]
- The Price of Anarchy of Self-Selection in Tullock ContestsHau Chan, David C. Parkes, Karim R. Lakhani. 1795-1797 [doi]
- Human-in-the-loop Planning and Monitoring of Swarm Search and Service MissionsMeghan Chandarana, Michael Lewis 0001, Katia P. Sycara, Sebastian A. Scherer. 1798-1800 [doi]
- A New Framework for Multi-Agent Reinforcement Learning - Centralized Training and Exploration with Decentralized Execution via Policy DistillationGang Chen 0002. 1801-1803 [doi]
- Aggregation of Support-Relations of Bipolar Argumentation FrameworksWeiwei Chen 0006. 1804-1806 [doi]
- Social Structure Emergence: A Multi-agent Reinforcement Learning Framework for Relationship BuildingYang Chen, Jiamou Liu, He Zhao, Hongyi Su. 1807-1809 [doi]
- The Fair Contextual Multi-Armed BanditYifang Chen, Alex Cuellar, Haipeng Luo, Jignesh Modi, Heramb Nemlekar, Stefanos Nikolaidis. 1810-1812 [doi]
- Limiting the Deviation Incentives in Resource Sharing NetworksYukun Cheng, Xiaotie Deng, Yuhao Li. 1813-1815 [doi]
- An Abstract Framework for Agent-Based Explanations in AIGiovanni Ciatto, Davide Calvaresi, Michael Ignaz Schumacher, Andrea Omicini. 1816-1818 [doi]
- Fear of Punishment Promotes the Emergence of Cooperation and Enhanced Social Welfare in Social DilemmasTheodor Cimpeanu, The Anh Han. 1819-1821 [doi]
- Voting with Random Classifiers (VORACE)Cristina Cornelio, Michele Donini, Andrea Loreggia, Maria Silvia Pini, Francesca Rossi. 1822-1824 [doi]
- Translating Embedding with Local Connection for Knowledge Graph CompletionZeyuan Cui, Shijun Liu, Li Pan, Qiang He. 1825-1827 [doi]
- Distributed, Automated Calibration of Agent-based Model Parameters and Agent BehaviorsMatteo D'Auria, Eric O. Scott, Rajdeep Singh Lather, Javier Hilty, Sean Luke. 1828-1830 [doi]
- Distributed Reinforcement Learning for Cooperative Multi-Robot Object ManipulationGuohui Ding, Joewie J. Koh, Kelly Merckaert, Bram Vanderborght, Marco M. Nicotra, Christoffer Heckman, Alessandro Roncone, Lijun Chen 0001. 1831-1833 [doi]
- Decomposed Deep Reinforcement Learning for Robotic ControlYinzhao Dong, Chao Yu, Paul Weng, Ahmed Maustafa, Hui Cheng, Hongwei Ge. 1834-1836 [doi]
- Computationally Grounded Quantitative Trust with TimeNagat Drawel, Jamal Bentahar, Hongyang Qu 0001. 1837-1839 [doi]
- Microbribery in Group IdentificationGábor Erdélyi, Yongjie Yang 0001. 1840-1842 [doi]
- Decentralized Task Assignment for Multi-item Pickup and Delivery in Logistic ScenariosAlessandro Farinelli, Antonello Contini, Davide Zorzi. 1843-1845 [doi]
- Distance Hedonic GamesMichele Flammini, Bojana Kodric, Martin Olsen, Giovanna Varricchio. 1846-1848 [doi]
- Ballooning Multi-Armed BanditsGanesh Ghalme, Swapnil Dhamal, Shweta Jain 0002, Sujit Gujar, Y. Narahari. 1849-1851 [doi]
- Cluster-Based Social Reinforcement LearningMahak Goindani, Jennifer Neville. 1852-1854 [doi]
- Multi-agent Adversarial Inverse Reinforcement Learning with Latent VariablesNate Gruver, Jiaming Song, Mykel J. Kochenderfer, Stefano Ermon. 1855-1857 [doi]
- Networked Multi-Agent Reinforcement Learning with Emergent CommunicationShubham Gupta, Rishi Hazra, Ambedkar Dukkipati. 1858-1860 [doi]
- Winning an Election: On Emergent Strategic Communication in Multi-Agent NetworksShubham Gupta, Ambedkar Dukkipati. 1861-1863 [doi]
- Matching Affinity Clustering: Improved Hierarchical Clustering at Scale with GuaranteesMohammadTaghi Hajiaghayi, Marina Knittel. 1864-1866 [doi]
- Automating Coordinated Autonomous Vehicle ControlAllen Huang, Geoff Nitschke. 1867-1868 [doi]
- Anchor Attention for Hybrid Crowd Forecasts AggregationYuzhong Huang, Andrés Abeliuk, Fred Morstatter, Pavel Atanasov, Aram Galstyan. 1869-1871 [doi]
- Mastering Basketball With Deep Reinforcement Learning: An Integrated Curriculum Training ApproachHangtian Jia, Chunxu Ren, Yujing Hu, Yingfeng Chen, Tangjie Lv, Changjie Fan, Hongyao Tang, Jianye Hao. 1872-1874 [doi]
- Multi-agent Path Planning based on MA-RRT* Fixed NodesJinmingwu Jiang, Kaigui Wu. 1875-1877 [doi]
- An Agent-Based Model for Trajectory Modelling in Shared Spaces: A Combination of Expert-Based and Deep Learning ApproachesFatema T. Johora, Hao Cheng, Jörg P. Müller, Monika Sester. 1878-1880 [doi]
- Anchoring Theory in Sequential Stackelberg GamesJan Karwowski, Jacek Mandziuk, Adam Zychowski. 1881-1883 [doi]
- Efficient Hybrid Fault Detection for Autonomous RobotsEliahu Khalastchi, Meir Kalech. 1884-1886 [doi]
- Silly Rules Improve the Capacity of Agents to Learn Stable Enforcement and Compliance BehaviorsRaphael Koster, Dylan Hadfield-Menell, Gillian K. Hadfield, Joel Z. Leibo. 1887-1888 [doi]
- Signaling Friends and Head-Faking Enemies Simultaneously: Balancing Goal Obfuscation and Goal LegibilityAnagha Kulkarni 0002, Siddharth Srivastava 0001, Subbarao Kambhampati. 1889-1891 [doi]
- Deep Reinforcement Learning for Market MakingPankaj Kumar. 1892-1894 [doi]
- Computing the Shapley Value for Ride-Sharing and Routing GamesChaya Levinger, Noam Hazon, Amos Azaria. 1895-1897 [doi]
- Lifelong Multi-Agent Path Finding in Large-Scale WarehousesJiaoyang Li 0001, Andrew Tinka, Scott Kiesel, Joseph W. Durham, T. K. Satish Kumar, Sven Koenig. 1898-1900 [doi]
- Graph Neural Networks for Decentralized Path PlanningQingbiao Li, Fernando Gama, Alejandro Ribeiro, Amanda Prorok. 1901-1903 [doi]
- PANDA: Privacy-Aware Double Auction for Divisible Resources without a MediatorBingyu Liu, Shangyu Xie, Yuan Hong. 1904-1906 [doi]
- Two-sided Auctions with Budgets: Fairness, Incentives and EfficiencyXiang Liu, Weiwei Wu 0001, Minming Li, Wanyuan Wang. 1907-1909 [doi]
- Robust Following with Hidden Information in Travel PartnersShih-Yun Lo, Elaine Schaertl Short, Andrea Lockerd Thomaz. 1910-1912 [doi]
- A Decentralized Multi-Agent Coordination Method for Dynamic and Constrained Production PlanningMarin Lujak, Alberto Fernández 0002, Eva Onaindia. 1913-1915 [doi]
- Normalizing Flow Model for Policy Representation in Continuous Action Multi-agent SystemsXiaobai Ma, Jayesh K. Gupta, Mykel J. Kochenderfer. 1916-1918 [doi]
- Genetic Deep Reinforcement Learning for Mapless NavigationEnrico Marchesini, Alessandro Farinelli. 1919-1921 [doi]
- A Game Theoretic Approach For k-Core MinimizationSourav Medya, Tianyi Ma, Arlei Silva, Ambuj K. Singh. 1922-1924 [doi]
- Modified Actor-CriticsErinc Merdivan, Sten Hanke, Matthieu Geist. 1925-1927 [doi]
- Multi-Vehicle Mixed Reality Reinforcement Learning for Autonomous Multi-Lane DrivingRupert Mitchell, Jenny Fletcher, Jacopo Panerati, Amanda Prorok. 1928-1930 [doi]
- Maximizing Plan Legibility in Stochastic EnvironmentsShuwa Miura, Shlomo Zilberstein. 1931-1933 [doi]
- Cooperative Real-Time Inertial Parameter EstimationMarina Moreira, Brian Coltin, Rodrigo Ventura. 1934-1936 [doi]
- Towards a Value-driven Explainable Agent for Collective PrivacyFrancesca Mosca, Jose M. Such, Peter McBurney. 1937-1939 [doi]
- Argumentation is More Important than Appearance for Designing Culturally Tailored Virtual AgentsPrasanth Murali, Ameneh Shamekhi, Dhaval Parmar, Timothy W. Bickmore. 1940-1942 [doi]
- Mining International Political Norms from the GDELT DatabaseRohit Murali, Suravi Patnaik, Stephen Cranefield. 1943-1945 [doi]
- Robust Self-organization in Games: Symmetries, Conservation Laws and Dimensionality ReductionSai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras. 1946-1948 [doi]
- Mini-batch Bayesian Inverse Reinforcement Learning for Multiple DynamicsYusuke Nakata, Sachiyo Arai. 1949-1950 [doi]
- A Study of Incentive Compatibility and Stability Issues in Fractional MatchingsShivika Narang, Yadati Narahari. 1951-1953 [doi]
- Conditional Updates of Answer Set Programming and Its Application in Explainable PlanningVan Nguyen, Tran Cao Son, Vasileiou Loukas Stylianos, William Yeoh 0001. 1954-1956 [doi]
- Explicit Modelling of Resources for Multi-Agent MicroServices using the CArtAgO FrameworkEoin O'Neill, David Lillis, Gregory M. P. O'Hare, Rem W. Collier. 1957-1959 [doi]
- Vulcano: Operational Fire Suppression Management Using Deep Reinforcement LearningCristobal Pais. 1960-1962 [doi]
- Hierarchical Reinforcement Learning with Integrated Discovery of Salient SubgoalsShubham Pateria, Budhitama Subagdja, Ah-Hwee Tan. 1963-1965 [doi]
- Sequential Advertising Agent with Interpretable User Hidden IntentsZhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang 0001, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai. 1966-1968 [doi]
- Discovering Imperfectly Observable Adversarial Actions using Anomaly DetectionOlga Petrova, Karel Durkota, Galina Alperovich, Karel Horák, Michal Najman, Branislav Bosanský, Viliam Lisý. 1969-1971 [doi]
- Aplib: An Agent Programming Library for Testing GamesI. S. W. B. Prasetya, Mehdi Dastani. 1972-1974 [doi]
- Modeling Disinformation and the Effort to Counter It: A Cautionary Tale of When the Treatment Can Be Worse Than the DiseaseAmirarsalan Rajabi, Chathika Gunaratne, Alexander V. Mantzaris, Ivan Garibay. 1975-1977 [doi]
- GUESs: Generative modeling of Unknown Environments and Spatial Abstraction for RobotsFrancesco Riccio, Roberto Capobianco, Daniele Nardi. 1978-1980 [doi]
- Continuous Influence Maximisation for the Voter Dynamics: Is Targeting High-Degree Nodes a Good Strategy?Guillermo Romero Moreno, Long Tran-Thanh, Markus Brede. 1981-1983 [doi]
- Mitigating the Negative Side Effects of Reasoning with Imperfect Models: A Multi-Objective ApproachSandhya Saisubramanian, Ece Kamar, Shlomo Zilberstein. 1984-1986 [doi]
- ExTra: Transfer-guided ExplorationAnirban Santara, Rishabh Madan, Pabitra Mitra, Balaraman Ravindran. 1987-1989 [doi]
- C-CoCoA: A Continuous Cooperative Constraint Approximation Algorithm to Solve Functional DCOPsAmit Sarker, Abdullahil Baki Arif, Moumita Choudhury, Md. Mosaddek Khan. 1990-1992 [doi]
- Heuristic Strategies in Uncertain Approval Voting EnvironmentsJaelle Scheuerman, Jason L. Harman, Nicholas Mattei, Kristen Brent Venable. 1993-1995 [doi]
- Not all Mistakes are EqualMurat Sensoy, Maryam Saleki, Simon Julier, Reyhan Aydogan, John Reid. 1996-1998 [doi]
- On-line Estimators for Ad-hoc Task AllocationElnaz Shafipour Yourdshahi, Matheus Aparecido do Carmo Alves, Leandro Soriano Marcolino, Plamen Angelov 0001. 1999-2001 [doi]
- Theme Park Simulation based on Questionnaires for Maximizing Visitor SurplusHitoshi Shimizu, Tatsushi Matsubayashi, Akinori Fujino, Hiroshi Sawada. 2002-2004 [doi]
- Fair Cake-Cutting Algorithms with Real Land-Value DataItay Shtechman, Rica Gonen, Erel Segal-haLevi. 2005-2007 [doi]
- BitcoinF: Achieving Fairness For Bitcoin In Transaction Fee Only ModelShoeb Siddiqui, Ganesh Vanahalli, Sujit Gujar. 2008-2010 [doi]
- An Axiomatic Approach to Truth DiscoveryJoseph Singleton, Richard Booth. 2011-2013 [doi]
- Robust Market Making via Adversarial Reinforcement LearningThomas Spooner, Rahul Savani. 2014-2016 [doi]
- Analyzing the Effects of Memory Biases and Mood Disorders on Social PerformanceNanda Kishore Sreenivas, Shrisha Rao. 2017-2019 [doi]
- Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural NetworksJoseph Suarez, Yilun Du, Igor Mordatch, Phillip Isola. 2020-2022 [doi]
- Restricted Domains of Dichotomous Preferences with Possibly Incomplete InformationZoi Terzopoulou, Alexander Karpov, Svetlana Obraztsova. 2023-2025 [doi]
- Verification-Guided Tree SearchAlvaro Velasquez, Daniel Melcer. 2026-2028 [doi]
- Thompson Sampling for Factored Multi-Agent BanditsTimothy Verstraeten, Eugenio Bargiacchi, Pieter J. K. Libin, Diederik M. Roijers, Ann Nowé. 2029-2031 [doi]
- Too Many Cooks: Coordinating Multi-agent Collaboration Through Inverse PlanningRose E. Wang, Sarah A. Wu, James A. Evans, Joshua B. Tenenbaum, David C. Parkes, Max Kleiman-Weiner. 2032-2034 [doi]
- Online Algorithms for Multi-shop Ski Rental with Machine Learned PredictionsShufan Wang, Jian Li. 2035-2037 [doi]
- An Interpretable Multimodal Visual Question Answering System using Attention-based Weighted Contextual FeaturesYu Wang, Yilin Shen, Hongxia Jin. 2038-2040 [doi]
- Automatic Synthesis of Generalized Winning Strategy of Impartial Combinatorial GamesKaisheng Wu, Yong Qiao, Kaidong Chen, Fei Rong, Liangda Fang, Zhao-Rong Lai, Qian Dong, Liping Xiong. 2041-2043 [doi]
- Embedding Preference Elicitation Within the Search for DCOP SolutionsYuanming Xiao, Atena M. Tabakhi, William Yeoh 0001. 2044-2046 [doi]
- A Supervised Topic Model Approach to Learning Effective Styles within Human-Agent NegotiationYuyu Xu, David C. Jeong, Pedro Sequeira, Jonathan Gratch, Javed Aslam, Stacy Marsella. 2047-2049 [doi]
- An Information Distribution Method for Avoiding Hunting Phenomenon in Theme ParksHiroaki Yamada 0001, Naoyuki Kamiyama. 2050-2052 [doi]
- Efficient Deep Reinforcement Learning through Policy TransferTianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Zhaodong Wang, Jiajie Peng. 2053-2055 [doi]
- Task Coordination in Multiagent SystemsVahid Yazdanpanah, Mehdi Dastani, Shaheen Fatima, Nicholas R. Jennings, Devrim Murat Yazan, W. Henk Zijm. 2056-2058 [doi]
- The Sequential Online Chore Division Problem - Definition and ApplicationHarel Yedidsion, Shani Alkoby, Peter Stone. 2059-2061 [doi]
- A Computational Model of Hurricane Evacuation DecisionNutchanon Yongsatianchot, Stacy Marsella. 2062-2064 [doi]
- Interactive RL via Online Human DemonstrationsChao Yu, Tianpei Yang, Wenxuan Zhu, Yinzhao Dong, Guangliang Li. 2065-2067 [doi]
- CoMet: A Meta Learning-Based Approach for Cross-Dataset Labeling Using Co-TrainingGuy Zaks, Gilad Katz. 2068-2070 [doi]
- Explainable and Contextual Preferences based Decision Making with Assumption-based Argumentation for Diagnostics and Prognostics of Alzheimer's DiseaseZhiwei Zeng, Zhiqi Shen, Jing Jih Chin, Cyril Leung, Yu Wang, Ying Chi, Chunyan Miao. 2071-2073 [doi]
- A POMDP-based Method for Analyzing Blockchain System Security Against Long Delay Attack: (Extended Abstract)Shuangfeng Zhang, Yuan Liu, Xingren Chen, Xin Zhou. 2074-2076 [doi]
- Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path FindingYi Zhang, Yu Qian, Yichen Yao, Haoyuan Hu, Yinghui Xu. 2077-2079 [doi]
- Opponent Modelling for Reinforcement Learning in Multi-Objective Normal Form GamesYijie Zhang, Roxana Radulescu, Patrick Mannion, Diederik M. Roijers, Ann Nowé. 2080-2082 [doi]
- Integrating Independent and Centralized Multi-agent Reinforcement Learning for Traffic Signal Network OptimizationZhi Zhang, Jiachen Yang, Hongyuan Zha. 2083-2085 [doi]
- Coalitional Games with Stochastic Characteristic Functions Defined by Private TypesDengji Zhao, Yiqing Huang, Liat Cohen, Tal Grinshpoun. 2086-2088 [doi]
- A Generic Metaheuristic Approach to Sequential Security GamesAdam Zychowski, Jacek Mandziuk. 2089-2091 [doi]
- A Framework for Collaborative and Interactive Agent-oriented Developer OperationsCleber Jorge Amaral, Timotheus Kampik, Stephen Cranefield. 2092-2094 [doi]
- Hierarchical and Non-Hierarchical Multi-Agent Interactions Based on Unity Reinforcement LearningZehong Cao, Kaichiu Wong, Quan Bai, Chin-Teng Lin. 2095-2097 [doi]
- A Consensus-based Group Decision Support System using a Multi-Agent MicroServices Approach: DemonstrationJoão Carneiro, Rui Andrade, Patrícia Alves, Luís Conceição, Paulo Novais, Goreti Marreiros. 2098-2100 [doi]
- AI-assisted Schedule Explainer for Nurse RosteringKristijonas Cyras, Amin Karamlou, Myles Lee, Dimitrios Letsios, Ruth Misener, Francesca Toni. 2101-2103 [doi]
- Coordination of Prosumer Agents via Distributed Optimal Power Flow: An Edge Computing Hardware PrototypeDaniel Gebbran, Gregor Verbic, Archie C. Chapman, Sleiman Mhanna. 2104-2106 [doi]
- Trading Agent Competition with Autonomous Economic AgentsDavid Minarsch, Marco Favorito, Ali Hosseini, Jonathan Ward. 2107-2110 [doi]
- MsATL: A Tool for SAT-Based ATL Satisfiability CheckingArtur Niewiadomski 0001, Magdalena Kacprzak, Damian Kurpiewski, Michal Knapik, Wojciech Penczek, Wojciech Jamroga. 2111-2113 [doi]
- MARTINE: Multi-Agent based Real-Time INfrastructure for EnergyTiago Pinto, Luis Gomes 0001, Pedro Faria, Filipe Sousa, Zita A. Vale. 2114-2116 [doi]
- User-Models to Drive an Adaptive Virtual Advisor: DemonstrationHedieh Ranjbartabar, Deborah Richards, Ayse Aysin Bilgin, Cat Kutay, Samuel Mascarenhas. 2117-2119 [doi]
- DALI: An Agent-Plug-In System to "Smartify" Conventional Traffic Control SystemsBehnam Torabi, Rym Zalila-Wenkstern. 2120-2122 [doi]
- VerSecTis - An Agent based Model Checker for Security ProtocolsAgnieszka M. Zbrzezny, Andrzej Zbrzezny, Sabina Szymoniak, Olga Siedlecka-Lamch, Miroslaw Kurkowski. 2123-2125 [doi]
- VERIFCAR: A Framework for Modeling and Model checking Communicating Autonomous VehiclesJohan Arcile, Raymond R. Devillers, Hanna Klaudel. 2126-2127 [doi]
- Strategyproof Multi-Item Exchange Under Single-Minded Dichotomous PreferencesHaris Aziz 0001. 2128-2130 [doi]
- Sequential Voting in Multi-agent Soft Constraint AggregationCristina Cornelio, Maria Silvia Pini, Francesca Rossi, Kristen Brent Venable. 2131-2133 [doi]
- Strategic Negotiations for Extensive-Form GamesDave De Jonge, Dongmo Zhang. 2134-2136 [doi]
- Inferring True Voting Outcomes in Homophilic Social NetworksJohn A. Doucette, Alan Tsang, Hadi Hosseini, Kate Larson, Robin Cohen. 2137-2139 [doi]
- COMBIMA: Truthful, Budget Maintaining, Dynamic Combinatorial MarketRica Gonen, Ozi Egri. 2140-2142 [doi]
- Probabilistic Physical Search on General Graphs: Approximations and HeuristicsNoam Hazon, Mira Gonen. 2143-2145 [doi]
- A Very Condensed Survey and Critique of Multiagent Deep Reinforcement LearningPablo Hernandez-Leal, Bilal Kartal, Matthew E. Taylor. 2146-2148 [doi]
- A Formal Framework for Reasoning about Opportunistic Propensity in Multi-agent SystemsJieting Luo, John-Jules Ch. Meyer, Max Knobbout. 2149-2151 [doi]
- Norm Emergence in Multiagent Systems: A Viewpoint PaperAndreasa Morris-Martin, Marina De Vos, Julian A. Padget. 2152-2154 [doi]
- Solving the Fair Electric Load Shedding Problem in Developing CountriesOlabambo I. Oluwasuji, Obaid Malik, Jie Zhang 0008, Sarvapali D. Ramchurn. 2155-2157 [doi]
- Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis and SurveyRoxana Radulescu, Patrick Mannion, Diederik M. Roijers, Ann Nowé. 2158-2160 [doi]
- Why, Who, What, When and How about Explainability in Human-Agent SystemsAvi Rosenfeld, Ariella Richardson. 2161-2164 [doi]
- Agents Teaching Agents: A Survey on Inter-agent Transfer LearningFelipe Leno da Silva, Garrett Warnell, Anna Helena Reali Costa, Peter Stone. 2165-2167 [doi]
- Long-Run Multi-Robot Planning Under Uncertain Task DurationsCarlos Azevedo. 2168-2170 [doi]
- Modeling and Comparing Robot Behaviors for Anomaly DetectionDavide Azzalini. 2171-2173 [doi]
- Competence-Aware Systems for Long-Term AutonomyConnor Basich. 2174-2175 [doi]
- Computer-aided Reasoning about Collective Decision MakingArthur Boixel. 2176-2178 [doi]
- Vision for Decisions: Utilizing Uncertain Real-Time Information and Signaling for ConservationElizabeth Bondi. 2179-2181 [doi]
- Efficiency and Fairness of Resource Utilisation under UncertaintyJan Bürmann. 2182-2184 [doi]
- Computing Desirable Partitions in Coalition Formation GamesMartin Bullinger. 2185-2187 [doi]
- Cost Effective Interventions in Complex Networks Using Agent-Based Modelling and SimulationsTheodor Cimpeanu. 2188-2190 [doi]
- A Theoretical Framework for Self-Organized Task Allocation in Large SwarmsJohn Harwell. 2191-2192 [doi]
- Adaptive Agent-Based Simulation for Individualized TrainingJohan Källström. 2193-2195 [doi]
- Decentralised Runtime Norm SynthesisAndreasa Morris-Martin. 2196-2198 [doi]
- Value-Aligned and Explainable Agents for Collective Decision Making: Privacy ApplicationFrancesca Mosca. 2199-2200 [doi]
- Reinforcement Learning Algorithms for Autonomous Adaptive AgentsSindhu Padakandla. 2201-2203 [doi]
- Achieving Emergent Governance in Competitive Multi-Agent SystemsMichael Pernpeintner. 2204-2206 [doi]
- A Utility-Based Perspective on Multi-Objective Multi-Agent Decision MakingRoxana Radulescu. 2207-2208 [doi]
- Computational Methods for Simulating Biased AgentsJaelle Scheuerman. 2209-2210 [doi]
- Truth Discovery: Who to Trust and What to BelieveJoseph Singleton. 2211-2213 [doi]
- Algorithmic Fairness for Networked AlgorithmsAna-Andreea Stoica. 2214-2216 [doi]
- Towards Multi-Robot Coordination under Temporal UncertaintyCharlie Street. 2217-2218 [doi]
- New Challenges in Matching with ConstraintsZhaohong Sun 0001. 2219-2221 [doi]
- Incomplete Opinions in Collective Decision MakingZoi Terzopoulou. 2222-2224 [doi]
- Multimodal Representation Learning for Robotic Cross-Modality Policy TransferMiguel Vasco. 2225-2227 [doi]
- Balance Between Scalability and Optimality in Network Security GamesKai Wang. 2228-2230 [doi]
- Implementing Securities Based Decision Markets with Stochastic Decision RulesWenlong Wang. 2231-2233 [doi]
- Incentive Mechanisms for Data Privacy Preservation and PricingMengxiao Zhang. 2234-2236 [doi]