Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL

Arambam James Singh, Akshat Kumar, Hoong Chuin Lau. Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL. In Susanne Biundo, Minh Do, Robert Goldman, Michael Katz, Qiang Yang 0001, Hankz Hankui Zhuo, editors, Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, ICASP 2021, Guangzhou, China (virtual), August 2-13, 2021. pages 588-596, AAAI Press, 2021. [doi]

Abstract

Abstract is missing.