Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL

Arambam James Singh, Akshat Kumar, Hoong Chuin Lau. Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL. In Susanne Biundo, Minh Do, Robert Goldman, Michael Katz, Qiang Yang 0001, Hankz Hankui Zhuo, editors, Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, ICASP 2021, Guangzhou, China (virtual), August 2-13, 2021. pages 588-596, AAAI Press, 2021. [doi]

@inproceedings{SinghKL21,
  title = {Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL},
  author = {Arambam James Singh and Akshat Kumar and Hoong Chuin Lau},
  year = {2021},
  url = {https://ojs.aaai.org/index.php/ICAPS/article/view/16007},
  researchr = {https://researchr.org/publication/SinghKL21},
  cites = {0},
  citedby = {0},
  pages = {588-596},
  booktitle = {Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, ICASP 2021, Guangzhou, China (virtual), August 2-13, 2021},
  editor = {Susanne Biundo and Minh Do and Robert Goldman and Michael Katz and Qiang Yang 0001 and Hankz Hankui Zhuo},
  publisher = {AAAI Press},
  isbn = {978-1-57735-867-1},
}