Average Reward Optimization with Multiple Discounting Reinforcement Learners

Chris Reinke, Eiji Uchibe, Kenji Doya. Average Reward Optimization with Multiple Discounting Reinforcement Learners. In Derong Liu, Shengli Xie, Yuanqing Li, Dongbin Zhao, El-Sayed M. El-Alfy, editors, Neural Information Processing - 24th International Conference, ICONIP 2017, Guangzhou, China, November 14-18, 2017, Proceedings, Part I. Volume 10634 of Lecture Notes in Computer Science, pages 789-800, Springer, 2017. [doi]

Abstract

Abstract is missing.