Exploration Bonuses Based on Upper Confidence Bounds for Sparse Reward Games

Naoki Mizukami, Jun Suzuki, Hirotaka Kameko, Yoshimasa Tsuruoka. Exploration Bonuses Based on Upper Confidence Bounds for Sparse Reward Games. In Mark H. M. Winands, H. Jaap van den Herik, Walter A. Kosters, editors, Advances in Computer Games - 15th International Conferences, ACG 2017, Leiden, The Netherlands, July 3-5, 2017, Revised Selected Papers. Volume 10664 of Lecture Notes in Computer Science, pages 165-175, Springer, 2017. [doi]

Abstract

Abstract is missing.