Dyna-like reinforcement learning based on accumulative and average rewards

Kao-Shing Hwang, Chia-Yue Lo. Dyna-like reinforcement learning based on accumulative and average rewards. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, Istanbul, Turkey, 10-13 October 2010. pages 1250-1254, IEEE, 2010. [doi]

Abstract

Abstract is missing.