The asymptotic equipartition property in reinforcement learning and its relation to return maximization

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai. The asymptotic equipartition property in reinforcement learning and its relation to return maximization. Neural Networks, 19(1):62-75, 2006. [doi]

Abstract

Abstract is missing.