A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

Roy Fox, Moshe Tennenholtz. A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs. In Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, July 22-26, 2007, Vancouver, British Columbia, Canada. pages 553-558, AAAI Press, 2007.

Abstract

Abstract is missing.